ID A0A8S1BC46_ARCPL Unreviewed; 414 AA.
AC A0A8S1BC46;
DT 12-OCT-2022, integrated into UniProtKB/TrEMBL.
DT 12-OCT-2022, sequence version 1.
DT 28-JAN-2026, entry version 12.
DE RecName: Full=Collagen alpha-1(XV) chain {ECO:0008006|Google:ProtNLM};
GN ORFNames=APLA_LOCUS17408 {ECO:0000313|EMBL:CAB3260337.1};
OS Arctia plantaginis (Wood tiger moth) (Phalaena plantaginis).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; Noctuoidea;
OC Erebidae; Arctiinae; Arctia.
OX NCBI_TaxID=874455 {ECO:0000313|EMBL:CAB3260337.1, ECO:0000313|Proteomes:UP000494106};
RN [1] {ECO:0000313|EMBL:CAB3260337.1, ECO:0000313|Proteomes:UP000494106}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA Wallbank WR R., Pardo Diaz C., Kozak K., Martin S., Jiggins C., Moest M.,
RA Warren A I., Byers J.R.P. K., Montejo-Kovacevich G., Yen C E.;
RL Submitted (APR-2020) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:CAB3260337.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CADEBC010000733; CAB3260337.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A8S1BC46; -.
DR OrthoDB; 5983381at2759; -.
DR Proteomes; UP000494106; Unassembled WGS sequence.
DR GO; GO:0031012; C:extracellular matrix; IEA:TreeGrafter.
DR GO; GO:0005615; C:extracellular space; IEA:TreeGrafter.
DR Gene3D; 3.40.1620.70; -; 1.
DR Gene3D; 1.20.5.320; 6-Phosphogluconate Dehydrogenase, domain 3; 1.
DR Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR InterPro; IPR016186; C-type_lectin-like/link_sf.
DR InterPro; IPR050149; Collagen_superfamily.
DR InterPro; IPR010515; Collagenase_NC10/endostatin.
DR InterPro; IPR016187; CTDL_fold.
DR InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF1082; COLLAGEN TRIPLE HELIX REPEAT; 1.
DR Pfam; PF20010; Collagen_trimer; 1.
DR Pfam; PF06482; Endostatin; 1.
DR SUPFAM; SSF56436; C-type lectin-like; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000494106}.
FT DOMAIN 113..160
FT /note="Collagen type XV/XVIII trimerization"
FT /evidence="ECO:0000259|Pfam:PF20010"
FT DOMAIN 202..366
FT /note="Collagenase NC10/endostatin"
FT /evidence="ECO:0000259|Pfam:PF06482"
FT REGION 1..112
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1..15
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 18..27
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 45..60
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 414 AA; 44806 MW; 8A3BE78DF7E1479A CRC64;
MITLKFKGEK GERGFDGLPG LPGATGPAGP PGPSGALSEA IQYLPGPPGP PGSPGPPGPP
GVSIVGPKGE PGFSHYEEYP VHGSPKIYGR PRPTHAHSGA GQAEGTSKSV PSAAVYQTTE
EMLKLASSNP VGALAYVVEE QALFVKINSG WQYVLLGSLV TQAAPAPATP SPPPPPMPAA
SLVHVPPISN YVENTPVVGP SIHLAALNEP LSGSMHGIRR ADYACYRQGR RAGFRGTFRA
LLTSKIQNLN SIVRYSDRHL PVVNTYGEML FRSFSDMFHG NDALSPEARI YSFDGRNIMT
DPHWPQKVIW HGSRTNGERA LDAYCNEWQN GDPTNRGLAS SLQGHKMLAQ ERYPCSNHFA
VLCIEVASEL HPRRKREVAR TNATGVLDDE DYLYNAEEYQ QLLNEIFAQP FREN
//