GenomeNet

Database: UniProt
Entry: A0A8S1BC46_ARCPL
LinkDB: A0A8S1BC46_ARCPL
Original site: A0A8S1BC46_ARCPL 
ID   A0A8S1BC46_ARCPL        Unreviewed;       414 AA.
AC   A0A8S1BC46;
DT   12-OCT-2022, integrated into UniProtKB/TrEMBL.
DT   12-OCT-2022, sequence version 1.
DT   28-JAN-2026, entry version 12.
DE   RecName: Full=Collagen alpha-1(XV) chain {ECO:0008006|Google:ProtNLM};
GN   ORFNames=APLA_LOCUS17408 {ECO:0000313|EMBL:CAB3260337.1};
OS   Arctia plantaginis (Wood tiger moth) (Phalaena plantaginis).
OC   Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC   Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; Noctuoidea;
OC   Erebidae; Arctiinae; Arctia.
OX   NCBI_TaxID=874455 {ECO:0000313|EMBL:CAB3260337.1, ECO:0000313|Proteomes:UP000494106};
RN   [1] {ECO:0000313|EMBL:CAB3260337.1, ECO:0000313|Proteomes:UP000494106}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA   Wallbank WR R., Pardo Diaz C., Kozak K., Martin S., Jiggins C., Moest M.,
RA   Warren A I., Byers J.R.P. K., Montejo-Kovacevich G., Yen C E.;
RL   Submitted (APR-2020) to the EMBL/GenBank/DDBJ databases.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:CAB3260337.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; CADEBC010000733; CAB3260337.1; -; Genomic_DNA.
DR   AlphaFoldDB; A0A8S1BC46; -.
DR   OrthoDB; 5983381at2759; -.
DR   Proteomes; UP000494106; Unassembled WGS sequence.
DR   GO; GO:0031012; C:extracellular matrix; IEA:TreeGrafter.
DR   GO; GO:0005615; C:extracellular space; IEA:TreeGrafter.
DR   Gene3D; 3.40.1620.70; -; 1.
DR   Gene3D; 1.20.5.320; 6-Phosphogluconate Dehydrogenase, domain 3; 1.
DR   Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR   InterPro; IPR016186; C-type_lectin-like/link_sf.
DR   InterPro; IPR050149; Collagen_superfamily.
DR   InterPro; IPR010515; Collagenase_NC10/endostatin.
DR   InterPro; IPR016187; CTDL_fold.
DR   InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR   PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR   PANTHER; PTHR24023:SF1082; COLLAGEN TRIPLE HELIX REPEAT; 1.
DR   Pfam; PF20010; Collagen_trimer; 1.
DR   Pfam; PF06482; Endostatin; 1.
DR   SUPFAM; SSF56436; C-type lectin-like; 1.
PE   4: Predicted;
KW   Reference proteome {ECO:0000313|Proteomes:UP000494106}.
FT   DOMAIN          113..160
FT                   /note="Collagen type XV/XVIII trimerization"
FT                   /evidence="ECO:0000259|Pfam:PF20010"
FT   DOMAIN          202..366
FT                   /note="Collagenase NC10/endostatin"
FT                   /evidence="ECO:0000259|Pfam:PF06482"
FT   REGION          1..112
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1..15
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        18..27
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        45..60
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   414 AA;  44806 MW;  8A3BE78DF7E1479A CRC64;
     MITLKFKGEK GERGFDGLPG LPGATGPAGP PGPSGALSEA IQYLPGPPGP PGSPGPPGPP
     GVSIVGPKGE PGFSHYEEYP VHGSPKIYGR PRPTHAHSGA GQAEGTSKSV PSAAVYQTTE
     EMLKLASSNP VGALAYVVEE QALFVKINSG WQYVLLGSLV TQAAPAPATP SPPPPPMPAA
     SLVHVPPISN YVENTPVVGP SIHLAALNEP LSGSMHGIRR ADYACYRQGR RAGFRGTFRA
     LLTSKIQNLN SIVRYSDRHL PVVNTYGEML FRSFSDMFHG NDALSPEARI YSFDGRNIMT
     DPHWPQKVIW HGSRTNGERA LDAYCNEWQN GDPTNRGLAS SLQGHKMLAQ ERYPCSNHFA
     VLCIEVASEL HPRRKREVAR TNATGVLDDE DYLYNAEEYQ QLLNEIFAQP FREN
//
DBGET integrated database retrieval system