GenomeNet

Database: UniProt
Entry: A0A8S1BTF0_ARCPL
LinkDB: A0A8S1BTF0_ARCPL
Original site: A0A8S1BTF0_ARCPL 
ID   A0A8S1BTF0_ARCPL        Unreviewed;       785 AA.
AC   A0A8S1BTF0;
DT   12-OCT-2022, integrated into UniProtKB/TrEMBL.
DT   12-OCT-2022, sequence version 1.
DT   28-JAN-2026, entry version 15.
DE   RecName: Full=Collagen alpha-1(XV) chain {ECO:0008006|Google:ProtNLM};
GN   ORFNames=APLA_LOCUS17402 {ECO:0000313|EMBL:CAB3260330.1};
OS   Arctia plantaginis (Wood tiger moth) (Phalaena plantaginis).
OC   Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC   Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; Noctuoidea;
OC   Erebidae; Arctiinae; Arctia.
OX   NCBI_TaxID=874455 {ECO:0000313|EMBL:CAB3260330.1, ECO:0000313|Proteomes:UP000494106};
RN   [1] {ECO:0000313|EMBL:CAB3260330.1, ECO:0000313|Proteomes:UP000494106}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA   Wallbank WR R., Pardo Diaz C., Kozak K., Martin S., Jiggins C., Moest M.,
RA   Warren A I., Byers J.R.P. K., Montejo-Kovacevich G., Yen C E.;
RL   Submitted (APR-2020) to the EMBL/GenBank/DDBJ databases.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:CAB3260330.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; CADEBC010000733; CAB3260330.1; -; Genomic_DNA.
DR   AlphaFoldDB; A0A8S1BTF0; -.
DR   OrthoDB; 5983381at2759; -.
DR   Proteomes; UP000494106; Unassembled WGS sequence.
DR   GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR   GO; GO:0031012; C:extracellular matrix; IEA:TreeGrafter.
DR   GO; GO:0005615; C:extracellular space; IEA:TreeGrafter.
DR   GO; GO:0030020; F:extracellular matrix structural constituent conferring tensile strength; IEA:TreeGrafter.
DR   GO; GO:0030198; P:extracellular matrix organization; IEA:TreeGrafter.
DR   Gene3D; 3.40.1620.70; -; 1.
DR   Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR   InterPro; IPR016186; C-type_lectin-like/link_sf.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR050149; Collagen_superfamily.
DR   InterPro; IPR010515; Collagenase_NC10/endostatin.
DR   InterPro; IPR016187; CTDL_fold.
DR   InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR   PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR   PANTHER; PTHR24023:SF1116; MACROPHAGE RECEPTOR WITH COLLAGENOUS STRUCTURE; 1.
DR   Pfam; PF01391; Collagen; 4.
DR   Pfam; PF20010; Collagen_trimer; 1.
DR   Pfam; PF06482; Endostatin; 1.
DR   SUPFAM; SSF56436; C-type lectin-like; 1.
PE   4: Predicted;
KW   Collagen {ECO:0000256|ARBA:ARBA00023119};
KW   Reference proteome {ECO:0000313|Proteomes:UP000494106}.
FT   DOMAIN          501..546
FT                   /note="Collagen type XV/XVIII trimerization"
FT                   /evidence="ECO:0000259|Pfam:PF20010"
FT   DOMAIN          583..748
FT                   /note="Collagenase NC10/endostatin"
FT                   /evidence="ECO:0000259|Pfam:PF06482"
FT   REGION          21..490
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        32..45
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        93..104
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        148..163
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        195..204
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        237..253
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        276..291
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        307..324
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        331..348
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        360..373
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        391..404
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        408..425
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        436..450
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   785 AA;  80764 MW;  23FC09D62A863946 CRC64;
     MSQGQCRCSL SDISQILEVM PELKGLPGPQ GPTGADGTTG TPGKTGQMGE PGPPGPLGAK
     GDRGERGEIG APGPEGQVGP KGDPGMDGSP GLQGPPGPPG PPSPVSAAII ESTGLYGSNN
     PGILVPTGER GQMGLPGPQG ERGYQGSKGE RGLHGPKGDK GERGYVGSRG PHGAKGERGQ
     PGRDGTPGLP GAHGRPSEKG EKGARGLPGP PGLSVPVASE NSVPSDVTHG AALLKGIKGD
     RGEKGDKGEK GTRGMEGPQG FPGNDGKPGE RGDIGPSGLP GSQGPSGLNG PKGDKGEAGP
     PGPVAISRDE ALILTKGDKG ESGPRGKRGH PGPPGPRGPP GLPGPPGTPG NNGISGDIGL
     PGWTGPPGAA GQTGPPGPKG EKGDPGISAL DLEKVKGEKG DRGFDGLPGP PGKEGPRGPA
     GPPGSPATNI QYISVPGPPG PPGPPGPPSI YPNEVPDALT DTPGINRLEP GTAYRDPLDP
     LGESADFEDD DDGRTIVGTI LFKTTDSLIR LGTNSPLGTL AYVIQEQALL VRVNNGWQYV
     AMGSLLAIHT PPAGSPTRTP LQNILETSSL VHHKNPAVEG PVLRLAALNE PHTGDMHGVS
     SSNYECRRQA QRANMDGTFR AFISSRVQTI DSIVSWVDRE IPVVNTRGDV LFNSWGEMFD
     GSGALFAHAP RIYSFSGQNV LMDPLWPTKA VWHGANPNGE IAMDAYCDAW HSSNPDKYGL
     ASSLRSNKLL DQETYSCSSR LIVLCVEATP VDTVRRKKRS KYRNSAEKLQ FLKDIEERNE
     TRLKL
//
DBGET integrated database retrieval system