GenomeNet

Database: UniProt
Entry: A0A811UZ85_CERCA
LinkDB: A0A811UZ85_CERCA
Original site: A0A811UZ85_CERCA 
ID   A0A811UZ85_CERCA        Unreviewed;       453 AA.
AC   A0A811UZ85;
DT   29-SEP-2021, integrated into UniProtKB/TrEMBL.
DT   29-SEP-2021, sequence version 1.
DT   28-JAN-2026, entry version 14.
DE   SubName: Full=(Mediterranean fruit fly) hypothetical protein {ECO:0000313|EMBL:CAD7004014.1};
GN   ORFNames=CCAP1982_LOCUS12437 {ECO:0000313|EMBL:CAD7004014.1};
OS   Ceratitis capitata (Mediterranean fruit fly) (Tephritis capitata).
OC   Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC   Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; Tephritoidea;
OC   Tephritidae; Ceratitis; Ceratitis.
OX   NCBI_TaxID=7213 {ECO:0000313|EMBL:CAD7004014.1, ECO:0000313|Proteomes:UP000606786};
RN   [1] {ECO:0000313|EMBL:CAD7004014.1}
RP   NUCLEOTIDE SEQUENCE.
RC   STRAIN=EGII {ECO:0000313|EMBL:CAD7004014.1};
RA   Whitehead M.;
RL   Submitted (NOV-2020) to the EMBL/GenBank/DDBJ databases.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:CAD7004014.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; CAJHJT010000034; CAD7004014.1; -; Genomic_DNA.
DR   AlphaFoldDB; A0A811UZ85; -.
DR   OrthoDB; 5983381at2759; -.
DR   Proteomes; UP000606786; Unassembled WGS sequence.
DR   GO; GO:0031012; C:extracellular matrix; IEA:TreeGrafter.
DR   GO; GO:0005615; C:extracellular space; IEA:TreeGrafter.
DR   GO; GO:0030020; F:extracellular matrix structural constituent conferring tensile strength; IEA:TreeGrafter.
DR   GO; GO:0030198; P:extracellular matrix organization; IEA:TreeGrafter.
DR   Gene3D; 3.40.1620.70; -; 1.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR050149; Collagen_superfamily.
DR   InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR   PANTHER; PTHR24023:SF1112; COL_CUTICLE_N DOMAIN-CONTAINING PROTEIN-RELATED; 1.
DR   PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR   Pfam; PF01391; Collagen; 3.
DR   Pfam; PF20010; Collagen_trimer; 1.
PE   4: Predicted;
KW   Reference proteome {ECO:0000313|Proteomes:UP000606786}.
FT   DOMAIN          378..420
FT                   /note="Collagen type XV/XVIII trimerization"
FT                   /evidence="ECO:0000259|Pfam:PF20010"
FT   REGION          119..367
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        124..145
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        252..275
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        297..309
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        340..352
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   453 AA;  45530 MW;  ECFA5B7234C63D1D CRC64;
     MSCQQRVWIY GYGTIYAHRR AATGCEGANV AIGCKLGEWG QGHPGIHGST GLKGEPGQPG
     VPGMAGAPGA PGLKGDRGLP GELGPIGPPG PPGEVIYADN SLNGTSAAAG LGQCQCPAGP
     PGAPGMRGPQ GYEGPPGLNG EPGPAGSNGL TGLPGSKGEK GEKGVRGLSG PKGDRGPEGP
     PGQAFFAGGY EGMAMNGTKG EKGEKGMRGR RGKPGQAGPI GPPGKPGPMG DIGHSGRPGL
     QGPKGDSGTK GQKGEAGGRE GLKGDKGDRG QDGRDGLPGP PGLPAAAGEG VQYIPMPGAP
     GPPGPPGQPG MPGLSINGPK GEPGMDSRSF YGDPSYYGRP GPPGPPGPPG PPSHSRHEDE
     ETPYFSASSW NMRIVPGAVT FPNIDEMTKK SAMNPPGTLA YITEEEALLV RVNKGWQYIA
     FLGPLGLSRL SSVPLVFHLT KIVKLREIGK TSK
//
DBGET integrated database retrieval system