GenomeNet

Database: UniProt
Entry: A0A9Q0BR46_9MUSC
LinkDB: A0A9Q0BR46_9MUSC
Original site: A0A9Q0BR46_9MUSC 
ID   A0A9Q0BR46_9MUSC        Unreviewed;       824 AA.
AC   A0A9Q0BR46;
DT   13-SEP-2023, integrated into UniProtKB/TrEMBL.
DT   13-SEP-2023, sequence version 1.
DT   28-JAN-2026, entry version 10.
DE   RecName: Full=Collagen alpha-1(XVIII) chain {ECO:0008006|Google:ProtNLM};
GN   ORFNames=M5D96_005922 {ECO:0000313|EMBL:KAI8041657.1};
OS   Drosophila gunungcola (fruit fly).
OC   Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC   Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; Ephydroidea;
OC   Drosophilidae; Drosophila; Sophophora.
OX   NCBI_TaxID=103775 {ECO:0000313|EMBL:KAI8041657.1, ECO:0000313|Proteomes:UP001059596};
RN   [1] {ECO:0000313|EMBL:KAI8041657.1}
RP   NUCLEOTIDE SEQUENCE.
RC   STRAIN=Sukarami {ECO:0000313|EMBL:KAI8041657.1};
RX   PubMed=36930539; DOI=10.1093/gbe/evad048;
RA   Negi A., Liao B.Y., Yeh S.D.;
RT   "Long-read-based Genome Assembly of Drosophila gunungcola Reveals Fewer
RT   Chemosensory Genes in Flower-breeding Species.";
RL   Genome Biol. Evol. 15:evad048-evad048(2023).
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:KAI8041657.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; JAMKOV010000003; KAI8041657.1; -; Genomic_DNA.
DR   AlphaFoldDB; A0A9Q0BR46; -.
DR   Proteomes; UP001059596; Unassembled WGS sequence.
DR   GO; GO:0005587; C:collagen type IV trimer; IEA:TreeGrafter.
DR   GO; GO:0005615; C:extracellular space; IEA:TreeGrafter.
DR   GO; GO:0030020; F:extracellular matrix structural constituent conferring tensile strength; IEA:TreeGrafter.
DR   GO; GO:0030198; P:extracellular matrix organization; IEA:TreeGrafter.
DR   CDD; cd00247; Endostatin-like; 1.
DR   FunFam; 3.10.100.10:FF:000048; Multiplexin collagen isoform Ap3; 1.
DR   FunFam; 3.40.1620.70:FF:000001; Multiplexin collagen isoform Ap3; 1.
DR   Gene3D; 3.40.1620.70; -; 1.
DR   Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR   InterPro; IPR016186; C-type_lectin-like/link_sf.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR050149; Collagen_superfamily.
DR   InterPro; IPR010515; Collagenase_NC10/endostatin.
DR   InterPro; IPR016187; CTDL_fold.
DR   InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR   PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR   PANTHER; PTHR24023:SF1098; MULTIPLEXIN, ISOFORM R; 1.
DR   Pfam; PF01391; Collagen; 3.
DR   Pfam; PF20010; Collagen_trimer; 1.
DR   Pfam; PF06482; Endostatin; 1.
DR   SUPFAM; SSF56436; C-type lectin-like; 1.
PE   4: Predicted;
KW   Collagen {ECO:0000256|ARBA:ARBA00023119};
KW   Reference proteome {ECO:0000313|Proteomes:UP001059596}.
FT   DOMAIN          511..559
FT                   /note="Collagen type XV/XVIII trimerization"
FT                   /evidence="ECO:0000259|Pfam:PF20010"
FT   DOMAIN          609..774
FT                   /note="Collagenase NC10/endostatin"
FT                   /evidence="ECO:0000259|Pfam:PF06482"
FT   REGION          44..111
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          139..446
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          459..485
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          782..805
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        67..88
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        92..101
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        148..165
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        198..209
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        218..231
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        237..254
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        339..351
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        353..376
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        402..414
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        459..468
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   824 AA;  86234 MW;  2AF8BB507F98DC2E CRC64;
     MPPPRATVAP TTAEMDSLFV EGSGESIPFE DSTEVNLESE DFWNSADEAT DIFDASGMQP
     PGQTQYTHER PYRGIKGEKG ERGPKGDSIR GPPGPPGPPG PKGETAAYPP FVETTSAGAK
     YTGECTCNAS DILEAIKDNE SLRETLRGVP GTPGKDGKPG TPGHTGATGV PGARGARGSE
     GAQGLKGEPG VDGLPGVVGP PGPPGPPGLP ENYDESLMVN SMGTFRGTTQ PGAKGVSGEK
     GDAGPKGERG DPGHKGAHGP SGAKGEPGEP GTPGLPGLPG QAGQPGGLEG LASVNVNGTK
     GEKGEKGMRG RRGGSGPTGP IGPPGKPGAM GDIGHSGRPG MTGPKGEMGP KGPKGDTGGR
     EGVKGDKGDR GQDGRDGLPG PPGMPSTGGG DGDSSGVQYI PMPGPPGPPG PPGLPGLSIS
     GPKGDPGMDS RSPFFGDASY YGRPGARSSL DELKALRELQ DLRDRPDGTA ETPRQTGHSH
     KHEETLGLME GEEPTYSASS SNMNMKIVPG AVTFQNIDEM TKKSALNPPG TLAYITEEEA
     LLVRVNKGWQ YIALGTLVPI ATPAPPTTVA PSMRFDLQSK NLLNSPPPLI NTPTFTTAPE
     YETWYPRMLR VAALNEPSTG DLQGIRGADF ACYRQGRRAG LLGTFKAFLS SRVQNLDTIV
     RPADRDLPVV NTRGDVLFNS WKGIFNGQGG FFSQAPRIYS FSGKNVMTDS SWPMKMVWHG
     SLPNGERSMD TYCDAWHSGD HLKSGYASNL DGHKLLEQKR QSCDSKLIIL CVEALSQDRK
     RKRRDLGGTS YRNSRSYSHS HDESESLEFS TAEEYAAHLE NLLL
//
DBGET integrated database retrieval system