ID A0A9Q0BR46_9MUSC Unreviewed; 824 AA.
AC A0A9Q0BR46;
DT 13-SEP-2023, integrated into UniProtKB/TrEMBL.
DT 13-SEP-2023, sequence version 1.
DT 28-JAN-2026, entry version 10.
DE RecName: Full=Collagen alpha-1(XVIII) chain {ECO:0008006|Google:ProtNLM};
GN ORFNames=M5D96_005922 {ECO:0000313|EMBL:KAI8041657.1};
OS Drosophila gunungcola (fruit fly).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; Ephydroidea;
OC Drosophilidae; Drosophila; Sophophora.
OX NCBI_TaxID=103775 {ECO:0000313|EMBL:KAI8041657.1, ECO:0000313|Proteomes:UP001059596};
RN [1] {ECO:0000313|EMBL:KAI8041657.1}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=Sukarami {ECO:0000313|EMBL:KAI8041657.1};
RX PubMed=36930539; DOI=10.1093/gbe/evad048;
RA Negi A., Liao B.Y., Yeh S.D.;
RT "Long-read-based Genome Assembly of Drosophila gunungcola Reveals Fewer
RT Chemosensory Genes in Flower-breeding Species.";
RL Genome Biol. Evol. 15:evad048-evad048(2023).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KAI8041657.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; JAMKOV010000003; KAI8041657.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A9Q0BR46; -.
DR Proteomes; UP001059596; Unassembled WGS sequence.
DR GO; GO:0005587; C:collagen type IV trimer; IEA:TreeGrafter.
DR GO; GO:0005615; C:extracellular space; IEA:TreeGrafter.
DR GO; GO:0030020; F:extracellular matrix structural constituent conferring tensile strength; IEA:TreeGrafter.
DR GO; GO:0030198; P:extracellular matrix organization; IEA:TreeGrafter.
DR CDD; cd00247; Endostatin-like; 1.
DR FunFam; 3.10.100.10:FF:000048; Multiplexin collagen isoform Ap3; 1.
DR FunFam; 3.40.1620.70:FF:000001; Multiplexin collagen isoform Ap3; 1.
DR Gene3D; 3.40.1620.70; -; 1.
DR Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR InterPro; IPR016186; C-type_lectin-like/link_sf.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR050149; Collagen_superfamily.
DR InterPro; IPR010515; Collagenase_NC10/endostatin.
DR InterPro; IPR016187; CTDL_fold.
DR InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF1098; MULTIPLEXIN, ISOFORM R; 1.
DR Pfam; PF01391; Collagen; 3.
DR Pfam; PF20010; Collagen_trimer; 1.
DR Pfam; PF06482; Endostatin; 1.
DR SUPFAM; SSF56436; C-type lectin-like; 1.
PE 4: Predicted;
KW Collagen {ECO:0000256|ARBA:ARBA00023119};
KW Reference proteome {ECO:0000313|Proteomes:UP001059596}.
FT DOMAIN 511..559
FT /note="Collagen type XV/XVIII trimerization"
FT /evidence="ECO:0000259|Pfam:PF20010"
FT DOMAIN 609..774
FT /note="Collagenase NC10/endostatin"
FT /evidence="ECO:0000259|Pfam:PF06482"
FT REGION 44..111
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 139..446
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 459..485
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 782..805
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 67..88
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 92..101
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 148..165
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 198..209
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 218..231
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 237..254
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 339..351
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 353..376
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 402..414
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 459..468
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 824 AA; 86234 MW; 2AF8BB507F98DC2E CRC64;
MPPPRATVAP TTAEMDSLFV EGSGESIPFE DSTEVNLESE DFWNSADEAT DIFDASGMQP
PGQTQYTHER PYRGIKGEKG ERGPKGDSIR GPPGPPGPPG PKGETAAYPP FVETTSAGAK
YTGECTCNAS DILEAIKDNE SLRETLRGVP GTPGKDGKPG TPGHTGATGV PGARGARGSE
GAQGLKGEPG VDGLPGVVGP PGPPGPPGLP ENYDESLMVN SMGTFRGTTQ PGAKGVSGEK
GDAGPKGERG DPGHKGAHGP SGAKGEPGEP GTPGLPGLPG QAGQPGGLEG LASVNVNGTK
GEKGEKGMRG RRGGSGPTGP IGPPGKPGAM GDIGHSGRPG MTGPKGEMGP KGPKGDTGGR
EGVKGDKGDR GQDGRDGLPG PPGMPSTGGG DGDSSGVQYI PMPGPPGPPG PPGLPGLSIS
GPKGDPGMDS RSPFFGDASY YGRPGARSSL DELKALRELQ DLRDRPDGTA ETPRQTGHSH
KHEETLGLME GEEPTYSASS SNMNMKIVPG AVTFQNIDEM TKKSALNPPG TLAYITEEEA
LLVRVNKGWQ YIALGTLVPI ATPAPPTTVA PSMRFDLQSK NLLNSPPPLI NTPTFTTAPE
YETWYPRMLR VAALNEPSTG DLQGIRGADF ACYRQGRRAG LLGTFKAFLS SRVQNLDTIV
RPADRDLPVV NTRGDVLFNS WKGIFNGQGG FFSQAPRIYS FSGKNVMTDS SWPMKMVWHG
SLPNGERSMD TYCDAWHSGD HLKSGYASNL DGHKLLEQKR QSCDSKLIIL CVEALSQDRK
RKRRDLGGTS YRNSRSYSHS HDESESLEFS TAEEYAAHLE NLLL
//