GenomeNet

Database: UniProt
Entry: A0A7J8B6V9_ROUAE
LinkDB: A0A7J8B6V9_ROUAE
Original site: A0A7J8B6V9_ROUAE 
ID   A0A7J8B6V9_ROUAE        Unreviewed;      1292 AA.
AC   A0A7J8B6V9;
DT   07-APR-2021, integrated into UniProtKB/TrEMBL.
DT   07-APR-2021, sequence version 1.
DT   08-OCT-2025, entry version 17.
DE   SubName: Full=Collagen type XVIII alpha 1 chain {ECO:0000313|EMBL:KAF6394458.1};
GN   ORFNames=HJG63_003140 {ECO:0000313|EMBL:KAF6394458.1};
OS   Rousettus aegyptiacus (Egyptian fruit bat) (Pteropus aegyptiacus).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Laurasiatheria; Chiroptera; Yinpterochiroptera; Pteropodoidea;
OC   Pteropodidae; Rousettinae; Rousettus.
OX   NCBI_TaxID=9407 {ECO:0000313|EMBL:KAF6394458.1, ECO:0000313|Proteomes:UP000593571};
RN   [1] {ECO:0000313|EMBL:KAF6394458.1, ECO:0000313|Proteomes:UP000593571}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=MRouAeg1 {ECO:0000313|EMBL:KAF6394458.1};
RC   TISSUE=Muscle {ECO:0000313|EMBL:KAF6394458.1};
RX   PubMed=32699395;
RA   Jebb D., Huang Z., Pippel M., Hughes G.M., Lavrichenko K., Devanna P.,
RA   Winkler S., Jermiin L.S., Skirmuntt E.C., Katzourakis A., Burkitt-Gray L.,
RA   Ray D.A., Sullivan K.A.M., Roscito J.G., Kirilenko B.M., Davalos L.M.,
RA   Corthals A.P., Power M.L., Jones G., Ransome R.D., Dechmann D.K.N.,
RA   Locatelli A.G., Puechmaille S.J., Fedrigo O., Jarvis E.D., Hiller M.,
RA   Vernes S.C., Myers E.W., Teeling E.C.;
RT   "Six reference-quality genomes reveal evolution of bat adaptations.";
RL   Nature 583:578-584(2020).
CC   -!- SUBCELLULAR LOCATION: Secreted, extracellular space, extracellular
CC       matrix {ECO:0000256|ARBA:ARBA00004498}.
CC   -!- SIMILARITY: Belongs to the multiplexin collagen family.
CC       {ECO:0000256|ARBA:ARBA00061275}.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:KAF6394458.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; JACASE010000020; KAF6394458.1; -; Genomic_DNA.
DR   Proteomes; UP000593571; Unassembled WGS sequence.
DR   GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR   GO; GO:0031012; C:extracellular matrix; IEA:TreeGrafter.
DR   GO; GO:0005615; C:extracellular space; IEA:TreeGrafter.
DR   GO; GO:0030020; F:extracellular matrix structural constituent conferring tensile strength; IEA:TreeGrafter.
DR   GO; GO:0007155; P:cell adhesion; IEA:UniProtKB-KW.
DR   GO; GO:0030198; P:extracellular matrix organization; IEA:TreeGrafter.
DR   CDD; cd00247; Endostatin-like; 1.
DR   FunFam; 3.10.100.10:FF:000008; collagen alpha-1(XVIII) chain isoform X1; 1.
DR   FunFam; 3.40.1620.70:FF:000003; Collagen type XVIII alpha 1; 1.
DR   FunFam; 2.60.120.200:FF:000039; Collagen XV alpha 1 chain; 1.
DR   Gene3D; 2.60.120.200; -; 1.
DR   Gene3D; 3.40.1620.70; -; 1.
DR   Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR   InterPro; IPR016186; C-type_lectin-like/link_sf.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR050149; Collagen_superfamily.
DR   InterPro; IPR010515; Collagenase_NC10/endostatin.
DR   InterPro; IPR013320; ConA-like_dom_sf.
DR   InterPro; IPR016187; CTDL_fold.
DR   InterPro; IPR048287; TSPN-like_N.
DR   InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR   PANTHER; PTHR24023:SF1112; COL_CUTICLE_N DOMAIN-CONTAINING PROTEIN-RELATED; 1.
DR   PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR   Pfam; PF01391; Collagen; 3.
DR   Pfam; PF20010; Collagen_trimer; 1.
DR   Pfam; PF06482; Endostatin; 1.
DR   SMART; SM00210; TSPN; 1.
DR   SUPFAM; SSF56436; C-type lectin-like; 1.
DR   SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
PE   3: Inferred from homology;
KW   Cell adhesion {ECO:0000256|ARBA:ARBA00022889};
KW   Collagen {ECO:0000256|ARBA:ARBA00023119, ECO:0000313|EMBL:KAF6394458.1};
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW   Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW   Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW   Hydroxylation {ECO:0000256|ARBA:ARBA00023278};
KW   Proteoglycan {ECO:0000256|ARBA:ARBA00022974};
KW   Reference proteome {ECO:0000313|Proteomes:UP000593571};
KW   Repeat {ECO:0000256|ARBA:ARBA00022737};
KW   Secreted {ECO:0000256|ARBA:ARBA00022525};
KW   Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..33
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           34..1292
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5029471833"
FT   DOMAIN          41..229
FT                   /note="Thrombospondin-like N-terminal"
FT                   /evidence="ECO:0000259|SMART:SM00210"
FT   REGION          231..983
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        309..321
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        366..382
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        411..426
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        455..465
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        497..508
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        556..571
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        604..616
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        617..626
FT                   /note="Gly residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        646..660
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        666..675
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        685..700
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        761..773
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        794..807
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        864..882
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        894..906
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        916..925
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        936..952
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        962..974
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   1292 AA;  130153 MW;  96460A786FF948FA CRC64;
     MAPRWPWLQP PRRHLLDVLA PLVLLLGVRG ASAEPESAGT EVGLLQLLGE PLPQHVAQVD
     DPDVGPAFVF GADSSGGQAA RAHLPSPFFR DFALLFHVRP AAEGPGVLLA VTDAAQAVVS
     VGVKLSGVRD GHQDVQLLYT EPGAARTRVA ASFRLPAFAG QWTRLALSVG GDSAALFVDC
     EELQRVPLAR SPQRLQLEPG AGLFVAQAGA ADPDRFQGAI AELRVRGDPQ VSPLHCLEDD
     ADDDRDRASG DFGSGLQASP ELAGEEAGMS LQPGLPEAPP VTSPPLAGGS NSEDSRTEEI
     EEETTASSLG AQTLPGSQTA ATGDGNVRSP EDSLEEDPAG PLVPGPDARL VPGPQGPPGP
     PGKDGAPGRD GEPGDPGKDG RPGDTGPQGF PGTPGDVGPK GEKGDPGVGP RGPPGPQGPP
     GPPGPPFSQD KLTFIDMEGS GFGGDLESLR GPRGFPGPPG PPGVPGLPGQ PGRFGANSSA
     TPGPAGMPGV PGRDGQPGPP GPPGPPGSPG REGRPGEAGQ KGSLGKDGAP GLKGSRGDPG
     PAGAPGENGL TGAPGPAGPP GPPGPPGPPG PGLAAGFDDM EGSGGPFWPT TRGAEGLQGP
     PGQPGAKGDP GIAGPPGAKG GAGADGPPGF PGLPGREGVA GAQGPKGEKG TPGEKGDPGK
     DGVGQPGPPG PPGVPGPVIY LSEQDGQPGF AGFPGPAGPK GDPGSGGQPG PPGMKGEKGE
     PGMVYGPDGH GLAPAPKGAK GEPGPRGPLG PYGRPGHKGE IGFPGRPGRP GINGLKGEKG
     EPGDASIGFG VRGPPGPPGP PGPPGVPGTP VYDSNAFVES GPPGPPGLPG HQGPSGPKGD
     KGEVGPPGAP GQFPLDLFQL GAEMKGEKGD RGHAGQKGER GEPGAGGFFG SSIPGPPGPP
     GYPGIPGPKG ESTPGQPGPP GPQGPPGIGY EGRQGPPGPP GPPGPPGPPS FPGPYRQTIS
     VPGPPGPPGP PGPPGTMGTS SGVRIWPTYQ TMLDRVPELP EGWLIYVAER EELYVRVRHG
     FRKVLLEART SLPRGTENEV AALQPPLVQL HEGNPHPRGE LPPSTARPWR ADDVLASPPR
     WRDPQPYPGV PHYGSYVQPW PVRPTGTPAH PHQDFRPVLH LVALNGPQPG GLRGIRGADL
     QCFQQARAAG LAGTFRAFLS SRLQDLHSIV RRADRAAVPV VNLRDEELFP SWGALFSGSE
     AQLKPGARIL SFDGRDVLQH PAWPQKSVWH GSDPSGRRLT ESYCETWRTE AMTATGQASS
     LLAGKLLEQK AASCHNAFIV LCIENSLTTA SK
//
DBGET integrated database retrieval system