GenomeNet

Database: UniProt
Entry: H2M8W2_ORYLA
LinkDB: H2M8W2_ORYLA
Original site: H2M8W2_ORYLA 
ID   H2M8W2_ORYLA            Unreviewed;       677 AA.
AC   H2M8W2;
DT   21-MAR-2012, integrated into UniProtKB/TrEMBL.
DT   21-MAR-2012, sequence version 1.
DT   27-MAR-2024, entry version 73.
DE   SubName: Full=Collagen, type IX, alpha 3 {ECO:0000313|Ensembl:ENSORLP00000014936.1};
GN   Name=col9a3 {ECO:0000313|Ensembl:ENSORLP00000014936.1};
OS   Oryzias latipes (Japanese rice fish) (Japanese killifish).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC   Ovalentaria; Atherinomorphae; Beloniformes; Adrianichthyidae; Oryziinae;
OC   Oryzias.
OX   NCBI_TaxID=8090 {ECO:0000313|Ensembl:ENSORLP00000014936.1, ECO:0000313|Proteomes:UP000001038};
RN   [1] {ECO:0000313|Ensembl:ENSORLP00000014936.1, ECO:0000313|Proteomes:UP000001038}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=Hd-rR {ECO:0000313|Ensembl:ENSORLP00000014936.1,
RC   ECO:0000313|Proteomes:UP000001038};
RX   PubMed=17554307; DOI=10.1038/nature05846;
RA   Kasahara M., Naruse K., Sasaki S., Nakatani Y., Qu W., Ahsan B., Yamada T.,
RA   Nagayasu Y., Doi K., Kasai Y., Jindo T., Kobayashi D., Shimada A.,
RA   Toyoda A., Kuroki Y., Fujiyama A., Sasaki T., Shimizu A., Asakawa S.,
RA   Shimizu N., Hashimoto S., Yang J., Lee Y., Matsushima K., Sugano S.,
RA   Sakaizumi M., Narita T., Ohishi K., Haga S., Ohta F., Nomoto H., Nogata K.,
RA   Morishita T., Endo T., Shin-I T., Takeda H., Morishita S., Kohara Y.;
RT   "The medaka draft genome and insights into vertebrate genome evolution.";
RL   Nature 447:714-719(2007).
RN   [2] {ECO:0000313|Ensembl:ENSORLP00000014936.1}
RP   IDENTIFICATION.
RC   STRAIN=Hd-rR {ECO:0000313|Ensembl:ENSORLP00000014936.1};
RG   Ensembl;
RL   Submitted (NOV-2023) to UniProtKB.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   RefSeq; XP_004070849.1; XM_004070801.2.
DR   AlphaFoldDB; H2M8W2; -.
DR   STRING; 8090.ENSORLP00000014936; -.
DR   Ensembl; ENSORLT00000014937.2; ENSORLP00000014936.1; ENSORLG00000011918.2.
DR   GeneID; 101155514; -.
DR   KEGG; ola:101155514; -.
DR   CTD; 1299; -.
DR   eggNOG; KOG3544; Eukaryota.
DR   GeneTree; ENSGT00940000161930; -.
DR   HOGENOM; CLU_001074_18_2_1; -.
DR   InParanoid; H2M8W2; -.
DR   OMA; MINEQIA; -.
DR   OrthoDB; 5363474at2759; -.
DR   Proteomes; UP000001038; Chromosome 7.
DR   Bgee; ENSORLG00000011918; Expressed in sexually immature organism and 9 other cell types or tissues.
DR   GO; GO:0062023; C:collagen-containing extracellular matrix; IBA:GO_Central.
DR   GO; GO:0005615; C:extracellular space; IBA:GO_Central.
DR   GO; GO:0030020; F:extracellular matrix structural constituent conferring tensile strength; IBA:GO_Central.
DR   GO; GO:0030198; P:extracellular matrix organization; IBA:GO_Central.
DR   InterPro; IPR008160; Collagen.
DR   PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR   PANTHER; PTHR24023:SF1034; COLLAGEN ALPHA-1(XVIII) CHAIN; 1.
DR   Pfam; PF01391; Collagen; 7.
PE   4: Predicted;
KW   Reference proteome {ECO:0000313|Proteomes:UP000001038};
KW   Signal {ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..21
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           22..677
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5003565589"
FT   REGION          19..228
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          258..377
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          404..513
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          545..677
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        100..117
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        139..156
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        190..206
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        306..321
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   677 AA;  64073 MW;  21E6CC4D8C961FEB CRC64;
     MTVFSAVGAL LLCQLLASSS AQRPGPRGPK GLPGPPGLPG RDGTDGKPGP PGFSGKPGPK
     GKAGTPGLPG KAGLPGLPGI DGLTGPDGPP GKDGPAGEQG EPGPPGPPGP PGRGRPGAPG
     LPGTNGFPGA VGPQGPVGPE GLPGLPGPPG PDGPPGLPGT LQDLNGDLLC PAICPPGPPG
     PPGMPGFKGH TGHKGDKGEQ GKDGEKGDQG PTGPPGIPGT VGLQGPRGLR GLQGPIGPGG
     DRGFPGFRGK PGIAGIIGKT GDPGERGPQG FKGPKGEVGK IGPKGAPGGA GPKGDPGMPG
     RDGKDGTQGL DGEKGDAGRH GTVGEKGPNG LPGLAGKVGA KGSKGEVGDP GKSGETGPSG
     EPGLPGEIGI PGERGIAGPR GVAGGIGPAG NPGPLGVKGF QGIKGALGDP GLPGPTGIRG
     EFGDRGPVGA SGAKGDMGVA GSDGLPGENG EPGAFGPVGQ KGESGKRGEL GPKGVTGPQG
     ELGARGPPGK QGPIGFQGEQ GVPGLPGKRG VPGKLASEQH IRELCGSMID DQIAQLAANL
     RRPLAPGMVG RPGPAGSPGE PGSAGSIGHP GPRGPPGYRG LPGELGDPGP RGDVGDQGDK
     GSVGKAVDGP PGDQGHQGQP GVPGIVKDGR DGSPGDPGEP GEAGRVGRTG HQGPPGICDT
     SACQGASVAG KSSNPKN
//
DBGET integrated database retrieval system