GenomeNet

Database: UniProt
Entry: H2MHG7_ORYLA
LinkDB: H2MHG7_ORYLA
Original site: H2MHG7_ORYLA 
ID   H2MHG7_ORYLA            Unreviewed;      1233 AA.
AC   H2MHG7;
DT   21-MAR-2012, integrated into UniProtKB/TrEMBL.
DT   05-DEC-2018, sequence version 2.
DT   27-MAR-2024, entry version 70.
DE   SubName: Full=Collagen type XVIII alpha 1 chain a {ECO:0000313|Ensembl:ENSORLP00000018074.2};
OS   Oryzias latipes (Japanese rice fish) (Japanese killifish).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC   Ovalentaria; Atherinomorphae; Beloniformes; Adrianichthyidae; Oryziinae;
OC   Oryzias.
OX   NCBI_TaxID=8090 {ECO:0000313|Ensembl:ENSORLP00000018074.2, ECO:0000313|Proteomes:UP000001038};
RN   [1] {ECO:0000313|Ensembl:ENSORLP00000018074.2, ECO:0000313|Proteomes:UP000001038}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=Hd-rR {ECO:0000313|Ensembl:ENSORLP00000018074.2,
RC   ECO:0000313|Proteomes:UP000001038};
RX   PubMed=17554307; DOI=10.1038/nature05846;
RA   Kasahara M., Naruse K., Sasaki S., Nakatani Y., Qu W., Ahsan B., Yamada T.,
RA   Nagayasu Y., Doi K., Kasai Y., Jindo T., Kobayashi D., Shimada A.,
RA   Toyoda A., Kuroki Y., Fujiyama A., Sasaki T., Shimizu A., Asakawa S.,
RA   Shimizu N., Hashimoto S., Yang J., Lee Y., Matsushima K., Sugano S.,
RA   Sakaizumi M., Narita T., Ohishi K., Haga S., Ohta F., Nomoto H., Nogata K.,
RA   Morishita T., Endo T., Shin-I T., Takeda H., Morishita S., Kohara Y.;
RT   "The medaka draft genome and insights into vertebrate genome evolution.";
RL   Nature 447:714-719(2007).
RN   [2] {ECO:0000313|Ensembl:ENSORLP00000018074.2}
RP   IDENTIFICATION.
RC   STRAIN=Hd-rR {ECO:0000313|Ensembl:ENSORLP00000018074.2};
RG   Ensembl;
RL   Submitted (NOV-2023) to UniProtKB.
CC   -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC       feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00090}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   AlphaFoldDB; H2MHG7; -.
DR   STRING; 8090.ENSORLP00000018074; -.
DR   Ensembl; ENSORLT00000018075.2; ENSORLP00000018074.2; ENSORLG00000014422.2.
DR   eggNOG; KOG3544; Eukaryota.
DR   eggNOG; KOG3546; Eukaryota.
DR   GeneTree; ENSGT00940000158212; -.
DR   HOGENOM; CLU_004003_1_0_1; -.
DR   InParanoid; H2MHG7; -.
DR   TreeFam; TF315821; -.
DR   Proteomes; UP000001038; Chromosome 21.
DR   Bgee; ENSORLG00000014422; Expressed in sexually immature organism and 14 other cell types or tissues.
DR   GO; GO:0062023; C:collagen-containing extracellular matrix; IBA:GO_Central.
DR   GO; GO:0005737; C:cytoplasm; IBA:GO_Central.
DR   GO; GO:0005615; C:extracellular space; IBA:GO_Central.
DR   GO; GO:0030020; F:extracellular matrix structural constituent conferring tensile strength; IBA:GO_Central.
DR   GO; GO:0030198; P:extracellular matrix organization; IBA:GO_Central.
DR   GO; GO:0001755; P:neural crest cell migration; IEA:Ensembl.
DR   GO; GO:0001501; P:skeletal system development; IBA:GO_Central.
DR   CDD; cd00247; Endostatin-like; 1.
DR   Gene3D; 2.60.120.200; -; 1.
DR   Gene3D; 3.40.1620.70; -; 1.
DR   Gene3D; 1.10.2000.10; Frizzled cysteine-rich domain; 1.
DR   Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR   InterPro; IPR016186; C-type_lectin-like/link_sf.
DR   InterPro; IPR010515; Collagenase_NC10/endostatin.
DR   InterPro; IPR013320; ConA-like_dom_sf.
DR   InterPro; IPR016187; CTDL_fold.
DR   InterPro; IPR020067; Frizzled_dom.
DR   InterPro; IPR036790; Frizzled_dom_sf.
DR   InterPro; IPR048287; TSPN-like_N.
DR   InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR   PANTHER; PTHR24637; COLLAGEN; 1.
DR   PANTHER; PTHR24637:SF422; GENE, 37797-RELATED; 1.
DR   Pfam; PF20010; Collagen_trimer; 1.
DR   Pfam; PF06482; Endostatin; 1.
DR   Pfam; PF01392; Fz; 1.
DR   SMART; SM00063; FRI; 1.
DR   SMART; SM00210; TSPN; 1.
DR   SUPFAM; SSF56436; C-type lectin-like; 1.
DR   SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
DR   SUPFAM; SSF63501; Frizzled cysteine-rich domain; 1.
DR   PROSITE; PS50038; FZ; 1.
PE   4: Predicted;
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW   ProRule:PRU00090}; Reference proteome {ECO:0000313|Proteomes:UP000001038};
KW   Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..23
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           24..1233
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5017274190"
FT   DOMAIN          238..356
FT                   /note="FZ"
FT                   /evidence="ECO:0000259|PROSITE:PS50038"
FT   REGION          98..127
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          579..637
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          717..769
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          784..899
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          970..1053
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        589..604
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        748..766
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        785..805
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        820..869
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        878..895
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        972..990
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1013..1027
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   DISULFID        253..299
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00090"
FT   DISULFID        290..328
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00090"
SQ   SEQUENCE   1233 AA;  133414 MW;  D3034615E4850956 CRC64;
     MPATKTKFGL LLFLTQWVVY SEAWFWSWSG TTTRLPQTLE YEGSGSPTGS GELPSEITTK
     AEGDMVHEGH GTPNIKQTWV LTTEDPQMST VTPATLRETD DASESVTARI PPQSTSGNGT
     STLTALGSDP SRILKNASKI ESEVELAVEL DSKHGGFSSS GLLPVAENMA SLVYEDGAVK
     QADRRGLDNA SGAAFHVKKV GLRLSERVST IASKQEFSKA SVRAVGIGKG IHAQTENASS
     ISCLVLDAAL PFCSSKFGET FAVPNYFNHS SLEEVQALLK EWAWLLESRC HHSLEWFFCL
     LLVPKCDPAT PMLTLPCRSF CEVLLDSCWA LLEGQHLPVE CHTLPDASHG YQCLSVCNQN
     EDNGVSLLQL IGDPPPSEIS QVYGPDNAIG YVFGPDAKMG QLAVAHFPTP FYRDFSLNFH
     LKPTSNEGGV VFSITDSSQQ VIVVGVKLST VERQTQDVIF YYSETGSEQS YEAARFQVTS
     MTDTWTRFSI SVRDDKVAFY FNCDLSPVVK RFERSPDDME LERGSGVFVG QTGGPDPERF
     LVCICFHPTQ PNVVFYTQCH RTTDFSKWNL GDGELLAGFG YQGKKGDPGQ PGPPGPPGPP
     GPSTEYVPSS DGTVVSKVQG PRGPPGPPGP QGQPGEDGEP VCRWFHCQHT FFLKKHHIFN
     KVFRSFSIIS LFQVLILIHA VTFFSHLLLC VQGEKGEPGL IIGPDGNPLY LGGLTGQKGD
     IGPQGPGPPG LKGEIGMPGR PVSIHQNGVP GPPGPPGPPG PPGPPFAIDR FNVRTASDFD
     IYTLKNQWKG ERGDPGVKGE KGEPGAGHDP QYGALQGPLG PPGLPGPKGD SIMGPPGPQG
     PPGSPGIGYD GRPGPPGPPG PPGPPWSQTL PESYKPNYPV SVPGPPGPPG PPGHPGHSSG
     VMVLRSYDTL IATARSQEEG SLIYILDKAD LYLRVRDGLR QVMLGEYNPF FRDFDNEVAE
     AQPPPVVLYP HSPDHSQNNG ASQYSQGGSV IRPIETPPPV ERRYPPQYEP RFPDQQRPSQ
     TDGRLINQPS DGKYAVTPQR RPNPPVPGPV ETSTLGVSGL RLIALNTPQS GNMKGIRGPD
     YLCFEQARAA GLRGTFRAFL SSKVQDLHTI VRHSDREKPP IVNLKNQVLF DSWQSIFGES
     VSKMRKNVPI YSFDGRDILR DSAWPEKMVW HGSSKKGHRQ TDQYCEAWRA GDQAVTGLAS
     SLQSSHLLQQ TPTSCSRACI VLCIENAMVS PSK
//
DBGET integrated database retrieval system