GenomeNet

Database: UniProt
Entry: H2M7Q7_ORYLA
LinkDB: H2M7Q7_ORYLA
Original site: H2M7Q7_ORYLA 
ID   H2M7Q7_ORYLA            Unreviewed;      1024 AA.
AC   H2M7Q7;
DT   21-MAR-2012, integrated into UniProtKB/TrEMBL.
DT   05-DEC-2018, sequence version 2.
DT   27-MAR-2024, entry version 65.
DE   SubName: Full=Collagen type VI alpha 2 chain {ECO:0000313|Ensembl:ENSORLP00000014515.2};
GN   Name=COL6A2 {ECO:0000313|Ensembl:ENSORLP00000014515.2};
OS   Oryzias latipes (Japanese rice fish) (Japanese killifish).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC   Ovalentaria; Atherinomorphae; Beloniformes; Adrianichthyidae; Oryziinae;
OC   Oryzias.
OX   NCBI_TaxID=8090 {ECO:0000313|Ensembl:ENSORLP00000014515.2, ECO:0000313|Proteomes:UP000001038};
RN   [1] {ECO:0000313|Ensembl:ENSORLP00000014515.2, ECO:0000313|Proteomes:UP000001038}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=Hd-rR {ECO:0000313|Ensembl:ENSORLP00000014515.2,
RC   ECO:0000313|Proteomes:UP000001038};
RX   PubMed=17554307; DOI=10.1038/nature05846;
RA   Kasahara M., Naruse K., Sasaki S., Nakatani Y., Qu W., Ahsan B., Yamada T.,
RA   Nagayasu Y., Doi K., Kasai Y., Jindo T., Kobayashi D., Shimada A.,
RA   Toyoda A., Kuroki Y., Fujiyama A., Sasaki T., Shimizu A., Asakawa S.,
RA   Shimizu N., Hashimoto S., Yang J., Lee Y., Matsushima K., Sugano S.,
RA   Sakaizumi M., Narita T., Ohishi K., Haga S., Ohta F., Nomoto H., Nogata K.,
RA   Morishita T., Endo T., Shin-I T., Takeda H., Morishita S., Kohara Y.;
RT   "The medaka draft genome and insights into vertebrate genome evolution.";
RL   Nature 447:714-719(2007).
RN   [2] {ECO:0000313|Ensembl:ENSORLP00000014515.2}
RP   IDENTIFICATION.
RC   STRAIN=Hd-rR {ECO:0000313|Ensembl:ENSORLP00000014515.2};
RG   Ensembl;
RL   Submitted (NOV-2023) to UniProtKB.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   AlphaFoldDB; H2M7Q7; -.
DR   STRING; 8090.ENSORLP00000014515; -.
DR   Ensembl; ENSORLT00000014516.2; ENSORLP00000014515.2; ENSORLG00000011584.2.
DR   eggNOG; KOG3544; Eukaryota.
DR   GeneTree; ENSGT00940000155682; -.
DR   HOGENOM; CLU_009158_2_0_1; -.
DR   InParanoid; H2M7Q7; -.
DR   OrthoDB; 2906665at2759; -.
DR   TreeFam; TF331207; -.
DR   Proteomes; UP000001038; Chromosome 17.
DR   Bgee; ENSORLG00000011584; Expressed in muscle tissue and 12 other cell types or tissues.
DR   GO; GO:0062023; C:collagen-containing extracellular matrix; IBA:GO_Central.
DR   GO; GO:0007409; P:axonogenesis; IEA:Ensembl.
DR   GO; GO:0055001; P:muscle cell development; IEA:Ensembl.
DR   GO; GO:0007517; P:muscle organ development; IEA:Ensembl.
DR   CDD; cd01450; vWFA_subfamily_ECM; 1.
DR   Gene3D; 3.40.50.410; von Willebrand factor, type A domain; 3.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR002035; VWF_A.
DR   InterPro; IPR036465; vWFA_dom_sf.
DR   PANTHER; PTHR24020; COLLAGEN ALPHA; 1.
DR   PANTHER; PTHR24020:SF70; PH DOMAIN-CONTAINING PROTEIN; 1.
DR   Pfam; PF01391; Collagen; 3.
DR   Pfam; PF00092; VWA; 3.
DR   PRINTS; PR00453; VWFADOMAIN.
DR   SMART; SM00327; VWA; 3.
DR   SUPFAM; SSF53300; vWA-like; 3.
DR   PROSITE; PS50234; VWFA; 3.
PE   4: Predicted;
KW   Reference proteome {ECO:0000313|Proteomes:UP000001038};
KW   Signal {ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..32
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           33..1024
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5017384317"
FT   DOMAIN          46..240
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   DOMAIN          617..804
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   DOMAIN          836..1019
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   REGION          255..590
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        411..434
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   1024 AA;  108747 MW;  4D6428FFAA2CF6F2 CRC64;
     MDGSSSERQE RRRMPGLEAF ILCLFFGALT RAQKAECSRK NECPIDVYFT IDTSETIALQ
     ESPPGALVES IKSFTAELVK RMQDEELRGV VRVKWNTGGL HFSQTQRIIS RIGNNTDFLN
     RLKPIQYLGK GTYIDCALKK MSEEMDQFPS SPSALRFAVV ITDGHVTGNP CRGIKVAAEE
     ARDKGIRIFA VASSTNIDET GLREIASSPA SVYRDEFMAV DLSSGARIHV QTIERIIKTM
     KQVAYTECYK VSCLETDGPP GPKGHRGQKG AKGDIGQPGQ KGERGRPGDP GIEGPIGQPG
     IKGEPGQMGD KGEMGSQGKK GVAGIAGRNG TDGQKGKIGR IGAPGCKGDS GDRGPDGHPG
     DVGERGPLGT DGDKGDSGRP GRSGPPGESG APGPKGERGS PGSPGLPGQK GRRGERGRTG
     VRGEPGRRGD YGKKGARGPP GPTGEKGEMG PEGLRGLPGE AGIQGSKGDN GLPGPRGAAG
     KPGGPGKNGT RGDPGDAGPR GEPGPPGPKG DVGRPGFGYP GPRGPPGEKG EKGNPGPRGS
     RGECGQKGGP GDKGRPGEPG EPGSMGEPGP RGQRGEAGRD GDPGPEGDPG LTECDVMTYI
     RETCGCCDCE KRCGPLDIVF VIDSSESVGL TNFTLEKNFV INTISRLGSF AKSPDSETGT
     RVGVVQYSHS GTFQAISLND SKIDSLAAFK EEVKRLEWIA GGTWTPSALK YAYDNLIRDS
     RRAKAKVTVV VITDGRFDPR DNDTLLTYLC RDPSVDVSAI GIGDMFDQIG ENENLNSIAC
     QREGRVTGMR RFADLVAEEF IDKIETVLCP DPVIVCPDLP CKSEPAVASC VQRPVDIVFL
     LDGSERMGLE NHRRAKEFIE NVARRLTLAN TATDDRRARL ALLQYGSPVE QKVEFPLTHE
     LGVISDSLAN VNYMDSSSAL GSAIIYAVNN LVIKQDGRRL SRRNAEVAFV FITDGITATE
     QLEEGVSAMK RAEGIPTVIA MGSDTDEEVL HKVSLGDTSA IFRGDDYSML NKPAFFERFV
     RWIC
//
DBGET integrated database retrieval system