ID H2M7Q7_ORYLA Unreviewed; 1024 AA.
AC H2M7Q7;
DT 21-MAR-2012, integrated into UniProtKB/TrEMBL.
DT 05-DEC-2018, sequence version 2.
DT 27-MAR-2024, entry version 65.
DE SubName: Full=Collagen type VI alpha 2 chain {ECO:0000313|Ensembl:ENSORLP00000014515.2};
GN Name=COL6A2 {ECO:0000313|Ensembl:ENSORLP00000014515.2};
OS Oryzias latipes (Japanese rice fish) (Japanese killifish).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC Ovalentaria; Atherinomorphae; Beloniformes; Adrianichthyidae; Oryziinae;
OC Oryzias.
OX NCBI_TaxID=8090 {ECO:0000313|Ensembl:ENSORLP00000014515.2, ECO:0000313|Proteomes:UP000001038};
RN [1] {ECO:0000313|Ensembl:ENSORLP00000014515.2, ECO:0000313|Proteomes:UP000001038}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Hd-rR {ECO:0000313|Ensembl:ENSORLP00000014515.2,
RC ECO:0000313|Proteomes:UP000001038};
RX PubMed=17554307; DOI=10.1038/nature05846;
RA Kasahara M., Naruse K., Sasaki S., Nakatani Y., Qu W., Ahsan B., Yamada T.,
RA Nagayasu Y., Doi K., Kasai Y., Jindo T., Kobayashi D., Shimada A.,
RA Toyoda A., Kuroki Y., Fujiyama A., Sasaki T., Shimizu A., Asakawa S.,
RA Shimizu N., Hashimoto S., Yang J., Lee Y., Matsushima K., Sugano S.,
RA Sakaizumi M., Narita T., Ohishi K., Haga S., Ohta F., Nomoto H., Nogata K.,
RA Morishita T., Endo T., Shin-I T., Takeda H., Morishita S., Kohara Y.;
RT "The medaka draft genome and insights into vertebrate genome evolution.";
RL Nature 447:714-719(2007).
RN [2] {ECO:0000313|Ensembl:ENSORLP00000014515.2}
RP IDENTIFICATION.
RC STRAIN=Hd-rR {ECO:0000313|Ensembl:ENSORLP00000014515.2};
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; H2M7Q7; -.
DR STRING; 8090.ENSORLP00000014515; -.
DR Ensembl; ENSORLT00000014516.2; ENSORLP00000014515.2; ENSORLG00000011584.2.
DR eggNOG; KOG3544; Eukaryota.
DR GeneTree; ENSGT00940000155682; -.
DR HOGENOM; CLU_009158_2_0_1; -.
DR InParanoid; H2M7Q7; -.
DR OrthoDB; 2906665at2759; -.
DR TreeFam; TF331207; -.
DR Proteomes; UP000001038; Chromosome 17.
DR Bgee; ENSORLG00000011584; Expressed in muscle tissue and 12 other cell types or tissues.
DR GO; GO:0062023; C:collagen-containing extracellular matrix; IBA:GO_Central.
DR GO; GO:0007409; P:axonogenesis; IEA:Ensembl.
DR GO; GO:0055001; P:muscle cell development; IEA:Ensembl.
DR GO; GO:0007517; P:muscle organ development; IEA:Ensembl.
DR CDD; cd01450; vWFA_subfamily_ECM; 1.
DR Gene3D; 3.40.50.410; von Willebrand factor, type A domain; 3.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR002035; VWF_A.
DR InterPro; IPR036465; vWFA_dom_sf.
DR PANTHER; PTHR24020; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24020:SF70; PH DOMAIN-CONTAINING PROTEIN; 1.
DR Pfam; PF01391; Collagen; 3.
DR Pfam; PF00092; VWA; 3.
DR PRINTS; PR00453; VWFADOMAIN.
DR SMART; SM00327; VWA; 3.
DR SUPFAM; SSF53300; vWA-like; 3.
DR PROSITE; PS50234; VWFA; 3.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000001038};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..32
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 33..1024
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5017384317"
FT DOMAIN 46..240
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 617..804
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 836..1019
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT REGION 255..590
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 411..434
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1024 AA; 108747 MW; 4D6428FFAA2CF6F2 CRC64;
MDGSSSERQE RRRMPGLEAF ILCLFFGALT RAQKAECSRK NECPIDVYFT IDTSETIALQ
ESPPGALVES IKSFTAELVK RMQDEELRGV VRVKWNTGGL HFSQTQRIIS RIGNNTDFLN
RLKPIQYLGK GTYIDCALKK MSEEMDQFPS SPSALRFAVV ITDGHVTGNP CRGIKVAAEE
ARDKGIRIFA VASSTNIDET GLREIASSPA SVYRDEFMAV DLSSGARIHV QTIERIIKTM
KQVAYTECYK VSCLETDGPP GPKGHRGQKG AKGDIGQPGQ KGERGRPGDP GIEGPIGQPG
IKGEPGQMGD KGEMGSQGKK GVAGIAGRNG TDGQKGKIGR IGAPGCKGDS GDRGPDGHPG
DVGERGPLGT DGDKGDSGRP GRSGPPGESG APGPKGERGS PGSPGLPGQK GRRGERGRTG
VRGEPGRRGD YGKKGARGPP GPTGEKGEMG PEGLRGLPGE AGIQGSKGDN GLPGPRGAAG
KPGGPGKNGT RGDPGDAGPR GEPGPPGPKG DVGRPGFGYP GPRGPPGEKG EKGNPGPRGS
RGECGQKGGP GDKGRPGEPG EPGSMGEPGP RGQRGEAGRD GDPGPEGDPG LTECDVMTYI
RETCGCCDCE KRCGPLDIVF VIDSSESVGL TNFTLEKNFV INTISRLGSF AKSPDSETGT
RVGVVQYSHS GTFQAISLND SKIDSLAAFK EEVKRLEWIA GGTWTPSALK YAYDNLIRDS
RRAKAKVTVV VITDGRFDPR DNDTLLTYLC RDPSVDVSAI GIGDMFDQIG ENENLNSIAC
QREGRVTGMR RFADLVAEEF IDKIETVLCP DPVIVCPDLP CKSEPAVASC VQRPVDIVFL
LDGSERMGLE NHRRAKEFIE NVARRLTLAN TATDDRRARL ALLQYGSPVE QKVEFPLTHE
LGVISDSLAN VNYMDSSSAL GSAIIYAVNN LVIKQDGRRL SRRNAEVAFV FITDGITATE
QLEEGVSAMK RAEGIPTVIA MGSDTDEEVL HKVSLGDTSA IFRGDDYSML NKPAFFERFV
RWIC
//