ID A0A1U7Q9Q0_MESAU Unreviewed; 1486 AA.
AC A0A1U7Q9Q0;
DT 10-MAY-2017, integrated into UniProtKB/TrEMBL.
DT 10-MAY-2017, sequence version 1.
DT 27-MAR-2024, entry version 35.
DE SubName: Full=Collagen alpha-1(II) chain {ECO:0000313|RefSeq:XP_005067220.1};
GN Name=Col2a1 {ECO:0000313|RefSeq:XP_005067220.1};
OS Mesocricetus auratus (Golden hamster).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea;
OC Cricetidae; Cricetinae; Mesocricetus.
OX NCBI_TaxID=10036 {ECO:0000313|Proteomes:UP000189706, ECO:0000313|RefSeq:XP_005067220.1};
RN [1] {ECO:0000313|RefSeq:XP_005067220.1}
RP IDENTIFICATION.
RG RefSeq;
RL Submitted (NOV-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR RefSeq; XP_005067220.1; XM_005067163.3.
DR STRING; 10036.ENSMAUP00000022527; -.
DR eggNOG; KOG3544; Eukaryota.
DR OrthoDB; 2970887at2759; -.
DR Proteomes; UP000189706; Unplaced.
DR GO; GO:0005604; C:basement membrane; IEA:Ensembl.
DR GO; GO:0005585; C:collagen type II trimer; IEA:Ensembl.
DR GO; GO:0005737; C:cytoplasm; IEA:Ensembl.
DR GO; GO:0005615; C:extracellular space; IEA:Ensembl.
DR GO; GO:0005201; F:extracellular matrix structural constituent; IEA:InterPro.
DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR GO; GO:0042289; F:MHC class II protein binding; IEA:Ensembl.
DR GO; GO:0048407; F:platelet-derived growth factor binding; IEA:Ensembl.
DR GO; GO:0042803; F:protein homodimerization activity; IEA:Ensembl.
DR GO; GO:0043394; F:proteoglycan binding; IEA:Ensembl.
DR GO; GO:0097065; P:anterior head development; IEA:Ensembl.
DR GO; GO:0001502; P:cartilage condensation; IEA:Ensembl.
DR GO; GO:0060351; P:cartilage development involved in endochondral bone morphogenesis; IEA:Ensembl.
DR GO; GO:0071773; P:cellular response to BMP stimulus; IEA:Ensembl.
DR GO; GO:0007417; P:central nervous system development; IEA:Ensembl.
DR GO; GO:0002062; P:chondrocyte differentiation; IEA:Ensembl.
DR GO; GO:0030199; P:collagen fibril organization; IEA:Ensembl.
DR GO; GO:0060272; P:embryonic skeletal joint morphogenesis; IEA:Ensembl.
DR GO; GO:0001958; P:endochondral ossification; IEA:Ensembl.
DR GO; GO:0097192; P:extrinsic apoptotic signaling pathway in absence of ligand; IEA:Ensembl.
DR GO; GO:0003007; P:heart morphogenesis; IEA:Ensembl.
DR GO; GO:0042472; P:inner ear morphogenesis; IEA:Ensembl.
DR GO; GO:0060174; P:limb bud formation; IEA:Ensembl.
DR GO; GO:2001240; P:negative regulation of extrinsic apoptotic signaling pathway in absence of ligand; IEA:Ensembl.
DR GO; GO:0030903; P:notochord development; IEA:Ensembl.
DR GO; GO:0071599; P:otic vesicle development; IEA:Ensembl.
DR GO; GO:0006029; P:proteoglycan metabolic process; IEA:Ensembl.
DR GO; GO:0010468; P:regulation of gene expression; IEA:Ensembl.
DR GO; GO:0060021; P:roof of mouth development; IEA:Ensembl.
DR GO; GO:0007605; P:sensory perception of sound; IEA:Ensembl.
DR GO; GO:0001894; P:tissue homeostasis; IEA:Ensembl.
DR GO; GO:0007601; P:visual perception; IEA:Ensembl.
DR Gene3D; 2.60.120.1000; -; 1.
DR Gene3D; 2.10.70.10; Complement Module, domain 1; 1.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR000885; Fib_collagen_C.
DR InterPro; IPR001007; VWF_dom.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF58; COLLAGEN ALPHA-1(II) CHAIN; 1.
DR Pfam; PF01410; COLFI; 1.
DR Pfam; PF01391; Collagen; 4.
DR Pfam; PF00093; VWC; 1.
DR SMART; SM00038; COLFI; 1.
DR SMART; SM00214; VWC; 1.
DR SUPFAM; SSF57603; FnI-like domain; 1.
DR PROSITE; PS51461; NC1_FIB; 1.
DR PROSITE; PS01208; VWFC_1; 1.
DR PROSITE; PS50184; VWFC_2; 1.
PE 4: Predicted;
KW Collagen {ECO:0000256|ARBA:ARBA00023119,
KW ECO:0000313|RefSeq:XP_005067220.1};
KW Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Hydroxylation {ECO:0000256|ARBA:ARBA00023278};
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW Reference proteome {ECO:0000313|Proteomes:UP000189706};
KW Secreted {ECO:0000256|ARBA:ARBA00022530}; Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..25
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 26..1486
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5010521063"
FT DOMAIN 32..89
FT /note="VWFC"
FT /evidence="ECO:0000259|PROSITE:PS50184"
FT DOMAIN 1252..1486
FT /note="Fibrillar collagen NC1"
FT /evidence="ECO:0000259|PROSITE:PS51461"
FT REGION 96..178
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 190..1235
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 132..148
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 156..173
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 236..250
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 350..364
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 431..445
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 909..923
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1199..1216
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1486 AA; 141859 MW; 53F9278DFD874850 CRC64;
MTRLGAPQSL VLLSLLIAAV LRCQGQDDED AGSCLQDGQI YKDKDVWKPT SCRICVCDTG
NILCDDIICE DKDCLNAEVP FGECCPICPA DLTPASGQLG PKGQKGEPGD LKDIVGPKGP
PGPQGPAGEQ GPRGDRGDKG EKGAPGPRGR DGEPGTPGNP GPPGPPGPPG PPGLIGNFAA
QMAGGFDEKA GGAQMGVMQG PMGPMGPRGP PGPAGAPGPQ GFQGNPGEPG EPGVSGPMGP
RGPPGPPGKP GDDGEAGKPG KSGERGLPGP QGARGFPGTP GLPGVKGHRG YPGLDGAKGE
AGAPGVKGES GSPGENGSPG PMGPRGLPGE RGRTGPAGAA GARGNDGQPG PAGPPGPVGP
AGGPGFPGAP GAKGEAGPTG ARGPEGAQGP RGEPGNPGSP GPAGASGNPG TDGIPGAKGS
AGAPGIAGAP GFPGPRGPPG PQGATGPLGP KGQMGEPGIA GFKGDQGPKG ETGPAGPQGA
PGPAGEEGKR GARGEPGGAG PIGPPGERGA PGNRGFPGQD GLAGPKGAPG ERGPSGLAGP
KGANGDPGRP GEPGLPGARG LTGRPGDAGP QGKVGPSGAP GEDGRPGPPG PQGARGQPGV
MGFPGPKGAN GEPGKAGEKG LAGAPGLRGL PGKDGETGAA GPPGPAGPAG ERGEQGAPGP
SGFQGLPGPP GPPGEGGKQG DQGIPGEAGA PGLVGPRGER GFPGERGSPG AQGLQGARGL
PGTPGTDGPK GAPGADGPPG AQGPPGLQGM PGERGAAGIA GPKGDRGDVG EKGPEGAPGK
DGGRGLTGPI GPPGPAGANG EKGEVGPPGP SGSTGARGAP GERGETGPPG PAGFAGPPGA
DGQPGAKGDQ GEAGQKGDAG APGPQGPSGA PGPQGPTGVT GPKGARGAQG PPGATGFPGA
AGRVGPPGSN GNPGPPGPPG PSGKDGPKGA RGDTGSPGRA GDPGLQGPAG PSGEKGEPGD
DGPPGSDGPP GPQGLAGQRG IVGLPGQRGE RGFPGLPGPS GEPGKQGAPG ASGDRGPPGP
VGPPGLTGPA GEPGREGSPG ADGPPGRDGA AGVKGDRGET GALGAPGAPG PPGAPGPAGP
TGKQGDRGES GAQGPMGPSG PAGARGIPGP QGPRGDKGES GEPGERGVKG HRGFTGLQGL
PGPPGPSGDQ GASGPAGPSG PRGPPGPVGP SGKDGSNGIP GPIGPPGPRG RSGETGPAGP
PGNPGPPGPP GPPGPGIDMS AFAGLGQREK GPDPLQYMRA DEADSSLRQH DVEVDATLKS
LNNQIESIRS PDGSRKNPAR TCQDLKLCHP EWKSGDYWID PNQGCTLDAM KVFCNMETGE
TCVYPNPASV PKKNWWSSKG KEKKHIWFGE TMNGGFHFSY GDGHLAPNTA NVQMTFLRLL
STEGSQNITY HCKNSIAYLD EAAGNLKKAL LIQGSNDVEM RAEGNSRFTY TALKDGCTKH
TGKWGKTVIE YRSQKTSRLP IIDIAPMDIG GPEQEFGVDI GPVCFL
//