ID X5HZZ7_ACISC Unreviewed; 1421 AA.
AC X5HZZ7;
DT 11-JUN-2014, integrated into UniProtKB/TrEMBL.
DT 11-JUN-2014, sequence version 1.
DT 27-MAR-2024, entry version 30.
DE SubName: Full=Type2 collagen alpha1 chain {ECO:0000313|EMBL:BAO58967.1};
GN Name=Col2a1 {ECO:0000313|EMBL:BAO58967.1};
OS Acipenser schrenckii (Amur sturgeon).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Chondrostei; Acipenseriformes; Acipenseridae; Acipenser.
OX NCBI_TaxID=111304 {ECO:0000313|EMBL:BAO58967.1};
RN [1] {ECO:0000313|EMBL:BAO58967.1}
RP NUCLEOTIDE SEQUENCE.
RA Azuma N., Takagi Y., Ura K.;
RT "Molecular cloning of mRNA of Col2a1 in Acipenser schrenckii.";
RL Submitted (APR-2014) to the EMBL/GenBank/DDBJ databases.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AB922837; BAO58967.1; -; mRNA.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0005201; F:extracellular matrix structural constituent; IEA:InterPro.
DR Gene3D; 2.60.120.1000; -; 1.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR000885; Fib_collagen_C.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF58; COLLAGEN ALPHA-1(II) CHAIN; 1.
DR Pfam; PF01410; COLFI; 1.
DR Pfam; PF01391; Collagen; 6.
DR SMART; SM00038; COLFI; 1.
DR PROSITE; PS51461; NC1_FIB; 1.
PE 2: Evidence at transcript level;
KW Collagen {ECO:0000313|EMBL:BAO58967.1};
KW Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW Secreted {ECO:0000256|ARBA:ARBA00022530}; Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..26
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 27..1421
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5004957402"
FT DOMAIN 1187..1421
FT /note="Fibrillar collagen NC1"
FT /evidence="ECO:0000259|PROSITE:PS51461"
FT REGION 32..1156
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 67..83
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 91..107
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 285..299
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 366..380
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 597..611
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1135..1151
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1421 AA; 134790 MW; D21C09C01F745365 CRC64;
MFSFVDSRTV LLLAAIQLCL LAVVKCQDVE VQQPGRKGQK GEPGDITDVV GPRGPGGPMG
PPGEQGPRGE RGDKGDKGGP GPRGRDGEPG TPGNPGPPGP PGPNGPPGLG GNFAAQMAGG
FDEKAGGAQM GVMQGPMGPM GPRGPPGPTG APGPQGFQGN PGEPGEPGAA GPLGPRGPPG
PSGKPGEDGE AGKPGKSGER GSPGPQGARG FPGTPGLPGI KGHRGYPGLD GAKGEAGAAG
SKGEAGSSGE NGAPGPMGPR GLPGERGRNG PSGAAGARGN DGLPGPAGPP GPVGPAGAPG
FPGSPGSKGE AGPTGARGPE GAQGPRGESG TPGSPGPSGA SGNPGTDGIP GAKGSAGAPG
IAGAPGFPGP RGPPGPQGAT GPLGPKGQQG DPGIPGFKGE HGPKGEHGPA GPQGAPGPAG
EEGKRGARGE PGAAGPLGPP GERGAPGNRG FPGQDGLAGP KGAPGERGQP GVGGPKGANG
DPGRPGEPGL PGARGLTGRP GDAGPQGKGG PSGAAGEDGR PGPPGPQGAR GQPGVMGFPG
PKGANGEPGK AGEKGLVGPP GLRGLSGKDG ETGAAGPPGP SGPAGERGEQ GPPGPSGFQG
LPGPPGPPGE GGKPGDQGVP GEAGAAGRAG PRGERGFPGE RGSPGAQGLQ GPRGLPGTPG
TDGPKGATGP SGALGAQGPP GLQGMPGERG ASGIAGAKGD RGDVGEKGPE GASGKDGSRG
LTGPIGPPGP AGPNGEKGES GPSGPPGAAG TRGAPGDRGE NGPPGPAGFA GPPGADGQPG
AKGEQGEGGQ KGDAGAPGPQ GPSGAPGPQG PTGVSGPKGA RGAQGPPGAT GFPGAAGRVG
PPGPNGNPGP SGPAGSAGKD GPKGVRGDAG PPGRAGDAGL QGAAGPPGEK GEPGEDGPPG
PDGPSGPQGL GGNRGIVGLP GQRGERGFPG LPGPSGEPGK QGAPGGAGDR GPPGPVGPPG
LSGPSGEPGR EGNPGSDGPP GRDGSAGIKG DRGQTGPAGA PGAPGAPGSP GPVGPTGKQG
DRGESGAQGP AGPSGPAGAR GMAGPQGPRG DKGEAGETGE RGQKGHRGFT GLQGLPGPPG
TAGDQGAAGP AGPTGARGPP GPVGPHGKDG SNGQPGPIGP PGPRGRSGEV GPAGPPGNAG
PPGPPGPPGP GIDMSAFAGL AAPEKAPDPM RYMRADEASS SLRQHDAEVD ATLKSINNQI
ENIRSPEGSK KNPARTCRDL KLCHPDWKSG DYWIDPNQGC AVDAIKVFCN MESGETCVYP
NPASIPRKNW WTSKSADCKH VWFGETMNGG FHFSYGDDSL APNTASIQMT FLRLLSTEAS
QNLTYHCKNS IAYMDQSAGN LKKAVLLQGS NDVEIRAEGN SRFTYNVLED GCTKHTDRWG
KTVIEYKSQK TSRLPIVDIA PLDIGGSDQE FGVDIGPVCY L
//