GenomeNet

Database: UniProt
Entry: X5HZZ7_ACISC
LinkDB: X5HZZ7_ACISC
Original site: X5HZZ7_ACISC 
ID   X5HZZ7_ACISC            Unreviewed;      1421 AA.
AC   X5HZZ7;
DT   11-JUN-2014, integrated into UniProtKB/TrEMBL.
DT   11-JUN-2014, sequence version 1.
DT   27-MAR-2024, entry version 30.
DE   SubName: Full=Type2 collagen alpha1 chain {ECO:0000313|EMBL:BAO58967.1};
GN   Name=Col2a1 {ECO:0000313|EMBL:BAO58967.1};
OS   Acipenser schrenckii (Amur sturgeon).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Actinopterygii; Chondrostei; Acipenseriformes; Acipenseridae; Acipenser.
OX   NCBI_TaxID=111304 {ECO:0000313|EMBL:BAO58967.1};
RN   [1] {ECO:0000313|EMBL:BAO58967.1}
RP   NUCLEOTIDE SEQUENCE.
RA   Azuma N., Takagi Y., Ura K.;
RT   "Molecular cloning of mRNA of Col2a1 in Acipenser schrenckii.";
RL   Submitted (APR-2014) to the EMBL/GenBank/DDBJ databases.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; AB922837; BAO58967.1; -; mRNA.
DR   GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR   GO; GO:0005201; F:extracellular matrix structural constituent; IEA:InterPro.
DR   Gene3D; 2.60.120.1000; -; 1.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR000885; Fib_collagen_C.
DR   PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR   PANTHER; PTHR24023:SF58; COLLAGEN ALPHA-1(II) CHAIN; 1.
DR   Pfam; PF01410; COLFI; 1.
DR   Pfam; PF01391; Collagen; 6.
DR   SMART; SM00038; COLFI; 1.
DR   PROSITE; PS51461; NC1_FIB; 1.
PE   2: Evidence at transcript level;
KW   Collagen {ECO:0000313|EMBL:BAO58967.1};
KW   Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW   Secreted {ECO:0000256|ARBA:ARBA00022530}; Signal {ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..26
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           27..1421
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5004957402"
FT   DOMAIN          1187..1421
FT                   /note="Fibrillar collagen NC1"
FT                   /evidence="ECO:0000259|PROSITE:PS51461"
FT   REGION          32..1156
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        67..83
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        91..107
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        285..299
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        366..380
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        597..611
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1135..1151
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   1421 AA;  134790 MW;  D21C09C01F745365 CRC64;
     MFSFVDSRTV LLLAAIQLCL LAVVKCQDVE VQQPGRKGQK GEPGDITDVV GPRGPGGPMG
     PPGEQGPRGE RGDKGDKGGP GPRGRDGEPG TPGNPGPPGP PGPNGPPGLG GNFAAQMAGG
     FDEKAGGAQM GVMQGPMGPM GPRGPPGPTG APGPQGFQGN PGEPGEPGAA GPLGPRGPPG
     PSGKPGEDGE AGKPGKSGER GSPGPQGARG FPGTPGLPGI KGHRGYPGLD GAKGEAGAAG
     SKGEAGSSGE NGAPGPMGPR GLPGERGRNG PSGAAGARGN DGLPGPAGPP GPVGPAGAPG
     FPGSPGSKGE AGPTGARGPE GAQGPRGESG TPGSPGPSGA SGNPGTDGIP GAKGSAGAPG
     IAGAPGFPGP RGPPGPQGAT GPLGPKGQQG DPGIPGFKGE HGPKGEHGPA GPQGAPGPAG
     EEGKRGARGE PGAAGPLGPP GERGAPGNRG FPGQDGLAGP KGAPGERGQP GVGGPKGANG
     DPGRPGEPGL PGARGLTGRP GDAGPQGKGG PSGAAGEDGR PGPPGPQGAR GQPGVMGFPG
     PKGANGEPGK AGEKGLVGPP GLRGLSGKDG ETGAAGPPGP SGPAGERGEQ GPPGPSGFQG
     LPGPPGPPGE GGKPGDQGVP GEAGAAGRAG PRGERGFPGE RGSPGAQGLQ GPRGLPGTPG
     TDGPKGATGP SGALGAQGPP GLQGMPGERG ASGIAGAKGD RGDVGEKGPE GASGKDGSRG
     LTGPIGPPGP AGPNGEKGES GPSGPPGAAG TRGAPGDRGE NGPPGPAGFA GPPGADGQPG
     AKGEQGEGGQ KGDAGAPGPQ GPSGAPGPQG PTGVSGPKGA RGAQGPPGAT GFPGAAGRVG
     PPGPNGNPGP SGPAGSAGKD GPKGVRGDAG PPGRAGDAGL QGAAGPPGEK GEPGEDGPPG
     PDGPSGPQGL GGNRGIVGLP GQRGERGFPG LPGPSGEPGK QGAPGGAGDR GPPGPVGPPG
     LSGPSGEPGR EGNPGSDGPP GRDGSAGIKG DRGQTGPAGA PGAPGAPGSP GPVGPTGKQG
     DRGESGAQGP AGPSGPAGAR GMAGPQGPRG DKGEAGETGE RGQKGHRGFT GLQGLPGPPG
     TAGDQGAAGP AGPTGARGPP GPVGPHGKDG SNGQPGPIGP PGPRGRSGEV GPAGPPGNAG
     PPGPPGPPGP GIDMSAFAGL AAPEKAPDPM RYMRADEASS SLRQHDAEVD ATLKSINNQI
     ENIRSPEGSK KNPARTCRDL KLCHPDWKSG DYWIDPNQGC AVDAIKVFCN MESGETCVYP
     NPASIPRKNW WTSKSADCKH VWFGETMNGG FHFSYGDDSL APNTASIQMT FLRLLSTEAS
     QNLTYHCKNS IAYMDQSAGN LKKAVLLQGS NDVEIRAEGN SRFTYNVLED GCTKHTDRWG
     KTVIEYKSQK TSRLPIVDIA PLDIGGSDQE FGVDIGPVCY L
//
DBGET integrated database retrieval system