ID H3BYV8_TETNG Unreviewed; 973 AA.
AC H3BYV8;
DT 18-APR-2012, integrated into UniProtKB/TrEMBL.
DT 18-APR-2012, sequence version 1.
DT 27-MAR-2024, entry version 59.
DE SubName: Full=Collagen type II alpha 1 chain {ECO:0000313|Ensembl:ENSTNIP00000001174.1};
GN Name=COL2A1 {ECO:0000313|Ensembl:ENSTNIP00000001174.1};
OS Tetraodon nigroviridis (Spotted green pufferfish) (Chelonodon
OS nigroviridis).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC Eupercaria; Tetraodontiformes; Tetradontoidea; Tetraodontidae; Tetraodon.
OX NCBI_TaxID=99883 {ECO:0000313|Ensembl:ENSTNIP00000001174.1, ECO:0000313|Proteomes:UP000007303};
RN [1] {ECO:0000313|Proteomes:UP000007303}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=15496914; DOI=10.1038/nature03025;
RA Jaillon O., Aury J.-M., Brunet F., Petit J.-L., Stange-Thomann N.,
RA Mauceli E., Bouneau L., Fischer C., Ozouf-Costaz C., Bernot A., Nicaud S.,
RA Jaffe D., Fisher S., Lutfalla G., Dossat C., Segurens B., Dasilva C.,
RA Salanoubat M., Levy M., Boudet N., Castellano S., Anthouard V., Jubin C.,
RA Castelli V., Katinka M., Vacherie B., Biemont C., Skalli Z., Cattolico L.,
RA Poulain J., De Berardinis V., Cruaud C., Duprat S., Brottier P.,
RA Coutanceau J.-P., Gouzy J., Parra G., Lardier G., Chapple C.,
RA McKernan K.J., McEwan P., Bosak S., Kellis M., Volff J.-N., Guigo R.,
RA Zody M.C., Mesirov J., Lindblad-Toh K., Birren B., Nusbaum C., Kahn D.,
RA Robinson-Rechavi M., Laudet V., Schachter V., Quetier F., Saurin W.,
RA Scarpelli C., Wincker P., Lander E.S., Weissenbach J., Roest Crollius H.;
RT "Genome duplication in the teleost fish Tetraodon nigroviridis reveals the
RT early vertebrate proto-karyotype.";
RL Nature 431:946-957(2004).
RN [2] {ECO:0000313|Ensembl:ENSTNIP00000001174.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; H3BYV8; -.
DR Ensembl; ENSTNIT00000003550.1; ENSTNIP00000001174.1; ENSTNIG00000014825.1.
DR GeneTree; ENSGT00940000155224; -.
DR InParanoid; H3BYV8; -.
DR OMA; RGQQIIN; -.
DR Proteomes; UP000007303; Unassembled WGS sequence.
DR Gene3D; 2.10.70.10; Complement Module, domain 1; 1.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR001007; VWF_dom.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF58; COLLAGEN ALPHA-1(II) CHAIN; 1.
DR Pfam; PF01391; Collagen; 6.
DR Pfam; PF00093; VWC; 1.
DR SMART; SM00214; VWC; 1.
DR SUPFAM; SSF57603; FnI-like domain; 1.
DR PROSITE; PS01208; VWFC_1; 1.
DR PROSITE; PS50184; VWFC_2; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000007303};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..26
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 27..973
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5003580432"
FT DOMAIN 37..95
FT /note="VWFC"
FT /evidence="ECO:0000259|PROSITE:PS50184"
FT REGION 105..973
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 121..154
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 330..344
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 368..382
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 973 AA; 90620 MW; BAFD0171A09B7165 CRC64;
MFIFMDSRIV LLLVASQVCL LTVVRCQEED DRKFSGGGCM QDDQRYSDKD VWKPEPCRIC
VCDTGTVLCD EIVCEELKDC PNPEIPFGEC CPICAADQSP PMGCNGSAGP RGRDGEPGTP
GNPGNPGPPG PPGPPGLGGG PMGPRGPPGP SGAPHLLPCS IFIFPGRPAD RQGPMGPRGP
PGPSGKPGED GEAGKPGKSG ERGPTGPQGA RGFPGTPGLP GIKGHRGYPG LDGAKGETGA
VGSKGESGAP GENGAPGPLG PRGLPGERGR PGPSGVAGAR GNDGLSGPAG PPGPVGPSGA
PGFPGSPGAK GEAGPTGARG PEGAQGPRGE SGTPGSSGPS GASGNPGTDG IPGAKGSAGA
PGIAGAPGFP GPRGPPGPQG ATGPLGPKGT SGDPGIPGFK GEAGPKGEIG PAGLQGPPGQ
QGEEGKRGPR GEPGAAGPLG PPGERGAPGN RGFPGQDGLA GSKGAPGERG PSGASGPKGA
NGDPGRPGES GLPGARGLTG RPGDAGPQGK VGPSGAPGED GRPGPPGPQG ARGQPGVMGF
PGPKGASGEP GKSGEKGLAG APGLRGLPGK DGETGAAGPP GPAGPAGERG EQGQPGPSGF
QGLPGPPGPP GEGGKPGDQG VPGEAGASGT TGPRGERGFP GERGAAGPQG LQGPRGLPGT
PGTDGPKGAI GPHGSLGAQG PPGLQGMPGE RGGAGIPGPK GDRGDIGEKG PEGAPGKDGA
RGLTGPIGPP GPSGPNGEKG ETGPAGPSGA PGTRGTPGDR GETGSPGPAG FAGPPGADGQ
PGIKGEQGET GQKGDAGAPG PQGPSGAPGP AGPTGVFGPK GARGAQGPPG ATGFPGAAGR
VGPPGPNGNP GPAGPAGSPG KDGPKGIRGD AGPPGRQGDA GLRGPAGPSG EKGDAGEDGP
VGPPGPSGPQ GLAGQRGIVG LPGQRGERGF PGLPGPSGEP GKQGAPGTGG DRGPPGPVGP
PGLTGPAGEL GRE
//