GenomeNet

Database: UniProt
Entry: H3BYV8_TETNG
LinkDB: H3BYV8_TETNG
Original site: H3BYV8_TETNG 
ID   H3BYV8_TETNG            Unreviewed;       973 AA.
AC   H3BYV8;
DT   18-APR-2012, integrated into UniProtKB/TrEMBL.
DT   18-APR-2012, sequence version 1.
DT   27-MAR-2024, entry version 59.
DE   SubName: Full=Collagen type II alpha 1 chain {ECO:0000313|Ensembl:ENSTNIP00000001174.1};
GN   Name=COL2A1 {ECO:0000313|Ensembl:ENSTNIP00000001174.1};
OS   Tetraodon nigroviridis (Spotted green pufferfish) (Chelonodon
OS   nigroviridis).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC   Eupercaria; Tetraodontiformes; Tetradontoidea; Tetraodontidae; Tetraodon.
OX   NCBI_TaxID=99883 {ECO:0000313|Ensembl:ENSTNIP00000001174.1, ECO:0000313|Proteomes:UP000007303};
RN   [1] {ECO:0000313|Proteomes:UP000007303}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX   PubMed=15496914; DOI=10.1038/nature03025;
RA   Jaillon O., Aury J.-M., Brunet F., Petit J.-L., Stange-Thomann N.,
RA   Mauceli E., Bouneau L., Fischer C., Ozouf-Costaz C., Bernot A., Nicaud S.,
RA   Jaffe D., Fisher S., Lutfalla G., Dossat C., Segurens B., Dasilva C.,
RA   Salanoubat M., Levy M., Boudet N., Castellano S., Anthouard V., Jubin C.,
RA   Castelli V., Katinka M., Vacherie B., Biemont C., Skalli Z., Cattolico L.,
RA   Poulain J., De Berardinis V., Cruaud C., Duprat S., Brottier P.,
RA   Coutanceau J.-P., Gouzy J., Parra G., Lardier G., Chapple C.,
RA   McKernan K.J., McEwan P., Bosak S., Kellis M., Volff J.-N., Guigo R.,
RA   Zody M.C., Mesirov J., Lindblad-Toh K., Birren B., Nusbaum C., Kahn D.,
RA   Robinson-Rechavi M., Laudet V., Schachter V., Quetier F., Saurin W.,
RA   Scarpelli C., Wincker P., Lander E.S., Weissenbach J., Roest Crollius H.;
RT   "Genome duplication in the teleost fish Tetraodon nigroviridis reveals the
RT   early vertebrate proto-karyotype.";
RL   Nature 431:946-957(2004).
RN   [2] {ECO:0000313|Ensembl:ENSTNIP00000001174.1}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (NOV-2023) to UniProtKB.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   AlphaFoldDB; H3BYV8; -.
DR   Ensembl; ENSTNIT00000003550.1; ENSTNIP00000001174.1; ENSTNIG00000014825.1.
DR   GeneTree; ENSGT00940000155224; -.
DR   InParanoid; H3BYV8; -.
DR   OMA; RGQQIIN; -.
DR   Proteomes; UP000007303; Unassembled WGS sequence.
DR   Gene3D; 2.10.70.10; Complement Module, domain 1; 1.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR001007; VWF_dom.
DR   PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR   PANTHER; PTHR24023:SF58; COLLAGEN ALPHA-1(II) CHAIN; 1.
DR   Pfam; PF01391; Collagen; 6.
DR   Pfam; PF00093; VWC; 1.
DR   SMART; SM00214; VWC; 1.
DR   SUPFAM; SSF57603; FnI-like domain; 1.
DR   PROSITE; PS01208; VWFC_1; 1.
DR   PROSITE; PS50184; VWFC_2; 1.
PE   4: Predicted;
KW   Reference proteome {ECO:0000313|Proteomes:UP000007303};
KW   Signal {ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..26
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           27..973
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5003580432"
FT   DOMAIN          37..95
FT                   /note="VWFC"
FT                   /evidence="ECO:0000259|PROSITE:PS50184"
FT   REGION          105..973
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        121..154
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        330..344
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        368..382
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   973 AA;  90620 MW;  BAFD0171A09B7165 CRC64;
     MFIFMDSRIV LLLVASQVCL LTVVRCQEED DRKFSGGGCM QDDQRYSDKD VWKPEPCRIC
     VCDTGTVLCD EIVCEELKDC PNPEIPFGEC CPICAADQSP PMGCNGSAGP RGRDGEPGTP
     GNPGNPGPPG PPGPPGLGGG PMGPRGPPGP SGAPHLLPCS IFIFPGRPAD RQGPMGPRGP
     PGPSGKPGED GEAGKPGKSG ERGPTGPQGA RGFPGTPGLP GIKGHRGYPG LDGAKGETGA
     VGSKGESGAP GENGAPGPLG PRGLPGERGR PGPSGVAGAR GNDGLSGPAG PPGPVGPSGA
     PGFPGSPGAK GEAGPTGARG PEGAQGPRGE SGTPGSSGPS GASGNPGTDG IPGAKGSAGA
     PGIAGAPGFP GPRGPPGPQG ATGPLGPKGT SGDPGIPGFK GEAGPKGEIG PAGLQGPPGQ
     QGEEGKRGPR GEPGAAGPLG PPGERGAPGN RGFPGQDGLA GSKGAPGERG PSGASGPKGA
     NGDPGRPGES GLPGARGLTG RPGDAGPQGK VGPSGAPGED GRPGPPGPQG ARGQPGVMGF
     PGPKGASGEP GKSGEKGLAG APGLRGLPGK DGETGAAGPP GPAGPAGERG EQGQPGPSGF
     QGLPGPPGPP GEGGKPGDQG VPGEAGASGT TGPRGERGFP GERGAAGPQG LQGPRGLPGT
     PGTDGPKGAI GPHGSLGAQG PPGLQGMPGE RGGAGIPGPK GDRGDIGEKG PEGAPGKDGA
     RGLTGPIGPP GPSGPNGEKG ETGPAGPSGA PGTRGTPGDR GETGSPGPAG FAGPPGADGQ
     PGIKGEQGET GQKGDAGAPG PQGPSGAPGP AGPTGVFGPK GARGAQGPPG ATGFPGAAGR
     VGPPGPNGNP GPAGPAGSPG KDGPKGIRGD AGPPGRQGDA GLRGPAGPSG EKGDAGEDGP
     VGPPGPSGPQ GLAGQRGIVG LPGQRGERGF PGLPGPSGEP GKQGAPGTGG DRGPPGPVGP
     PGLTGPAGEL GRE
//
DBGET integrated database retrieval system