GenomeNet

Database: UniProt
Entry: H2T1T8_TAKRU
LinkDB: H2T1T8_TAKRU
Original site: H2T1T8_TAKRU 
ID   H2T1T8_TAKRU            Unreviewed;      1456 AA.
AC   H2T1T8;
DT   21-MAR-2012, integrated into UniProtKB/TrEMBL.
DT   21-MAR-2012, sequence version 1.
DT   28-MAR-2018, entry version 31.
DE   SubName: Full=Collagen, type I, alpha 1b {ECO:0000313|Ensembl:ENSTRUP00000018626};
GN   Name=LOC101063107 {ECO:0000313|Ensembl:ENSTRUP00000018626};
OS   Takifugu rubripes (Japanese pufferfish) (Fugu rubripes).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC   Eupercaria; Tetraodontiformes; Tetradontoidea; Tetraodontidae;
OC   Takifugu.
OX   NCBI_TaxID=31033 {ECO:0000313|Ensembl:ENSTRUP00000018626, ECO:0000313|Proteomes:UP000005226};
RN   [1] {ECO:0000313|Ensembl:ENSTRUP00000018626}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX   PubMed=21551351;
RA   Kai W., Kikuchi K., Tohari S., Chew A.K., Tay A., Fujiwara A.,
RA   Hosoya S., Suetake H., Naruse K., Brenner S., Suzuki Y., Venkatesh B.;
RT   "Integration of the genetic map and genome assembly of fugu
RT   facilitates insights into distinct features of genome evolution in
RT   teleosts and mammals.";
RL   Genome Biol. Evol. 3:424-442(2011).
RN   [2] {ECO:0000313|Ensembl:ENSTRUP00000018626}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (FEB-2012) to UniProtKB.
CC   -----------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution-NoDerivs License
CC   -----------------------------------------------------------------------
DR   Ensembl; ENSTRUT00000018702; ENSTRUP00000018626; ENSTRUG00000007520.
DR   eggNOG; KOG3544; Eukaryota.
DR   eggNOG; ENOG410XNMM; LUCA.
DR   GeneTree; ENSGT00900000140789; -.
DR   Proteomes; UP000005226; Unplaced.
DR   GO; GO:0005201; F:extracellular matrix structural constituent; IEA:InterPro.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR000885; Fib_collagen_C.
DR   InterPro; IPR001007; VWF_dom.
DR   Pfam; PF01410; COLFI; 1.
DR   Pfam; PF01391; Collagen; 6.
DR   Pfam; PF00093; VWC; 1.
DR   ProDom; PD002078; Fib_collagen_C; 1.
DR   SMART; SM00038; COLFI; 1.
DR   SMART; SM00214; VWC; 1.
DR   PROSITE; PS51461; NC1_FIB; 1.
DR   PROSITE; PS01208; VWFC_1; 1.
DR   PROSITE; PS50184; VWFC_2; 1.
PE   4: Predicted;
KW   Complete proteome {ECO:0000313|Proteomes:UP000005226};
KW   Reference proteome {ECO:0000313|Proteomes:UP000005226};
KW   Signal {ECO:0000256|SAM:SignalP}.
FT   SIGNAL        1     22       {ECO:0000256|SAM:SignalP}.
FT   CHAIN        23   1456       {ECO:0000256|SAM:SignalP}.
FT                                /FTId=PRO_5003573596.
FT   DOMAIN       31     89       VWFC. {ECO:0000259|PROSITE:PS50184}.
FT   DOMAIN     1221   1456       Fibrillar collagen NC1.
FT                                {ECO:0000259|PROSITE:PS51461}.
SQ   SEQUENCE   1456 AA;  137350 MW;  C385AD834A0FCB7F CRC64;
     MFSFVDIRLA LLLSAAVLLV RGQGEDDSTF GSCTLEGQLY NDKDVWKPEP CQICVCDSGT
     VMCDEVICED TSECADPIIP EGECCPICPD AEGTQSPVDE EIIRSEVGRS STGQRGPTGP
     AGPPGRDGLD GRPGPAGPPG PPGPPGLGGN FSPQMGYVDH TKSGGGPAIP GPMGPMGSRG
     SPGPPGSSGP QGFTGPAGEP GEPGSPGPMG PRGPSGPPGK NGDDGEAGKS GRPGERGVAG
     TPGARGIPGT AGLPGIKGHR GFSGLDGAKG DSGPAGPKGE PGISGENGIP GSMGARGLPG
     ERGRPGAPGP SGARGNDGNS GPSGPPGPTG PSGPPGFPGG AGAKGETGPQ GGRGSDGPAG
     SRGEPGNPGP AGAAGAAGVP GSDGSPGAKG APGAAGIAGS PGFPGSRGPA GAQGAVGAPG
     PKGNNGDPGP SGPKGEPGVK GEPGPVGVQG LAGPSGEEGK RGPRGEPGGG GPRGPPGERG
     APGGRGFPGG DGAAGGKGAP GERGSPGPAG AQGATGESGN PGAPGAPGSK GVTGSPGSAG
     PDGKAGPAGV PGQDGRSGPP GSGGARGQPG VMGFPGPKGT AGEPGKVGER GAVGVAGAVG
     APGKDGDAGA PGPAGVAGPA GEKGEQGPAG PPGFQGLPGP QGATGETGKP GEQGVAGEVG
     SPGPSGPRGD RGFPGERGAP GPAGPTGPRG SPGPSGNDGP KGEPGAAGNP GSAGGPGMQG
     MPGERGAAGL PGAKGERGEA GGKGGDGAAG KDGSRGMTGP MGAPGPSGAQ GEKGEPGPVG
     VAGPTGPRGA PGDRGEAGPA GHAGFAGPPG ADGQPGAKGE SGETGPKGDA GPPGTTGPAG
     SSGPQGPAGP SGPKGATGGA GAPGATGFPG PAGRVGPPGP AGVAGPPGPI GAVGKDGARG
     ARGETGPAGR PGEAGAVGAP GPSGEKGSPG ADGAPGPAGI SGPQGIGGQR GTVGIPGQRG
     ERGFPGLAGP AGEPGKQGSS GPVGERGPPG PAGPPGLSGA PGEPGREGSS GHDGAPGRDG
     APGPKGDRGE SGVAGPPGPP GAPGAPGAVG PSGKTGDRGE AGPAGPAGPS GPAGVRGPAG
     PAGAKGDRGE AGEAGERGHK GHRGFSGMSG LPGPAGSHGE RGPAGASGPA GPRGPAGSSG
     SPGKDGMNGL PGVMGPPGPR GRNGEMGAAG PPGPPGLPGP PGAPGGGFDF ISQPIQEKAP
     DPLRGGYYRA DDPNMMHDRD MEVDTTLKTL TQKVEKIRSP DGTQKSPARM CRDLRMCHPE
     WKSGMYWVDP NQGSTLDAIK VHCNMETGET CIYPSDSSIP MKNWYLSKNM KEKKHVWFSE
     SMTGGFQFQY GMAGADSDDV NIQMTFMRLM SNQASQNITY HCKNSIAYMD STTGNLKKAL
     LLQGSNDVEI RAEGNSRFTY SVSEDGCTSH TGSWGKTVID YKTSKTSRLP IIDIAPMDVG
     APDQEFGVEV GPVCFL
//
DBGET integrated database retrieval system