GenomeNet

Database: UniProt
Entry: A0A3B4TQX3_SERDU
LinkDB: A0A3B4TQX3_SERDU
Original site: A0A3B4TQX3_SERDU 
ID   A0A3B4TQX3_SERDU        Unreviewed;      1322 AA.
AC   A0A3B4TQX3;
DT   05-DEC-2018, integrated into UniProtKB/TrEMBL.
DT   05-DEC-2018, sequence version 1.
DT   27-MAR-2024, entry version 27.
DE   SubName: Full=Collagen type I alpha 2 chain {ECO:0000313|Ensembl:ENSSDUP00000008659.1};
OS   Seriola dumerili (Greater amberjack) (Caranx dumerili).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC   Carangaria; Carangiformes; Carangidae; Seriola.
OX   NCBI_TaxID=41447 {ECO:0000313|Ensembl:ENSSDUP00000008659.1, ECO:0000313|Proteomes:UP000261420};
RN   [1] {ECO:0000313|Ensembl:ENSSDUP00000008659.1}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (SEP-2023) to UniProtKB.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   STRING; 41447.ENSSDUP00000008659; -.
DR   Ensembl; ENSSDUT00000008826.1; ENSSDUP00000008659.1; ENSSDUG00000006239.1.
DR   GeneTree; ENSGT00940000155639; -.
DR   OMA; SFYWIDP; -.
DR   Proteomes; UP000261420; Unplaced.
DR   GO; GO:0005201; F:extracellular matrix structural constituent; IEA:InterPro.
DR   GO; GO:0009612; P:response to mechanical stimulus; IEA:Ensembl.
DR   GO; GO:0001501; P:skeletal system development; IEA:Ensembl.
DR   Gene3D; 2.60.120.1000; -; 1.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR000885; Fib_collagen_C.
DR   PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR   PANTHER; PTHR24023:SF1108; ENDOSTATIN DOMAIN-CONTAINING PROTEIN; 1.
DR   Pfam; PF01410; COLFI; 1.
DR   Pfam; PF01391; Collagen; 8.
DR   SMART; SM00038; COLFI; 1.
DR   PROSITE; PS51461; NC1_FIB; 1.
PE   4: Predicted;
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW   Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW   Secreted {ECO:0000256|ARBA:ARBA00022530}; Signal {ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..22
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           23..1322
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5017305595"
FT   DOMAIN          1090..1322
FT                   /note="Fibrillar collagen NC1"
FT                   /evidence="ECO:0000259|PROSITE:PS51461"
FT   REGION          22..1076
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        56..70
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        209..223
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1046..1064
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   1322 AA;  124129 MW;  7C267C609E333E14 CRC64;
     MLSFVDTRIL LLLAVTSYLA SCQGPRGEKG PRGDRNFAAQ YDGVKAPDPG PGPMGIMGAR
     GPPGPPGPPG SQGHTGHAGE PGEPGQTGPV GPRGPPGPPG KSGEDGNNGR PGKPGDRGAP
     GPQGARGFPG TPGLPGMKGH RGYTGLDGRK GEPGAAGAKG EPGAHGAAGS PGLAGSRGLP
     GERGRAGPAG PAGARGADGN VGPAGPAGPL GAAGPPGFPG GPGPKGEIGP VGATGPSGPQ
     GSRGEPGPNG AVGPVGPAGN PGANGLNGAK GAAGTPGVAG APGFPGPRGG PGPQGPQGAT
     GPRGLAGDPG AQGVKGDGGP KGEPGNSGPQ GAPGPQGEEG KRGPTGELGA TGPAGNRGAR
     GASGSRGMPG SEGRTGPIGM PGARGSTGSA GPRGPPGDAG RAGEPGPAGL RGLPGSPGSS
     GPPGKEGPAG PAGQDGRTGP PGPTGPRGQP GNIGFPGPKG PSGEPGKPGE KGATGPTGLR
     GAPGADGNNG ATGAMGPAGG PGDKGEQGPS GAPGFQGLPG PAGPAGEAGK AGDRGIPGDQ
     GVAGPAGSKG ERGNPGAAGA SGAQGPIGAR GPVGAPGADG GKGEPGVAGN AGGPGPQGPG
     GMPGERGAAG PPGPKGEKGE NGHRGPDGNA GRDGSRGLPG PAGPPGPTGA NGDKGESGAF
     GPAGPAGPRG ASGERGEVGP AGAPGFAGPP GADGQPGARG ERGPGGGKGE LGPAGPAGPA
     GQSGPAGPSG PSGPGGARGD TGPPGLTGFP GAAGRVGAAG PSGIVGPPGA AGPAGKDGPR
     GLRGDSGPAG PSGEQGMVGP PGPAGDKGPS GEAGPPGAPG VPGSIGPLGI QGFVGLPGTR
     GDRGSPGGAG ALQGEAGRVG PAGPPGARGP PGNIGLPGMT GPQGEAGREG NAGNDGPPGR
     PGIPGFKGDR GEPGPAGSMG LAGAPGPAGP TGGAGRPGNR GEAGPGGPAG PVGAAGARGA
     AGPAGPRGEK GVAGDKGERG MKGLRGHPGL QGMPGPSGPS GDTGAAGANG PSGPRGASGP
     HGPAGKDGRA GGHGTIGSPG ARGPPGYVGP AGPPGPPGLP GPPGPAGGGY DVSGYDEYRA
     DQPALRAKDY EVDATIKSLN TQIENLLTPE GSRKNPARTC RDIKLSHPDW SSFYWIDPNQ
     GCINDAIKVF CDFTTRETCI YAHPESIARK NWFRSTEGKK HVWFGETING GTEFTYNDET
     LSPQSMATQL AFMRLLSNQA SQNITYHCKN SVAYMDGESG NLKKAVVLQG SNDVELRAEG
     NSRFTFSVLE DGCTRHTGEW SKTVIEYRTN KPSRLPILDI APLDIGGADQ EFGLDIGPVC
     FK
//
DBGET integrated database retrieval system