GenomeNet

Database: UniProt
Entry: A0A3Q0GJ22_ALLSI
LinkDB: A0A3Q0GJ22_ALLSI
Original site: A0A3Q0GJ22_ALLSI 
ID   A0A3Q0GJ22_ALLSI        Unreviewed;      1522 AA.
AC   A0A3Q0GJ22;
DT   13-FEB-2019, integrated into UniProtKB/TrEMBL.
DT   13-FEB-2019, sequence version 1.
DT   27-MAR-2024, entry version 24.
DE   SubName: Full=Collagen alpha-3(V) chain isoform X1 {ECO:0000313|RefSeq:XP_025059676.1};
GN   Name=COL5A3 {ECO:0000313|RefSeq:XP_025059676.1};
OS   Alligator sinensis (Chinese alligator).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Archelosauria; Archosauria; Crocodylia; Alligatoridae; Alligatorinae;
OC   Alligator.
OX   NCBI_TaxID=38654 {ECO:0000313|Proteomes:UP000189705, ECO:0000313|RefSeq:XP_025059676.1};
RN   [1] {ECO:0000313|RefSeq:XP_025059676.1}
RP   IDENTIFICATION.
RG   RefSeq;
RL   Submitted (NOV-2023) to UniProtKB.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   RefSeq; XP_025059676.1; XM_025203891.1.
DR   KEGG; asn:102383354; -.
DR   InParanoid; A0A3Q0GJ22; -.
DR   OrthoDB; 2970887at2759; -.
DR   Proteomes; UP000189705; Unplaced.
DR   GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR   GO; GO:0005201; F:extracellular matrix structural constituent; IEA:InterPro.
DR   Gene3D; 2.60.120.1000; -; 1.
DR   Gene3D; 2.60.120.200; -; 1.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR013320; ConA-like_dom_sf.
DR   InterPro; IPR000885; Fib_collagen_C.
DR   PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR   PANTHER; PTHR24023:SF42; COLLAGEN ALPHA-1(XI) CHAIN; 1.
DR   Pfam; PF01410; COLFI; 1.
DR   Pfam; PF01391; Collagen; 5.
DR   SMART; SM00038; COLFI; 1.
DR   SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
DR   PROSITE; PS51461; NC1_FIB; 1.
PE   4: Predicted;
KW   Collagen {ECO:0000313|RefSeq:XP_025059676.1};
KW   Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW   Reference proteome {ECO:0000313|Proteomes:UP000189705};
KW   Secreted {ECO:0000256|ARBA:ARBA00022530}; Signal {ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..17
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           18..1522
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5018224290"
FT   DOMAIN          1292..1521
FT                   /note="Fibrillar collagen NC1"
FT                   /evidence="ECO:0000259|PROSITE:PS51461"
FT   REGION          79..291
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          364..1267
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        95..116
FT                   /note="Basic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        145..166
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        238..266
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        489..503
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        727..741
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        844..858
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1070..1085
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1212..1228
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1247..1264
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   1522 AA;  149837 MW;  3097C7D0DE7BA060 CRC64;
     MLWLDCMPVA TLPLARGLWP VVSTEGITVF GARLMDEEVF QGDVQQLLIV PDPAAAQVYC
     EHYMPGCDVP LAYPLQAPFP EQQSRSEPTS TPRRKGKKGK GKGKGQGRRK GKGKKKRNKE
     QLEVPTPLMA LGQDESLVAP SEDEHSTPVP TTNPVPTVTT SPGHTNPHED LVLGRDGNPS
     ATGSWPQEYE EYEEATEPLG PGRFGPAEPD DTMVWTMQKG SLKGEKGEPG AVEPGQQFEG
     PPGPPGPSGE MGPAGPPGPP GFPGDPGDWG PAGRPGVPGA DGARGPPGTM IMLPFQFGGD
     THKGPAVSFQ EAQAQAILQQ AKLAMKGPPG PMGLTGRPGP LGLPGYPGLK GDTGDMGLPG
     PRGIQGLMGP PGRLGKRGRS GADGARGLPG DTGPKGDRGF DGLPGLPGEK GHKGEVGKPG
     SPGPPGEMGP KGTDGLPGPR GQPGEPGMRG LAGSRGVPGS PGPLGPSGID GATGPKGNQG
     IMGEPGPPGQ QGNPGPQGLP GPQGPVGLPG EKGVLGKPGI PGVAGADGPP GHPGKEGPTG
     DKGLQGPPGT AGPVGYPGPR GVKGTFGARG LKGSKGEKGE DGFPGFKGDM GTKGDRGDQG
     PLGLRGEDGP EGLKGQMGTA GEPGPPGLAG EKGKLGVPGL PGYPGRQGPK GSTGFQGSVG
     LAGEKGKRGK AGQAGQTGPR GSPGLPGERG QPGSTGKPGP KGESGHDGAP GTAGEKGPQG
     PQGSSGFPGP KGPPGPPGKD GLPGHPGQRG EPGFHGKTGP PGPTGVVGPQ GHSGETGPMG
     ERGHPGPPGS PGEQGLPGAA GREGTKGDPG PAGTPGRSGP PGVHGFPGAR GASGEIGPAG
     LKGGEGPPGP PGPTGSPGER GPSGPAGGIG LPGRGGAQGP LGPAGEKGSP GERGPLGSAG
     HDGIQGPVGL PGAAGPPGPA GEDGDKGEMG LPGQKGSKGD KGEPHSSGLW STPCSWGDHA
     HASPFWGPDT SLVSLQGEPG EAGDPGPAGE PGHPGTKGDV GEKGDAGPSG AAGAPGKRGP
     PGDDGAKGDL GPIGFPGDPG PPGDPGVSGL DGAPGDKGDG GNPGALGPPG ASGEPGPPGP
     PGRRGPAGPM GREGHQGDKG TKGEPGTEGP PGKMGPVGAQ GPPGRLGPDG LHGIPGPAGE
     QGLLGAPGQA GPPGPMGPVG LPGLKGDPGH KGDKGHAGLI GLIGPPGEMG EKGDRGLPGV
     QGPMGPKGNP GMTGPLGPPG PPGPPGLSGP VGQKGSKGSP GALGPRGDTG PPGPPGPPGP
     SMELLEPLPL AGGRRQRRGA PGLSPKSTEG LEEVHAVLSS LQAEVEQLRR PHGTPDSPGR
     ACAELWLSHP HLPDGEYWID PNQGCSRDAF RVFCNFTAGG ETCLFPDKKF EAVRLAAWSR
     EKPESWFSSF KRGQKFSYVD VDGHIVPVPQ VTFLRLLSAS AYQTFALTCQ NAAAWFDASA
     SSFTRALRLR GANGEELGHS HPSAPIHALA DGCQVRRGQA RTVLEVRGPH VEWLPLADVA
     VTDFGGAGQK FGFELGPVCF VG
//
DBGET integrated database retrieval system