ID A0A3Q0GJ22_ALLSI Unreviewed; 1522 AA.
AC A0A3Q0GJ22;
DT 13-FEB-2019, integrated into UniProtKB/TrEMBL.
DT 13-FEB-2019, sequence version 1.
DT 27-MAR-2024, entry version 24.
DE SubName: Full=Collagen alpha-3(V) chain isoform X1 {ECO:0000313|RefSeq:XP_025059676.1};
GN Name=COL5A3 {ECO:0000313|RefSeq:XP_025059676.1};
OS Alligator sinensis (Chinese alligator).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Archelosauria; Archosauria; Crocodylia; Alligatoridae; Alligatorinae;
OC Alligator.
OX NCBI_TaxID=38654 {ECO:0000313|Proteomes:UP000189705, ECO:0000313|RefSeq:XP_025059676.1};
RN [1] {ECO:0000313|RefSeq:XP_025059676.1}
RP IDENTIFICATION.
RG RefSeq;
RL Submitted (NOV-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR RefSeq; XP_025059676.1; XM_025203891.1.
DR KEGG; asn:102383354; -.
DR InParanoid; A0A3Q0GJ22; -.
DR OrthoDB; 2970887at2759; -.
DR Proteomes; UP000189705; Unplaced.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0005201; F:extracellular matrix structural constituent; IEA:InterPro.
DR Gene3D; 2.60.120.1000; -; 1.
DR Gene3D; 2.60.120.200; -; 1.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR000885; Fib_collagen_C.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF42; COLLAGEN ALPHA-1(XI) CHAIN; 1.
DR Pfam; PF01410; COLFI; 1.
DR Pfam; PF01391; Collagen; 5.
DR SMART; SM00038; COLFI; 1.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
DR PROSITE; PS51461; NC1_FIB; 1.
PE 4: Predicted;
KW Collagen {ECO:0000313|RefSeq:XP_025059676.1};
KW Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW Reference proteome {ECO:0000313|Proteomes:UP000189705};
KW Secreted {ECO:0000256|ARBA:ARBA00022530}; Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..17
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 18..1522
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5018224290"
FT DOMAIN 1292..1521
FT /note="Fibrillar collagen NC1"
FT /evidence="ECO:0000259|PROSITE:PS51461"
FT REGION 79..291
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 364..1267
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 95..116
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 145..166
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 238..266
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 489..503
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 727..741
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 844..858
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1070..1085
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1212..1228
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1247..1264
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1522 AA; 149837 MW; 3097C7D0DE7BA060 CRC64;
MLWLDCMPVA TLPLARGLWP VVSTEGITVF GARLMDEEVF QGDVQQLLIV PDPAAAQVYC
EHYMPGCDVP LAYPLQAPFP EQQSRSEPTS TPRRKGKKGK GKGKGQGRRK GKGKKKRNKE
QLEVPTPLMA LGQDESLVAP SEDEHSTPVP TTNPVPTVTT SPGHTNPHED LVLGRDGNPS
ATGSWPQEYE EYEEATEPLG PGRFGPAEPD DTMVWTMQKG SLKGEKGEPG AVEPGQQFEG
PPGPPGPSGE MGPAGPPGPP GFPGDPGDWG PAGRPGVPGA DGARGPPGTM IMLPFQFGGD
THKGPAVSFQ EAQAQAILQQ AKLAMKGPPG PMGLTGRPGP LGLPGYPGLK GDTGDMGLPG
PRGIQGLMGP PGRLGKRGRS GADGARGLPG DTGPKGDRGF DGLPGLPGEK GHKGEVGKPG
SPGPPGEMGP KGTDGLPGPR GQPGEPGMRG LAGSRGVPGS PGPLGPSGID GATGPKGNQG
IMGEPGPPGQ QGNPGPQGLP GPQGPVGLPG EKGVLGKPGI PGVAGADGPP GHPGKEGPTG
DKGLQGPPGT AGPVGYPGPR GVKGTFGARG LKGSKGEKGE DGFPGFKGDM GTKGDRGDQG
PLGLRGEDGP EGLKGQMGTA GEPGPPGLAG EKGKLGVPGL PGYPGRQGPK GSTGFQGSVG
LAGEKGKRGK AGQAGQTGPR GSPGLPGERG QPGSTGKPGP KGESGHDGAP GTAGEKGPQG
PQGSSGFPGP KGPPGPPGKD GLPGHPGQRG EPGFHGKTGP PGPTGVVGPQ GHSGETGPMG
ERGHPGPPGS PGEQGLPGAA GREGTKGDPG PAGTPGRSGP PGVHGFPGAR GASGEIGPAG
LKGGEGPPGP PGPTGSPGER GPSGPAGGIG LPGRGGAQGP LGPAGEKGSP GERGPLGSAG
HDGIQGPVGL PGAAGPPGPA GEDGDKGEMG LPGQKGSKGD KGEPHSSGLW STPCSWGDHA
HASPFWGPDT SLVSLQGEPG EAGDPGPAGE PGHPGTKGDV GEKGDAGPSG AAGAPGKRGP
PGDDGAKGDL GPIGFPGDPG PPGDPGVSGL DGAPGDKGDG GNPGALGPPG ASGEPGPPGP
PGRRGPAGPM GREGHQGDKG TKGEPGTEGP PGKMGPVGAQ GPPGRLGPDG LHGIPGPAGE
QGLLGAPGQA GPPGPMGPVG LPGLKGDPGH KGDKGHAGLI GLIGPPGEMG EKGDRGLPGV
QGPMGPKGNP GMTGPLGPPG PPGPPGLSGP VGQKGSKGSP GALGPRGDTG PPGPPGPPGP
SMELLEPLPL AGGRRQRRGA PGLSPKSTEG LEEVHAVLSS LQAEVEQLRR PHGTPDSPGR
ACAELWLSHP HLPDGEYWID PNQGCSRDAF RVFCNFTAGG ETCLFPDKKF EAVRLAAWSR
EKPESWFSSF KRGQKFSYVD VDGHIVPVPQ VTFLRLLSAS AYQTFALTCQ NAAAWFDASA
SSFTRALRLR GANGEELGHS HPSAPIHALA DGCQVRRGQA RTVLEVRGPH VEWLPLADVA
VTDFGGAGQK FGFELGPVCF VG
//