ID A0A287CUZ8_ICTTR Unreviewed; 680 AA.
AC A0A287CUZ8;
DT 22-NOV-2017, integrated into UniProtKB/TrEMBL.
DT 22-NOV-2017, sequence version 1.
DT 27-MAR-2024, entry version 22.
DE SubName: Full=Collagen type IX alpha 1 chain {ECO:0000313|Ensembl:ENSSTOP00000025096.1};
GN Name=COL9A1 {ECO:0000313|Ensembl:ENSSTOP00000025096.1};
OS Ictidomys tridecemlineatus (Thirteen-lined ground squirrel) (Spermophilus
OS tridecemlineatus).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Glires; Rodentia; Sciuromorpha; Sciuridae;
OC Xerinae; Marmotini; Ictidomys.
OX NCBI_TaxID=43179 {ECO:0000313|Ensembl:ENSSTOP00000025096.1, ECO:0000313|Proteomes:UP000005215};
RN [1] {ECO:0000313|Proteomes:UP000005215}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RG The Broad Institute Genome Assembly & Analysis Group;
RG Computational R&D Group;
RG and Sequencing Platform;
RA Di Palma F., Alfoldi J., Johnson J., Berlin A., Gnerre S., Jaffe D.,
RA MacCallum I., Young S., Walker B.J., Lindblad-Toh K.;
RT "The Draft Genome of Spermophilus tridecemlineatus.";
RL Submitted (NOV-2011) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSSTOP00000025096.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AGTP01077895; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR AlphaFoldDB; A0A287CUZ8; -.
DR Ensembl; ENSSTOT00000039546.1; ENSSTOP00000025096.1; ENSSTOG00000005082.3.
DR GeneTree; ENSGT00940000157935; -.
DR Proteomes; UP000005215; Unassembled WGS sequence.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0005576; C:extracellular region; IEA:UniProtKB-KW.
DR InterPro; IPR008160; Collagen.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF1100; FIBRILLAR COLLAGEN NC1 DOMAIN-CONTAINING PROTEIN; 1.
DR Pfam; PF01391; Collagen; 8.
PE 4: Predicted;
KW Collagen {ECO:0000256|ARBA:ARBA00023119};
KW Reference proteome {ECO:0000313|Proteomes:UP000005215};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..25
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 26..680
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5012538502"
FT REGION 28..518
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 540..680
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 31..45
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 56..76
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 144..158
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 192..213
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 550..566
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 643..660
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 680 AA; 64752 MW; 2C75084BB3AF04B5 CRC64;
MAWIAPDHRA LGLRLLLSGL CLCAAQRGPP GEQGPPGPPG PPGVPGIDGI DGDRGPKGPP
GPPGPPGEPG KPGAPGKPGT PGADGLTGPD GSPGSVGPRG QKGEPGVPGS RGFPGRGIPG
PPGPPGTAGL PGELGRVGPI GDPGKRGPPG PPGPPGPSGT IGFHDGDPLC PNSCPPGRSG
YPGLPGMRGH KGAKGEIGEP GRQGHKGEEG DQGELGEVGA QGPPGAQGLR GITGTVGDKG
EKGARGLDGE PGPQGLPGAP GDQGQRGPPG EAGPKGDRGA QGPRGIPGPP GPKGDTGLPG
VDGRDGIPGM PGTKGEPGKP GPPGDAGLQG LPGVPGIPGA KGVAGEKGNT GAPGKPGQLG
NSGKPGQQGP PGEVGPRGPR GLPGSRGETG PVGSPGLPGK PGSFGSPGLP GLPGPPGLPG
MKGDRGVFGE QGPKGEQGAS GEEGEAGERG DLGDIGLPGP KGSMGNPGEP GLRGPEGSRG
LPGAEGPRGP PGPRGVQGEQ GATGLPGIQG PPGRAPTDQH IKQVCMRVIQ EHFAEMAASL
KRPDSGASGL PGRPGPPGPP GPPGENGFPG QMGIRGLPGI KGPPGALGLR GPKGDLGEKG
ERGPPGRGPK GLPGAIGLPG DPGPASYGRN GRDGERGPPG VAGIPGVPGP PGPPGPPGFC
EPASCTLQAG QRAFSKGPDQ
//