GenomeNet

Database: UniProt
Entry: A0A287CUZ8_ICTTR
LinkDB: A0A287CUZ8_ICTTR
Original site: A0A287CUZ8_ICTTR 
ID   A0A287CUZ8_ICTTR        Unreviewed;       680 AA.
AC   A0A287CUZ8;
DT   22-NOV-2017, integrated into UniProtKB/TrEMBL.
DT   22-NOV-2017, sequence version 1.
DT   27-MAR-2024, entry version 22.
DE   SubName: Full=Collagen type IX alpha 1 chain {ECO:0000313|Ensembl:ENSSTOP00000025096.1};
GN   Name=COL9A1 {ECO:0000313|Ensembl:ENSSTOP00000025096.1};
OS   Ictidomys tridecemlineatus (Thirteen-lined ground squirrel) (Spermophilus
OS   tridecemlineatus).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Glires; Rodentia; Sciuromorpha; Sciuridae;
OC   Xerinae; Marmotini; Ictidomys.
OX   NCBI_TaxID=43179 {ECO:0000313|Ensembl:ENSSTOP00000025096.1, ECO:0000313|Proteomes:UP000005215};
RN   [1] {ECO:0000313|Proteomes:UP000005215}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RG   The Broad Institute Genome Assembly & Analysis Group;
RG   Computational R&D Group;
RG   and Sequencing Platform;
RA   Di Palma F., Alfoldi J., Johnson J., Berlin A., Gnerre S., Jaffe D.,
RA   MacCallum I., Young S., Walker B.J., Lindblad-Toh K.;
RT   "The Draft Genome of Spermophilus tridecemlineatus.";
RL   Submitted (NOV-2011) to the EMBL/GenBank/DDBJ databases.
RN   [2] {ECO:0000313|Ensembl:ENSSTOP00000025096.1}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (NOV-2023) to UniProtKB.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; AGTP01077895; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   AlphaFoldDB; A0A287CUZ8; -.
DR   Ensembl; ENSSTOT00000039546.1; ENSSTOP00000025096.1; ENSSTOG00000005082.3.
DR   GeneTree; ENSGT00940000157935; -.
DR   Proteomes; UP000005215; Unassembled WGS sequence.
DR   GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR   GO; GO:0005576; C:extracellular region; IEA:UniProtKB-KW.
DR   InterPro; IPR008160; Collagen.
DR   PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR   PANTHER; PTHR24023:SF1100; FIBRILLAR COLLAGEN NC1 DOMAIN-CONTAINING PROTEIN; 1.
DR   Pfam; PF01391; Collagen; 8.
PE   4: Predicted;
KW   Collagen {ECO:0000256|ARBA:ARBA00023119};
KW   Reference proteome {ECO:0000313|Proteomes:UP000005215};
KW   Signal {ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..25
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           26..680
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5012538502"
FT   REGION          28..518
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          540..680
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        31..45
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        56..76
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        144..158
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        192..213
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        550..566
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        643..660
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   680 AA;  64752 MW;  2C75084BB3AF04B5 CRC64;
     MAWIAPDHRA LGLRLLLSGL CLCAAQRGPP GEQGPPGPPG PPGVPGIDGI DGDRGPKGPP
     GPPGPPGEPG KPGAPGKPGT PGADGLTGPD GSPGSVGPRG QKGEPGVPGS RGFPGRGIPG
     PPGPPGTAGL PGELGRVGPI GDPGKRGPPG PPGPPGPSGT IGFHDGDPLC PNSCPPGRSG
     YPGLPGMRGH KGAKGEIGEP GRQGHKGEEG DQGELGEVGA QGPPGAQGLR GITGTVGDKG
     EKGARGLDGE PGPQGLPGAP GDQGQRGPPG EAGPKGDRGA QGPRGIPGPP GPKGDTGLPG
     VDGRDGIPGM PGTKGEPGKP GPPGDAGLQG LPGVPGIPGA KGVAGEKGNT GAPGKPGQLG
     NSGKPGQQGP PGEVGPRGPR GLPGSRGETG PVGSPGLPGK PGSFGSPGLP GLPGPPGLPG
     MKGDRGVFGE QGPKGEQGAS GEEGEAGERG DLGDIGLPGP KGSMGNPGEP GLRGPEGSRG
     LPGAEGPRGP PGPRGVQGEQ GATGLPGIQG PPGRAPTDQH IKQVCMRVIQ EHFAEMAASL
     KRPDSGASGL PGRPGPPGPP GPPGENGFPG QMGIRGLPGI KGPPGALGLR GPKGDLGEKG
     ERGPPGRGPK GLPGAIGLPG DPGPASYGRN GRDGERGPPG VAGIPGVPGP PGPPGPPGFC
     EPASCTLQAG QRAFSKGPDQ
//
DBGET integrated database retrieval system