ID I3M5I0_ICTTR Unreviewed; 921 AA.
AC I3M5I0;
DT 11-JUL-2012, integrated into UniProtKB/TrEMBL.
DT 22-NOV-2017, sequence version 2.
DT 27-MAR-2024, entry version 65.
DE SubName: Full=Collagen type IX alpha 1 chain {ECO:0000313|Ensembl:ENSSTOP00000004593.3};
GN Name=COL9A1 {ECO:0000313|Ensembl:ENSSTOP00000004593.3};
OS Ictidomys tridecemlineatus (Thirteen-lined ground squirrel) (Spermophilus
OS tridecemlineatus).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Glires; Rodentia; Sciuromorpha; Sciuridae;
OC Xerinae; Marmotini; Ictidomys.
OX NCBI_TaxID=43179 {ECO:0000313|Ensembl:ENSSTOP00000004593.3, ECO:0000313|Proteomes:UP000005215};
RN [1] {ECO:0000313|Proteomes:UP000005215}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RG The Broad Institute Genome Assembly & Analysis Group;
RG Computational R&D Group;
RG and Sequencing Platform;
RA Di Palma F., Alfoldi J., Johnson J., Berlin A., Gnerre S., Jaffe D.,
RA MacCallum I., Young S., Walker B.J., Lindblad-Toh K.;
RT "The Draft Genome of Spermophilus tridecemlineatus.";
RL Submitted (NOV-2011) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSSTOP00000004593.3}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AGTP01077895; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR RefSeq; XP_005334482.1; XM_005334425.1.
DR AlphaFoldDB; I3M5I0; -.
DR STRING; 43179.ENSSTOP00000004593; -.
DR Ensembl; ENSSTOT00000005117.3; ENSSTOP00000004593.3; ENSSTOG00000005082.3.
DR GeneID; 101976372; -.
DR CTD; 1297; -.
DR eggNOG; KOG3544; Eukaryota.
DR GeneTree; ENSGT00940000157935; -.
DR HOGENOM; CLU_001074_18_1_1; -.
DR InParanoid; I3M5I0; -.
DR OrthoDB; 2968414at2759; -.
DR TreeFam; TF332900; -.
DR Proteomes; UP000005215; Unassembled WGS sequence.
DR GO; GO:0005594; C:collagen type IX trimer; IEA:Ensembl.
DR GO; GO:0030246; F:carbohydrate binding; IEA:Ensembl.
DR GO; GO:0042803; F:protein homodimerization activity; IEA:Ensembl.
DR Gene3D; 2.60.120.200; -; 1.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR048287; TSPN-like_N.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF1100; FIBRILLAR COLLAGEN NC1 DOMAIN-CONTAINING PROTEIN; 1.
DR Pfam; PF01391; Collagen; 11.
DR SMART; SM00210; TSPN; 1.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
PE 4: Predicted;
KW Collagen {ECO:0000256|ARBA:ARBA00023119};
KW Reference proteome {ECO:0000313|Proteomes:UP000005215};
KW Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..23
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 24..921
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5011522638"
FT DOMAIN 50..244
FT /note="Thrombospondin-like N-terminal"
FT /evidence="ECO:0000259|SMART:SM00210"
FT REGION 260..759
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 781..905
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 272..286
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 297..317
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 385..399
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 433..454
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 791..807
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 884..901
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 921 AA; 92029 MW; 8C24463BDC044E8C CRC64;
MKNRWKIPVF FFVCVFLGSW ASAAVKRRPR FPVNSNSNGE NELCPKIRIG QDDLPGFDLI
SQFQVDKAAS RRAIQRVVGS TSLQVAYKLG NNVDFRIPTR HLYPSGLPEE YSFLTTFRMT
GSTLEKNWSI WQIQDSSGKE QVGVKINGQT QSVAFSYKGL DGSLQTAAFS NLPSLFDSQW
HKIMIGVERS SATLFVDCNR IESLPIKPRG QIDVDGFAVL GKLADNPQVS VPFELQWMLI
HCDPMRPSRE TCHELPVRIT PTQTTDQRGP PGEQGPPGPP GPPGVPGIDG IDGDRGPKGP
PGPPGPPGEP GKPGAPGKPG TPGADGLTGP DGSPGSVGPR GQKGEPGVPG SRGFPGRGIP
GPPGPPGTAG LPGELGRVGP IGDPGKRGPP GPPGPPGPSG TIGFHDGDPL CPNSCPPGRS
GYPGLPGMRG HKGAKGEIGE PGRQGHKGEE GDQGELGEVG AQGPPGAQGL RGITGTVGDK
GEKGARGLDG EPGPQGLPGA PGDQGQRGPP GEAGPKGDRG AQGPRGIPGP PGPKGDTGLP
GVDGRDGIPG MPGTKGEPGK PGPPGDAGLQ GLPGVPGIPG AKGVAGEKGN TGAPGKPGQL
GNSGKPGQQG PPGEVGPRGP RGLPGSRGET GPVGSPGLPG KPGSFGSPGL PGLPGPPGLP
GMKGDRGVFG EQGPKGEQGA SGEEGEAGER GDLGDIGLPG PKGSMGNPGE PGLRGPEGSR
GLPGAEGPRG PPGPRGVQGE QGATGLPGIQ GPPGRAPTDQ HIKQVCMRVI QEHFAEMAAS
LKRPDSGASG LPGRPGPPGP PGPPGENGFP GQMGIRGLPG IKGPPGALGL RGPKGDLGEK
GERGPPGRGP KGLPGAIGLP GDPGPASYGR NGRDGERGPP GVAGIPGVPG PPGPPGPPGF
CEPASCTLQA GQRAFSKGPD Q
//