GenomeNet

Database: UniProt
Entry: I3M5I0_ICTTR
LinkDB: I3M5I0_ICTTR
Original site: I3M5I0_ICTTR 
ID   I3M5I0_ICTTR            Unreviewed;       921 AA.
AC   I3M5I0;
DT   11-JUL-2012, integrated into UniProtKB/TrEMBL.
DT   22-NOV-2017, sequence version 2.
DT   27-MAR-2024, entry version 65.
DE   SubName: Full=Collagen type IX alpha 1 chain {ECO:0000313|Ensembl:ENSSTOP00000004593.3};
GN   Name=COL9A1 {ECO:0000313|Ensembl:ENSSTOP00000004593.3};
OS   Ictidomys tridecemlineatus (Thirteen-lined ground squirrel) (Spermophilus
OS   tridecemlineatus).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Glires; Rodentia; Sciuromorpha; Sciuridae;
OC   Xerinae; Marmotini; Ictidomys.
OX   NCBI_TaxID=43179 {ECO:0000313|Ensembl:ENSSTOP00000004593.3, ECO:0000313|Proteomes:UP000005215};
RN   [1] {ECO:0000313|Proteomes:UP000005215}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RG   The Broad Institute Genome Assembly & Analysis Group;
RG   Computational R&D Group;
RG   and Sequencing Platform;
RA   Di Palma F., Alfoldi J., Johnson J., Berlin A., Gnerre S., Jaffe D.,
RA   MacCallum I., Young S., Walker B.J., Lindblad-Toh K.;
RT   "The Draft Genome of Spermophilus tridecemlineatus.";
RL   Submitted (NOV-2011) to the EMBL/GenBank/DDBJ databases.
RN   [2] {ECO:0000313|Ensembl:ENSSTOP00000004593.3}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (NOV-2023) to UniProtKB.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; AGTP01077895; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   RefSeq; XP_005334482.1; XM_005334425.1.
DR   AlphaFoldDB; I3M5I0; -.
DR   STRING; 43179.ENSSTOP00000004593; -.
DR   Ensembl; ENSSTOT00000005117.3; ENSSTOP00000004593.3; ENSSTOG00000005082.3.
DR   GeneID; 101976372; -.
DR   CTD; 1297; -.
DR   eggNOG; KOG3544; Eukaryota.
DR   GeneTree; ENSGT00940000157935; -.
DR   HOGENOM; CLU_001074_18_1_1; -.
DR   InParanoid; I3M5I0; -.
DR   OrthoDB; 2968414at2759; -.
DR   TreeFam; TF332900; -.
DR   Proteomes; UP000005215; Unassembled WGS sequence.
DR   GO; GO:0005594; C:collagen type IX trimer; IEA:Ensembl.
DR   GO; GO:0030246; F:carbohydrate binding; IEA:Ensembl.
DR   GO; GO:0042803; F:protein homodimerization activity; IEA:Ensembl.
DR   Gene3D; 2.60.120.200; -; 1.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR013320; ConA-like_dom_sf.
DR   InterPro; IPR048287; TSPN-like_N.
DR   PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR   PANTHER; PTHR24023:SF1100; FIBRILLAR COLLAGEN NC1 DOMAIN-CONTAINING PROTEIN; 1.
DR   Pfam; PF01391; Collagen; 11.
DR   SMART; SM00210; TSPN; 1.
DR   SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
PE   4: Predicted;
KW   Collagen {ECO:0000256|ARBA:ARBA00023119};
KW   Reference proteome {ECO:0000313|Proteomes:UP000005215};
KW   Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..23
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           24..921
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5011522638"
FT   DOMAIN          50..244
FT                   /note="Thrombospondin-like N-terminal"
FT                   /evidence="ECO:0000259|SMART:SM00210"
FT   REGION          260..759
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          781..905
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        272..286
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        297..317
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        385..399
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        433..454
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        791..807
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        884..901
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   921 AA;  92029 MW;  8C24463BDC044E8C CRC64;
     MKNRWKIPVF FFVCVFLGSW ASAAVKRRPR FPVNSNSNGE NELCPKIRIG QDDLPGFDLI
     SQFQVDKAAS RRAIQRVVGS TSLQVAYKLG NNVDFRIPTR HLYPSGLPEE YSFLTTFRMT
     GSTLEKNWSI WQIQDSSGKE QVGVKINGQT QSVAFSYKGL DGSLQTAAFS NLPSLFDSQW
     HKIMIGVERS SATLFVDCNR IESLPIKPRG QIDVDGFAVL GKLADNPQVS VPFELQWMLI
     HCDPMRPSRE TCHELPVRIT PTQTTDQRGP PGEQGPPGPP GPPGVPGIDG IDGDRGPKGP
     PGPPGPPGEP GKPGAPGKPG TPGADGLTGP DGSPGSVGPR GQKGEPGVPG SRGFPGRGIP
     GPPGPPGTAG LPGELGRVGP IGDPGKRGPP GPPGPPGPSG TIGFHDGDPL CPNSCPPGRS
     GYPGLPGMRG HKGAKGEIGE PGRQGHKGEE GDQGELGEVG AQGPPGAQGL RGITGTVGDK
     GEKGARGLDG EPGPQGLPGA PGDQGQRGPP GEAGPKGDRG AQGPRGIPGP PGPKGDTGLP
     GVDGRDGIPG MPGTKGEPGK PGPPGDAGLQ GLPGVPGIPG AKGVAGEKGN TGAPGKPGQL
     GNSGKPGQQG PPGEVGPRGP RGLPGSRGET GPVGSPGLPG KPGSFGSPGL PGLPGPPGLP
     GMKGDRGVFG EQGPKGEQGA SGEEGEAGER GDLGDIGLPG PKGSMGNPGE PGLRGPEGSR
     GLPGAEGPRG PPGPRGVQGE QGATGLPGIQ GPPGRAPTDQ HIKQVCMRVI QEHFAEMAAS
     LKRPDSGASG LPGRPGPPGP PGPPGENGFP GQMGIRGLPG IKGPPGALGL RGPKGDLGEK
     GERGPPGRGP KGLPGAIGLP GDPGPASYGR NGRDGERGPP GVAGIPGVPG PPGPPGPPGF
     CEPASCTLQA GQRAFSKGPD Q
//
DBGET integrated database retrieval system