GenomeNet

Database: UniProt
Entry: I3N1R0_ICTTR
LinkDB: I3N1R0_ICTTR
Original site: I3N1R0_ICTTR 
ID   I3N1R0_ICTTR            Unreviewed;      1141 AA.
AC   I3N1R0;
DT   11-JUL-2012, integrated into UniProtKB/TrEMBL.
DT   22-NOV-2017, sequence version 2.
DT   27-MAR-2024, entry version 65.
DE   SubName: Full=Collagen type XXVIII alpha 1 chain {ECO:0000313|Ensembl:ENSSTOP00000018306.2};
GN   Name=COL28A1 {ECO:0000313|Ensembl:ENSSTOP00000018306.2};
OS   Ictidomys tridecemlineatus (Thirteen-lined ground squirrel) (Spermophilus
OS   tridecemlineatus).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Glires; Rodentia; Sciuromorpha; Sciuridae;
OC   Xerinae; Marmotini; Ictidomys.
OX   NCBI_TaxID=43179 {ECO:0000313|Ensembl:ENSSTOP00000018306.2, ECO:0000313|Proteomes:UP000005215};
RN   [1] {ECO:0000313|Proteomes:UP000005215}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RG   The Broad Institute Genome Assembly & Analysis Group;
RG   Computational R&D Group;
RG   and Sequencing Platform;
RA   Di Palma F., Alfoldi J., Johnson J., Berlin A., Gnerre S., Jaffe D.,
RA   MacCallum I., Young S., Walker B.J., Lindblad-Toh K.;
RT   "The Draft Genome of Spermophilus tridecemlineatus.";
RL   Submitted (NOV-2011) to the EMBL/GenBank/DDBJ databases.
RN   [2] {ECO:0000313|Ensembl:ENSSTOP00000018306.2}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (NOV-2023) to UniProtKB.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; AGTP01068069; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AGTP01068070; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AGTP01068071; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AGTP01068072; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AGTP01068073; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AGTP01068074; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AGTP01068075; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   AlphaFoldDB; I3N1R0; -.
DR   STRING; 43179.ENSSTOP00000018306; -.
DR   Ensembl; ENSSTOT00000026598.2; ENSSTOP00000018306.2; ENSSTOG00000023884.2.
DR   eggNOG; KOG1217; Eukaryota.
DR   eggNOG; KOG3544; Eukaryota.
DR   GeneTree; ENSGT00940000161647; -.
DR   HOGENOM; CLU_009158_0_0_1; -.
DR   InParanoid; I3N1R0; -.
DR   OrthoDB; 2906665at2759; -.
DR   TreeFam; TF331207; -.
DR   Proteomes; UP000005215; Unassembled WGS sequence.
DR   GO; GO:0005604; C:basement membrane; IEA:Ensembl.
DR   GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR   GO; GO:0004867; F:serine-type endopeptidase inhibitor activity; IEA:InterPro.
DR   GO; GO:0007155; P:cell adhesion; IEA:UniProtKB-KW.
DR   CDD; cd01472; vWA_collagen; 1.
DR   CDD; cd01450; vWFA_subfamily_ECM; 1.
DR   Gene3D; 4.10.410.10; Pancreatic trypsin inhibitor Kunitz domain; 1.
DR   Gene3D; 3.40.50.410; von Willebrand factor, type A domain; 2.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR002223; Kunitz_BPTI.
DR   InterPro; IPR036880; Kunitz_BPTI_sf.
DR   InterPro; IPR020901; Prtase_inh_Kunz-CS.
DR   InterPro; IPR002035; VWF_A.
DR   InterPro; IPR036465; vWFA_dom_sf.
DR   PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR   PANTHER; PTHR24023:SF878; COLLAGEN ALPHA-1(XXVIII) CHAIN; 1.
DR   Pfam; PF01391; Collagen; 2.
DR   Pfam; PF00014; Kunitz_BPTI; 1.
DR   Pfam; PF00092; VWA; 2.
DR   PRINTS; PR00453; VWFADOMAIN.
DR   SMART; SM00131; KU; 1.
DR   SMART; SM00327; VWA; 2.
DR   SUPFAM; SSF57362; BPTI-like; 1.
DR   SUPFAM; SSF53300; vWA-like; 2.
DR   PROSITE; PS00280; BPTI_KUNITZ_1; 1.
DR   PROSITE; PS50279; BPTI_KUNITZ_2; 1.
DR   PROSITE; PS50234; VWFA; 2.
PE   4: Predicted;
KW   Cell adhesion {ECO:0000256|ARBA:ARBA00022889};
KW   Collagen {ECO:0000256|ARBA:ARBA00023119};
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW   Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW   Reference proteome {ECO:0000313|Proteomes:UP000005215};
KW   Secreted {ECO:0000256|ARBA:ARBA00022530}; Signal {ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..23
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           24..1141
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5012090592"
FT   DOMAIN          48..226
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   DOMAIN          797..975
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   DOMAIN          1088..1138
FT                   /note="BPTI/Kunitz inhibitor"
FT                   /evidence="ECO:0000259|PROSITE:PS50279"
FT   REGION          241..770
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1008..1036
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        585..599
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        738..753
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   1141 AA;  118409 MW;  7A4123E40D1621B3 CRC64;
     MWSRYFAFCL LLLPIFMSQT VYGQRKKGTK SNLLARKNDL QDSVCFIDVL FIVDSSESSK
     LVLFDKQKNF VDSLSDKIFQ LTPGHSLKYD IKLAALQFSS SVQIDPPFSS WKDLQTFKQR
     VKSMNFIGQG TFSYYAISNA TRLLKREGRK DGVKVVLLMT DGMDHPKNPD VQSISESARN
     AGISFITIGL LTVNKTKLHL ISGDAPSEPI LLLSDPTIVD KIRDRLDILF EKKCERKICE
     CEKGDPGEPG PPGKHGNPGI KGERGPKGNP GDAQKGETGE RGPGGIPGYK GDKGEPGECG
     KPGIKGDKGS PGPSGPKGPK GLQGIGGPPG DPGPKGFQGN KGEPGPPGPY GPPGAPGIGQ
     QGIKGERGQE GRTGAPGPIG VGQPGQPGPR GPEGAPGERG LPGEGFPGPK GEKGSEGPIG
     PQGLQGLSIK GDKGDLGPVG PQGPMGIPGT GSQGEQGIQG PIGPPGPQGP PGQGLPGSKG
     EVGPIGPTGP RGPVGIGAQG PKGEPGSKGS PGQTGEPGED GAAGKKGEAG LPGTRGPEGP
     PGKGQPGPKG DEGKKGSKGN QGHMGFPGPA GPKGEQGIMG PFGMPGPSIP GPPGPKGDRG
     GPGMPGFKGE PGLFIRGPKG AQGPQGPKGA PGLKGDGYPG VSGPRGLPGP PGPMGLRGVG
     DTGAKGEPGV RGPPGPAGPR GIGTQGPKGD IGQKGLPGPP GPPGYGSQGI KGEQGPQGFP
     GPKGTRGQGL PGQKGEHGEQ GDVGKKGEKG ELGDPGSPGK QGLQGPKGDA GLTREDIIEL
     IIKICGCGRK CKETPLELVF VIDSSESVGL NNFQIIQNFV KSLADRISVD LATARIGIIN
     YSHKVEIVAD LRQFSSKDDF KLAVDNMQYL GEGTYTATAL QAANDMFKSA RPGIKKVALV
     ITDGQTDSRD KKNLTEVVKE AKDTGVEIFV IGVVKKSDPN FDMFLKEMNL IATDPQHVYH
     FDDFLTLLDT LKQKLSTKIC EDFPSYLAKV FGSSSSQPEF GVLGEELGVS TPEPPEEISE
     SFSVPGPKHE ENEPPEPTWA HRLVVTSSSE AAATLGPLLS TLESMKTRTP SPGLILHQDN
     LVHKDPRCLE DLKPGNCVDY VVRWYYDKQV NSCARFWYSG CSGSGNRFNS EKECQEICIQ
     G
//
DBGET integrated database retrieval system