ID I3M377_ICTTR Unreviewed; 1195 AA.
AC I3M377;
DT 11-JUL-2012, integrated into UniProtKB/TrEMBL.
DT 22-NOV-2017, sequence version 2.
DT 27-MAR-2024, entry version 62.
DE RecName: Full=Cingulin {ECO:0000256|ARBA:ARBA00044075};
GN Name=Cgn {ECO:0000313|Ensembl:ENSSTOP00000003488.3};
OS Ictidomys tridecemlineatus (Thirteen-lined ground squirrel) (Spermophilus
OS tridecemlineatus).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Glires; Rodentia; Sciuromorpha; Sciuridae;
OC Xerinae; Marmotini; Ictidomys.
OX NCBI_TaxID=43179 {ECO:0000313|Ensembl:ENSSTOP00000003488.3, ECO:0000313|Proteomes:UP000005215};
RN [1] {ECO:0000313|Proteomes:UP000005215}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RG The Broad Institute Genome Assembly & Analysis Group;
RG Computational R&D Group;
RG and Sequencing Platform;
RA Di Palma F., Alfoldi J., Johnson J., Berlin A., Gnerre S., Jaffe D.,
RA MacCallum I., Young S., Walker B.J., Lindblad-Toh K.;
RT "The Draft Genome of Spermophilus tridecemlineatus.";
RL Submitted (NOV-2011) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSSTOP00000003488.3}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- FUNCTION: Probably plays a role in the formation and regulation of the
CC tight junction (TJ) paracellular permeability barrier.
CC {ECO:0000256|ARBA:ARBA00043864}.
CC -!- SIMILARITY: Belongs to the cingulin family.
CC {ECO:0000256|ARBA:ARBA00038467}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AGTP01066266; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AGTP01066267; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR AlphaFoldDB; I3M377; -.
DR STRING; 43179.ENSSTOP00000003488; -.
DR Ensembl; ENSSTOT00000003892.3; ENSSTOP00000003488.3; ENSSTOG00000003867.3.
DR eggNOG; ENOG502R9EI; Eukaryota.
DR GeneTree; ENSGT00940000162698; -.
DR HOGENOM; CLU_002036_0_0_1; -.
DR InParanoid; I3M377; -.
DR TreeFam; TF332247; -.
DR Proteomes; UP000005215; Unassembled WGS sequence.
DR GO; GO:0016459; C:myosin complex; IEA:InterPro.
DR InterPro; IPR002928; Myosin_tail.
DR PANTHER; PTHR46349:SF4; CINGULIN; 1.
DR PANTHER; PTHR46349; CINGULIN-LIKE PROTEIN 1-RELATED; 1.
DR Pfam; PF01576; Myosin_tail_1; 1.
PE 3: Inferred from homology;
KW Coiled coil {ECO:0000256|ARBA:ARBA00023054, ECO:0000256|SAM:Coils};
KW Reference proteome {ECO:0000313|Proteomes:UP000005215}.
FT DOMAIN 814..1145
FT /note="Myosin tail"
FT /evidence="ECO:0000259|Pfam:PF01576"
FT REGION 72..285
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1152..1174
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 354..416
FT /evidence="ECO:0000256|SAM:Coils"
FT COMPBIAS 72..193
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 202..230
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 238..254
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1152..1170
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1195 AA; 136793 MW; 23F3B80F0774C979 CRC64;
AMAEPRGPVD HGVQIRFITE PEGGAEMDTI RRGVRRPTKD ARANTYGVAV RVQGIAGQPF
VVLNSGEKGS DSFGVQIKGS DNQGAPGTLS SDSELPENPY SQVKGFPASS QGSTSDEEPG
SYWNGRLPRS QSQASLAGPI PMDSSNRSNS LLELAPKETS PDSTIDTAPL SSVDSLINKF
DSQVGGQTRG RTGRRTRTLP HEQRKRSQSL DSRLPQDPLE EQERQSPDHW TPSAKYDNHV
GSLKQPSQSP SPVRGLSHPH PAQDWVIQSF EEPQERARDP TMLQFKSTPD LLRDQQETAP
PGSVDHVKAT IYGILKEGSS ESEASVRRKV SLVLEQMEPL VMASPDSTKT IAGQSEVTRK
VEELQKKLEE EVKKRQKLEP SRVRLERQLE EKAEECDRLQ DLLERKKGDA QQSTKELQNM
KLLLDQGDRL RHGLETQVME LQNKLKQGQG SEPAKEVLLK ELLETRELLE EVLEGKQRVE
EQLRLREREL TALKGALKEE VASRDQEMEQ VRQQYQRDTE QLRRSMQDAT QDHAVLEAER
QKMSALVRGL QRELEETSEE TEHWQTMFQK NKEELRATKQ ELLQLRMEKE EMEEELGEKV
EVLQREVEHA RASTRDNLQL EELKELRWAQ DELKELRAQR QNQEAAGRHR DQELEKQLAV
LRVEADRVRE LEQQNAQLQK TLQQLKQDCE EASKAKVAAE AETAVLGQRR AAVETTLRET
QEENDEFRRR ILGLEQQLKE AHGLAEGGEA VEARLRDRVQ RLEAEKLRLQ EALNEAQEEE
GSLVAAKRAL EVRLEEAQRG LARLGQEQQA LSRALEEEGK QREVLRRSKA ELEEQKRLLD
RTVDRLNKEL EQIGDDSKQA LKQLQAQLED YKEKARREVA DAQRQAKDWA SEAEKTSGGL
SRLQDEIQRL RQALQASQAE RDTALLDKEL LVQRLQGLEQ EAENKKRSQD DRTRQLKNLE
EKVSRLEAEL DEEKNTVELL SDRVNRGRDQ VDQLRAELMQ ERSARQDLEC DKISLERQNK
DLKTRLTNLE GFQKPSASFS QLESQNQMLQ ERLQAEEREK TVLQSTNRKL ERRVKELSIQ
IDDERQHVND QKDQLSLRVK ALKRQVDEAE EEIERLDGLR KKAQRELEEQ HEVNEQLQAR
IKSLEKEAWR KTSRSAAEST LKHEGLSSDE EFDGVYDPSS IASLLTESNL QTSSC
//