ID I3JGB8_ORENI Unreviewed; 1202 AA.
AC I3JGB8;
DT 11-JUL-2012, integrated into UniProtKB/TrEMBL.
DT 17-JUN-2020, sequence version 2.
DT 27-MAR-2024, entry version 51.
DE RecName: Full=Cingulin {ECO:0000256|ARBA:ARBA00044075};
GN Name=tuft1 {ECO:0000313|Ensembl:ENSONIP00000007909.2};
OS Oreochromis niloticus (Nile tilapia) (Tilapia nilotica).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC Ovalentaria; Cichlomorphae; Cichliformes; Cichlidae; African cichlids;
OC Pseudocrenilabrinae; Oreochromini; Oreochromis.
OX NCBI_TaxID=8128 {ECO:0000313|Ensembl:ENSONIP00000007909.2, ECO:0000313|Proteomes:UP000005207};
RN [1] {ECO:0000313|Proteomes:UP000005207}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RG Broad Institute Genome Assembly Team;
RG Broad Institute Sequencing Platform;
RA Di Palma F., Johnson J., Lander E.S., Lindblad-Toh K.;
RT "The Genome Sequence of Oreochromis niloticus (Nile Tilapia).";
RL Submitted (JAN-2012) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSONIP00000007909.2}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- FUNCTION: Probably plays a role in the formation and regulation of the
CC tight junction (TJ) paracellular permeability barrier.
CC {ECO:0000256|ARBA:ARBA00043864}.
CC -!- SIMILARITY: Belongs to the cingulin family.
CC {ECO:0000256|ARBA:ARBA00038467}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; I3JGB8; -.
DR Ensembl; ENSONIT00000007914.2; ENSONIP00000007909.2; ENSONIG00000006277.2.
DR GeneTree; ENSGT00940000154489; -.
DR HOGENOM; CLU_002036_0_0_1; -.
DR InParanoid; I3JGB8; -.
DR OMA; HWREMFQ; -.
DR Proteomes; UP000005207; Linkage group LG11.
DR GO; GO:0016459; C:myosin complex; IEA:InterPro.
DR Gene3D; 1.10.287.1490; -; 1.
DR InterPro; IPR002928; Myosin_tail.
DR PANTHER; PTHR46349:SF4; CINGULIN; 1.
DR PANTHER; PTHR46349; CINGULIN-LIKE PROTEIN 1-RELATED; 1.
DR Pfam; PF01576; Myosin_tail_1; 1.
PE 3: Inferred from homology;
KW Coiled coil {ECO:0000256|ARBA:ARBA00023054, ECO:0000256|SAM:Coils};
KW Reference proteome {ECO:0000313|Proteomes:UP000005207}.
FT DOMAIN 907..1156
FT /note="Myosin tail"
FT /evidence="ECO:0000259|Pfam:PF01576"
FT REGION 23..43
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 80..282
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 570..630
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 807..832
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 356..567
FT /evidence="ECO:0000256|SAM:Coils"
FT COMPBIAS 159..174
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 203..282
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 592..611
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 612..630
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1202 AA; 137735 MW; DD4686CF4D2C218D CRC64;
MTTPSSGQTT PVDYGVQIRF INDYQNGGGQ PAPQPKTKTQ TTSKYGVAVR VQGIAGQPYV
VLKDGQKGDS YGVQLKTEYS GYNSLPRNRG KAEPGTEGGY IGGGEQGGLL RRAQSHGSLL
DRDGGPSNEE YQLSRPPGDG KSGSYGNLDG GIGVRGDREQ GWHVSDRERG TERNIWHGSY
QAGLNRSLGS VRDSPEPPPQ PNELHSNKRQ TSVNKLINRF DSVNAGGQQR GLPPAQQHPG
ATSPLTTANP YTSPPSSTHS SLRRSQGNVT KIPGQPANQW ASLEPQRPNL TEAQVTPDLL
LDQGQSAELS SEEEQAMKTI YNILRQGSSE TDVIVRHKVN LIFQNIQHLK PKQSMREEWM
SEKRKLEEKV VALQTALLEE RKDSVSNSDP ALKAELESCL DENLQLQEML DRKKNELNET
QSELTQLRMD RENAEKRVRD MEDQMAEFQD ELRRENSNKT DLVSSQAQLM EVLQLKKKLE
ETLRQREREL TALKGALKEE VASHDKEIEA MREQYSADMD KLRSSMEQVS QSHAGIEAER
LRVNASVRSL QQQLEDCRDE SSHWMEQFHS TRDELRTTKQ EEPPSHPETQ LLQTRLEKEE
FEEELKELQD KMTSMKQQTP DPNHTQTLSQ ELQQCKADLQ KGKSEVEKMR VEFDKKVMEV
ISLKKSHQNQ EAELKYEIDR LKGQLQKAKE DFTKAQEKNK RLPDPATISE LEQKLNEAQS
EVTQLKKKLS LTEEELDTSK TQLSRAHLDI NSFQDAQQEQ EEATARLKEK ISRLEAQLQS
NATESSEAEL ALHTEVRGLR SELDEAKRKA SRLSQEHREL SQRLEDTEKD KEALKQTISQ
LEDTKRQQER ALEKVNKDYD SLNVSLREET QALRVQLEEQ KERARKEMQE VQRHENDAHS
ELEKSKMNIR KLEEEISRQK KELLLACEER DNHQLDKELL ANRLRHLEGE LEASKNSHNE
KTREIRILED KLKRLELEVE EERSSVDMLT DRVARSRDQI DQLRSELMQE RSSKQDLELD
KNAMERHLKE LRSRVVDMEG QSRSSVGVSQ LESKIQELED RLRSEEREKN AVLATQRRLE
RKLKDLNMTL DEERQTHTEQ RDQLALRVKA LKRQVDEGET ELERLDGMRR KAQRDLEEQM
ELKEALQTRV TALETELKRK TQAAIRPALD SSALSSDDDD SLYDSSTITS ILTETNLQTS
SC
//