ID A0A151M598_ALLMI Unreviewed; 1295 AA.
AC A0A151M598;
DT 08-JUN-2016, integrated into UniProtKB/TrEMBL.
DT 08-JUN-2016, sequence version 1.
DT 27-MAR-2024, entry version 18.
DE RecName: Full=Cingulin {ECO:0000256|ARBA:ARBA00044075};
GN Name=CGN {ECO:0000313|EMBL:KYO19694.1};
GN ORFNames=Y1Q_0012347 {ECO:0000313|EMBL:KYO19694.1};
OS Alligator mississippiensis (American alligator).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Archelosauria; Archosauria; Crocodylia; Alligatoridae; Alligatorinae;
OC Alligator.
OX NCBI_TaxID=8496 {ECO:0000313|EMBL:KYO19694.1};
RN [1] {ECO:0000313|EMBL:KYO19694.1, ECO:0000313|Proteomes:UP000050525}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=KSC_2009_1 {ECO:0000313|EMBL:KYO19694.1};
RX PubMed=22293439; DOI=10.1186/gb-2012-13-1-415;
RA St John J.A., Braun E.L., Isberg S.R., Miles L.G., Chong A.Y., Gongora J.,
RA Dalzell P., Moran C., Bed'hom B., Abzhanov A., Burgess S.C., Cooksey A.M.,
RA Castoe T.A., Crawford N.G., Densmore L.D., Drew J.C., Edwards S.V.,
RA Faircloth B.C., Fujita M.K., Greenwold M.J., Hoffmann F.G., Howard J.M.,
RA Iguchi T., Janes D.E., Khan S.Y., Kohno S., de Koning A.J., Lance S.L.,
RA McCarthy F.M., McCormack J.E., Merchant M.E., Peterson D.G., Pollock D.D.,
RA Pourmand N., Raney B.J., Roessler K.A., Sanford J.R., Sawyer R.H.,
RA Schmidt C.J., Triplett E.W., Tuberville T.D., Venegas-Anaya M.,
RA Howard J.T., Jarvis E.D., Guillette L.J.Jr., Glenn T.C., Green R.E.,
RA Ray D.A.;
RT "Sequencing three crocodilian genomes to illuminate the evolution of
RT archosaurs and amniotes.";
RL Genome Biol. 13:415-415(2012).
CC -!- FUNCTION: Probably plays a role in the formation and regulation of the
CC tight junction (TJ) paracellular permeability barrier.
CC {ECO:0000256|ARBA:ARBA00043864}.
CC -!- SIMILARITY: Belongs to the cingulin family.
CC {ECO:0000256|ARBA:ARBA00038467}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KYO19694.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AKHW03006582; KYO19694.1; -; Genomic_DNA.
DR STRING; 8496.A0A151M598; -.
DR eggNOG; ENOG502R9EI; Eukaryota.
DR Proteomes; UP000050525; Unassembled WGS sequence.
DR GO; GO:0016459; C:myosin complex; IEA:InterPro.
DR Gene3D; 1.10.287.1490; -; 1.
DR InterPro; IPR002928; Myosin_tail.
DR PANTHER; PTHR46349:SF4; CINGULIN; 1.
DR PANTHER; PTHR46349; CINGULIN-LIKE PROTEIN 1-RELATED; 1.
DR Pfam; PF01576; Myosin_tail_1; 1.
PE 3: Inferred from homology;
KW Coiled coil {ECO:0000256|ARBA:ARBA00023054};
KW Reference proteome {ECO:0000313|Proteomes:UP000050525}.
FT DOMAIN 1040..1245
FT /note="Myosin tail"
FT /evidence="ECO:0000259|Pfam:PF01576"
FT REGION 1..28
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 40..59
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 97..421
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1253..1276
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 97..127
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 201..221
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 292..329
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 366..417
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1253..1269
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1295 AA; 147564 MW; 3E1C75692691F41F CRC64;
MHQESGALQT PRMDRHANGA MAEKQNPVDY GVQIRFIDDL TEPKKPHKMR SKAAKPSSYG
VAVRVQGIDG QPFVVLNSGE RGSDSFGVQI KSESAYSSLP AAQPSPASSP SRYPKEASGS
LSSDSDLPEN PYAARRARYS TSDEEQSSRP RGKGPPASPA RSPSPQGGRR PLLPKKPFSD
ELRRTQSHSS LLGPESGEPF GPSRPSTQTN STAQPRQLGK AKSGSMVNIA VPKGPEASEK
ASSAFGRRPV DKQPPSDPES SRPGPPAATG DIDTKPLSSV DSLISKFDGK VQQRGRTARR
SRILPDERKR SQSLDGRVSY HDTADSRELT TTKHGPGFQK GLKPPAVAMH ASSLPRTRRE
DLKTSMASRS QPTRDWANNG PEEPLVEQQQ SQVQSELQLK STPDLLKDQQ ETSPAGSSEH
MKELIYAILK DGSKDSEVML KRKANLLLEK VQELAVPPED AGAPSPQQAE LARRVEELQH
KLDEEMKLRQ KLELTRESPR SSAVQNLELQ LARVEEECQH LQGALEKKSQ ELQSSLRELS
EVKRAREQAE ARLGDCEEQL MGMQEELDRL RQGLGDVPEG DALLKDLLET REELEEVLSA
KQRQEEHLRL RERELTALKG ALKEEVASHD KELDQLRQQY QHDVDQLRRS MEDVSQDQAD
LETERQKINS IVRNLQRELA ESTEETGHWR DMFQKNKEEL RSTKQELMQV KLEREEFEEE
LRELRERFSA VRLQADQARS NTVGTGELEA LKKELRQTQE ERRELTAEKQ AQDELLRQRE
RELLALKVTL QETAGSHSRE LEQLREQFQR SLQQLQKDSD EAVKGKVVLE SEREVAEQTR
RAVEATLRET QEHNDDLRRK VLGLEAQLKD YQHVTEDWEG VEARLKDKIA KLEAERRQME
ETLGEASDQE QELLLVKRAL ENRLDEAQRS LSRLSLEHQE LSSSYQEELR QKEQLKRLKM
EMEEQKRLLD RTIEKLTKEL EQMAEESHHS LALLQSQLED YKEKSRKELG DSQKQAKDRG
AELEKAQFSL ARLQDEVTRL KQALQDSQAE RESAVLDREV LAQRLQHLEQ QVEVKKRSQD
DRSRQVKALE IDQLRAELLQ ERSSRQDLEC DKISLERQNK DLKNRLASSE GLQKPSASLT
QLESRVQELQ DKLQAEEREK NVLVSSNRKL ERKVKELTIQ IDDERQHVND QKDQLSLRVK
ALKRQVDEAE EEIERLEGAR KKAQWELEEQ HELNEQLQNR VKTLEKEAWR KAARSAAEAS
LKDDQLSSDE EFDSAYGPSS IASLLTEANL QTSSC
//