ID A0A2K5W2Q6_MACFA Unreviewed; 1267 AA.
AC A0A2K5W2Q6;
DT 28-MAR-2018, integrated into UniProtKB/TrEMBL.
DT 02-JUN-2021, sequence version 2.
DT 27-MAR-2024, entry version 26.
DE SubName: Full=Cingulin like 1 {ECO:0000313|Ensembl:ENSMFAP00000031346.2};
GN Name=CGNL1 {ECO:0000313|Ensembl:ENSMFAP00000031346.2};
OS Macaca fascicularis (Crab-eating macaque) (Cynomolgus monkey).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini;
OC Cercopithecidae; Cercopithecinae; Macaca.
OX NCBI_TaxID=9541 {ECO:0000313|Ensembl:ENSMFAP00000031346.2, ECO:0000313|Proteomes:UP000233100};
RN [1] {ECO:0000313|Ensembl:ENSMFAP00000031346.2, ECO:0000313|Proteomes:UP000233100}
RP NUCLEOTIDE SEQUENCE.
RA Warren W., Wilson R.K.;
RL Submitted (MAR-2013) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSMFAP00000031346.2}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (SEP-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; A0A2K5W2Q6; -.
DR Ensembl; ENSMFAT00000005561.2; ENSMFAP00000031346.2; ENSMFAG00000036711.2.
DR VEuPathDB; HostDB:ENSMFAG00000036711; -.
DR GeneTree; ENSGT00940000154489; -.
DR Proteomes; UP000233100; Chromosome 7.
DR Bgee; ENSMFAG00000036711; Expressed in adult mammalian kidney and 13 other cell types or tissues.
DR GO; GO:0005923; C:bicellular tight junction; IEA:UniProtKB-SubCell.
DR GO; GO:0016459; C:myosin complex; IEA:InterPro.
DR InterPro; IPR002928; Myosin_tail.
DR PANTHER; PTHR46349:SF2; CINGULIN-LIKE PROTEIN 1; 1.
DR PANTHER; PTHR46349; CINGULIN-LIKE PROTEIN 1-RELATED; 1.
DR Pfam; PF01576; Myosin_tail_1; 1.
PE 4: Predicted;
KW Coiled coil {ECO:0000256|ARBA:ARBA00023054};
KW Reference proteome {ECO:0000313|Proteomes:UP000233100}.
FT DOMAIN 1017..1221
FT /note="Myosin tail"
FT /evidence="ECO:0000259|Pfam:PF01576"
FT REGION 91..136
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 187..206
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 242..305
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 363..391
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 869..895
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1229..1267
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 91..105
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 363..389
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1267 AA; 144967 MW; 0BA6E82510FA04A1 CRC64;
MELYFGEYQH VQQEYGVHLR LASDDTQKSR SSQNSKAGSY GVSIRVQGID GHPYIVLNNT
ERCLAGTSFS ENGPPFPPAV INNLPLHSSN GSVLKESSEE LQLPENPYAQ PSPVRNLKQS
PLHEGKNGVL DRKDGSMKPS HVLNFQRHPE LLQPYDPEKN ELNLQNHQPP ESNWLKTLTE
EGINNKKPWT CFPKPSNSQP TSPSLDDLAK SGVTAIRLCS SVVIEDPKKQ TSVCVNVQSC
TKERVGEEAP STSGRPLTAQ SPHAHPETKK TRPDVLPFRR QDSAGPVLDG ARSRRSSSSS
TTPTSANSLY RFLLDDQDCA IHADNVNRHE NRRYIPFLPG TGRDIDTGSI PGVDQLIEKF
DQKPGLQRRG RTGKRNRINP DDRKRSRSVD SAFPFGLQGN SEYLTEFSRN LGKSNEHLLR
PSQVCPQRPL SQERRGKQSV GRTFAKLQGA AHGAPCAHSR PPQQNIDGKV LETKGTQEGT
VIRAPSLGAQ SKKEEEVKTA TATLMLQNRA AATSPDSGAK KISVKAFPSA SNTQATPDLL
KGQQELTQQT NEETAKQILY NYLKEGSADN DDATKRKVNL VFEKIQTLKS RAAGSAQGNN
QASNSTSEVK DLLEQKSKLT TEVAELQRQL QLGVENQQNI KEERERMRAN LEELRSQHNK
KVEENSMLQQ RLEESEGELR KNLEELFQVK MEREQHQTEI RDLQDQLSEM HDELDSAKRS
EDREKGALIE ELLQAKQDLQ DLLIAKEEQE DLLRKREREL TALKGALKEE VSSHDQEMDK
LKEQYDAELQ ALRESVEEAT KNVEVLASRS NTSEQDQAGT EMRVKLLQEE NEKLQGRSEE
LEQRVAQLQR QIEDLKGDEA KAKETLKKYE RNLSRTTQEQ KQLSEKLKEE SEQKEQLRRL
KNEMENERWH LGKTIEKLQK EMADIVEASR TSTLELQNQL DEYKEKNRRE LAEMQRQLKE
KTLEAEKSRL TAMKMQDEMR LMEEELRDYQ RAQDEALTKR QLLEQTLKDL EYELEAKSHL
KDDRSRLVKQ MEDKVSQLEM ELEEERNNSD LLSERISRSR EQMEQVRNEL LQERAARQDL
ECDKISLERQ NKDLKSRIIH LEGSYRSSKE GLVVQMEARI AELEDRLESE ERDRASLQLS
NRRLERKVKE LVMQVDDEHL SLTDQKDQLS LRLKAMKRQV EEAEEEIDRL ESSKKKLQRE
LEEQMDMNEH LQGQLNSMKK DLRLKKLPNK VLDDMDDDDD LSTDGGSLYE APLSYTFSKD
STTASQI
//