ID A0A0D9RSQ0_CHLSB Unreviewed; 3114 AA.
AC A0A0D9RSQ0;
DT 27-MAY-2015, integrated into UniProtKB/TrEMBL.
DT 27-MAY-2015, sequence version 1.
DT 27-MAR-2024, entry version 42.
DE SubName: Full=Centromere protein F {ECO:0000313|Ensembl:ENSCSAP00000011639.1};
GN Name=CENPF {ECO:0000313|Ensembl:ENSCSAP00000011639.1};
OS Chlorocebus sabaeus (Green monkey) (Cercopithecus sabaeus).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini;
OC Cercopithecidae; Cercopithecinae; Chlorocebus.
OX NCBI_TaxID=60711 {ECO:0000313|Ensembl:ENSCSAP00000011639.1, ECO:0000313|Proteomes:UP000029965};
RN [1] {ECO:0000313|Ensembl:ENSCSAP00000011639.1, ECO:0000313|Proteomes:UP000029965}
RP NUCLEOTIDE SEQUENCE.
RA Warren W., Wilson R.K.;
RL Submitted (MAR-2014) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSCSAP00000011639.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AQIB01130106; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AQIB01130107; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR RefSeq; XP_007986647.1; XM_007988456.1.
DR STRING; 60711.ENSCSAP00000011639; -.
DR Ensembl; ENSCSAT00000013647.1; ENSCSAP00000011639.1; ENSCSAG00000015553.1.
DR GeneID; 103230102; -.
DR KEGG; csab:103230102; -.
DR CTD; 1063; -.
DR eggNOG; ENOG502QVMD; Eukaryota.
DR GeneTree; ENSGT00730000111187; -.
DR OMA; EQPNEQH; -.
DR OrthoDB; 5363462at2759; -.
DR BioGRID-ORCS; 103230102; 0 hits in 9 CRISPR screens.
DR Proteomes; UP000029965; Chromosome 25.
DR Bgee; ENSCSAG00000015553; Expressed in fibroblast and 3 other cell types or tissues.
DR GO; GO:0005930; C:axoneme; IEA:Ensembl.
DR GO; GO:0005813; C:centrosome; IEA:Ensembl.
DR GO; GO:0036064; C:ciliary basal body; IEA:Ensembl.
DR GO; GO:0097539; C:ciliary transition fiber; IEA:Ensembl.
DR GO; GO:0005829; C:cytosol; IEA:Ensembl.
DR GO; GO:0030496; C:midbody; IEA:Ensembl.
DR GO; GO:0016363; C:nuclear matrix; IEA:Ensembl.
DR GO; GO:0031965; C:nuclear membrane; IEA:Ensembl.
DR GO; GO:0005654; C:nucleoplasm; IEA:Ensembl.
DR GO; GO:0000940; C:outer kinetochore; IEA:Ensembl.
DR GO; GO:0045120; C:pronucleus; IEA:Ensembl.
DR GO; GO:0000922; C:spindle pole; IEA:Ensembl.
DR GO; GO:0140297; F:DNA-binding transcription factor binding; IEA:Ensembl.
DR GO; GO:0070840; F:dynein complex binding; IEA:Ensembl.
DR GO; GO:0008017; F:microtubule binding; IEA:InterPro.
DR GO; GO:0042803; F:protein homodimerization activity; IEA:Ensembl.
DR GO; GO:0001822; P:kidney development; IEA:Ensembl.
DR GO; GO:0051310; P:metaphase chromosome alignment; IEA:Ensembl.
DR GO; GO:0000278; P:mitotic cell cycle; IEA:Ensembl.
DR GO; GO:0045892; P:negative regulation of DNA-templated transcription; IEA:Ensembl.
DR GO; GO:0015031; P:protein transport; IEA:Ensembl.
DR GO; GO:0010389; P:regulation of G2/M transition of mitotic cell cycle; IEA:Ensembl.
DR GO; GO:0016202; P:regulation of striated muscle tissue development; IEA:Ensembl.
DR GO; GO:0021591; P:ventricular system development; IEA:Ensembl.
DR InterPro; IPR043513; Cenp-F.
DR InterPro; IPR018302; CenpF/LEK1_Rb-prot-bd.
DR InterPro; IPR019513; Centromere_CenpF_leu-rich_rpt.
DR InterPro; IPR018463; Centromere_CenpF_N.
DR PANTHER; PTHR18874:SF10; CENTROMERE PROTEIN F; 1.
DR PANTHER; PTHR18874; CMF/LEK/CENP CELL DIVISION-RELATED; 1.
DR Pfam; PF10490; CENP-F_C_Rb_bdg; 1.
DR Pfam; PF10473; CENP-F_leu_zip; 3.
DR Pfam; PF10481; CENP-F_N; 1.
PE 4: Predicted;
KW Coiled coil {ECO:0000256|SAM:Coils};
KW Reference proteome {ECO:0000313|Proteomes:UP000029965}.
FT DOMAIN 1..307
FT /note="Centromere protein Cenp-F N-terminal"
FT /evidence="ECO:0000259|Pfam:PF10481"
FT DOMAIN 1893..2035
FT /note="Centromere protein Cenp-F leucine-rich repeat-
FT containing"
FT /evidence="ECO:0000259|Pfam:PF10473"
FT DOMAIN 2131..2270
FT /note="Centromere protein Cenp-F leucine-rich repeat-
FT containing"
FT /evidence="ECO:0000259|Pfam:PF10473"
FT DOMAIN 2313..2452
FT /note="Centromere protein Cenp-F leucine-rich repeat-
FT containing"
FT /evidence="ECO:0000259|Pfam:PF10473"
FT DOMAIN 2970..3013
FT /note="Kinetochore protein Cenp-F/LEK1 Rb protein-binding"
FT /evidence="ECO:0000259|Pfam:PF10490"
FT REGION 2891..2958
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2987..3114
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 13..131
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 164..191
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 280..685
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 832..873
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 899..996
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 1029..1159
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 1196..1244
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 1286..1313
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 1549..1646
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 1790..1845
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 1890..2078
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 2107..2278
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 2324..2891
FT /evidence="ECO:0000256|SAM:Coils"
FT COMPBIAS 2928..2956
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2999..3013
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3029..3080
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 3114 AA; 357742 MW; 0CF157C215AFFDAC CRC64;
MSWALEEWKE GLPTRALQKI QELEGQLDKL KKEKQQRQFQ LDSLEAALQK QKQKVENEKT
EGTNLKRENQ RLMEICESLE KTKQKISHEL QVKESQVNFQ EGQLNSGKKQ IEKLEQELKR
CKSELERSQQ AAQSADVSLN PCSTPQKIFT TPLTPSQYYS GSKYEDLKEK YNKEVEERKR
LEAEVKALQA KKASQTLPQT TMNHRDIARH QASSSVFSWQ QEKTPSHLSS NFQKTPIRRD
FSASYFSGEQ EVTPSRSTLQ IGKRDANSSF FDNSSSPHLL EQLKVQNQEL RSKINELELC
LQGQEKEMKG QVNKFQELQL QLEKAKVELI EKEKVLNKCR DELVRTTAQY DQASTKCTAL
EQKLKKLTED LSCQRQNAES ARCSLEQKIK EKEKEFQEEL SRQQRSFQTL DQECIQVKAR
LTQELQQAKN MHNVLQAELD KVTAVKQQLE KNLEEFKQKL CRAEQASQAS QIKEDELRRS
VEEMKKENNL LKSQSEQKAR EVCHLEAELK NVKQCLNQSQ NFAEEMKVKN TSQETMLRDL
QEKINQRENS LTLEKLKLAV AELEKQRDCS QDLLKKREHH IEQLNDKLSK TEKESKALLS
ALELKNKEYE ELKEEKTLFS CWKSENEKLL TQMESEKENL QSKINHLETC LKTQQIKSHE
YNERVRTLEM DRENLSVEIR NLHNVIDSKS VEVETQKLAY VELQQKAEFS DQKHQKEIEN
MCLKTSQLTG QVEDLEHKLQ LLSNEIMDKD RCYQDLHAEY ESLRDLLKSK DASLVTNEDH
QRSLLAFDEQ PAMHKSFANI IEEQGNMLSE RSECHLEADQ SPKNSAILQN RVDSLEFSLE
SQKQMNSDLQ KQCEELVQIK GEIEENLMKA EQMHQSFVAE TSQRISKLQE DTSAHQNVVA
ETLSALENKE KELQLLNDKL ATEQAEIQEL KQSNHLLEDS LKELQLLSET LSLEKKEMSS
IISLNKREFE ELTQENETLK EINASLNQEK MNLIQKSESF ANYIDEREKS ISELSDQYKQ
ENLILLQRCE ETGNAYENLS QKYKAAQEKN SKLECLLNEC TSLCENRKNE LEQLKEAFAK
EHQEFLTKLA FAEERNQNLM LELETVQQDL RSEMTDTQNN SKSETDGLKQ EIMTLKEEQN
KMQKEVNDLL QENEQLMKVM KTKHECQNLE SEPIRNCVKE RESEMNQCNF KPQMDLEVKE
ISLDSYNAQL VQLEAMLRNM ELKLQESEKE KECLQHELQI IRGDLETRNL QDMQSQEISG
LKDCEVDAEE RYISVLHELS TSQNDNAHLE CSLQTAMNKL NELEKICEIL QAEKCELVTE
LNDSRSECIT ATRKMAEEVG KLVNEVKILN DDSGLLHGEL VEDIPGGEFG EQPNEQHPMC
LAPLDESNSC EHLTLSNKEV QMHFAELQEK FSSLQSEHKI LHDQHCQMSS KMSELQTYID
SLKAENLVLS TNLRNFQGDL VKETQPGLEE GLVPSLSSSC VPDSPSLSSL GDSSFYKALL
EQTGEMSLLN NLEGTVSANQ CSVDEVFCSS LLEENLTKKE TPSAPAKGVE ELESLCEAYR
QSLEKLEEKM ESQGIMKNKE IQELERLLSS ERQELDCLRK QYLSENEQWQ QKLTSVTLEM
ESKLAAEKKQ TEQLSLELEV ARLQLQGLDL SSRSLLGIDT EDAIQGRNES CDISKEHTSE
TTERTPKHDV HQICDKDAQQ DLRLDIEKIT ETGAVKLTVE CSGEQYPDTN YETPGKDKTQ
GSSECISELS FSGANASVPM DFLGNQENIQ NLQLRVKETS NENLRLLHVI EERDRKVESL
LNEMKELDSK LHLQEVQLMT KIEACIELEK IVGELKKENS DLSEKLEYFS CDNQELLQRV
ESSEGLNSNL EMHADKSSHE DIEDNVAKVN DSWKERFLDV ENELSRIRSE KANIEHQALS
LEADLEIVQT EKLCLEKDNE NNQKVIACLE EELSVVTSER NQLRGELDTM SKKNMELDQL
SEKMKEKTQE LESHRSEYLH CIQVAEAEVK EKTELLQTLS SDVSELLKDK THLQEKLQSL
EKDSQALSLT KCELENQIAQ LNKEKELLVK ESESLQARLN ESDYEKLNVS KALEAALVEK
GEFALRLSST QEEVHQLRRG IEKLRVRIEA DERKQLHVAE KLKERELEND SLKDKVENLE
RELQMSEENQ ELVILDAENS KAEVETLKTQ IEEMARSLKV FELDLVTLRS EKENLTKQIQ
EKQGQVSELD KLLSSVKSLL EEKEQAEIQI KEESKTAVEM LQNQLKELNE AVAALCGDQE
TMKATEQSLD PSVEEAHPLR NSIEKLRARL ETDEKKQLCV LEQLKESEHH ADLLKSRVEN
LERELEIAGK NQKHAALEAE NSKGEVETLK AKIEGMTQSL RDLELDLATV RSEKENLTNK
LQKEQERISE LKIINSSFEN ILREKEQEKV QMKEKSNTAM EMLQTQLKEL NERVTALHKD
QEACKAKEQN LSSQVDCLEL EKAQLLQGLD EAKDNNIVLQ SSVNGLIQEV EDGKQKLEKK
DEEISRLKNQ IQDQEQLVSK LSQVEGEHQL WKKQNLELGN LTVELEQKIQ VLQSKNTSLQ
DTLEVLQSSY KNLENELELT KMDKISFVEK VNTMTAKEIE LQREMHEMAQ KTAELQEELS
GEKNRLTGEL ELLLEEMKSS KDQLKELTLE NSELKKSLDC MHKEQVEKEG KVREEIAEYQ
LRLHEAEKKH QALLLDTNKQ YEIEIHTYRE KLTSKEKCLS SQKLEMDLLK SSKEELNNSL
KATTQVLEEL KKTKMDNLKY VNQLKKENER AQGKIKLLIK SCKQLEEEKE ILQKELSKLQ
AAQEKQKTGT VMDTKVDELT TEIKELKESL EEKTKEADEY LDKYCSLLIS HEKLEKAKEM
LETQVAHLCS QQSKPDTRGS PLLDPVVPGP SPILSAAEKR LSSGQKKASG KRQRSSGIWE
NGRGPTPSTP ETFSKKSKKA VMSGIHPAED TEGTEFEPEG LPEVVKKGFA DIPTGKTSPY
ILRRTTMATR TSPRLAAQKL ALSPLSLGKE NLAESSKPTA GGSRSQKVKV AQQSPVDSDT
ILREPTTKSL LVNNLPERSP TDSPREGLRV KRGRLAPNPK AGLEPKGSDN CKVQ
//