GenomeNet

Database: UniProt
Entry: A0A0D9RSQ0_CHLSB
LinkDB: A0A0D9RSQ0_CHLSB
Original site: A0A0D9RSQ0_CHLSB 
ID   A0A0D9RSQ0_CHLSB        Unreviewed;      3114 AA.
AC   A0A0D9RSQ0;
DT   27-MAY-2015, integrated into UniProtKB/TrEMBL.
DT   27-MAY-2015, sequence version 1.
DT   27-MAR-2024, entry version 42.
DE   SubName: Full=Centromere protein F {ECO:0000313|Ensembl:ENSCSAP00000011639.1};
GN   Name=CENPF {ECO:0000313|Ensembl:ENSCSAP00000011639.1};
OS   Chlorocebus sabaeus (Green monkey) (Cercopithecus sabaeus).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini;
OC   Cercopithecidae; Cercopithecinae; Chlorocebus.
OX   NCBI_TaxID=60711 {ECO:0000313|Ensembl:ENSCSAP00000011639.1, ECO:0000313|Proteomes:UP000029965};
RN   [1] {ECO:0000313|Ensembl:ENSCSAP00000011639.1, ECO:0000313|Proteomes:UP000029965}
RP   NUCLEOTIDE SEQUENCE.
RA   Warren W., Wilson R.K.;
RL   Submitted (MAR-2014) to the EMBL/GenBank/DDBJ databases.
RN   [2] {ECO:0000313|Ensembl:ENSCSAP00000011639.1}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (NOV-2023) to UniProtKB.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; AQIB01130106; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AQIB01130107; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   RefSeq; XP_007986647.1; XM_007988456.1.
DR   STRING; 60711.ENSCSAP00000011639; -.
DR   Ensembl; ENSCSAT00000013647.1; ENSCSAP00000011639.1; ENSCSAG00000015553.1.
DR   GeneID; 103230102; -.
DR   KEGG; csab:103230102; -.
DR   CTD; 1063; -.
DR   eggNOG; ENOG502QVMD; Eukaryota.
DR   GeneTree; ENSGT00730000111187; -.
DR   OMA; EQPNEQH; -.
DR   OrthoDB; 5363462at2759; -.
DR   BioGRID-ORCS; 103230102; 0 hits in 9 CRISPR screens.
DR   Proteomes; UP000029965; Chromosome 25.
DR   Bgee; ENSCSAG00000015553; Expressed in fibroblast and 3 other cell types or tissues.
DR   GO; GO:0005930; C:axoneme; IEA:Ensembl.
DR   GO; GO:0005813; C:centrosome; IEA:Ensembl.
DR   GO; GO:0036064; C:ciliary basal body; IEA:Ensembl.
DR   GO; GO:0097539; C:ciliary transition fiber; IEA:Ensembl.
DR   GO; GO:0005829; C:cytosol; IEA:Ensembl.
DR   GO; GO:0030496; C:midbody; IEA:Ensembl.
DR   GO; GO:0016363; C:nuclear matrix; IEA:Ensembl.
DR   GO; GO:0031965; C:nuclear membrane; IEA:Ensembl.
DR   GO; GO:0005654; C:nucleoplasm; IEA:Ensembl.
DR   GO; GO:0000940; C:outer kinetochore; IEA:Ensembl.
DR   GO; GO:0045120; C:pronucleus; IEA:Ensembl.
DR   GO; GO:0000922; C:spindle pole; IEA:Ensembl.
DR   GO; GO:0140297; F:DNA-binding transcription factor binding; IEA:Ensembl.
DR   GO; GO:0070840; F:dynein complex binding; IEA:Ensembl.
DR   GO; GO:0008017; F:microtubule binding; IEA:InterPro.
DR   GO; GO:0042803; F:protein homodimerization activity; IEA:Ensembl.
DR   GO; GO:0001822; P:kidney development; IEA:Ensembl.
DR   GO; GO:0051310; P:metaphase chromosome alignment; IEA:Ensembl.
DR   GO; GO:0000278; P:mitotic cell cycle; IEA:Ensembl.
DR   GO; GO:0045892; P:negative regulation of DNA-templated transcription; IEA:Ensembl.
DR   GO; GO:0015031; P:protein transport; IEA:Ensembl.
DR   GO; GO:0010389; P:regulation of G2/M transition of mitotic cell cycle; IEA:Ensembl.
DR   GO; GO:0016202; P:regulation of striated muscle tissue development; IEA:Ensembl.
DR   GO; GO:0021591; P:ventricular system development; IEA:Ensembl.
DR   InterPro; IPR043513; Cenp-F.
DR   InterPro; IPR018302; CenpF/LEK1_Rb-prot-bd.
DR   InterPro; IPR019513; Centromere_CenpF_leu-rich_rpt.
DR   InterPro; IPR018463; Centromere_CenpF_N.
DR   PANTHER; PTHR18874:SF10; CENTROMERE PROTEIN F; 1.
DR   PANTHER; PTHR18874; CMF/LEK/CENP CELL DIVISION-RELATED; 1.
DR   Pfam; PF10490; CENP-F_C_Rb_bdg; 1.
DR   Pfam; PF10473; CENP-F_leu_zip; 3.
DR   Pfam; PF10481; CENP-F_N; 1.
PE   4: Predicted;
KW   Coiled coil {ECO:0000256|SAM:Coils};
KW   Reference proteome {ECO:0000313|Proteomes:UP000029965}.
FT   DOMAIN          1..307
FT                   /note="Centromere protein Cenp-F N-terminal"
FT                   /evidence="ECO:0000259|Pfam:PF10481"
FT   DOMAIN          1893..2035
FT                   /note="Centromere protein Cenp-F leucine-rich repeat-
FT                   containing"
FT                   /evidence="ECO:0000259|Pfam:PF10473"
FT   DOMAIN          2131..2270
FT                   /note="Centromere protein Cenp-F leucine-rich repeat-
FT                   containing"
FT                   /evidence="ECO:0000259|Pfam:PF10473"
FT   DOMAIN          2313..2452
FT                   /note="Centromere protein Cenp-F leucine-rich repeat-
FT                   containing"
FT                   /evidence="ECO:0000259|Pfam:PF10473"
FT   DOMAIN          2970..3013
FT                   /note="Kinetochore protein Cenp-F/LEK1 Rb protein-binding"
FT                   /evidence="ECO:0000259|Pfam:PF10490"
FT   REGION          2891..2958
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          2987..3114
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COILED          13..131
FT                   /evidence="ECO:0000256|SAM:Coils"
FT   COILED          164..191
FT                   /evidence="ECO:0000256|SAM:Coils"
FT   COILED          280..685
FT                   /evidence="ECO:0000256|SAM:Coils"
FT   COILED          832..873
FT                   /evidence="ECO:0000256|SAM:Coils"
FT   COILED          899..996
FT                   /evidence="ECO:0000256|SAM:Coils"
FT   COILED          1029..1159
FT                   /evidence="ECO:0000256|SAM:Coils"
FT   COILED          1196..1244
FT                   /evidence="ECO:0000256|SAM:Coils"
FT   COILED          1286..1313
FT                   /evidence="ECO:0000256|SAM:Coils"
FT   COILED          1549..1646
FT                   /evidence="ECO:0000256|SAM:Coils"
FT   COILED          1790..1845
FT                   /evidence="ECO:0000256|SAM:Coils"
FT   COILED          1890..2078
FT                   /evidence="ECO:0000256|SAM:Coils"
FT   COILED          2107..2278
FT                   /evidence="ECO:0000256|SAM:Coils"
FT   COILED          2324..2891
FT                   /evidence="ECO:0000256|SAM:Coils"
FT   COMPBIAS        2928..2956
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        2999..3013
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        3029..3080
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   3114 AA;  357742 MW;  0CF157C215AFFDAC CRC64;
     MSWALEEWKE GLPTRALQKI QELEGQLDKL KKEKQQRQFQ LDSLEAALQK QKQKVENEKT
     EGTNLKRENQ RLMEICESLE KTKQKISHEL QVKESQVNFQ EGQLNSGKKQ IEKLEQELKR
     CKSELERSQQ AAQSADVSLN PCSTPQKIFT TPLTPSQYYS GSKYEDLKEK YNKEVEERKR
     LEAEVKALQA KKASQTLPQT TMNHRDIARH QASSSVFSWQ QEKTPSHLSS NFQKTPIRRD
     FSASYFSGEQ EVTPSRSTLQ IGKRDANSSF FDNSSSPHLL EQLKVQNQEL RSKINELELC
     LQGQEKEMKG QVNKFQELQL QLEKAKVELI EKEKVLNKCR DELVRTTAQY DQASTKCTAL
     EQKLKKLTED LSCQRQNAES ARCSLEQKIK EKEKEFQEEL SRQQRSFQTL DQECIQVKAR
     LTQELQQAKN MHNVLQAELD KVTAVKQQLE KNLEEFKQKL CRAEQASQAS QIKEDELRRS
     VEEMKKENNL LKSQSEQKAR EVCHLEAELK NVKQCLNQSQ NFAEEMKVKN TSQETMLRDL
     QEKINQRENS LTLEKLKLAV AELEKQRDCS QDLLKKREHH IEQLNDKLSK TEKESKALLS
     ALELKNKEYE ELKEEKTLFS CWKSENEKLL TQMESEKENL QSKINHLETC LKTQQIKSHE
     YNERVRTLEM DRENLSVEIR NLHNVIDSKS VEVETQKLAY VELQQKAEFS DQKHQKEIEN
     MCLKTSQLTG QVEDLEHKLQ LLSNEIMDKD RCYQDLHAEY ESLRDLLKSK DASLVTNEDH
     QRSLLAFDEQ PAMHKSFANI IEEQGNMLSE RSECHLEADQ SPKNSAILQN RVDSLEFSLE
     SQKQMNSDLQ KQCEELVQIK GEIEENLMKA EQMHQSFVAE TSQRISKLQE DTSAHQNVVA
     ETLSALENKE KELQLLNDKL ATEQAEIQEL KQSNHLLEDS LKELQLLSET LSLEKKEMSS
     IISLNKREFE ELTQENETLK EINASLNQEK MNLIQKSESF ANYIDEREKS ISELSDQYKQ
     ENLILLQRCE ETGNAYENLS QKYKAAQEKN SKLECLLNEC TSLCENRKNE LEQLKEAFAK
     EHQEFLTKLA FAEERNQNLM LELETVQQDL RSEMTDTQNN SKSETDGLKQ EIMTLKEEQN
     KMQKEVNDLL QENEQLMKVM KTKHECQNLE SEPIRNCVKE RESEMNQCNF KPQMDLEVKE
     ISLDSYNAQL VQLEAMLRNM ELKLQESEKE KECLQHELQI IRGDLETRNL QDMQSQEISG
     LKDCEVDAEE RYISVLHELS TSQNDNAHLE CSLQTAMNKL NELEKICEIL QAEKCELVTE
     LNDSRSECIT ATRKMAEEVG KLVNEVKILN DDSGLLHGEL VEDIPGGEFG EQPNEQHPMC
     LAPLDESNSC EHLTLSNKEV QMHFAELQEK FSSLQSEHKI LHDQHCQMSS KMSELQTYID
     SLKAENLVLS TNLRNFQGDL VKETQPGLEE GLVPSLSSSC VPDSPSLSSL GDSSFYKALL
     EQTGEMSLLN NLEGTVSANQ CSVDEVFCSS LLEENLTKKE TPSAPAKGVE ELESLCEAYR
     QSLEKLEEKM ESQGIMKNKE IQELERLLSS ERQELDCLRK QYLSENEQWQ QKLTSVTLEM
     ESKLAAEKKQ TEQLSLELEV ARLQLQGLDL SSRSLLGIDT EDAIQGRNES CDISKEHTSE
     TTERTPKHDV HQICDKDAQQ DLRLDIEKIT ETGAVKLTVE CSGEQYPDTN YETPGKDKTQ
     GSSECISELS FSGANASVPM DFLGNQENIQ NLQLRVKETS NENLRLLHVI EERDRKVESL
     LNEMKELDSK LHLQEVQLMT KIEACIELEK IVGELKKENS DLSEKLEYFS CDNQELLQRV
     ESSEGLNSNL EMHADKSSHE DIEDNVAKVN DSWKERFLDV ENELSRIRSE KANIEHQALS
     LEADLEIVQT EKLCLEKDNE NNQKVIACLE EELSVVTSER NQLRGELDTM SKKNMELDQL
     SEKMKEKTQE LESHRSEYLH CIQVAEAEVK EKTELLQTLS SDVSELLKDK THLQEKLQSL
     EKDSQALSLT KCELENQIAQ LNKEKELLVK ESESLQARLN ESDYEKLNVS KALEAALVEK
     GEFALRLSST QEEVHQLRRG IEKLRVRIEA DERKQLHVAE KLKERELEND SLKDKVENLE
     RELQMSEENQ ELVILDAENS KAEVETLKTQ IEEMARSLKV FELDLVTLRS EKENLTKQIQ
     EKQGQVSELD KLLSSVKSLL EEKEQAEIQI KEESKTAVEM LQNQLKELNE AVAALCGDQE
     TMKATEQSLD PSVEEAHPLR NSIEKLRARL ETDEKKQLCV LEQLKESEHH ADLLKSRVEN
     LERELEIAGK NQKHAALEAE NSKGEVETLK AKIEGMTQSL RDLELDLATV RSEKENLTNK
     LQKEQERISE LKIINSSFEN ILREKEQEKV QMKEKSNTAM EMLQTQLKEL NERVTALHKD
     QEACKAKEQN LSSQVDCLEL EKAQLLQGLD EAKDNNIVLQ SSVNGLIQEV EDGKQKLEKK
     DEEISRLKNQ IQDQEQLVSK LSQVEGEHQL WKKQNLELGN LTVELEQKIQ VLQSKNTSLQ
     DTLEVLQSSY KNLENELELT KMDKISFVEK VNTMTAKEIE LQREMHEMAQ KTAELQEELS
     GEKNRLTGEL ELLLEEMKSS KDQLKELTLE NSELKKSLDC MHKEQVEKEG KVREEIAEYQ
     LRLHEAEKKH QALLLDTNKQ YEIEIHTYRE KLTSKEKCLS SQKLEMDLLK SSKEELNNSL
     KATTQVLEEL KKTKMDNLKY VNQLKKENER AQGKIKLLIK SCKQLEEEKE ILQKELSKLQ
     AAQEKQKTGT VMDTKVDELT TEIKELKESL EEKTKEADEY LDKYCSLLIS HEKLEKAKEM
     LETQVAHLCS QQSKPDTRGS PLLDPVVPGP SPILSAAEKR LSSGQKKASG KRQRSSGIWE
     NGRGPTPSTP ETFSKKSKKA VMSGIHPAED TEGTEFEPEG LPEVVKKGFA DIPTGKTSPY
     ILRRTTMATR TSPRLAAQKL ALSPLSLGKE NLAESSKPTA GGSRSQKVKV AQQSPVDSDT
     ILREPTTKSL LVNNLPERSP TDSPREGLRV KRGRLAPNPK AGLEPKGSDN CKVQ
//
DBGET integrated database retrieval system