ID G5BUR9_HETGA Unreviewed; 2892 AA.
AC G5BUR9;
DT 14-DEC-2011, integrated into UniProtKB/TrEMBL.
DT 14-DEC-2011, sequence version 1.
DT 24-JAN-2024, entry version 36.
DE SubName: Full=Centromere protein F {ECO:0000313|EMBL:EHB13029.1};
GN ORFNames=GW7_11981 {ECO:0000313|EMBL:EHB13029.1};
OS Heterocephalus glaber (Naked mole rat).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Glires; Rodentia; Hystricomorpha; Bathyergidae;
OC Heterocephalus.
OX NCBI_TaxID=10181 {ECO:0000313|EMBL:EHB13029.1, ECO:0000313|Proteomes:UP000006813};
RN [1] {ECO:0000313|EMBL:EHB13029.1, ECO:0000313|Proteomes:UP000006813}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=21993625; DOI=10.1038/nature10533;
RA Kim E.B., Fang X., Fushan A.A., Huang Z., Lobanov A.V., Han L.,
RA Marino S.M., Sun X., Turanov A.A., Yang P., Yim S.H., Zhao X.,
RA Kasaikina M.V., Stoletzki N., Peng C., Polak P., Xiong Z., Kiezun A.,
RA Zhu Y., Chen Y., Kryukov G.V., Zhang Q., Peshkin L., Yang L., Bronson R.T.,
RA Buffenstein R., Wang B., Han C., Li Q., Chen L., Zhao W., Sunyaev S.R.,
RA Park T.J., Zhang G., Wang J., Gladyshev V.N.;
RT "Genome sequencing reveals insights into physiology and longevity of the
RT naked mole rat.";
RL Nature 479:223-227(2011).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; JH171970; EHB13029.1; -; Genomic_DNA.
DR SMR; G5BUR9; -.
DR STRING; 10181.G5BUR9; -.
DR eggNOG; ENOG502QVMD; Eukaryota.
DR InParanoid; G5BUR9; -.
DR Proteomes; UP000006813; Unassembled WGS sequence.
DR GO; GO:0000775; C:chromosome, centromeric region; IEA:InterPro.
DR GO; GO:0070840; F:dynein complex binding; IEA:InterPro.
DR GO; GO:0008017; F:microtubule binding; IEA:InterPro.
DR GO; GO:0042803; F:protein homodimerization activity; IEA:InterPro.
DR Gene3D; 1.10.287.1490; -; 1.
DR InterPro; IPR043513; Cenp-F.
DR InterPro; IPR018302; CenpF/LEK1_Rb-prot-bd.
DR InterPro; IPR019513; Centromere_CenpF_leu-rich_rpt.
DR InterPro; IPR018463; Centromere_CenpF_N.
DR PANTHER; PTHR18874:SF10; CENTROMERE PROTEIN F; 1.
DR PANTHER; PTHR18874; CMF/LEK/CENP CELL DIVISION-RELATED; 1.
DR Pfam; PF10490; CENP-F_C_Rb_bdg; 1.
DR Pfam; PF10473; CENP-F_leu_zip; 2.
DR Pfam; PF10481; CENP-F_N; 1.
PE 4: Predicted;
KW Coiled coil {ECO:0000256|SAM:Coils};
KW Reference proteome {ECO:0000313|Proteomes:UP000006813}.
FT DOMAIN 1..234
FT /note="Centromere protein Cenp-F N-terminal"
FT /evidence="ECO:0000259|Pfam:PF10481"
FT DOMAIN 2028..2167
FT /note="Centromere protein Cenp-F leucine-rich repeat-
FT containing"
FT /evidence="ECO:0000259|Pfam:PF10473"
FT DOMAIN 2210..2349
FT /note="Centromere protein Cenp-F leucine-rich repeat-
FT containing"
FT /evidence="ECO:0000259|Pfam:PF10473"
FT DOMAIN 2752..2795
FT /note="Kinetochore protein Cenp-F/LEK1 Rb protein-binding"
FT /evidence="ECO:0000259|Pfam:PF10490"
FT REGION 138..166
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 178..201
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 388..424
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2176..2195
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2668..2892
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 4..52
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 91..118
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 450..542
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 578..700
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 765..792
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 832..1010
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 1051..1085
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 1136..1170
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 1219..1278
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 1316..1385
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 1462..1559
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 1703..1751
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 1808..1835
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 1885..1982
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 2011..2175
FT /evidence="ECO:0000256|SAM:Coils"
FT COMPBIAS 181..201
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2668..2694
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2702..2736
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2738..2753
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2781..2795
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2811..2854
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 2892 AA; 329377 MW; 9128E9A58010D1DB CRC64;
MEICENLEKS KQKISHELQV KESQVNFQEG QLNSGRKQIE KLEQELKRCK SGLERSHPAA
PPADCSLCAC SSLQIFAAPP TPGQPYSGSK YEDLREKYNK EVEERRRLEA EVKVLQAKRA
SQAVPPTTMN HRDIARHQAS SSVFSWQQER TPNQLPSHPQ KTPSRRDFSA SHIFGEEEVT
PSRSVLQTGK RDTNSSCHDN SCSSPLLDQL KGQNQELRSE VNELQLRLQG QEKEMESQAH
RFQELQLQLE KTKVELIEKD KILKKNRDEL ARTTAQYEQA ATQCTALEQK LKKLTEDLNC
QRQNAESARC ALEQRVKKKE KALQEELSRQ QRAFQTLDQE CMQMKARLTQ ELQQAKNALC
GLEAELHKVT SVKQQLEKNL EELKQKFSRM EQASKDRQVE EEALRRSSQE TKKENGFLRT
QSEQRAREVS HLEEELRKVQ ACLSQSQNFL EEMRAKNTSQ ETMLRDLQEK INQQENSLTL
EKLKLALADL EKQRDCSQDL LRKREHHIEQ LNEKLSKIEK ESKTLLSALE LTKKEYEELN
EEKTQFSHWK SENEKLLYQM EAEKESLQSK INHLETCLKT QQIKSHEYNE RIRTLELERD
NLSVEMRNLC SMVDSKTREA EMQKQAYEEL QQKAELSDQK HKKEIENLCL QTSQLTGQVE
DLEQKLQLLS RELMDKAQLY QDLQAEYENL RDLLKSKDSS LSQGDHQRSC LAFEQQSAMS
TSFVNAMGEQ GSSSSGRSQC LLDADQSPKS SAILHNRVVS LGFSLESQKQ MNSNLQKQCE
ELVHIKGEIE ENLIKAEQMH ESFVAETSQR ICKLQEDTSV HQNVVAESLV ALEHKEKELQ
LWKARLESEQ VEIEELKKSN QLLEDSRKEL QLLSETLGSE KKEMSSVISL SKKEIEELTQ
ENGTLKEINE ILNQEKMNLV QKNEKFSNCI EEQEKSISEL SDQYKQEKLI LLQRCEETRS
AFEDLSEKYK AAQEKNLRLE CLLNECNSVC ESRKNELEQL KETFAKEHQE FLTKLTFAEE
QNQKLILEFE VVQQALKSEI RDLHSSSKSE ADGLRQETMT LKEEQNKMQK EVNDLLQENE
QLTKLTKTIH EGQHLKLEPL RDPVTERENE INTCNFQLQM DLGVRDIAPN NYNAQMVQLE
ALIRNTELKL QESEKDKECL QQELQTLRGE LGAGNVQDCS SQEMSGLMNC EVDAEGKYSA
VLCELSPRQG DSAHSQCSLQ AALSKLHELE KMCEMLQVEK LQLVSELKDS RAECITATSR
MAEEVEKLVN EMKMLNDENS LPQAEPVGEM LASEFDRQQN EQTPVSLNPL DDSNSYEQLT
LSNKEVQTHF AELQEKFSSL QSEHKILHDQ RCQMSSKMSE LRSYVDTLKA ENSALSVNLR
NLQGDLAKGM KPEERILSLS SCVTNSPSET SFGEYSFHKD VLEQTGDTSF LSNLGESVSA
NASLEEETLT MKEAPATLER SVEELKTLCQ GYLQSLKNLQ EKMESQAMMK DKEIQELRQL
LSSERTELDR LRKQYLSQNE QWQQKLTSVT LEMESKLAAE KQQTEHLSLE LEVARLQLQG
LELTSQSWLG ADIDDAIQNH QNGSCDIKES EEYTSEITEK TPKQDSHQIC EEVVQECLSV
ESEETTGSPL VPRGAREVPS ETNYETAVGD QSQGCPEHTY ELSTGGSCAL SPVEFVENQV
SVQTLQLQVK EALNENLRLL HAIEDRDRKV ENLLNEIKEL DSKLDLQELQ LKTKIETCLQ
LEKIVEELQK EKCNLSVKSE SFSCGDQELG LRVETSLRSN LEMGTDDSSQ EVLKENIAKE
DDNWKERFLD VENELNRMKA ENARIEHRAL TMEADLEVAQ TEKLCLEKDN GNSQKVIIHL
EEELSVATSE RNRLHGELDA VTKENKTLAQ MSEVMKEKVQ ALEAEVRDRA AALQTLSLQV
SELSENKRNL QEQLRRLEGD SQTLSSAAQE LESQVRRLDK EKESLARESE SLQAKLSELD
QERLTVSRAL EAALMERDEF AERLGSTQEE VHQLRKGMEK LRVRIEADEK KQLHVLGKLK
GSERDNDVLR DKVEGLQREL QMSEENQELV VLDAENAKAE METLRAQLDE LAESLRGSQV
DLAVLRSEKG DLTKQLQEKQ GQVSELDGLL SSLKSLLEEK EQARAQMQEE SKTAVEMIQA
QLRELTEEVA ALCQDQEPPR AEEQNLDAPG NEVPRLRGSI GRLRARVEAD REKQLHVLQQ
LKESECHADL LKGSVESLER ELELSGRSQE HAVREAKDSR AEVETLKAEI QQIAQNLRDL
EVDLVNTKSE KENLTKGLQE EQARVSELEI LNSSLENLLQ EKEHQNVQIK EESEVAVEML
QAQLKVLDEK MITLCNDQAA CKRKEQSLSS QVDSLELEKA QLLQDLEGAK NNYIILQSSV
NGLMQEVEDG KQELERKDEE LSQLKGQVQD QEQLASQLSQ VEGERQLWQK QKAEMGTLAV
ELEQKARELQ SKNHTLQESL EGLQNSCREL ESELTLTKVE NMSLVEKVNT MIVKEAELQG
EMQNMVEKTT ELKEEFSGEK NRLAAELNVL SEEIKSSKTD HLKYVDQLKK ENERAQGKIK
LLLKSCRQLE EEKKMLQKEL SDLEAAQKQK TGALVDVNVD ELMAEIRELK ETLEEKAKEA
DEYLDKYCSL LISYEKLEKA KEMLETQVTR LSSQHPKHGL SGSSLQSSTT PEPCPAPSVG
EGRSSSGRSK ASGKRQRCSG NWESRGGLTP STPETFPKKS RKVVKSGSIH PAEDEEEFEP
EGLSEVVKKG FADIPTGKTS PYVLRRTTVA TRTSPRLAAL KSALSPLSVG KENLEESSKP
TASGSRSQKV KVAQQSPVGS AEPTVTSVSV RHFSGRSPAD SPGDGPRTRR SPLVPSPDAR
PQPMNSENCR VQ
//