ID H2Q149_PANTR Unreviewed; 3114 AA.
AC H2Q149;
DT 21-MAR-2012, integrated into UniProtKB/TrEMBL.
DT 28-FEB-2018, sequence version 2.
DT 02-DEC-2020, entry version 53.
DE SubName: Full=Centromere protein F {ECO:0000313|Ensembl:ENSPTRP00000003323};
GN Name=CENPF {ECO:0000313|Ensembl:ENSPTRP00000003323,
GN ECO:0000313|VGNC:VGNC:490};
OS Pan troglodytes (Chimpanzee).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae;
OC Pan.
OX NCBI_TaxID=9598 {ECO:0000313|Ensembl:ENSPTRP00000003323, ECO:0000313|Proteomes:UP000002277};
RN [1] {ECO:0000313|Ensembl:ENSPTRP00000003323, ECO:0000313|Proteomes:UP000002277}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=16136131; DOI=10.1038/nature04072;
RG Chimpanzee sequencing and analysis consortium;
RT "Initial sequence of the chimpanzee genome and comparison with the human
RT genome.";
RL Nature 437:69-87(2005).
RN [2] {ECO:0000313|Ensembl:ENSPTRP00000003323}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (FEB-2012) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AACZ04071460; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR RefSeq; XP_001171564.2; XM_001171564.3.
DR STRING; 9598.ENSPTRP00000003323; -.
DR PaxDb; H2Q149; -.
DR Ensembl; ENSPTRT00000003607; ENSPTRP00000003323; ENSPTRG00000001977.
DR GeneID; 457733; -.
DR KEGG; ptr:457733; -.
DR CTD; 1063; -.
DR VGNC; VGNC:490; CENPF.
DR eggNOG; ENOG502QVMD; Eukaryota.
DR GeneTree; ENSGT00730000111187; -.
DR HOGENOM; CLU_000551_0_0_1; -.
DR InParanoid; H2Q149; -.
DR OMA; YNAQLVQ; -.
DR OrthoDB; 205405at2759; -.
DR TreeFam; TF101133; -.
DR Proteomes; UP000002277; Chromosome 1.
DR Bgee; ENSPTRG00000001977; Expressed in testis and 7 other tissues.
DR GO; GO:0005930; C:axoneme; IEA:Ensembl.
DR GO; GO:0005813; C:centrosome; IEA:Ensembl.
DR GO; GO:0036064; C:ciliary basal body; IEA:Ensembl.
DR GO; GO:0097539; C:ciliary transition fiber; IEA:Ensembl.
DR GO; GO:0000940; C:condensed chromosome outer kinetochore; IEA:Ensembl.
DR GO; GO:0030496; C:midbody; IEA:Ensembl.
DR GO; GO:0005635; C:nuclear envelope; IEA:Ensembl.
DR GO; GO:0016363; C:nuclear matrix; IEA:Ensembl.
DR GO; GO:0005654; C:nucleoplasm; IEA:Ensembl.
DR GO; GO:0045120; C:pronucleus; IEA:Ensembl.
DR GO; GO:0000922; C:spindle pole; IEA:Ensembl.
DR GO; GO:0070840; F:dynein complex binding; IEA:Ensembl.
DR GO; GO:0008017; F:microtubule binding; IEA:InterPro.
DR GO; GO:0008022; F:protein C-terminus binding; IEA:Ensembl.
DR GO; GO:0042803; F:protein homodimerization activity; IEA:Ensembl.
DR GO; GO:0008134; F:transcription factor binding; IEA:Ensembl.
DR GO; GO:0001822; P:kidney development; IEA:Ensembl.
DR GO; GO:0051310; P:metaphase plate congression; IEA:Ensembl.
DR GO; GO:0000278; P:mitotic cell cycle; IEA:Ensembl.
DR GO; GO:0045892; P:negative regulation of transcription, DNA-templated; IEA:Ensembl.
DR GO; GO:0015031; P:protein transport; IEA:Ensembl.
DR GO; GO:0010389; P:regulation of G2/M transition of mitotic cell cycle; IEA:Ensembl.
DR GO; GO:0016202; P:regulation of striated muscle tissue development; IEA:Ensembl.
DR GO; GO:0021591; P:ventricular system development; IEA:Ensembl.
DR InterPro; IPR043513; Cenp-F.
DR InterPro; IPR018302; CenpF/LEK1_Rb-prot-bd.
DR InterPro; IPR019513; Centromere_CenpF_leu-rich_rpt.
DR InterPro; IPR018463; Centromere_CenpF_N.
DR PANTHER; PTHR18874; PTHR18874; 2.
DR Pfam; PF10490; CENP-F_C_Rb_bdg; 1.
DR Pfam; PF10473; CENP-F_leu_zip; 3.
DR Pfam; PF10481; CENP-F_N; 1.
PE 4: Predicted;
KW Coiled coil {ECO:0000256|SAM:Coils};
KW Reference proteome {ECO:0000313|Proteomes:UP000002277}.
FT DOMAIN 1..307
FT /note="CENP-F_N"
FT /evidence="ECO:0000259|Pfam:PF10481"
FT DOMAIN 1893..2035
FT /note="CENP-F_leu_zip"
FT /evidence="ECO:0000259|Pfam:PF10473"
FT DOMAIN 2131..2270
FT /note="CENP-F_leu_zip"
FT /evidence="ECO:0000259|Pfam:PF10473"
FT DOMAIN 2313..2452
FT /note="CENP-F_leu_zip"
FT /evidence="ECO:0000259|Pfam:PF10473"
FT DOMAIN 2970..3013
FT /note="CENP-F_C_Rb_bdg"
FT /evidence="ECO:0000259|Pfam:PF10490"
FT REGION 211..236
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1667..1690
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1710..1747
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2891..3000
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 3022..3114
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1722..1747
FT /note="Polar"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2916..2953
FT /note="Polar"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2962..2984
FT /note="Polyampholyte"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3029..3080
FT /note="Polar"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 3114 AA; 357717 MW; 615A3A9CB5383EC4 CRC64;
MSWALEEWKE GLPTRALQKI QELEGQLDKL KKEKQQRQFQ LDSLEAALQK QKQKVENEKT
EGTNLKRENQ RLMEICESLE KTKQKISHEL QVKESQVNFQ EGQLNSGKKQ IEKLEQELKR
CKSELERSQQ AAQSADVSLN PCNTPQKIFT TPLTPSQYYS GSKYEDLKEK YNKEVEERKR
LEAEVKALQA KKASQTLPQA TMNHRDIARH QASSSVFSWQ QEKTPSHLSS NSQRTPIRRD
FSASYFSGEQ EVTPSRSTLQ IGKRDANSSF FDNSSSPHLL DQLKAQNQEL RNKINELELR
LQGHEKEMKG QVNKFQELQL QLEKAKVELI EKEKVLNKCR DELVRTTAQY DQASTKYTAL
EQKLKKLTED LSCQRQNAES ARCSLEQKIK EKEKEFQEEL SRQQRSFQTL DQECIQMKAR
LTQELQQAKN MHNVLQAELD KLTSVKQQLE NNLEEFKQKL CRAEQAFQAS QIKENELRRS
MEEMKKENNL LKSQSEQKAR EVCHLEAELK NIKQCLNQSQ NFAEEMKAKN TSQETMLRDL
QEKINQQENS LTLEKLKLAV ADLEKQRDCS QDLLKKREHH IEQLNDKLSK TEKESKALLS
ALELKKKEYE ELKEEKTLFS CWKSENEKLL TQMESEKENL QSKINHLETC LKTQQIKSHE
YNERVRTLEM DRENLSVEIR NLHNVLDSKS VEVETQKLAY VELQQKAEFS DQKHQKEIEN
MCLKTSQLTG QVEDLEHKLQ LLSNEIMDKD RCYQDLHAEY ESLRDLLKSK DASLVTNEDH
QRSLLAFDQQ PAMHHSFANI IGEQGSMPSE RSECHLEADQ SPKNSAILQN RVDSLEFSLE
SQKQMNSDLQ KQCEELVQIK GEIEENLMKA EQMHQSFVAE TSQRISKLQE DTSAHQNVVA
ETLSALENKE KELQLLNDKL ETEQAEIQEL KKSNHLLEDS LKELQLLSET LSLEKKEMSS
IISLNKREIE ELTQENGTLK EMNASLNQEK MNLIQKSESF ANYIDEREKS ISELSDQYKQ
EKLILLQRCE ETGNAYEDLS QKYKAAQEKN SKLECLLNEC TSLCENRKNE LEQLKEAFAK
EHQEFLTKLA FAEERNQNLM LELETVQQAL RSEMTDNQNN SKSEPGGLKQ EIMTLKEEQN
KMQKEVNDLL QENEQLMKVM KTKHECQNLE SEPIRNSVKE RESERNQCNF KPQMDLEVKE
ISLDSYNAQL VQLEAMLRSK ELKLQESEKE KECLQHELQT IRGDLESSNL QDMQSQEISG
LKDCEIDAEE KYISGPHELS TSQNDNAHLQ CSLQTTMNKL NELEKICEIL QAEKYELVTE
LNDSRSECIT ATRKMAEEVG KLLNEVKILN DDSGLLHGEL VEDIPGGEFG EQPNEQHPVS
LAPLDESNSY EHLTLSDKEV QMHFAELQEK FSSLQSEHKI LHDQHCQMSS KMSELQTYVD
SLKAENLVLS TNLRNFQGDL VKEMQLGLEE GLVPSLSSSC VPDSSSLSSL GDSSFYRALL
EQTGDMSLLS NLEGTVSANQ CSVDEVFCSS LQEENLTRKE TPSAPAKGVE ELESLCEVYR
QSLEKLEEKM ESQGIMKNKE IQELEQLLSS ERQELDCLRK QYLSENEQWQ QKLTSVTLEM
ESKLAAEKKQ TEQLSLELEV ARLQLQGLDL SSRSLLGIDT EDAIQGRNES CDISKEHTSE
TTERTPKHDV HQICDKDAQQ DLNLDIEKIT ETGAVKPTGE CSGEQSPDTN YEPPGEDKTQ
GSSECISELS FSGPNASVPM DFLGNQENIQ NLQLRVKETS NENLRLLHVI EDRDRKVESL
LNEMKELDSK LHLQEVQLMT KIEACIELEK IVGELKKENS DLSEKLEYFS CDNQELLQRV
ETSEGLNSDL EMHADKSSRE DIEDNVAKVN DSWKERFLDV ENELSRIRSE KANIEHQALY
LEADLEIVQT EKLCLEKDNE NKQKVIVCLE EELSVVTSER NQLRGELDTM SKKTMELDQL
SEKMKEKTRE LESHQSECLH CIQVAEAEVK EKTELLQTLS SDVSELLKDK THLQEKLQSL
EKDSQALSFT KCELENQIAQ LNKEKELLVK ESESLQARLS ESDYEKLNVS KALEAALVEK
GEFALRLSST QEEVHQLRRG IEKLRVRIEA DEKKQLHVAE ELKEREREND SLKDKVENLE
RELQMSEENQ ELVILDAENS KAEVETLKTQ IEEMARSLKV FELDLVTLRS EKENLTKQIQ
EKQGQLSELD KLLSSFKSLL EEKEQAEIQI KEESKTAVEM LQNQLKELNE AVAALCGDQE
IMKATEQSLD PPIEEEHQLR NSIEKLRARL EADEKKQLCV LQQLKESEHH ADLLKGRVEN
LERELEIART NQEHAAHEAE NSKGEVEALK AKIEGMTQSL RDLELDLVTI RSEKENLTNE
LQKEQERISE LEIINASFEN ILQEKEQEKV EMKEKSSTAM EMLQTQLKEL SERVAALHND
QEACKAKEQN LSSQVECLEL EKAQLLQGLD EAKNNYIVLQ SSVNGLIQEV EDGKQKLEKK
DEEISRLKNQ IQDEEQLVSK LSQVEGEHQL WKEQNLELRN LTVELEQKIQ VLQSKNASLQ
DTLEVLQSSY KNLENELELT KMDKMSFVEK VNKMTAKETE LQREMHEMAQ KTAELQEELS
GEKNRLAGEL QLLLEEIKSS KDQLKELTLE NSELKKSLDC MHKDQVEKEG KVREEIAEYQ
LRLHEAEKKH QALLLDTNKQ YEVEIQTYRE KLTSKEECLS SQKLEIDLLK SSKEELNNSL
KATTQILEEL KKTKMDNLKY VNQLKKENER AQGKMKLLIK SCKQLEEEKE ILQKELSQLQ
AAQEKQKTGT VMDTKVDELT TEIKELKETL EEKTKEADEY LDKYCSLLIS HEKLEKAKEM
LETQVAHLCS QQSKQDSRGS PLLDPVVPGP SPIPSVTEKR LSSGQNKASG KRQRSSGIWE
NGRGPTPATP ESFSKKSKKA VMSGIHPAED MEGTESEPEG LPEVVKKGFA DIPTGKTSPY
ILRRTTMATR TSPRLAAQKL ALSPLSLGKE NLAESSKPTA GGSRSQKVKV AQQSPVDSGT
ILREPTTKSV PVNNLPERSP TDSPREGLRV KRGRLVPSPK AGLESKGSEN CKVQ
//