ID F6X517_HORSE Unreviewed; 3100 AA.
AC F6X517;
DT 27-JUL-2011, integrated into UniProtKB/TrEMBL.
DT 10-APR-2019, sequence version 2.
DT 07-OCT-2020, entry version 60.
DE SubName: Full=Centromere protein F {ECO:0000313|Ensembl:ENSECAP00000021627};
GN Name=CENPF {ECO:0000313|Ensembl:ENSECAP00000021627,
GN ECO:0000313|VGNC:VGNC:16394};
OS Equus caballus (Horse).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Perissodactyla; Equidae; Equus.
OX NCBI_TaxID=9796 {ECO:0000313|Ensembl:ENSECAP00000021627, ECO:0000313|Proteomes:UP000002281};
RN [1] {ECO:0000313|Ensembl:ENSECAP00000021627, ECO:0000313|Proteomes:UP000002281}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Thoroughbred {ECO:0000313|Ensembl:ENSECAP00000021627,
RC ECO:0000313|Proteomes:UP000002281};
RX PubMed=19892987; DOI=10.1126/science.1178158;
RG Broad Institute Genome Sequencing Platform;
RG Broad Institute Whole Genome Assembly Team;
RA Wade C.M., Giulotto E., Sigurdsson S., Zoli M., Gnerre S., Imsland F.,
RA Lear T.L., Adelson D.L., Bailey E., Bellone R.R., Bloecker H., Distl O.,
RA Edgar R.C., Garber M., Leeb T., Mauceli E., MacLeod J.N., Penedo M.C.T.,
RA Raison J.M., Sharpe T., Vogel J., Andersson L., Antczak D.F., Biagi T.,
RA Binns M.M., Chowdhary B.P., Coleman S.J., Della Valle G., Fryc S.,
RA Guerin G., Hasegawa T., Hill E.W., Jurka J., Kiialainen A., Lindgren G.,
RA Liu J., Magnani E., Mickelson J.R., Murray J., Nergadze S.G., Onofrio R.,
RA Pedroni S., Piras M.F., Raudsepp T., Rocchi M., Roeed K.H., Ryder O.A.,
RA Searle S., Skow L., Swinburne J.E., Syvaenen A.C., Tozaki T., Valberg S.J.,
RA Vaudin M., White J.R., Zody M.C., Lander E.S., Lindblad-Toh K.;
RT "Genome sequence, comparative analysis, and population genetics of the
RT domestic horse.";
RL Science 326:865-867(2009).
RN [2] {ECO:0000313|Ensembl:ENSECAP00000021627}
RP IDENTIFICATION.
RC STRAIN=Thoroughbred {ECO:0000313|Ensembl:ENSECAP00000021627};
RG Ensembl;
RL Submitted (JUL-2011) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR PaxDb; F6X517; -.
DR PRIDE; F6X517; -.
DR Ensembl; ENSECAT00000025966; ENSECAP00000021627; ENSECAG00000024127.
DR VGNC; VGNC:16394; CENPF.
DR GeneTree; ENSGT00730000111187; -.
DR HOGENOM; CLU_000551_0_0_1; -.
DR InParanoid; F6X517; -.
DR OrthoDB; 205405at2759; -.
DR TreeFam; TF101133; -.
DR Proteomes; UP000002281; Chromosome 5.
DR Bgee; ENSECAG00000024127; Expressed in inner cell mass and 16 other tissues.
DR ExpressionAtlas; F6X517; baseline.
DR GO; GO:0097539; C:ciliary transition fiber; IEA:Ensembl.
DR GO; GO:0000940; C:condensed chromosome outer kinetochore; IEA:Ensembl.
DR GO; GO:0005737; C:cytoplasm; IEA:Ensembl.
DR GO; GO:0030496; C:midbody; IEA:Ensembl.
DR GO; GO:0005635; C:nuclear envelope; IEA:Ensembl.
DR GO; GO:0016363; C:nuclear matrix; IEA:Ensembl.
DR GO; GO:0005654; C:nucleoplasm; IEA:Ensembl.
DR GO; GO:0000922; C:spindle pole; IEA:Ensembl.
DR GO; GO:0070840; F:dynein complex binding; IEA:Ensembl.
DR GO; GO:0008017; F:microtubule binding; IEA:InterPro.
DR GO; GO:0008022; F:protein C-terminus binding; IEA:Ensembl.
DR GO; GO:0042803; F:protein homodimerization activity; IEA:Ensembl.
DR GO; GO:0008134; F:transcription factor binding; IEA:Ensembl.
DR GO; GO:0001822; P:kidney development; IEA:Ensembl.
DR GO; GO:0051310; P:metaphase plate congression; IEA:Ensembl.
DR GO; GO:0000278; P:mitotic cell cycle; IEA:Ensembl.
DR GO; GO:0045892; P:negative regulation of transcription, DNA-templated; IEA:Ensembl.
DR GO; GO:0015031; P:protein transport; IEA:Ensembl.
DR GO; GO:0010389; P:regulation of G2/M transition of mitotic cell cycle; IEA:Ensembl.
DR GO; GO:0021591; P:ventricular system development; IEA:Ensembl.
DR InterPro; IPR043513; Cenp-F.
DR InterPro; IPR018302; CenpF/LEK1_Rb-prot-bd.
DR InterPro; IPR019513; Centromere_CenpF_leu-rich_rpt.
DR InterPro; IPR018463; Centromere_CenpF_N.
DR PANTHER; PTHR18874; PTHR18874; 2.
DR Pfam; PF10490; CENP-F_C_Rb_bdg; 1.
DR Pfam; PF10473; CENP-F_leu_zip; 3.
DR Pfam; PF10481; CENP-F_N; 1.
PE 4: Predicted;
KW Coiled coil {ECO:0000256|SAM:Coils};
KW Reference proteome {ECO:0000313|Proteomes:UP000002281}.
FT DOMAIN 1..305
FT /note="CENP-F_N"
FT /evidence="ECO:0000259|Pfam:PF10481"
FT DOMAIN 1889..2031
FT /note="CENP-F_leu_zip"
FT /evidence="ECO:0000259|Pfam:PF10473"
FT DOMAIN 2127..2266
FT /note="CENP-F_leu_zip"
FT /evidence="ECO:0000259|Pfam:PF10473"
FT DOMAIN 2309..2448
FT /note="CENP-F_leu_zip"
FT /evidence="ECO:0000259|Pfam:PF10473"
FT DOMAIN 2967..3009
FT /note="CENP-F_C_Rb_bdg"
FT /evidence="ECO:0000259|Pfam:PF10490"
FT REGION 208..262
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 470..491
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1513..1532
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1664..1683
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2887..3067
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 13..131
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 164..191
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 278..333
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 341..404
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 416..464
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 503..613
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 621..697
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 716..771
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 837..864
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 897..987
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 1027..1082
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 1087..1107
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 1116..1150
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 1194..1242
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 1281..1315
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 1334..1354
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 1392..1412
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 1544..1614
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 1619..1641
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 1787..1849
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 1886..1906
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 1956..1990
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 2005..2095
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 2103..2281
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 2292..2312
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 2320..2463
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 2471..2491
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 2506..2543
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 2551..2606
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 2632..2680
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 2685..2705
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 2714..2821
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 2832..2859
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 2867..2887
FT /evidence="ECO:0000256|SAM:Coils"
FT COMPBIAS 208..260
FT /note="Polar"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 473..491
FT /note="Polyampholyte"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1513..1530
FT /note="Polar"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2887..2928
FT /note="Polar"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2935..2950
FT /note="Polar"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2951..2980
FT /note="Polyampholyte"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2995..3009
FT /note="Polar"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3032..3067
FT /note="Polar"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 3100 AA; 355254 MW; 15180A479B35AF86 CRC64;
MSWALEEWKE GLPTRALQKI QELEGQLDKL KKERQQRQFQ LETLEAALQK QKQKVENEKT
EGTNLKRENQ SLMEICENLE KTKQKIAHDL QVKESQVNFQ EGQLNSSKKQ IEKLEQELKR
CKSELERSQQ TAQSADVSLN PCNTPQKIFA TPLTPSHYYS GSKYEDLKEK YNKEVEERKR
LEAEVKALQA KKVSQAIPQS TMNHRDIARH QASSSVFSWQ QEQTPSRLSS SAQKTPIGRD
FSASHFSGEV TPSRSTLKMG KRDANGSFCE DSSNSYLLDQ LKAQNQELRS KISELELRLQ
GQEKEMKGQV NKFQELQLQL ERAKVELIEK EKVSNKSRDE LVRTTAQYDQ ASTKCTALEQ
KLKKLTEDLS CQRQNADSAK RSLEQKIREK EKEFQEELSR QQRSFHTLDQ ECTQMKAKLT
QELQQAKNMH NILQAELDKG TSAKHQLEKN LEEFKQKFSR TEQAFQASQI KENELRRSSE
EMKKENSLLK SQSEQRAREV CHLEEELKKA KQCLSQSQNL AEEMKAKTTS QETMLRDLQE
KINQQENSLT LEKLKLALAD LEKQRDCSQD LLKKREHHIE QLNDKLSKTE RESEALLTAL
ELKKKECEEL KEEKTLFSRC KSENEQLLSQ MKSEKESLQS KVNHLETCLK TQQIKSHEYN
ERVRTLEMER ENLNVEIRNL RNTIDSKTVE VETQKQAYVE LHQKAEFSDQ KHRKEMENMC
LKVSQLTGQV EDLEHKLQLL SSEIMDKDQR YQDLRAECES LRDQLKSKDS SLVTNEAHHR
SLLAFEQQSA MSNSFVNIIG EQESVPSERS ECPLKVDQSP KTSSVLQHRV VSLEFSLESQ
KQMNSDLQKQ CEELVQIRGE IEENLIKAEQ MHQSFVAETS QRISKLQEDT SVHQNVVAET
LVALEDKERE LQLLNEKLET EQAETQELKK SNHLLQEALK ELQLLSETLS SEKKEMSSVI
SLNKREIEEL TQENGTLKEI NAALNQEKIN LLQKSESFSN CIDERDRSIS ELANQYQQER
LILLQKCEET GNAFEDLNEK YKAAQEKNSK LESLLNECTS VCENKKNELE QLKEAFAREH
QAFVTKLALA EERNQNLILE LGAVQQDLRS EITDIQNNSK SEADGLKQEI VTLKEEQSKM
QQEVNALLQE NEHLIKLMKT EHEHQNLELE PARDSVKERE SEINRCDLQL PMDLEVEDTS
LDSYKAQLAQ LEAKIRNMEL KLQESEEEKE CLQRELQTIR GELQTGSLQQ VTQSQEVRGL
KDTEEQYISV LHELSTSQND NAHLQCSLQT AMNKLNELEK LCEVLRVEKF ELISELNDSR
SECITATSKM AGEVEKLVNE VKMLNDENDL LQGEFVKEMP EGEFGEQQNE QKSVAVNPLD
DGDFCEPLTL SNKEVQMHFA ELQEKFSSLQ SEHKILHDQH CQMSAKMSQL QSYVDMLKAE
NSVLSTSLRN SQGDLVKEEM PGPREGRFLS LSFSCVTDSS SITSLGESSF YKELLEHTGE
ASLLNNLEGD VSANRSPAEE ASSSSLEEEV LTKTDIPPAP ARTIEELETL CQMYRQSLQK
LEEKMESQGI MKNKEIEELE QLLSCERKEL DCLREQYLSE NEQWQQKLTS VTMEMESKLA
AEKKQTEQLS LELEVARLQL QGLDLSSRSL LGADVEDVIR DGNDSCDIKE SEENTSETRA
RTPKDDIHQI GDKADQQDLS LEVEKITETG AVNLTGEWSR EQSPDTSHET PVEDKTLGCS
ECVSELPPSG PNALVPMDVL ENQVTIQNLQ LQVKETSDEN LRLLHVIEER DKKVESLLNE
LKELDSKLDL QKVQLTTKVE TCLELEKTVE ELKKEKSDLS EKLESFSCDN QKLHQRVESL
EGVSSNLEVG TEKSSHEIIE DNVANVNNWR ERFFDAENEL KRIKSEKSSI ELHALSVEAD
LERVQTEKLY LEKDNENKQN VITCLEEELS VVTSERNRLH GELDTLSKEN KELDQASEKM
KEKVLELESR QGECLQHLDV VGAEVREKTQ LLQTLSSEVS ELLEDKGHLQ EQLQSLEKDS
QTLSLVKSEL ENQIGQLSEE KESLVKESES LQTKLNELEH EKLDVAKALE AALMEKGEVA
VRLSSTQEEV HQLRKGIEKL RVRIEADEKK QLHALEKLKE SERKNDSLQD KVENLERELQ
MSEENQELVI LDAENCRAEA ETLKTQIELM TESLKVLEGD LTTVRSEKEN LMKELQEKQG
QVSELDTLVS SFKNLLEEKE QEKIQMKEQS KAAVEMLQTE LKELNEEVAA LCDDQETWKA
KEPSPDSAAL GVQQLRTSIE KVKVLLEADE KKQLHTLGEL KESRHHVDLL KDRVENLERE
LEISKEKQER VRLEAENSKA EVVTLKVKIE EMAQSLRDLE LDLGNIRSEK ENLTKELQKE
QGRVSELEVL NCSFENLLQE KEQEKVEMKE ESKIAVEMLQ TQLKELNEKV AALCNDQEIF
KAKEQNLSSQ VDSLEHERSQ LLQGLDEAKS NYIILQSSVN GLVQEVEGGK QKLEKKDEEI
SILKNQLQDQ EQLLSKLSQV EGEQQLCKNQ KIELGNLVVE LEQKIQGLQS KNDTLQDTLD
MLQNSYKDLE KELELAKMEK MSFVEQVNTM TAKETELQRE MHEMVRERTE LKEGFTGEKQ
RLREELNLMS EEIKSSKGQL KELMLENSEL KKSLDCAHKD QMEKHEKMRE EIAGYHLRLQ
DAEKKHQALL LDTNKQYEME IQTYQEKLTS KEECLSSQKA EMDLLKSSKE ELSNSLKATT
EILEELKKAK MDNLKHANQL KKENERAQGK IKLLIKSCKQ LEAEKEMLQK ELSRLEAAQE
KQKTGTVVDA NVDELMTEMK ELKETLEEKN KEADEYLEKY CSLLISHEKL EKDKEMLETQ
VARLSSQQSK LNLPSSPLLN SVVPGSSPAP SVTEKKLPSG QNKASGKRQR SSGIRENGEG
TTPSTPETFS KKSRKAVKSG IHPAEDAEYA EFEPEGLPEV VKKGFADIPT GKTSPYVLRR
TTMATRTSPR LAAQKLAPSP LSLDKENHAE TSKPTAGGSR SQKVKAAQQS PADSGSASRE
PSTKSLSSCG LEYLDFVNWK RENGLNGDLA ELKVLPPECV
//