ID A0A3Q2KYR7_HORSE Unreviewed; 3029 AA.
AC A0A3Q2KYR7;
DT 10-APR-2019, integrated into UniProtKB/TrEMBL.
DT 10-APR-2019, sequence version 1.
DT 07-OCT-2020, entry version 9.
DE SubName: Full=Centromere protein F {ECO:0000313|Ensembl:ENSECAP00000029918};
GN Name=CENPF {ECO:0000313|Ensembl:ENSECAP00000029918,
GN ECO:0000313|VGNC:VGNC:16394};
OS Equus caballus (Horse).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Perissodactyla; Equidae; Equus.
OX NCBI_TaxID=9796 {ECO:0000313|Ensembl:ENSECAP00000029918, ECO:0000313|Proteomes:UP000002281};
RN [1] {ECO:0000313|Ensembl:ENSECAP00000029918, ECO:0000313|Proteomes:UP000002281}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Thoroughbred {ECO:0000313|Ensembl:ENSECAP00000029918,
RC ECO:0000313|Proteomes:UP000002281};
RX PubMed=19892987; DOI=10.1126/science.1178158;
RG Broad Institute Genome Sequencing Platform;
RG Broad Institute Whole Genome Assembly Team;
RA Wade C.M., Giulotto E., Sigurdsson S., Zoli M., Gnerre S., Imsland F.,
RA Lear T.L., Adelson D.L., Bailey E., Bellone R.R., Bloecker H., Distl O.,
RA Edgar R.C., Garber M., Leeb T., Mauceli E., MacLeod J.N., Penedo M.C.T.,
RA Raison J.M., Sharpe T., Vogel J., Andersson L., Antczak D.F., Biagi T.,
RA Binns M.M., Chowdhary B.P., Coleman S.J., Della Valle G., Fryc S.,
RA Guerin G., Hasegawa T., Hill E.W., Jurka J., Kiialainen A., Lindgren G.,
RA Liu J., Magnani E., Mickelson J.R., Murray J., Nergadze S.G., Onofrio R.,
RA Pedroni S., Piras M.F., Raudsepp T., Rocchi M., Roeed K.H., Ryder O.A.,
RA Searle S., Skow L., Swinburne J.E., Syvaenen A.C., Tozaki T., Valberg S.J.,
RA Vaudin M., White J.R., Zody M.C., Lander E.S., Lindblad-Toh K.;
RT "Genome sequence, comparative analysis, and population genetics of the
RT domestic horse.";
RL Science 326:865-867(2009).
RN [2] {ECO:0000313|Ensembl:ENSECAP00000029918}
RP IDENTIFICATION.
RC STRAIN=Thoroughbred {ECO:0000313|Ensembl:ENSECAP00000029918};
RG Ensembl;
RL Submitted (JAN-2019) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR Ensembl; ENSECAT00000032495; ENSECAP00000029918; ENSECAG00000024127.
DR VGNC; VGNC:16394; CENPF.
DR GeneTree; ENSGT00730000111187; -.
DR Proteomes; UP000002281; Chromosome 5.
DR ExpressionAtlas; A0A3Q2KYR7; baseline.
DR GO; GO:0000775; C:chromosome, centromeric region; IEA:InterPro.
DR GO; GO:0070840; F:dynein complex binding; IEA:InterPro.
DR GO; GO:0008017; F:microtubule binding; IEA:InterPro.
DR GO; GO:0042803; F:protein homodimerization activity; IEA:InterPro.
DR GO; GO:0008134; F:transcription factor binding; IEA:InterPro.
DR InterPro; IPR043513; Cenp-F.
DR InterPro; IPR018302; CenpF/LEK1_Rb-prot-bd.
DR InterPro; IPR019513; Centromere_CenpF_leu-rich_rpt.
DR InterPro; IPR018463; Centromere_CenpF_N.
DR PANTHER; PTHR18874; PTHR18874; 3.
DR Pfam; PF10490; CENP-F_C_Rb_bdg; 1.
DR Pfam; PF10473; CENP-F_leu_zip; 3.
DR Pfam; PF10481; CENP-F_N; 1.
PE 4: Predicted;
KW Coiled coil {ECO:0000256|SAM:Coils};
KW Reference proteome {ECO:0000313|Proteomes:UP000002281}.
FT DOMAIN 1..305
FT /note="CENP-F_N"
FT /evidence="ECO:0000259|Pfam:PF10481"
FT DOMAIN 1818..1960
FT /note="CENP-F_leu_zip"
FT /evidence="ECO:0000259|Pfam:PF10473"
FT DOMAIN 2056..2195
FT /note="CENP-F_leu_zip"
FT /evidence="ECO:0000259|Pfam:PF10473"
FT DOMAIN 2238..2377
FT /note="CENP-F_leu_zip"
FT /evidence="ECO:0000259|Pfam:PF10473"
FT DOMAIN 2896..2938
FT /note="CENP-F_C_Rb_bdg"
FT /evidence="ECO:0000259|Pfam:PF10490"
FT REGION 208..262
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 470..491
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1442..1461
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1593..1612
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2816..2996
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 13..131
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 164..191
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 278..333
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 341..404
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 416..464
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 503..613
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 645..700
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 766..793
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 826..916
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 956..1011
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 1016..1036
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 1045..1079
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 1123..1171
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 1210..1244
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 1263..1283
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 1321..1341
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 1473..1543
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 1548..1570
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 1716..1778
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 1815..1835
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 1885..1919
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 1934..2024
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 2032..2210
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 2221..2241
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 2249..2392
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 2400..2420
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 2435..2472
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 2480..2535
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 2561..2609
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 2614..2634
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 2643..2750
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 2761..2788
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 2796..2816
FT /evidence="ECO:0000256|SAM:Coils"
FT COMPBIAS 208..260
FT /note="Polar"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 473..491
FT /note="Polyampholyte"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1442..1459
FT /note="Polar"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2816..2857
FT /note="Polar"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2864..2879
FT /note="Polar"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2880..2909
FT /note="Polyampholyte"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2924..2938
FT /note="Polar"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2961..2996
FT /note="Polar"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 3029 AA; 346807 MW; 77A3E23093225A38 CRC64;
MSWALEEWKE GLPTRALQKI QELEGQLDKL KKERQQRQFQ LETLEAALQK QKQKVENEKT
EGTNLKRENQ SLMEICENLE KTKQKIAHDL QVKESQVNFQ EGQLNSSKKQ IEKLEQELKR
CKSELERSQQ TAQSADVSLN PCNTPQKIFA TPLTPSHYYS GSKYEDLKEK YNKEVEERKR
LEAEVKALQA KKVSQAIPQS TMNHRDIARH QASSSVFSWQ QEQTPSRLSS SAQKTPIGRD
FSASHFSGEV TPSRSTLKMG KRDANGSFCE DSSNSYLLDQ LKAQNQELRS KISELELRLQ
GQEKEMKGQV NKFQELQLQL ERAKVELIEK EKVSNKSRDE LVRTTAQYDQ ASTKCTALEQ
KLKKLTEDLS CQRQNADSAK RSLEQKIREK EKEFQEELSR QQRSFHTLDQ ECTQMKAKLT
QELQQAKNMH NILQAELDKG TSAKHQLEKN LEEFKQKFSR TEQAFQASQI KENELRRSSE
EMKKENSLLK SQSEQRAREV CHLEEELKKA KQCLSQSQNL AEEMKAKTTS QETMLRDLQE
KINQQENSLT LEKLKLALAD LEKQRDCSQD LLKKREHHIE QLNDKLSKTE RESEALLTAL
ELKKKECEEL KEEKTLFSRC KSENEQLLKL HQKAEFSDQK HRKEMENMCL KVSQLTGQVE
DLEHKLQLLS SEIMDKDQRY QDLRAECESL RDQLKSKDSS LVTNEAHHRS LLAFEQQSAM
SNSFVNIIGE QESVPSERSE CPLKVDQSPK TSSVLQHRVV SLEFSLESQK QMNSDLQKQC
EELVQIRGEI EENLIKAEQM HQSFVAETSQ RISKLQEDTS VHQNVVAETL VALEDKEREL
QLLNEKLETE QAETQELKKS NHLLQEALKE LQLLSETLSS EKKEMSSVIS LNKREIEELT
QENGTLKEIN AALNQEKINL LQKSESFSNC IDERDRSISE LANQYQQERL ILLQKCEETG
NAFEDLNEKY KAAQEKNSKL ESLLNECTSV CENKKNELEQ LKEAFAREHQ AFVTKLALAE
ERNQNLILEL GAVQQDLRSE ITDIQNNSKS EADGLKQEIV TLKEEQSKMQ QEVNALLQEN
EHLIKLMKTE HEHQNLELEP ARDSVKERES EINRCDLQLP MDLEVEDTSL DSYKAQLAQL
EAKIRNMELK LQESEEEKEC LQRELQTIRG ELQTGSLQQV TQSQEVRGLK DTEEQYISVL
HELSTSQNDN AHLQCSLQTA MNKLNELEKL CEVLRVEKFE LISELNDSRS ECITATSKMA
GEVEKLVNEV KMLNDENDLL QGEFVKEMPE GEFGEQQNEQ KSVAVNPLDD GDFCEPLTLS
NKEVQMHFAE LQEKFSSLQS EHKILHDQHC QMSAKMSQLQ SYVDMLKAEN SVLSTSLRNS
QGDLVKEEMP GPREGRFLSL SFSCVTDSSS ITSLGESSFY KELLEHTGEA SLLNNLEGDV
SANRSPAEEA SSSSLEEEVL TKTDIPPAPA RTIEELETLC QMYRQSLQKL EEKMESQGIM
KNKEIEELEQ LLSCERKELD CLREQYLSEN EQWQQKLTSV TMEMESKLAA EKKQTEQLSL
ELEVARLQLQ GLDLSSRSLL GADVEDVIRD GNDSCDIKES EENTSETRAR TPKDDIHQIG
DKADQQDLSL EVEKITETGA VNLTGEWSRE QSPDTSHETP VEDKTLGCSE CVSELPPSGP
NALVPMDVLE NQVTIQNLQL QVKETSDENL RLLHVIEERD KKVESLLNEL KELDSKLDLQ
KVQLTTKVET CLELEKTVEE LKKEKSDLSE KLESFSCDNQ KLHQRVESLE GVSSNLEVGT
EKSSHEIIED NVANVNNWRE RFFDAENELK RIKSEKSSIE LHALSVEADL ERVQTEKLYL
EKDNENKQNV ITCLEEELSV VTSERNRLHG ELDTLSKENK ELDQASEKMK EKVLELESRQ
GECLQHLDVV GAEVREKTQL LQTLSSEVSE LLEDKGHLQE QLQSLEKDSQ TLSLVKSELE
NQIGQLSEEK ESLVKESESL QTKLNELEHE KLDVAKALEA ALMEKGEVAV RLSSTQEEVH
QLRKGIEKLR VRIEADEKKQ LHALEKLKES ERKNDSLQDK VENLERELQM SEENQELVIL
DAENCRAEAE TLKTQIELMT ESLKVLEGDL TTVRSEKENL MKELQEKQGQ VSELDTLVSS
FKNLLEEKEQ EKIQMKEQSK AAVEMLQTEL KELNEEVAAL CDDQETWKAK EPSPDSAALG
VQQLRTSIEK VKVLLEADEK KQLHTLGELK ESRHHVDLLK DRVENLEREL EISKEKQERV
RLEAENSKAE VVTLKVKIEE MAQSLRDLEL DLGNIRSEKE NLTKELQKEQ GRVSELEVLN
CSFENLLQEK EQEKVEMKEE SKIAVEMLQT QLKELNEKVA ALCNDQEIFK AKEQNLSSQV
DSLEHERSQL LQGLDEAKSN YIILQSSVNG LVQEVEGGKQ KLEKKDEEIS ILKNQLQDQE
QLLSKLSQVE GEQQLCKNQK IELGNLVVEL EQKIQGLQSK NDTLQDTLDM LQNSYKDLEK
ELELAKMEKM SFVEQVNTMT AKETELQREM HEMVRERTEL KEGFTGEKQR LREELNLMSE
EIKSSKGQLK ELMLENSELK KSLDCAHKDQ MEKHEKMREE IAGYHLRLQD AEKKHQALLL
DTNKQYEMEI QTYQEKLTSK EECLSSQKAE MDLLKSSKEE LSNSLKATTE ILEELKKAKM
DNLKHANQLK KENERAQGKI KLLIKSCKQL EAEKEMLQKE LSRLEAAQEK QKTGTVVDAN
VDELMTEMKE LKETLEEKNK EADEYLEKYC SLLISHEKLE KDKEMLETQV ARLSSQQSKL
NLPSSPLLNS VVPGSSPAPS VTEKKLPSGQ NKASGKRQRS SGIRENGEGT TPSTPETFSK
KSRKAVKSGI HPAEDAEYAE FEPEGLPEVV KKGFADIPTG KTSPYVLRRT TMATRTSPRL
AAQKLAPSPL SLDKENHAET SKPTAGGSRS QKVKAAQQSP ADSGSASREP STKSLSSCGL
EYLDFVNWKR ENGLNGDLAE LKVLPPECV
//