ID R0K4F5_ANAPL Unreviewed; 2644 AA.
AC R0K4F5;
DT 26-JUN-2013, integrated into UniProtKB/TrEMBL.
DT 26-JUN-2013, sequence version 1.
DT 24-JAN-2024, entry version 30.
DE SubName: Full=Centromere protein F {ECO:0000313|EMBL:EOB04936.1};
DE Flags: Fragment;
GN ORFNames=Anapl_06836 {ECO:0000313|EMBL:EOB04936.1};
OS Anas platyrhynchos (Mallard) (Anas boschas).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda;
OC Coelurosauria; Aves; Neognathae; Galloanserae; Anseriformes; Anatidae;
OC Anatinae; Anas.
OX NCBI_TaxID=8839 {ECO:0000313|EMBL:EOB04936.1, ECO:0000313|Proteomes:UP000296049};
RN [1] {ECO:0000313|Proteomes:UP000296049}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=23749191; DOI=10.1038/ng.2657;
RA Huang Y., Li Y., Burt D.W., Chen H., Zhang Y., Qian W., Kim H., Gan S.,
RA Zhao Y., Li J., Yi K., Feng H., Zhu P., Li B., Liu Q., Fairley S.,
RA Magor K.E., Du Z., Hu X., Goodman L., Tafer H., Vignal A., Lee T.,
RA Kim K.W., Sheng Z., An Y., Searle S., Herrero J., Groenen M.A.,
RA Crooijmans R.P., Faraut T., Cai Q., Webster R.G., Aldridge J.R.,
RA Warren W.C., Bartschat S., Kehr S., Marz M., Stadler P.F., Smith J.,
RA Kraus R.H., Zhao Y., Ren L., Fei J., Morisson M., Kaiser P., Griffin D.K.,
RA Rao M., Pitel F., Wang J., Li N.;
RT "The duck genome and transcriptome provide insight into an avian influenza
RT virus reservoir species.";
RL Nat. Genet. 45:776-783(2013).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KB742734; EOB04936.1; -; Genomic_DNA.
DR Proteomes; UP000296049; Unassembled WGS sequence.
DR GO; GO:0000775; C:chromosome, centromeric region; IEA:InterPro.
DR GO; GO:0070840; F:dynein complex binding; IEA:InterPro.
DR GO; GO:0008017; F:microtubule binding; IEA:InterPro.
DR GO; GO:0042803; F:protein homodimerization activity; IEA:InterPro.
DR InterPro; IPR043513; Cenp-F.
DR InterPro; IPR018302; CenpF/LEK1_Rb-prot-bd.
DR InterPro; IPR019513; Centromere_CenpF_leu-rich_rpt.
DR InterPro; IPR018463; Centromere_CenpF_N.
DR PANTHER; PTHR18874:SF10; CENTROMERE PROTEIN F; 1.
DR PANTHER; PTHR18874; CMF/LEK/CENP CELL DIVISION-RELATED; 1.
DR Pfam; PF10490; CENP-F_C_Rb_bdg; 1.
DR Pfam; PF10473; CENP-F_leu_zip; 2.
DR Pfam; PF10481; CENP-F_N; 1.
DR SUPFAM; SSF57997; Tropomyosin; 1.
PE 4: Predicted;
KW Coiled coil {ECO:0000256|SAM:Coils};
KW Reference proteome {ECO:0000313|Proteomes:UP000296049}.
FT DOMAIN 1..298
FT /note="Centromere protein Cenp-F N-terminal"
FT /evidence="ECO:0000259|Pfam:PF10481"
FT DOMAIN 1681..1818
FT /note="Centromere protein Cenp-F leucine-rich repeat-
FT containing"
FT /evidence="ECO:0000259|Pfam:PF10473"
FT DOMAIN 1912..2051
FT /note="Centromere protein Cenp-F leucine-rich repeat-
FT containing"
FT /evidence="ECO:0000259|Pfam:PF10473"
FT DOMAIN 2564..2606
FT /note="Kinetochore protein Cenp-F/LEK1 Rb protein-binding"
FT /evidence="ECO:0000259|Pfam:PF10490"
FT REGION 50..69
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 217..238
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2495..2556
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2621..2644
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 162..189
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 271..450
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 479..623
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 668..878
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 972..1013
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 1059..1086
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 1320..1379
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 1717..2094
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 2126..2463
FT /evidence="ECO:0000256|SAM:Coils"
FT COMPBIAS 50..65
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2495..2514
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2515..2556
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT NON_TER 2644
FT /evidence="ECO:0000313|EMBL:EOB04936.1"
SQ SEQUENCE 2644 AA; 305197 MW; 82A3A27222DCB41A CRC64;
MSWVVEEWKE GLSPRVLQKI QELESQVDKL KKERQQRQFQ LESLEVALEK QKQKVENEKN
EAATQKRENQ SLMELCESLD KAKQKISHDL KVKESQINIQ SGQLNDSKKE IERLEQELKR
YKCELERSQQ ALITGDSSFS GTPQKNLTAP LTPVQSHNDA KFEELEEKYK KEVEERKKLE
LELRTIQVNK INQPYPQQSS LSHREIAWHQ ASSSVFSWQP EKTPSRNQET PAKRSSTASY
FPWEKETNSS IISEKKEFDN SFAENCNSSL ITPLRAQNQE LNSSVKDLEK QLQALEKEKK
SHMNKYQEAE LKLDRMKLEL TQKDKVITKT RDKITQMTMQ LNQATTQVQM MEQKMKRLSE
ELNCQRQNAE STRQSLEQKI KAKEKEYQEE LACQQRSLQK LDQQSNHVRN KLNQELQQAK
NDFNSLQAEF DKIMAAKQRL ERDNSDLTQK LCRAEQALLA AQAKESDLTR SFEEVKQEKN
LLDLKFEKKL QEIHQLEEEL ETVKQSLKQS QNFAEEMKNK NVFQEAELKL LEEKCIKQAS
SSSVEQLKLA LADMEKQRNS TQDLIEEKEN HIKELNCKIN KMEEESEALQ SLLAFKQREY
EELKKETNTV SQWNNENDNL LQIKGEIGES NIEQMHESFV TETKEHISNL QADISACQTF
VGQTSAILEE KDMQLQTLNE RLKNQEAELQ DLKISNKLLE DSVRQLKLMS ETWDSEKKGM
SSMICSYKKE IEEITQENAT IRDLSRALEQ NQITLQEANE NISNILKEKE EIISEMSRKH
KEEKQCIEAR TEEITRELKI LQEKYKVVEE ENVDIMSILR EQTVEFEEKK AKLEQKEKLV
LSENKDILHK LIASEEIKKD LIQELQQLQS EFSDIQHVPS REPDCSRQEI LNVEARLNAM
QEQQDIVFQG KEQLVKEIET KNELLVCDFS CKRRDCSEHL RKCMEEKDAE LNKHQFKLQL
LQMDFEDREL SLENCRLELI QLKTALREME TELEESVREK ERLQQELLSV NKLEASYSQR
TVLGEDCHSL EYSDDDVSQN CGKREMDESY SPVLLSSSLQ LTISKLSELE KIYEKLQNEN
IALTSGFEDL TSAIPSVFNK VAEEEENIMN SADTNLRAEK TTFPNEVMDP SDNSDLRMHC
DNKEISFKEC SAGPSSDYED LKLSSKEVKI HFAEVKEKIF SFQNEHIKLY EHHCGMISKI
SELQSCIEIL KAENAALSTS LSSAHTDYLS GPLSSTQNGT QSKLDETKST ISFSGLCFSE
VSEVDNSFNS GLCKWTEEIN QLKSSAEINS EGAANVLVEN CHNDTTLDSV KESRSITLST
SNLEGRIEEL EMLCQTYEKA LKVLEDQLQV QENMKNEEIQ ELKNIILSER KEIDHLKQQN
LSDREEWRQK LSNLTTEMEW KLAEERKQTE NLSLDFEAAQ LQLQVLDISS HSLLCTDTEN
NAQQENDSLY HLGSPFWKPF PTGSPEMRNN KPKLIPIEKS PVGDSSVCEN VTGTAEARLV
EDCSEEFSRE QKCRNTSGKI TSPSHHVSAL SFSNSGIFLG SEDFFENQIN TEALQEEAKQ
QTPENLKLIC ETDESHENVD LQTEVKKLNS HLHFQNAQLA LESSAFAELE ETTVAKEEDS
SIKEKLESLS VSDQQLSLEV LSLEKEPEKI KSEVEIYQAK WSNAADTLDD VEMAKGNCHE
QFLGAENEQR QPKSEKVNIK NHDFFIDNNT EVLQAKYQQL ERDKEIDLKT ISVLQEQLVS
VTAERNHIGQ ELCVLSDTKE ELDQKYQKLQ EKLKELESTK VDSTEIIRRL ENEVRMQTNL
LELAKSDINR LSNEKDNLLQ KLGEDAVSSS LEEKLQNQAA DVNKEKELLA REFEAMQNKL
SASEMENLKL SRSFEGLLIE KGELAARLSS AQKEVDQMRH GIEKLKVKIE SDERQKRRAA
EKLKEHERKV DFLVDKIERL ERELEMSGEN LEGVIIQMET AKSEAETLTV EMEEMTEKLK
SHQLQIDVLT SQNECLAKDV KEKQERILEL ESSNLTTAKL IEEKEKEKMQ LKEEFENSML
LLKSELKDVS EKLELSSQEE AVARAKEQVL INQVALLEQD KTILLQECQE IKNENIKLDH
TRELLAQEFT DCKQKLDEKV QENCALQQQV KETEELSSQL TRMEQECERW HQEKEALHNL
VAELKLKAQC FSNNESFPDI LNVLKVSYKG LEEELESTLC EKTALSKKVN ELTESCVELE
AMLSDTEQKI SKLQEEFTTE KNKLAEQIQL LQELSEKNKT QLHITMSEKN ELTKSLGMVQ
KELQEKESEM KREIAEYRDR LLQTEKALQD ALTEANRKNE MQIEACQDKM NSLGLFNSSQ
KLELEQLMSA NEELNNSLKL ANQTLGELLN LQINNGNIIV QLRKQNKLAA SKVQLWMKSC
KHMEKEKERL QQQLTKCEEM LKKKDLNVSE KEEEIKLKLE ELQESVEEKT REANENLEKY
SSLIVKYYKL EQVNEMLETQ VTLLSSQLKQ PMRDAVSSPL LSSGNLSTVS NQSDPRDEDS
AELSSKRQRS EDTWKENGEP RSPMPEPSSK KERKDSICQN LPCQENSDCE PDGLPEVVRK
GFADIPTGKV TPYILRRTTL NLGTSPRVGS PSEKFLLQTQ DLQKDQNLSE RSCSTPGGSK
SQKV
//