GenomeNet

Database: UniProt
Entry: R0K4F5_ANAPL
LinkDB: R0K4F5_ANAPL
Original site: R0K4F5_ANAPL 
ID   R0K4F5_ANAPL            Unreviewed;      2644 AA.
AC   R0K4F5;
DT   26-JUN-2013, integrated into UniProtKB/TrEMBL.
DT   26-JUN-2013, sequence version 1.
DT   24-JAN-2024, entry version 30.
DE   SubName: Full=Centromere protein F {ECO:0000313|EMBL:EOB04936.1};
DE   Flags: Fragment;
GN   ORFNames=Anapl_06836 {ECO:0000313|EMBL:EOB04936.1};
OS   Anas platyrhynchos (Mallard) (Anas boschas).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda;
OC   Coelurosauria; Aves; Neognathae; Galloanserae; Anseriformes; Anatidae;
OC   Anatinae; Anas.
OX   NCBI_TaxID=8839 {ECO:0000313|EMBL:EOB04936.1, ECO:0000313|Proteomes:UP000296049};
RN   [1] {ECO:0000313|Proteomes:UP000296049}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX   PubMed=23749191; DOI=10.1038/ng.2657;
RA   Huang Y., Li Y., Burt D.W., Chen H., Zhang Y., Qian W., Kim H., Gan S.,
RA   Zhao Y., Li J., Yi K., Feng H., Zhu P., Li B., Liu Q., Fairley S.,
RA   Magor K.E., Du Z., Hu X., Goodman L., Tafer H., Vignal A., Lee T.,
RA   Kim K.W., Sheng Z., An Y., Searle S., Herrero J., Groenen M.A.,
RA   Crooijmans R.P., Faraut T., Cai Q., Webster R.G., Aldridge J.R.,
RA   Warren W.C., Bartschat S., Kehr S., Marz M., Stadler P.F., Smith J.,
RA   Kraus R.H., Zhao Y., Ren L., Fei J., Morisson M., Kaiser P., Griffin D.K.,
RA   Rao M., Pitel F., Wang J., Li N.;
RT   "The duck genome and transcriptome provide insight into an avian influenza
RT   virus reservoir species.";
RL   Nat. Genet. 45:776-783(2013).
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; KB742734; EOB04936.1; -; Genomic_DNA.
DR   Proteomes; UP000296049; Unassembled WGS sequence.
DR   GO; GO:0000775; C:chromosome, centromeric region; IEA:InterPro.
DR   GO; GO:0070840; F:dynein complex binding; IEA:InterPro.
DR   GO; GO:0008017; F:microtubule binding; IEA:InterPro.
DR   GO; GO:0042803; F:protein homodimerization activity; IEA:InterPro.
DR   InterPro; IPR043513; Cenp-F.
DR   InterPro; IPR018302; CenpF/LEK1_Rb-prot-bd.
DR   InterPro; IPR019513; Centromere_CenpF_leu-rich_rpt.
DR   InterPro; IPR018463; Centromere_CenpF_N.
DR   PANTHER; PTHR18874:SF10; CENTROMERE PROTEIN F; 1.
DR   PANTHER; PTHR18874; CMF/LEK/CENP CELL DIVISION-RELATED; 1.
DR   Pfam; PF10490; CENP-F_C_Rb_bdg; 1.
DR   Pfam; PF10473; CENP-F_leu_zip; 2.
DR   Pfam; PF10481; CENP-F_N; 1.
DR   SUPFAM; SSF57997; Tropomyosin; 1.
PE   4: Predicted;
KW   Coiled coil {ECO:0000256|SAM:Coils};
KW   Reference proteome {ECO:0000313|Proteomes:UP000296049}.
FT   DOMAIN          1..298
FT                   /note="Centromere protein Cenp-F N-terminal"
FT                   /evidence="ECO:0000259|Pfam:PF10481"
FT   DOMAIN          1681..1818
FT                   /note="Centromere protein Cenp-F leucine-rich repeat-
FT                   containing"
FT                   /evidence="ECO:0000259|Pfam:PF10473"
FT   DOMAIN          1912..2051
FT                   /note="Centromere protein Cenp-F leucine-rich repeat-
FT                   containing"
FT                   /evidence="ECO:0000259|Pfam:PF10473"
FT   DOMAIN          2564..2606
FT                   /note="Kinetochore protein Cenp-F/LEK1 Rb protein-binding"
FT                   /evidence="ECO:0000259|Pfam:PF10490"
FT   REGION          50..69
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          217..238
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          2495..2556
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          2621..2644
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COILED          162..189
FT                   /evidence="ECO:0000256|SAM:Coils"
FT   COILED          271..450
FT                   /evidence="ECO:0000256|SAM:Coils"
FT   COILED          479..623
FT                   /evidence="ECO:0000256|SAM:Coils"
FT   COILED          668..878
FT                   /evidence="ECO:0000256|SAM:Coils"
FT   COILED          972..1013
FT                   /evidence="ECO:0000256|SAM:Coils"
FT   COILED          1059..1086
FT                   /evidence="ECO:0000256|SAM:Coils"
FT   COILED          1320..1379
FT                   /evidence="ECO:0000256|SAM:Coils"
FT   COILED          1717..2094
FT                   /evidence="ECO:0000256|SAM:Coils"
FT   COILED          2126..2463
FT                   /evidence="ECO:0000256|SAM:Coils"
FT   COMPBIAS        50..65
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        2495..2514
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        2515..2556
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   NON_TER         2644
FT                   /evidence="ECO:0000313|EMBL:EOB04936.1"
SQ   SEQUENCE   2644 AA;  305197 MW;  82A3A27222DCB41A CRC64;
     MSWVVEEWKE GLSPRVLQKI QELESQVDKL KKERQQRQFQ LESLEVALEK QKQKVENEKN
     EAATQKRENQ SLMELCESLD KAKQKISHDL KVKESQINIQ SGQLNDSKKE IERLEQELKR
     YKCELERSQQ ALITGDSSFS GTPQKNLTAP LTPVQSHNDA KFEELEEKYK KEVEERKKLE
     LELRTIQVNK INQPYPQQSS LSHREIAWHQ ASSSVFSWQP EKTPSRNQET PAKRSSTASY
     FPWEKETNSS IISEKKEFDN SFAENCNSSL ITPLRAQNQE LNSSVKDLEK QLQALEKEKK
     SHMNKYQEAE LKLDRMKLEL TQKDKVITKT RDKITQMTMQ LNQATTQVQM MEQKMKRLSE
     ELNCQRQNAE STRQSLEQKI KAKEKEYQEE LACQQRSLQK LDQQSNHVRN KLNQELQQAK
     NDFNSLQAEF DKIMAAKQRL ERDNSDLTQK LCRAEQALLA AQAKESDLTR SFEEVKQEKN
     LLDLKFEKKL QEIHQLEEEL ETVKQSLKQS QNFAEEMKNK NVFQEAELKL LEEKCIKQAS
     SSSVEQLKLA LADMEKQRNS TQDLIEEKEN HIKELNCKIN KMEEESEALQ SLLAFKQREY
     EELKKETNTV SQWNNENDNL LQIKGEIGES NIEQMHESFV TETKEHISNL QADISACQTF
     VGQTSAILEE KDMQLQTLNE RLKNQEAELQ DLKISNKLLE DSVRQLKLMS ETWDSEKKGM
     SSMICSYKKE IEEITQENAT IRDLSRALEQ NQITLQEANE NISNILKEKE EIISEMSRKH
     KEEKQCIEAR TEEITRELKI LQEKYKVVEE ENVDIMSILR EQTVEFEEKK AKLEQKEKLV
     LSENKDILHK LIASEEIKKD LIQELQQLQS EFSDIQHVPS REPDCSRQEI LNVEARLNAM
     QEQQDIVFQG KEQLVKEIET KNELLVCDFS CKRRDCSEHL RKCMEEKDAE LNKHQFKLQL
     LQMDFEDREL SLENCRLELI QLKTALREME TELEESVREK ERLQQELLSV NKLEASYSQR
     TVLGEDCHSL EYSDDDVSQN CGKREMDESY SPVLLSSSLQ LTISKLSELE KIYEKLQNEN
     IALTSGFEDL TSAIPSVFNK VAEEEENIMN SADTNLRAEK TTFPNEVMDP SDNSDLRMHC
     DNKEISFKEC SAGPSSDYED LKLSSKEVKI HFAEVKEKIF SFQNEHIKLY EHHCGMISKI
     SELQSCIEIL KAENAALSTS LSSAHTDYLS GPLSSTQNGT QSKLDETKST ISFSGLCFSE
     VSEVDNSFNS GLCKWTEEIN QLKSSAEINS EGAANVLVEN CHNDTTLDSV KESRSITLST
     SNLEGRIEEL EMLCQTYEKA LKVLEDQLQV QENMKNEEIQ ELKNIILSER KEIDHLKQQN
     LSDREEWRQK LSNLTTEMEW KLAEERKQTE NLSLDFEAAQ LQLQVLDISS HSLLCTDTEN
     NAQQENDSLY HLGSPFWKPF PTGSPEMRNN KPKLIPIEKS PVGDSSVCEN VTGTAEARLV
     EDCSEEFSRE QKCRNTSGKI TSPSHHVSAL SFSNSGIFLG SEDFFENQIN TEALQEEAKQ
     QTPENLKLIC ETDESHENVD LQTEVKKLNS HLHFQNAQLA LESSAFAELE ETTVAKEEDS
     SIKEKLESLS VSDQQLSLEV LSLEKEPEKI KSEVEIYQAK WSNAADTLDD VEMAKGNCHE
     QFLGAENEQR QPKSEKVNIK NHDFFIDNNT EVLQAKYQQL ERDKEIDLKT ISVLQEQLVS
     VTAERNHIGQ ELCVLSDTKE ELDQKYQKLQ EKLKELESTK VDSTEIIRRL ENEVRMQTNL
     LELAKSDINR LSNEKDNLLQ KLGEDAVSSS LEEKLQNQAA DVNKEKELLA REFEAMQNKL
     SASEMENLKL SRSFEGLLIE KGELAARLSS AQKEVDQMRH GIEKLKVKIE SDERQKRRAA
     EKLKEHERKV DFLVDKIERL ERELEMSGEN LEGVIIQMET AKSEAETLTV EMEEMTEKLK
     SHQLQIDVLT SQNECLAKDV KEKQERILEL ESSNLTTAKL IEEKEKEKMQ LKEEFENSML
     LLKSELKDVS EKLELSSQEE AVARAKEQVL INQVALLEQD KTILLQECQE IKNENIKLDH
     TRELLAQEFT DCKQKLDEKV QENCALQQQV KETEELSSQL TRMEQECERW HQEKEALHNL
     VAELKLKAQC FSNNESFPDI LNVLKVSYKG LEEELESTLC EKTALSKKVN ELTESCVELE
     AMLSDTEQKI SKLQEEFTTE KNKLAEQIQL LQELSEKNKT QLHITMSEKN ELTKSLGMVQ
     KELQEKESEM KREIAEYRDR LLQTEKALQD ALTEANRKNE MQIEACQDKM NSLGLFNSSQ
     KLELEQLMSA NEELNNSLKL ANQTLGELLN LQINNGNIIV QLRKQNKLAA SKVQLWMKSC
     KHMEKEKERL QQQLTKCEEM LKKKDLNVSE KEEEIKLKLE ELQESVEEKT REANENLEKY
     SSLIVKYYKL EQVNEMLETQ VTLLSSQLKQ PMRDAVSSPL LSSGNLSTVS NQSDPRDEDS
     AELSSKRQRS EDTWKENGEP RSPMPEPSSK KERKDSICQN LPCQENSDCE PDGLPEVVRK
     GFADIPTGKV TPYILRRTTL NLGTSPRVGS PSEKFLLQTQ DLQKDQNLSE RSCSTPGGSK
     SQKV
//
DBGET integrated database retrieval system