ID A0A087Y7Z3_POEFO Unreviewed; 2824 AA.
AC A0A087Y7Z3;
DT 29-OCT-2014, integrated into UniProtKB/TrEMBL.
DT 26-NOV-2014, sequence version 2.
DT 24-JAN-2024, entry version 44.
DE SubName: Full=Centromere protein F {ECO:0000313|Ensembl:ENSPFOP00000014146.2};
OS Poecilia formosa (Amazon molly) (Limia formosa).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC Ovalentaria; Atherinomorphae; Cyprinodontiformes; Poeciliidae; Poeciliinae;
OC Poecilia.
OX NCBI_TaxID=48698 {ECO:0000313|Ensembl:ENSPFOP00000014146.2, ECO:0000313|Proteomes:UP000028760};
RN [1] {ECO:0000313|Proteomes:UP000028760}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=female {ECO:0000313|Proteomes:UP000028760};
RA Schartl M., Warren W.;
RL Submitted (OCT-2013) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSPFOP00000014146.2}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (SEP-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AYCK01003644; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AYCK01003645; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AYCK01003646; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AYCK01003647; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR STRING; 48698.ENSPFOP00000014146; -.
DR Ensembl; ENSPFOT00000014166.2; ENSPFOP00000014146.2; ENSPFOG00000014027.2.
DR eggNOG; ENOG502QVMD; Eukaryota.
DR GeneTree; ENSGT00730000111187; -.
DR OMA; EQPNEQH; -.
DR Proteomes; UP000028760; Unassembled WGS sequence.
DR GO; GO:0000775; C:chromosome, centromeric region; IEA:InterPro.
DR GO; GO:0070840; F:dynein complex binding; IEA:InterPro.
DR GO; GO:0008017; F:microtubule binding; IEA:InterPro.
DR GO; GO:0042803; F:protein homodimerization activity; IEA:InterPro.
DR GO; GO:0060271; P:cilium assembly; IEA:Ensembl.
DR GO; GO:0001947; P:heart looping; IEA:Ensembl.
DR GO; GO:0001822; P:kidney development; IEA:Ensembl.
DR GO; GO:0021591; P:ventricular system development; IEA:Ensembl.
DR Gene3D; 1.10.287.1490; -; 3.
DR Gene3D; 1.20.5.340; -; 1.
DR InterPro; IPR043513; Cenp-F.
DR InterPro; IPR018302; CenpF/LEK1_Rb-prot-bd.
DR InterPro; IPR019513; Centromere_CenpF_leu-rich_rpt.
DR InterPro; IPR018463; Centromere_CenpF_N.
DR PANTHER; PTHR18874:SF10; CENTROMERE PROTEIN F; 1.
DR PANTHER; PTHR18874; CMF/LEK/CENP CELL DIVISION-RELATED; 1.
DR Pfam; PF10490; CENP-F_C_Rb_bdg; 1.
DR Pfam; PF10473; CENP-F_leu_zip; 2.
DR Pfam; PF10481; CENP-F_N; 1.
DR SUPFAM; SSF57997; Tropomyosin; 1.
PE 4: Predicted;
KW Coiled coil {ECO:0000256|SAM:Coils};
KW Reference proteome {ECO:0000313|Proteomes:UP000028760}.
FT DOMAIN 1..306
FT /note="Centromere protein Cenp-F N-terminal"
FT /evidence="ECO:0000259|Pfam:PF10481"
FT DOMAIN 1851..1991
FT /note="Centromere protein Cenp-F leucine-rich repeat-
FT containing"
FT /evidence="ECO:0000259|Pfam:PF10473"
FT DOMAIN 2089..2234
FT /note="Centromere protein Cenp-F leucine-rich repeat-
FT containing"
FT /evidence="ECO:0000259|Pfam:PF10473"
FT DOMAIN 2757..2801
FT /note="Kinetochore protein Cenp-F/LEK1 Rb protein-binding"
FT /evidence="ECO:0000259|Pfam:PF10490"
FT REGION 124..146
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 229..290
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 453..476
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 558..587
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1451..1484
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2216..2241
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2277..2302
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2663..2786
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 162..196
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 882..994
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 1047..1155
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 1500..1534
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 1585..1662
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 1740..1781
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 1855..1914
FT /evidence="ECO:0000256|SAM:Coils"
FT COMPBIAS 127..146
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 251..289
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 558..581
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1463..1484
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2277..2300
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2680..2703
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2704..2722
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2725..2741
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2755..2770
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 2824 AA; 322609 MW; ED18C84CFE2BEEB0 CRC64;
MSWAVEEWKD GLPGKALQKI QEMEAQLDKL KKERTQKQFQ LDSLEAALQK QKQKVDSERT
EISALKRENQ SMIESCDALE KTRQKVVHDL GVKEQQVSYL EGQLNSCKKT IDRLEQELKK
YKTELDRTQP AGSSSLSTSS SDLQTFSTPQ KTFATPAPVA AHRQQDGKLE ELQEKYNHEV
EERRKLESEL KMLQVKLLNQ SSVSHKDIAA RQAGSSIFPW QQQDQVHSLR GQDVMETPLK
RRGASLWATQ EETPIKPSQR MSSSRTVQSP GGSSQQTEQL KSLNQGSELR GRVSELERNL
AAQEKEIRNQ ASKLQELQTH LNQTRKELAE RDRELAKAGH ELSQAADRHQ QLQAKCSLVE
QKLKQVSEEM SCQRHNAESC RRALEQKLKD QERNSQKELA QLQSSHQVLD QQLNQTRTKL
TQEVQQAKKD HNVLQADMEK LRLQKNQMER EIEEQKQKLL RSEQGLQATQ TKEQDLRKKM
EELQKEKSGA TIQLDRSSRL LTQLEEEKRI SEQSLKRTQG QLEDLKAKSE GQAEELKRLQ
SKLEQQTQAA ARELEQMKKT LSDAEAQSDK SQTELQKQKQ ESEKLSNTLT VVEKESEALK
SNLRQSQDEC QAVKLEHQAL LEWKKNKENL INQTEAVQQE LTDKIITLED KASLMHEANN
KLQDQISSME ADKASLSAHI DALKGELLMR STELEEKQHQ YQQLQLHVSE AEQKHGKELE
NAGKQLAQLQ GQVKELESRL QRETSRAELA EKTISELQEE HQAACDLLHS KDQLLELGLA
EVSQLKDSLA QSAAQQEAQS DRLDTDSLET FVMQKAVLLK QCEESVSAQA EETERVKLQA
EEVQQELLLS RHKICKVTKY YSCSPHSVLK YITSMEQILK VQEQLGAELQ KQIKTMSDKE
EELTKTCEEK SEELKRLEEE LKVQRSRTEE TEQQLRAAEA QILSVENQKA ELENRLQEMK
TEAEILQFSH TERINALQEQ ILCLEKQVAA NQEAAEEVPV LKNKLDVVNQ SLAHLNKSLE
ASEKCLSSAN EIKAGLEATW SEKVKLITAL EDQINNLTEK LRKESESHSS EVGDLANKER
SLKEQLEAIK QSAAAAKAES SSRREEIRTM KATLSAASRG LEERDNTIKS LKEKLDKAEA
EQAKASELLK EKTVAMNKIK IKVQLEMLQM DLEDNESAMT SVDSQVEELK GTIASLEAML
EDSKAQVSVL ETRLDEVQSQ NSLLETKYVT AKEELLERSC EITRLEEDAV RRQQLEESVS
ALEAKLSQIT EENSKAETEL RQIIQEKGSR LDQLNEEKND LQASLTLLMD EQKQAETEIT
RVKAGKEELE ERMSDLLKEI QQLHADVGEL SEQKQEAEAT IDQMAAETQQ TESALRRVSE
EKAQLDVALS SVSEEKVELQ ERLSALTSEN RELVCKVQQV EEEKLQLESK FSEISEENAK
LQSEMNRLTA EKQQLQTSAE RLQEQLDTRD ETSHTESRER PLLNRNQQFE VEQAALHQKL
SVLQMELAVL QQQCDLLLQQ VDQQQRIIQQ LNNIKITLKD TNKQIILKFL QQVVVLEEAL
EGEVRRLSQS LESSLLEKGE IASRLNSTQE EVRQMRTGIE KLQVRIESDE RKKKNMGELL
KETRLCVSAA AQRKSDSLQD RIDALEREKD EFEQNLEEAM LQCSLDLVNI LFVTSSHDKN
GLVKDLKNNV KLSDVILNEL EVLLTSMCPS GDESWPVKDV DAEMKQQGEP KTLWSEFDLL
TSELQLRAEL EVQAQKLEEK VQAAEEQLSS ALEEKKSLSD QVGGRNDVTQ LLEERDSLSL
QLETSRCQLT DAMEMLEGLE MAKEFLCNSS KISLYTLNKT ILTSLSCSSG WDEKFLQQES
ELKRVRSEKA NLEQHILGME SELESMQAES CRLTEELETQ RKTYSGRIQQ IEALLSETTQ
LRAELVSCSE DRDELSQSLG HWRDKVHGLE KTNLETRNLI SILEEDIRVG RKEYESLQSG
MERVKAEREQ FLQQVVVLEE AISKHNREKE GLLSHLHEME EDHTSTNQNT ESMAGKIQAL
EGEVRRLSQS LESSLLEKGE IASRLNSTQE EVRQMRTGIE KLQVRIESDE RKKKNMGELL
KAAQRKSDSL QDRIDALERE KDEFEQNLEE AVLQAEAAKA ELEEERSKVE EEKKELNEKL
TELSAALDTL RSEKAHLERE LDMKNVEIEE LKAATDKLEQ GLEKAEVEEL EKWTREEAER
QSVRVGDLQG QLTESERENN DLRTTIESLG EEIKDLKTLA QAKEEEILTL EKEKENKAES
FAAEKKQLEE RNEQLEQERD ELQSALVSVS QEKAKADQEK TELEEKRVKL QSKISLVETE
KQNLQQTVSS LEQEKLRLEA ETEELQSSLL MMEEEKNNLS SALSLLEEEK QQATEEKERL
TQEHEALQKT VASMEEELET RKRSMTQISE QHQAETCPVW CRGTKNVLSF YIFKVLNWSK
MNVLHMQYSL LCIQVSELTS SLARLTKERD SALSKINLWM KACKQLEQEK QNILSSSGVI
HQQLVLSYFK HTKIFALETG QKLLGKILYF SKGGRSSEEL QAEATQLRSE AEVRRKEVEE
LKTALEKKTA EAEQKEGEVK ERAAALEGVR AEKERELEEI RKELDEVNAL LEEKSTEADE
SMEKYCSLMV KVHKLEESND ALKTRLEQLA ASQPANEAKA PPETHRRSAR KSSSRHQEEK
MSENTENVVP TTTASSPQGS SPGKRGHKEI SDRDGAQEAL HNLTKKLKAN TATPRGRGEQ
DDEEFRPEGL PDLVQKGFAD IPLGEASPYI IRRTTGRRCS PWLAARQTQP AVKSDPFVAC
RSDS
//