GenomeNet

Database: UniProt
Entry: M3XII8_LATCH
LinkDB: M3XII8_LATCH
Original site: M3XII8_LATCH 
ID   M3XII8_LATCH            Unreviewed;      3070 AA.
AC   M3XII8;
DT   01-MAY-2013, integrated into UniProtKB/TrEMBL.
DT   01-MAY-2013, sequence version 1.
DT   18-JUN-2025, entry version 69.
DE   SubName: Full=Centromere protein F {ECO:0000313|Ensembl:ENSLACP00000022544.1};
GN   Name=CENPF {ECO:0000313|Ensembl:ENSLACP00000022544.1};
OS   Latimeria chalumnae (Coelacanth).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Coelacanthiformes; Coelacanthidae; Latimeria.
OX   NCBI_TaxID=7897 {ECO:0000313|Ensembl:ENSLACP00000022544.1, ECO:0000313|Proteomes:UP000008672};
RN   [1] {ECO:0000313|Proteomes:UP000008672}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=Wild caught {ECO:0000313|Proteomes:UP000008672};
RA   Di Palma F., Alfoldi J., Johnson J., Berlin A., Gnerre S., Jaffe D.,
RA   MacCallum I., Young S., Walker B.J., Lander E., Lindblad-Toh K.;
RT   "The draft genome of Latimeria chalumnae.";
RL   Submitted (AUG-2011) to the EMBL/GenBank/DDBJ databases.
RN   [2] {ECO:0000313|Ensembl:ENSLACP00000022544.1}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (MAR-2025) to UniProtKB.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; AFYH01094288; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AFYH01094289; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AFYH01094290; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AFYH01094291; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AFYH01094292; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AFYH01094293; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AFYH01094294; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AFYH01094295; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   RefSeq; XP_014345462.1; XM_014489976.2.
DR   FunCoup; M3XII8; 952.
DR   STRING; 7897.ENSLACP00000022544; -.
DR   Ensembl; ENSLACT00000025563.1; ENSLACP00000022544.1; ENSLACG00000007875.2.
DR   GeneID; 102353984; -.
DR   KEGG; lcm:102353984; -.
DR   CTD; 1063; -.
DR   eggNOG; ENOG502QVMD; Eukaryota.
DR   GeneTree; ENSGT00730000111187; -.
DR   InParanoid; M3XII8; -.
DR   OMA; EQPNEQH; -.
DR   OrthoDB; 10255522at2759; -.
DR   Proteomes; UP000008672; Unassembled WGS sequence.
DR   Bgee; ENSLACG00000007875; Expressed in pelvic fin and 1 other cell type or tissue.
DR   GO; GO:0000775; C:chromosome, centromeric region; IEA:InterPro.
DR   GO; GO:0005634; C:nucleus; IEA:TreeGrafter.
DR   GO; GO:0000922; C:spindle pole; IEA:TreeGrafter.
DR   GO; GO:0070840; F:dynein complex binding; IEA:InterPro.
DR   GO; GO:0008017; F:microtubule binding; IEA:InterPro.
DR   GO; GO:0042803; F:protein homodimerization activity; IEA:InterPro.
DR   GO; GO:0060271; P:cilium assembly; IEA:Ensembl.
DR   GO; GO:0001947; P:heart looping; IEA:Ensembl.
DR   GO; GO:0001822; P:kidney development; IEA:Ensembl.
DR   GO; GO:0051310; P:metaphase chromosome alignment; IEA:TreeGrafter.
DR   GO; GO:0000278; P:mitotic cell cycle; IEA:TreeGrafter.
DR   GO; GO:0010389; P:regulation of G2/M transition of mitotic cell cycle; IEA:TreeGrafter.
DR   GO; GO:0021591; P:ventricular system development; IEA:Ensembl.
DR   Gene3D; 1.10.287.1490; -; 2.
DR   InterPro; IPR043513; Cenp-F.
DR   InterPro; IPR018302; CenpF/LEK1_Rb-prot-bd.
DR   InterPro; IPR019513; Centromere_CenpF_leu-rich_rpt.
DR   InterPro; IPR018463; Centromere_CenpF_N.
DR   PANTHER; PTHR18874:SF10; CENTROMERE PROTEIN F; 1.
DR   PANTHER; PTHR18874; CMF/LEK/CENP CELL DIVISION-RELATED; 1.
DR   Pfam; PF10490; CENP-F_C_Rb_bdg; 1.
DR   Pfam; PF10473; CENP-F_leu_zip; 2.
DR   Pfam; PF10481; CENP-F_N; 1.
DR   SUPFAM; SSF57997; Tropomyosin; 2.
PE   4: Predicted;
KW   Coiled coil {ECO:0000256|SAM:Coils};
KW   Reference proteome {ECO:0000313|Proteomes:UP000008672}.
FT   DOMAIN          1..304
FT                   /note="Centromere protein Cenp-F N-terminal"
FT                   /evidence="ECO:0000259|Pfam:PF10481"
FT   DOMAIN          2017..2158
FT                   /note="Centromere protein Cenp-F leucine-rich repeat-
FT                   containing"
FT                   /evidence="ECO:0000259|Pfam:PF10473"
FT   DOMAIN          2253..2392
FT                   /note="Centromere protein Cenp-F leucine-rich repeat-
FT                   containing"
FT                   /evidence="ECO:0000259|Pfam:PF10473"
FT   DOMAIN          2922..2964
FT                   /note="Kinetochore protein Cenp-F/LEK1 Rb protein-binding"
FT                   /evidence="ECO:0000259|Pfam:PF10490"
FT   REGION          205..278
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1813..1834
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          2778..2803
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          2837..2891
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          3012..3070
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COILED          20..131
FT                   /evidence="ECO:0000256|SAM:Coils"
FT   COILED          162..189
FT                   /evidence="ECO:0000256|SAM:Coils"
FT   COILED          653..740
FT                   /evidence="ECO:0000256|SAM:Coils"
FT   COILED          780..1313
FT                   /evidence="ECO:0000256|SAM:Coils"
FT   COILED          1341..1438
FT                   /evidence="ECO:0000256|SAM:Coils"
FT   COILED          1595..1728
FT                   /evidence="ECO:0000256|SAM:Coils"
FT   COILED          1965..2167
FT                   /evidence="ECO:0000256|SAM:Coils"
FT   COMPBIAS        206..241
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        263..274
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1813..1822
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        2790..2803
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        2837..2870
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        3017..3032
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        3060..3070
FT                   /note="Acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   3070 AA;  355088 MW;  85C0A931070117D1 CRC64;
     MSWAIEEWKE GLPSKALQKI QDFESQLEKL KKERQQKQFQ LESLEAAFQK QKQKVENEKS
     EVTALKRENQ SLVESCDNLE KSKQKISHDL QVKELQVNFL EGQLATCKKQ IEKLEQEMKR
     SKCEIERSQQ PLLLGDLQPC ATPEKNFAVP VAPSRNYNDS KVEELQEKYN KEVEERKRLE
     AELKIMQAKV VNQSQGNVNR RDIARQQASS SVFPWQQEQT PSHAASNSLE TPSRRGCTTS
     HFPWEREETP SMYCQRSAKK TASNSSFNES SSNSPQNDLL KVQNQELNSK VTELELRLQA
     QEKEMKTCAN KLQEVQAHFE KAKLELSEKD KALNKYRDEV TKMTTQLDQS SSKCEVVEQK
     LKQVSEELIC QRQNTDSARH TMEQRLKEKE KEYQQELCQQ LHSFQILDQQ FKQMKTELQQ
     AKNDRNTLQA EIDKLSTMKQ RAEKEVEDMK QTLFRTEQTL QAKEKDFKKT VEEVQKEKNN
     LHCQFEQSSR QVHQQEEELK MTQQHLKQSQ SLVEELKSKN IAREVQLLSF KEKLDKQEQS
     LNTDLENLRQ TVADLQKQQD SAQDILTKRE KEMEEMNNKI ITMEKETEEL QNALCLKYNE
     CAELKRETSL LSEWKNKTEN LKNQMLCEKE GMLKEIQELE KCLESHQYDN ERIKVLENEK
     QNLCLQIENY ERLLDCKSAD LESQKQIYEK LRKTAEQADQ KYSKEKENMC LQVIQLTAQA
     NGLEKKLQLE TNKILKMEQS YSELCAEYER ATNLAKSKES VIELKEAEML NLQNSLSETI
     IDFEKQLAKV NSEKSDLIKE HENAVLGKAV EAENMKLELE KCQNDIAFSK EQISSLECDL
     KLQKDLNSEL ESRCEELMKV KDELEEKLVE VAKNLEIVQA EAKEERELKI TVSAQQQRVD
     DLLATVQEKE MSIQKLTSEQ ERKESCLQSL QSSNQLLEVQ VQQLNIQSEA QRQEKEDILA
     SIFSNEKEVE NLAKENEKLK EVIDTLSQEK QVLLEKNSNF ANMVKEKEAE VSELSTRRAE
     EHQTILENNV KLESELANLQ KKYSCMEETK DELEVQIKEK TEKLEEQERK FNKQCAEYLS
     KMEHYEEANQ SLVKEVEKVQ SSLNNKLEET TQFKERLIVS EKETENLRKK LSDTTEGYKE
     LQEMLRRLQQ ENELLNQQVT EERKKLSCLR NAFNENETVF AKKNKDACLK IDQLEKELST
     AQNQQLKSSE LAKEKTQYGE ELREAMREKE SDLSKIQVQL EMLQMDLEDK EVCIESYSTQ
     IEQMEATVKK MESELRESEE QKTIVREEKN SLCKELEATK SKLSDALEKE QILKLCAEHK
     EQSEKELAAV SQEYKSCQLL KTKLETSLQE VSSKCEELEK MYERMQTEKS ELISELSNLR
     TQCTTVLDEN SGLADKIKHL ENEVNLSKDE STALHSKLMS LKDENEKLKE WAKQEECEKL
     KLCNKEIERH LFEVNEKLSC SQKEYDIVQE QYCCAMSKVS ELQSLVETLE EEKSVLVAKL
     EEVYSSKMDA PEFSDRIEPK QTCAIFEHES PIHIKEDNAH ENPVNIKEDP AVSKHHIAAE
     GTDHFNQSAE VDKADELQTQ LNMAERKLFD TEIILSKTNV EKAALEAEVN ILETSLESAQ
     LQLTEQKAQL QHLEQLILER ETEVMDLKEQ LNDFKGNLVS KENNASQTEG NLPKEIEELK
     ILSETYETGI KKLEEQLQMQ KDARNGEIQE LSQTLAATKN ELTCLQKQHS SEIDQWQQKL
     SSMTLEMETK LAAERQQTEI LSTELKGARI QIQHLDLSSH SLLCAETEEV QNGSCIQDQK
     EEINENLQYI NSFSKDPQSE PQSNERDGIE TENTIDISQE NISRLDVETE TDSVADNTVE
     CLRATVEFLC LDSNTSTHQE DFQETHVVPE SCTSQPENVS SKQEFLSQEL EEYKKDDILL
     KEEQHLYSRL DSQQLQSTSQ NSACTELQNV VYKLEEEKSV LSDRIKSTNL ENQKLSERIK
     DLEKELNSMT SEQEVYKARL SDVTEMLHSL EMVKGNWNEK YLEVENELKR TRSEKANLEK
     HILSMEADIE EMQIAKTSLE KEAANSSKCI SRLGEQLSVA TADKNQLSQE LESSREIQEE
     LEQISQNLKE KLEQLASDKV NYTEFIKVLE AENKKLTKQL EITKFDVEKL SKERNSILEQ
     LDCLEKNVLS DEKGELQKQF DQRTEEKEVL LNECETLQGK ICALEMENSR LSQSLESSLV
     EKGEIASRLN STQEEVVQMR HGIEKLKVRI ESDEKKRHHE AEKLKASERK ADSFQDKVEK
     LQRELQMSEE SLEGMVLEAE TVKEEMEKLV TVKQEISKKL QILETEVNTL SSERDLLDKE
     LQQKQLKISE LEGSCMDITN LLKKVEEEKL QVMQEHVVSQ KALKSELMEL NEKLKICSEE
     LENWRAKEQD WLGQISGLEC EKTELSQQLQ QKESSFAELH SANVSLTQDL LASKQELNEQ
     VKANNRLQQE VTDMQQWKQK ISEQLSNVEA EKVSLEKERN QLQTITTESE QRVQALEAKN
     FKMQGTIEGL EGSQQLLEDE LQSAKLQNSA LLEQIQKITE NDSRVRSDLN AANQKIEKMQ
     EECNLERSTL VAQINNAQQQ EESYKVQLDL VTSEKEEMKK RLQHLQNELQ ISEEKIKEER
     MEYQHQLQEA EEKQKTLLKE GREKEAEVHT EREKLITMRQ MLNDHILEIN NLKSAKEQLN
     AALRKAESKL EQLNEKKVED LKTTIVQLKK EKESAVSKLQ LWMRSCKQME QEKEMLLKDI
     EQQDALLNKL KKNEKIETDT NADGVSSELE ELRETVEEKT READESMEKY CNLIINYHKL
     EEANETLKNR IAFLSSQLKQ PGSQDSNSTA KTTPDNPPNS KTGNLKTGKT SPWDRSRPCN
     KRHRTVEPKE DGLELEKVEA TECVSKRIRN GKENGMSRHS AGGDNVEFKP EGLPEVVKKG
     FADIPAGTAS PFILRRTTLQ RRSPRLAAQK TSPITQVVQK VALENQAQNS KTPGGSKLQQ
     VKEVSSFSLG SVLGPVASTS GSPLSSVINS PKKTAFQIPS GSAPTRRSRR SPSTRKYPEQ
     EEEEENCNVQ
//
DBGET integrated database retrieval system