ID L9K1Y8_TUPCH Unreviewed; 3104 AA.
AC L9K1Y8;
DT 03-APR-2013, integrated into UniProtKB/TrEMBL.
DT 03-APR-2013, sequence version 1.
DT 24-JAN-2024, entry version 36.
DE SubName: Full=Centromere protein F {ECO:0000313|EMBL:ELW55467.1};
GN ORFNames=TREES_T100017246 {ECO:0000313|EMBL:ELW55467.1};
OS Tupaia chinensis (Chinese tree shrew).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Scandentia; Tupaiidae; Tupaia.
OX NCBI_TaxID=246437 {ECO:0000313|EMBL:ELW55467.1, ECO:0000313|Proteomes:UP000011518};
RN [1] {ECO:0000313|Proteomes:UP000011518}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA Zhang G., Fan Y., Yao Y., Huang Z.;
RT "Genome of the Chinese tree shrew, a rising model animal genetically
RT related to primates.";
RL Submitted (JUL-2012) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Proteomes:UP000011518}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=23385571; DOI=10.1038/ncomms2416;
RA Fan Y., Huang Z.Y., Cao C.C., Chen C.S., Chen Y.X., Fan D.D., He J.,
RA Hou H.L., Hu L., Hu X.T., Jiang X.T., Lai R., Lang Y.S., Liang B.,
RA Liao S.G., Mu D., Ma Y.Y., Niu Y.Y., Sun X.Q., Xia J.Q., Xiao J.,
RA Xiong Z.Q., Xu L., Yang L., Zhang Y., Zhao W., Zhao X.D., Zheng Y.T.,
RA Zhou J.M., Zhu Y.B., Zhang G.J., Wang J., Yao Y.G.;
RT "Genome of the Chinese tree shrew.";
RL Nat. Commun. 4:1426-1426(2013).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KB320929; ELW55467.1; -; Genomic_DNA.
DR RefSeq; XP_006155889.1; XM_006155827.2.
DR RefSeq; XP_006155890.1; XM_006155828.2.
DR RefSeq; XP_006155891.1; XM_006155829.2.
DR STRING; 246437.L9K1Y8; -.
DR GeneID; 102487040; -.
DR KEGG; tup:102487040; -.
DR CTD; 1063; -.
DR eggNOG; ENOG502QVMD; Eukaryota.
DR InParanoid; L9K1Y8; -.
DR OrthoDB; 5363462at2759; -.
DR Proteomes; UP000011518; Unassembled WGS sequence.
DR GO; GO:0000775; C:chromosome, centromeric region; IEA:InterPro.
DR GO; GO:0070840; F:dynein complex binding; IEA:InterPro.
DR GO; GO:0008017; F:microtubule binding; IEA:InterPro.
DR GO; GO:0042803; F:protein homodimerization activity; IEA:InterPro.
DR Gene3D; 1.10.287.1490; -; 1.
DR InterPro; IPR043513; Cenp-F.
DR InterPro; IPR018302; CenpF/LEK1_Rb-prot-bd.
DR InterPro; IPR019513; Centromere_CenpF_leu-rich_rpt.
DR InterPro; IPR018463; Centromere_CenpF_N.
DR PANTHER; PTHR18874:SF10; CENTROMERE PROTEIN F; 1.
DR PANTHER; PTHR18874; CMF/LEK/CENP CELL DIVISION-RELATED; 1.
DR Pfam; PF10490; CENP-F_C_Rb_bdg; 1.
DR Pfam; PF10473; CENP-F_leu_zip; 2.
DR Pfam; PF10481; CENP-F_N; 1.
DR SUPFAM; SSF57997; Tropomyosin; 1.
PE 4: Predicted;
KW Coiled coil {ECO:0000256|SAM:Coils};
KW Reference proteome {ECO:0000313|Proteomes:UP000011518}.
FT DOMAIN 1..305
FT /note="Centromere protein Cenp-F N-terminal"
FT /evidence="ECO:0000259|Pfam:PF10481"
FT DOMAIN 2126..2265
FT /note="Centromere protein Cenp-F leucine-rich repeat-
FT containing"
FT /evidence="ECO:0000259|Pfam:PF10473"
FT DOMAIN 2308..2447
FT /note="Centromere protein Cenp-F leucine-rich repeat-
FT containing"
FT /evidence="ECO:0000259|Pfam:PF10473"
FT DOMAIN 2964..3007
FT /note="Kinetochore protein Cenp-F/LEK1 Rb protein-binding"
FT /evidence="ECO:0000259|Pfam:PF10490"
FT REGION 212..235
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 250..276
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1230..1260
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1520..1539
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2886..2976
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 3021..3104
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 13..131
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 164..198
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 278..495
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 521..628
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 895..988
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 1024..1079
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 1120..1150
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 1281..1354
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 1544..1596
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 1786..1844
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 1892..2280
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 2319..2885
FT /evidence="ECO:0000256|SAM:Coils"
FT COMPBIAS 2886..2948
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2949..2976
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3023..3070
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 3104 AA; 356826 MW; 8E7E532F68E27808 CRC64;
MSWALEEWKE GLPTRALQKI QELEGQLDKL KKEKQQRQFQ LETLEAALQK QKQKVENEKT
EGTNLKRENQ RLMEICENLD KSKQKISHEL QIKESQVNFQ EGQLNSSKKQ IEKLEQELKR
CKSELERSQQ MAQSADVSLN PCSTPQKMFT TPLTPRQCYS GSKYEDLKEK YNKEVEERKR
LEAEVKALQA KRTSQTISQS TMNHRDIARH QASSSVFWQQ ETPGRLSSDS HRTPAGRDFS
ASHFLRDHEV TPSKLTSKIG KRDANNAACD NSSSPHLLDQ LKAQNKELRS KINELELHLQ
GQEKEIKGQA NKFQEVQLQL EKAKVDLIEK EKVLNKNRDE LVRTTAQYDQ ASTKCTTLEQ
KLKKLTEDLN CQRQNAESAR CSLEQKIKEK EKEFHEELSR QQRSLQTLDQ ECTQVKAKLT
QELQQAKNTQ NILQAELDKA TLVKQQLERS LEEFKQKCCR TEQALQARQV TEDELRRSSE
EMKKENGLLK RQSEQRAREV CHLEEELKKV KQCLNQSQNF VEEMRAKNLS QETMLRELEE
KVNLQENSLT LEKLKVSTAD LEKQQDQDLL KKREYHIEQL NYKLSKTEEE YKALLSTLEL
MKKEYKKLEE EKALFSSWKS ENEKLLNQMA LEKESSQSKI NHLETLLNTH QIKSHEYDEK
VRTLQMERES LNVEIRNLHS VIDSKSVEIE TQKQAFVELQ QKAEVSDEKH NKEIENMCLK
TSQLTGQVED LGHKLQLLSS EIMDKDQQYQ DLHAEYEGLR DLLKSKDSFL VTNEDHQRSL
LAFKQQSDKN DSFAKIRGEQ ECMPSEKSEC NLEVDQSPKN SAILQNRVVS LEFLLESQKQ
MNSDLQKQCE ELVQIKGEIE GNLIKAEQMH QSFVAETSQR ISKLQEDTSL HQNVVAETLV
ALENKEKEFQ LLNEKLETEQ AEVQELKKSN YLLENSVKEL QLLCETLSSE KKELSSVISL
HKKQIEELTQ ENGALKEINE TLNQEKANFV QKSDSFSNSI DEKERISEFS DQFKHERLTL
LQRCEETRNA FEDLSQKYKA AQEKNSKLEC LLNECTSICE NKKMELEQLK EIFAKEHKEF
LTKLALAEER NQVLILELGT VQQDRQSEIA HIQNNFKSET DGLNQEIMIL KEQQNKMQKE
DNGLLQENEE LKKLMHTKHE HQNLEVDPIR DSMKDSKNEI NKHNCQRQVD LEVKDLFLDS
YNTQLVHLEA MVRNLEVKLL ESENEKECLQ QELQRTRGES ETKSSQDTQS QEISNLKDCE
KDTEEKYISV LHELSTSQND NAHLQCSLQT AMNKLNELEK MYEILQIEKL ELVSELNDSR
SEYITVTNKM TEEVEKLVNE VKILNDKNSH LQGDLVTEMA EGEFGEQQNE KSVSLNPMED
SNSYERLTLS NKEVQTHFAE LQEKFLSLQG EHKILHDQHC QMSSKMSELQ TYVDKLKAEN
SVLSVNLRNV QGDLVKEMKP GPEEEQILSP SFSCVTDSPN LTRFGEGSLD KDLLEQTGET
SLWSDLEGNV SANQSNVGEV SCSSLEEEEN LTKKETSSAP VRSVEELEIL CQMYLQSLKN
LEEKMENQGI LKNKEIQELE QLLSSERKEL DCLRKQFLSE NEQWQQKLTS VTMEMESKLA
AEKKQTEHLS MELEVARLQL QGLDLSSRSF IGTDTDVVRG QNESCDITES DEYTSETTER
IPKQDICQIC DENNQQDLSL ETREITETET VKLRGECCKE QSPETNCDVP VEDKPQGCPE
CISQLSLSGP SALVPVNALG DQVTIQNLQL QVEETLSENL RLLHVLEDRD KKVESLLNEK
RELSAKLDLQ EVQLTNKIEA CIALEKILEE LKEKSDLSEK LESVSCDNQE LYQRVDTSDS
MNPHLEMGPN KLSDEVIDDS VAKVDDNCKK SLLDMENELN RIKSEKASIE CHALSMEADL
EIVQTEKLCL EKDNESKQKV IVSLEEELLV VTRERNRLHG ELDTVSKDNK ELVGVSEKLK
ERVQELESHQ GACVDRIQVV EAELKDKTEL LQTLSSDVSG LLKDKTHLQE QLQSLEEDSQ
ALSLVKRELE NQIGQLNKEK ESLLRKFESL QARLSELEHD KLNVSKALEA ALIEKGEFAG
KLSSTQEEVH QLRRGIEKLR VRIEADEKNQ LHVVEKLKER ERENDSLKDK VESLERELQM
SEENQELVIL DAENSKAEVE TLKTQVEEMA KSLKVLELDH GKVRSEKENL TEQLQEKQDQ
VSELDKMVSS FKSLLKEKEQ AEIQMKEEAK TIVEMLQTQL KELNEEVKAL YNDQEGWKAE
EPSPDTPVED VDKKINSIEK LKALIETDGK KQRLVIEKLK ESQHSADLLK DRVGNLEREL
DISGKNQESA VLEAENSKAM VETLKARIEK TDQNLSDLEL ALTDIRLEKE NLMKEKQKEQ
ERVSELETLS SSLENLLREK EQQNVQLREE STVAIEMVQA KFQELSEKVS DLCNDQEIYK
AKVQNLSNQV NSLETEKAKL LQDLNKAKNN DSVLQSTVKD LIREVEDGKQ KVEKKDEEIS
RLNSQIQDQE QLLSKLSQVE GEQELCKKQN VELRNLTVEL EQKIQVLQSQ NATLQNTVEA
LQNSYKDIEN QLELTKTEKM SFAEKVNTMT AKESELQREI REMVQKRTDL EEEFSGEKNK
LTKELKLLLE EIQSNKGQLK ELMLENSELK KSLDCVHKDQ VEKEEKVREV TEYQLQLQEA
EKRHQTLLLD TNKQYQIEIQ TYQEKLTSKE ECLKSQKLEI DNLKSTKEEL NNSLKVTTQL
VEELKKAKVD NLKYVSQLKK ENERAQGKIK LLMKSCKQLE EEKGMLQKEL SQLEAALEKQ
KTGTIVNPNL DELMTEMKEL KETLEEKSKE ADEYLDKYCS LLISHEKSEK AREMLETQVA
RLSSQLSKHD LRSSPSLNSV VPGPSPVSSV PEKKLSSGQN KTSGKRQRPS GISDNDGGSV
PSTPETFSKK SRKIVKSVPH SASDTEDTEF EPEGLPEVVK KGFADIPVGK TSPYILRRTT
MATRTSPRLA AQKLALSPLS LGNKILVDSS KRTAGGSRSQ KVKIAQQSPV DSGTAFREPT
TRSLAVNNLS GDSPREGLRV KRGQLTPSPE AGPEPKGSEN CRVQ
//