GenomeNet

Database: UniProt
Entry: H2Q149_PANTR
LinkDB: H2Q149_PANTR
Original site: H2Q149_PANTR 
ID   H2Q149_PANTR            Unreviewed;      3114 AA.
AC   H2Q149;
DT   21-MAR-2012, integrated into UniProtKB/TrEMBL.
DT   28-FEB-2018, sequence version 2.
DT   02-DEC-2020, entry version 53.
DE   SubName: Full=Centromere protein F {ECO:0000313|Ensembl:ENSPTRP00000003323};
GN   Name=CENPF {ECO:0000313|Ensembl:ENSPTRP00000003323,
GN   ECO:0000313|VGNC:VGNC:490};
OS   Pan troglodytes (Chimpanzee).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae;
OC   Pan.
OX   NCBI_TaxID=9598 {ECO:0000313|Ensembl:ENSPTRP00000003323, ECO:0000313|Proteomes:UP000002277};
RN   [1] {ECO:0000313|Ensembl:ENSPTRP00000003323, ECO:0000313|Proteomes:UP000002277}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX   PubMed=16136131; DOI=10.1038/nature04072;
RG   Chimpanzee sequencing and analysis consortium;
RT   "Initial sequence of the chimpanzee genome and comparison with the human
RT   genome.";
RL   Nature 437:69-87(2005).
RN   [2] {ECO:0000313|Ensembl:ENSPTRP00000003323}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (FEB-2012) to UniProtKB.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; AACZ04071460; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   RefSeq; XP_001171564.2; XM_001171564.3.
DR   STRING; 9598.ENSPTRP00000003323; -.
DR   PaxDb; H2Q149; -.
DR   Ensembl; ENSPTRT00000003607; ENSPTRP00000003323; ENSPTRG00000001977.
DR   GeneID; 457733; -.
DR   KEGG; ptr:457733; -.
DR   CTD; 1063; -.
DR   VGNC; VGNC:490; CENPF.
DR   eggNOG; ENOG502QVMD; Eukaryota.
DR   GeneTree; ENSGT00730000111187; -.
DR   HOGENOM; CLU_000551_0_0_1; -.
DR   InParanoid; H2Q149; -.
DR   OMA; YNAQLVQ; -.
DR   OrthoDB; 205405at2759; -.
DR   TreeFam; TF101133; -.
DR   Proteomes; UP000002277; Chromosome 1.
DR   Bgee; ENSPTRG00000001977; Expressed in testis and 7 other tissues.
DR   GO; GO:0005930; C:axoneme; IEA:Ensembl.
DR   GO; GO:0005813; C:centrosome; IEA:Ensembl.
DR   GO; GO:0036064; C:ciliary basal body; IEA:Ensembl.
DR   GO; GO:0097539; C:ciliary transition fiber; IEA:Ensembl.
DR   GO; GO:0000940; C:condensed chromosome outer kinetochore; IEA:Ensembl.
DR   GO; GO:0030496; C:midbody; IEA:Ensembl.
DR   GO; GO:0005635; C:nuclear envelope; IEA:Ensembl.
DR   GO; GO:0016363; C:nuclear matrix; IEA:Ensembl.
DR   GO; GO:0005654; C:nucleoplasm; IEA:Ensembl.
DR   GO; GO:0045120; C:pronucleus; IEA:Ensembl.
DR   GO; GO:0000922; C:spindle pole; IEA:Ensembl.
DR   GO; GO:0070840; F:dynein complex binding; IEA:Ensembl.
DR   GO; GO:0008017; F:microtubule binding; IEA:InterPro.
DR   GO; GO:0008022; F:protein C-terminus binding; IEA:Ensembl.
DR   GO; GO:0042803; F:protein homodimerization activity; IEA:Ensembl.
DR   GO; GO:0008134; F:transcription factor binding; IEA:Ensembl.
DR   GO; GO:0001822; P:kidney development; IEA:Ensembl.
DR   GO; GO:0051310; P:metaphase plate congression; IEA:Ensembl.
DR   GO; GO:0000278; P:mitotic cell cycle; IEA:Ensembl.
DR   GO; GO:0045892; P:negative regulation of transcription, DNA-templated; IEA:Ensembl.
DR   GO; GO:0015031; P:protein transport; IEA:Ensembl.
DR   GO; GO:0010389; P:regulation of G2/M transition of mitotic cell cycle; IEA:Ensembl.
DR   GO; GO:0016202; P:regulation of striated muscle tissue development; IEA:Ensembl.
DR   GO; GO:0021591; P:ventricular system development; IEA:Ensembl.
DR   InterPro; IPR043513; Cenp-F.
DR   InterPro; IPR018302; CenpF/LEK1_Rb-prot-bd.
DR   InterPro; IPR019513; Centromere_CenpF_leu-rich_rpt.
DR   InterPro; IPR018463; Centromere_CenpF_N.
DR   PANTHER; PTHR18874; PTHR18874; 2.
DR   Pfam; PF10490; CENP-F_C_Rb_bdg; 1.
DR   Pfam; PF10473; CENP-F_leu_zip; 3.
DR   Pfam; PF10481; CENP-F_N; 1.
PE   4: Predicted;
KW   Coiled coil {ECO:0000256|SAM:Coils};
KW   Reference proteome {ECO:0000313|Proteomes:UP000002277}.
FT   DOMAIN          1..307
FT                   /note="CENP-F_N"
FT                   /evidence="ECO:0000259|Pfam:PF10481"
FT   DOMAIN          1893..2035
FT                   /note="CENP-F_leu_zip"
FT                   /evidence="ECO:0000259|Pfam:PF10473"
FT   DOMAIN          2131..2270
FT                   /note="CENP-F_leu_zip"
FT                   /evidence="ECO:0000259|Pfam:PF10473"
FT   DOMAIN          2313..2452
FT                   /note="CENP-F_leu_zip"
FT                   /evidence="ECO:0000259|Pfam:PF10473"
FT   DOMAIN          2970..3013
FT                   /note="CENP-F_C_Rb_bdg"
FT                   /evidence="ECO:0000259|Pfam:PF10490"
FT   REGION          211..236
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1667..1690
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1710..1747
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          2891..3000
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          3022..3114
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1722..1747
FT                   /note="Polar"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        2916..2953
FT                   /note="Polar"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        2962..2984
FT                   /note="Polyampholyte"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        3029..3080
FT                   /note="Polar"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   3114 AA;  357717 MW;  615A3A9CB5383EC4 CRC64;
     MSWALEEWKE GLPTRALQKI QELEGQLDKL KKEKQQRQFQ LDSLEAALQK QKQKVENEKT
     EGTNLKRENQ RLMEICESLE KTKQKISHEL QVKESQVNFQ EGQLNSGKKQ IEKLEQELKR
     CKSELERSQQ AAQSADVSLN PCNTPQKIFT TPLTPSQYYS GSKYEDLKEK YNKEVEERKR
     LEAEVKALQA KKASQTLPQA TMNHRDIARH QASSSVFSWQ QEKTPSHLSS NSQRTPIRRD
     FSASYFSGEQ EVTPSRSTLQ IGKRDANSSF FDNSSSPHLL DQLKAQNQEL RNKINELELR
     LQGHEKEMKG QVNKFQELQL QLEKAKVELI EKEKVLNKCR DELVRTTAQY DQASTKYTAL
     EQKLKKLTED LSCQRQNAES ARCSLEQKIK EKEKEFQEEL SRQQRSFQTL DQECIQMKAR
     LTQELQQAKN MHNVLQAELD KLTSVKQQLE NNLEEFKQKL CRAEQAFQAS QIKENELRRS
     MEEMKKENNL LKSQSEQKAR EVCHLEAELK NIKQCLNQSQ NFAEEMKAKN TSQETMLRDL
     QEKINQQENS LTLEKLKLAV ADLEKQRDCS QDLLKKREHH IEQLNDKLSK TEKESKALLS
     ALELKKKEYE ELKEEKTLFS CWKSENEKLL TQMESEKENL QSKINHLETC LKTQQIKSHE
     YNERVRTLEM DRENLSVEIR NLHNVLDSKS VEVETQKLAY VELQQKAEFS DQKHQKEIEN
     MCLKTSQLTG QVEDLEHKLQ LLSNEIMDKD RCYQDLHAEY ESLRDLLKSK DASLVTNEDH
     QRSLLAFDQQ PAMHHSFANI IGEQGSMPSE RSECHLEADQ SPKNSAILQN RVDSLEFSLE
     SQKQMNSDLQ KQCEELVQIK GEIEENLMKA EQMHQSFVAE TSQRISKLQE DTSAHQNVVA
     ETLSALENKE KELQLLNDKL ETEQAEIQEL KKSNHLLEDS LKELQLLSET LSLEKKEMSS
     IISLNKREIE ELTQENGTLK EMNASLNQEK MNLIQKSESF ANYIDEREKS ISELSDQYKQ
     EKLILLQRCE ETGNAYEDLS QKYKAAQEKN SKLECLLNEC TSLCENRKNE LEQLKEAFAK
     EHQEFLTKLA FAEERNQNLM LELETVQQAL RSEMTDNQNN SKSEPGGLKQ EIMTLKEEQN
     KMQKEVNDLL QENEQLMKVM KTKHECQNLE SEPIRNSVKE RESERNQCNF KPQMDLEVKE
     ISLDSYNAQL VQLEAMLRSK ELKLQESEKE KECLQHELQT IRGDLESSNL QDMQSQEISG
     LKDCEIDAEE KYISGPHELS TSQNDNAHLQ CSLQTTMNKL NELEKICEIL QAEKYELVTE
     LNDSRSECIT ATRKMAEEVG KLLNEVKILN DDSGLLHGEL VEDIPGGEFG EQPNEQHPVS
     LAPLDESNSY EHLTLSDKEV QMHFAELQEK FSSLQSEHKI LHDQHCQMSS KMSELQTYVD
     SLKAENLVLS TNLRNFQGDL VKEMQLGLEE GLVPSLSSSC VPDSSSLSSL GDSSFYRALL
     EQTGDMSLLS NLEGTVSANQ CSVDEVFCSS LQEENLTRKE TPSAPAKGVE ELESLCEVYR
     QSLEKLEEKM ESQGIMKNKE IQELEQLLSS ERQELDCLRK QYLSENEQWQ QKLTSVTLEM
     ESKLAAEKKQ TEQLSLELEV ARLQLQGLDL SSRSLLGIDT EDAIQGRNES CDISKEHTSE
     TTERTPKHDV HQICDKDAQQ DLNLDIEKIT ETGAVKPTGE CSGEQSPDTN YEPPGEDKTQ
     GSSECISELS FSGPNASVPM DFLGNQENIQ NLQLRVKETS NENLRLLHVI EDRDRKVESL
     LNEMKELDSK LHLQEVQLMT KIEACIELEK IVGELKKENS DLSEKLEYFS CDNQELLQRV
     ETSEGLNSDL EMHADKSSRE DIEDNVAKVN DSWKERFLDV ENELSRIRSE KANIEHQALY
     LEADLEIVQT EKLCLEKDNE NKQKVIVCLE EELSVVTSER NQLRGELDTM SKKTMELDQL
     SEKMKEKTRE LESHQSECLH CIQVAEAEVK EKTELLQTLS SDVSELLKDK THLQEKLQSL
     EKDSQALSFT KCELENQIAQ LNKEKELLVK ESESLQARLS ESDYEKLNVS KALEAALVEK
     GEFALRLSST QEEVHQLRRG IEKLRVRIEA DEKKQLHVAE ELKEREREND SLKDKVENLE
     RELQMSEENQ ELVILDAENS KAEVETLKTQ IEEMARSLKV FELDLVTLRS EKENLTKQIQ
     EKQGQLSELD KLLSSFKSLL EEKEQAEIQI KEESKTAVEM LQNQLKELNE AVAALCGDQE
     IMKATEQSLD PPIEEEHQLR NSIEKLRARL EADEKKQLCV LQQLKESEHH ADLLKGRVEN
     LERELEIART NQEHAAHEAE NSKGEVEALK AKIEGMTQSL RDLELDLVTI RSEKENLTNE
     LQKEQERISE LEIINASFEN ILQEKEQEKV EMKEKSSTAM EMLQTQLKEL SERVAALHND
     QEACKAKEQN LSSQVECLEL EKAQLLQGLD EAKNNYIVLQ SSVNGLIQEV EDGKQKLEKK
     DEEISRLKNQ IQDEEQLVSK LSQVEGEHQL WKEQNLELRN LTVELEQKIQ VLQSKNASLQ
     DTLEVLQSSY KNLENELELT KMDKMSFVEK VNKMTAKETE LQREMHEMAQ KTAELQEELS
     GEKNRLAGEL QLLLEEIKSS KDQLKELTLE NSELKKSLDC MHKDQVEKEG KVREEIAEYQ
     LRLHEAEKKH QALLLDTNKQ YEVEIQTYRE KLTSKEECLS SQKLEIDLLK SSKEELNNSL
     KATTQILEEL KKTKMDNLKY VNQLKKENER AQGKMKLLIK SCKQLEEEKE ILQKELSQLQ
     AAQEKQKTGT VMDTKVDELT TEIKELKETL EEKTKEADEY LDKYCSLLIS HEKLEKAKEM
     LETQVAHLCS QQSKQDSRGS PLLDPVVPGP SPIPSVTEKR LSSGQNKASG KRQRSSGIWE
     NGRGPTPATP ESFSKKSKKA VMSGIHPAED MEGTESEPEG LPEVVKKGFA DIPTGKTSPY
     ILRRTTMATR TSPRLAAQKL ALSPLSLGKE NLAESSKPTA GGSRSQKVKV AQQSPVDSGT
     ILREPTTKSV PVNNLPERSP TDSPREGLRV KRGRLVPSPK AGLESKGSEN CKVQ
//
DBGET integrated database retrieval system