ID G3X012_SARHA Unreviewed; 3107 AA.
AC G3X012;
DT 16-NOV-2011, integrated into UniProtKB/TrEMBL.
DT 16-NOV-2011, sequence version 1.
DT 10-FEB-2021, entry version 55.
DE SubName: Full=Centromere protein F {ECO:0000313|Ensembl:ENSSHAP00000021017};
GN Name=CENPF {ECO:0000313|Ensembl:ENSSHAP00000021017};
OS Sarcophilus harrisii (Tasmanian devil) (Sarcophilus laniarius).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Metatheria; Dasyuromorphia; Dasyuridae; Sarcophilus.
OX NCBI_TaxID=9305 {ECO:0000313|Ensembl:ENSSHAP00000021017, ECO:0000313|Proteomes:UP000007648};
RN [1] {ECO:0000313|Ensembl:ENSSHAP00000021017, ECO:0000313|Proteomes:UP000007648}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=21709235; DOI=10.1073/pnas.1102838108;
RA Miller W., Hayes V.M., Ratan A., Petersen D.C., Wittekindt N.E., Miller J.,
RA Walenz B., Knight J., Qi J., Zhao F., Wang Q., Bedoya-Reina O.C.,
RA Katiyar N., Tomsho L.P., Kasson L.M., Hardie R.A., Woodbridge P.,
RA Tindall E.A., Bertelsen M.F., Dixon D., Pyecroft S., Helgen K.M.,
RA Lesk A.M., Pringle T.H., Patterson N., Zhang Y., Kreiss A., Woods G.M.,
RA Jones M.E., Schuster S.C.;
RT "Genetic diversity and population structure of the endangered marsupial
RT Sarcophilus harrisii (Tasmanian devil).";
RL Proc. Natl. Acad. Sci. U.S.A. 108:12348-12353(2011).
RN [2] {ECO:0000313|Ensembl:ENSSHAP00000021017}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (SEP-2011) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AEFK01158560; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AEFK01158561; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AEFK01158562; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR STRING; 9305.ENSSHAP00000021017; -.
DR Ensembl; ENSSHAT00000021187; ENSSHAP00000021017; ENSSHAG00000017822.
DR eggNOG; ENOG502QVMD; Eukaryota.
DR GeneTree; ENSGT00730000111187; -.
DR HOGENOM; CLU_000551_0_0_1; -.
DR InParanoid; G3X012; -.
DR OMA; YNAQLVQ; -.
DR TreeFam; TF101133; -.
DR Proteomes; UP000007648; Unassembled WGS sequence.
DR GO; GO:0000775; C:chromosome, centromeric region; IEA:InterPro.
DR GO; GO:0070840; F:dynein complex binding; IEA:InterPro.
DR GO; GO:0008017; F:microtubule binding; IEA:InterPro.
DR GO; GO:0042803; F:protein homodimerization activity; IEA:InterPro.
DR GO; GO:0008134; F:transcription factor binding; IEA:InterPro.
DR InterPro; IPR043513; Cenp-F.
DR InterPro; IPR018302; CenpF/LEK1_Rb-prot-bd.
DR InterPro; IPR019513; Centromere_CenpF_leu-rich_rpt.
DR InterPro; IPR018463; Centromere_CenpF_N.
DR PANTHER; PTHR18874; PTHR18874; 2.
DR Pfam; PF10490; CENP-F_C_Rb_bdg; 1.
DR Pfam; PF10473; CENP-F_leu_zip; 2.
DR Pfam; PF10481; CENP-F_N; 1.
PE 4: Predicted;
KW Coiled coil {ECO:0000256|SAM:Coils};
KW Reference proteome {ECO:0000313|Proteomes:UP000007648}.
FT DOMAIN 1..301
FT /note="CENP-F_N"
FT /evidence="ECO:0000259|Pfam:PF10481"
FT DOMAIN 2118..2249
FT /note="CENP-F_leu_zip"
FT /evidence="ECO:0000259|Pfam:PF10473"
FT DOMAIN 2362..2442
FT /note="CENP-F_leu_zip"
FT /evidence="ECO:0000259|Pfam:PF10473"
FT DOMAIN 2962..3004
FT /note="CENP-F_C_Rb_bdg"
FT /evidence="ECO:0000259|Pfam:PF10490"
FT REGION 49..71
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 132..166
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 449..469
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2883..2941
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 3004..3107
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 274..308
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 316..371
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 517..544
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 547..588
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 659..732
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 906..954
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 987..1014
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 1091..1118
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 1181..1215
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 1263..1297
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 1420..1440
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 1548..1604
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 1611..1631
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 1755..1796
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 1804..1838
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 1877..1939
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 1954..1988
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 2003..2079
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 2094..2121
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 2129..2219
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 2241..2311
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 2323..2457
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 2465..2634
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 2639..2659
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 2685..2762
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 2770..2818
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 2825..2852
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 2860..2880
FT /evidence="ECO:0000256|SAM:Coils"
FT COMPBIAS 49..64
FT /note="Polyampholyte"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 134..162
FT /note="Polar"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2883..2921
FT /note="Polar"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3013..3035
FT /note="Polar"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3047..3074
FT /note="Polar"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3075..3107
FT /note="Polyampholyte"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 3107 AA; 360952 MW; 241545B7A70FE90F CRC64;
MSWALEEWKD GLSTRALQKI QELEGQLDKL KKDRQQRQFQ LESLEDALQK QKKKVEDQKN
EGATLKRENQ SLMETCENLE KMRQKISHEL QVKESQVNFQ EGQLNSSKKQ IEKLEQELKR
YKSELERNQL AVLPGDVSTL NSTPQKSSAP PVQSSQSASR YEELQEKYSK EVEERKRLEE
EVKALQIKKA SKTVPPNNMS HREIARHQSS SSVFPWQQEK TPTRHSANAQ ETPFRRGFTA
LHVPWEQETT PNRMPSWQDT NCSFHENPSD PHLLDQIKAQ NQELRFKVNE LEHRLQGQEK
DIKGHMNKWQ ETQLHLEKTK LELVEKEKVL NKTRDELMRL TTQFDQATAK CTTLEQKLKK
ISEDLSCQRQ NAESTRCALE QKIKDKEKDY QEELSHQQRT FQMLDQESTQ IKTRLNQELQ
QAKSTQNILQ AELDKVVAVK QQLERNSDEF KQKLSRTEQA LQTSETKENE LRKNLEEMMQ
EKQLFSHQLD KRTREVYLLE EELKKTKYCL KQNQNLAEEM KEKNATQGET LKALQEKIDQ
QEKSFILEKL KLAVADLEKQ RDCSQDLLKK REHHIEQLND KLSATQKETY ELLSTLELKE
KECEDLKKET IIFSQWKSEK EPLLNQLLLE KEGLEGKINH LELCLKTQQM KNHDANESFK
VMETEKENLG LEIRKLQNAI EVKSIELETQ KRAYDELQEK SECSEQKYKK EIENLSLKIL
QVTEEDEHLK QKLQLISSEI IEKDQRYEEL CIQYKRINSL VKSQNICQMT SDDHCGDLLT
FEEPVTNNSF TNILEFQGSL PLERGSQKIK LCSGDESLKS AALLHHEVSS FQFSVESEKQ
MNTDSQKHCE ELVPIEGKIE ENIFKAEHMH ECFVNKQFQM LSEQAESGQV EDEDLKTNKL
SFEKSVKELQ LMSETLNSEK KEINSDLFQN KKEIEELTQE NRNLKEINAI LNIEKMNLIQ
KNVDFSNCLI QKENNISELS DRNMEEKLLL VKRCEEAEKE LELLKEKYKS LEKKNTEMGC
ILSDHSLTLF GDRNNELKDL EGAFAREREY YIGKLALAEE KNEKLIFEME TIQCGLRTEN
AAIQNSSKIE ADCLRQEISN FKHEQNKIQE QYHDLLQENG RLMKLIKAKD VQMNALVLTG
SSASEQMSES ENQSENETDN FKMIKDLDAK DSSINTCNPQ VVRLEEVIKH MELRLKESEK
EKEFLQKELE MIRKELEIRD SKVMEAKQYD QSFETNSYED CKREMDEKYI SVLHELSTSQ
NDNAQLMTSL QGAVNKLNEL EKMCDILQIE KLKLTSELND SKSECILATT KMAEKMEKLV
TDIKTLNNKN SNLPGDFNER NDKDKFGEQS NEQMSVSLKL LESNIEAGGD YEHLKLSNKE
VQMHFFELQE KFSSLEVEHR ILHEQHCGMN SKLSELCSYI DTLKTENSVL SMNLKNLQTD
LMKQCSPDNE EFALEEGGSL SSSCMNEIPN LTSFVESSFY KDLFEEPRET SSNSLEETIL
NSQSSTNIHE TSLSSSVVEC ISKKIARSDP SLNIEEIKTL CQTYQISLKN LVEKYQRQEN
IKDKEIQELK QLIGSERKEL QSLREQYLAE NEQWQQKLTN VTVEMESKLA AEKKQTEYLS
LQLEVARLQL QGLDLSSRSF LIAESEDAIT DDQGNSVSDN SEEHNSFTDE KIIKSDSLHI
CEENVQQILH LESEKIAENE GVSSAAICSR KLNLENEYRN PLDKTLNNSE CIHELSFSSN
DTTVAMDFLE NQVLIQNLQT KLKETSNENL KLIRGIEESN KKVDSLLSRI KELDYELDLQ
KTELTAKICK CSELEKAIQE LQKEKVDLSD KLESLSYDNH QLLQRVTSFE NLNSCLAPDK
VIFTDATDIE ENIAVMSKDW KEQFLEIENE LKRVKSEKTN IENHAISIEG DLEELQRKNL
SLEKDNDNKQ KMVSNLDEQL SLVIAEKNHL SGELEIVLKA KMQIEEICEK LKEKIEVLES
NQRESLQHVE IMESEVRDKT QLLQSMTCHV NQLLKDKEDL QEQLQNLEKD KQKLSLIKDN
IKHEFGQLTN EKELLIRELE NMKSKLNESE AENVNLSKVL ESSLIEKGEI AARLNSTQEE
VHQLRTGIEK LKIRIEADEK KKHYVLEKLK ESERNADSFK DKVEALEREL QMSEENQEAV
ILDAESTKSE LETLKTQMEE LTKGLKCSEL ELFALRAEKE NILKELQEKQ DQVSQLEQLA
YNLKFVLSLS LEKERMSIEL QENSDHELEE LAKNLKSLEM EVVVLREEKE NMSKNQQENH
LIELEKLTQT IKYLESELAA LREEKESISN EVLKEKQYRV CELEELTKSL KCLEIEVVSL
RSEKESILKE LQKKQSEVSE LEELTKTLRC VETEVVSLRS EKDKENILKE LQEKQDQIAE
LKMLNVSFES LLEKKEEEKR KLKEESNHTV ELLQKQLNDL NEKIETLHEE NNICKVNEHD
LNCQVDCLKN EKIHLMQQLE ENKNNNALLK SSMNDLIQET ENNKQKLEKE TEENRILQKQ
VEDLRKLSSM LTQMEAQQQL WEKEKLQMKS LIVELQEKIK ELSKNETLYD SLEALQCSYA
NLEKELESTK RENSSLLEQI NKMTHNEVLL KKEMNEMMQK MTVLQEEYAG EKNRFTDHVK
ITEAEAEKNK MQLDELILEK CELKKSLDCL QKELEEREGK SRDEISDYQC RLENANKKYE
ALLKEANKKH EMEIEAYQEK LICKEQDLSL HKSEMEILKS NKEELTKSLK STSKILEELK
KTKMDNMKYV TQLKKENENA KGKIQLLIKT CKQLENEKEA LQKEITGFEA LQENQKQNNA
PGVNVEELMS EVKELKETIE EKNKETDEYL DKYCNLLVSY EKLEKTKDML EAQVTFLSSL
QGKVTSQSSP MLNSADSIHS PNYPVPEKKT SASQSKVSNK RPRSCGIKEN DGESMSSTPE
TFTKRIKKGV TPKGITSLSR GLENIEYEPE GLPEVVKKGF ADIPKGKTSP YVFRRTTLAT
RASPRLAAQK LSPLNLQKGQ SENLAETSKP TAGGSRSQKA KDDHQHQVQS LVTIIEPTTR
SPLSENNLSK KTSADIPKES MRTKRVKHSP SKHTVPEQNK EDNCRVQ
//