GenomeNet

Database: UniProt
Entry: G3X012_SARHA
LinkDB: G3X012_SARHA
Original site: G3X012_SARHA 
ID   G3X012_SARHA            Unreviewed;      3107 AA.
AC   G3X012;
DT   16-NOV-2011, integrated into UniProtKB/TrEMBL.
DT   16-NOV-2011, sequence version 1.
DT   10-FEB-2021, entry version 55.
DE   SubName: Full=Centromere protein F {ECO:0000313|Ensembl:ENSSHAP00000021017};
GN   Name=CENPF {ECO:0000313|Ensembl:ENSSHAP00000021017};
OS   Sarcophilus harrisii (Tasmanian devil) (Sarcophilus laniarius).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Metatheria; Dasyuromorphia; Dasyuridae; Sarcophilus.
OX   NCBI_TaxID=9305 {ECO:0000313|Ensembl:ENSSHAP00000021017, ECO:0000313|Proteomes:UP000007648};
RN   [1] {ECO:0000313|Ensembl:ENSSHAP00000021017, ECO:0000313|Proteomes:UP000007648}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX   PubMed=21709235; DOI=10.1073/pnas.1102838108;
RA   Miller W., Hayes V.M., Ratan A., Petersen D.C., Wittekindt N.E., Miller J.,
RA   Walenz B., Knight J., Qi J., Zhao F., Wang Q., Bedoya-Reina O.C.,
RA   Katiyar N., Tomsho L.P., Kasson L.M., Hardie R.A., Woodbridge P.,
RA   Tindall E.A., Bertelsen M.F., Dixon D., Pyecroft S., Helgen K.M.,
RA   Lesk A.M., Pringle T.H., Patterson N., Zhang Y., Kreiss A., Woods G.M.,
RA   Jones M.E., Schuster S.C.;
RT   "Genetic diversity and population structure of the endangered marsupial
RT   Sarcophilus harrisii (Tasmanian devil).";
RL   Proc. Natl. Acad. Sci. U.S.A. 108:12348-12353(2011).
RN   [2] {ECO:0000313|Ensembl:ENSSHAP00000021017}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (SEP-2011) to UniProtKB.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; AEFK01158560; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AEFK01158561; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AEFK01158562; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   STRING; 9305.ENSSHAP00000021017; -.
DR   Ensembl; ENSSHAT00000021187; ENSSHAP00000021017; ENSSHAG00000017822.
DR   eggNOG; ENOG502QVMD; Eukaryota.
DR   GeneTree; ENSGT00730000111187; -.
DR   HOGENOM; CLU_000551_0_0_1; -.
DR   InParanoid; G3X012; -.
DR   OMA; YNAQLVQ; -.
DR   TreeFam; TF101133; -.
DR   Proteomes; UP000007648; Unassembled WGS sequence.
DR   GO; GO:0000775; C:chromosome, centromeric region; IEA:InterPro.
DR   GO; GO:0070840; F:dynein complex binding; IEA:InterPro.
DR   GO; GO:0008017; F:microtubule binding; IEA:InterPro.
DR   GO; GO:0042803; F:protein homodimerization activity; IEA:InterPro.
DR   GO; GO:0008134; F:transcription factor binding; IEA:InterPro.
DR   InterPro; IPR043513; Cenp-F.
DR   InterPro; IPR018302; CenpF/LEK1_Rb-prot-bd.
DR   InterPro; IPR019513; Centromere_CenpF_leu-rich_rpt.
DR   InterPro; IPR018463; Centromere_CenpF_N.
DR   PANTHER; PTHR18874; PTHR18874; 2.
DR   Pfam; PF10490; CENP-F_C_Rb_bdg; 1.
DR   Pfam; PF10473; CENP-F_leu_zip; 2.
DR   Pfam; PF10481; CENP-F_N; 1.
PE   4: Predicted;
KW   Coiled coil {ECO:0000256|SAM:Coils};
KW   Reference proteome {ECO:0000313|Proteomes:UP000007648}.
FT   DOMAIN          1..301
FT                   /note="CENP-F_N"
FT                   /evidence="ECO:0000259|Pfam:PF10481"
FT   DOMAIN          2118..2249
FT                   /note="CENP-F_leu_zip"
FT                   /evidence="ECO:0000259|Pfam:PF10473"
FT   DOMAIN          2362..2442
FT                   /note="CENP-F_leu_zip"
FT                   /evidence="ECO:0000259|Pfam:PF10473"
FT   DOMAIN          2962..3004
FT                   /note="CENP-F_C_Rb_bdg"
FT                   /evidence="ECO:0000259|Pfam:PF10490"
FT   REGION          49..71
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          132..166
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          449..469
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          2883..2941
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          3004..3107
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COILED          274..308
FT                   /evidence="ECO:0000256|SAM:Coils"
FT   COILED          316..371
FT                   /evidence="ECO:0000256|SAM:Coils"
FT   COILED          517..544
FT                   /evidence="ECO:0000256|SAM:Coils"
FT   COILED          547..588
FT                   /evidence="ECO:0000256|SAM:Coils"
FT   COILED          659..732
FT                   /evidence="ECO:0000256|SAM:Coils"
FT   COILED          906..954
FT                   /evidence="ECO:0000256|SAM:Coils"
FT   COILED          987..1014
FT                   /evidence="ECO:0000256|SAM:Coils"
FT   COILED          1091..1118
FT                   /evidence="ECO:0000256|SAM:Coils"
FT   COILED          1181..1215
FT                   /evidence="ECO:0000256|SAM:Coils"
FT   COILED          1263..1297
FT                   /evidence="ECO:0000256|SAM:Coils"
FT   COILED          1420..1440
FT                   /evidence="ECO:0000256|SAM:Coils"
FT   COILED          1548..1604
FT                   /evidence="ECO:0000256|SAM:Coils"
FT   COILED          1611..1631
FT                   /evidence="ECO:0000256|SAM:Coils"
FT   COILED          1755..1796
FT                   /evidence="ECO:0000256|SAM:Coils"
FT   COILED          1804..1838
FT                   /evidence="ECO:0000256|SAM:Coils"
FT   COILED          1877..1939
FT                   /evidence="ECO:0000256|SAM:Coils"
FT   COILED          1954..1988
FT                   /evidence="ECO:0000256|SAM:Coils"
FT   COILED          2003..2079
FT                   /evidence="ECO:0000256|SAM:Coils"
FT   COILED          2094..2121
FT                   /evidence="ECO:0000256|SAM:Coils"
FT   COILED          2129..2219
FT                   /evidence="ECO:0000256|SAM:Coils"
FT   COILED          2241..2311
FT                   /evidence="ECO:0000256|SAM:Coils"
FT   COILED          2323..2457
FT                   /evidence="ECO:0000256|SAM:Coils"
FT   COILED          2465..2634
FT                   /evidence="ECO:0000256|SAM:Coils"
FT   COILED          2639..2659
FT                   /evidence="ECO:0000256|SAM:Coils"
FT   COILED          2685..2762
FT                   /evidence="ECO:0000256|SAM:Coils"
FT   COILED          2770..2818
FT                   /evidence="ECO:0000256|SAM:Coils"
FT   COILED          2825..2852
FT                   /evidence="ECO:0000256|SAM:Coils"
FT   COILED          2860..2880
FT                   /evidence="ECO:0000256|SAM:Coils"
FT   COMPBIAS        49..64
FT                   /note="Polyampholyte"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        134..162
FT                   /note="Polar"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        2883..2921
FT                   /note="Polar"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        3013..3035
FT                   /note="Polar"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        3047..3074
FT                   /note="Polar"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        3075..3107
FT                   /note="Polyampholyte"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   3107 AA;  360952 MW;  241545B7A70FE90F CRC64;
     MSWALEEWKD GLSTRALQKI QELEGQLDKL KKDRQQRQFQ LESLEDALQK QKKKVEDQKN
     EGATLKRENQ SLMETCENLE KMRQKISHEL QVKESQVNFQ EGQLNSSKKQ IEKLEQELKR
     YKSELERNQL AVLPGDVSTL NSTPQKSSAP PVQSSQSASR YEELQEKYSK EVEERKRLEE
     EVKALQIKKA SKTVPPNNMS HREIARHQSS SSVFPWQQEK TPTRHSANAQ ETPFRRGFTA
     LHVPWEQETT PNRMPSWQDT NCSFHENPSD PHLLDQIKAQ NQELRFKVNE LEHRLQGQEK
     DIKGHMNKWQ ETQLHLEKTK LELVEKEKVL NKTRDELMRL TTQFDQATAK CTTLEQKLKK
     ISEDLSCQRQ NAESTRCALE QKIKDKEKDY QEELSHQQRT FQMLDQESTQ IKTRLNQELQ
     QAKSTQNILQ AELDKVVAVK QQLERNSDEF KQKLSRTEQA LQTSETKENE LRKNLEEMMQ
     EKQLFSHQLD KRTREVYLLE EELKKTKYCL KQNQNLAEEM KEKNATQGET LKALQEKIDQ
     QEKSFILEKL KLAVADLEKQ RDCSQDLLKK REHHIEQLND KLSATQKETY ELLSTLELKE
     KECEDLKKET IIFSQWKSEK EPLLNQLLLE KEGLEGKINH LELCLKTQQM KNHDANESFK
     VMETEKENLG LEIRKLQNAI EVKSIELETQ KRAYDELQEK SECSEQKYKK EIENLSLKIL
     QVTEEDEHLK QKLQLISSEI IEKDQRYEEL CIQYKRINSL VKSQNICQMT SDDHCGDLLT
     FEEPVTNNSF TNILEFQGSL PLERGSQKIK LCSGDESLKS AALLHHEVSS FQFSVESEKQ
     MNTDSQKHCE ELVPIEGKIE ENIFKAEHMH ECFVNKQFQM LSEQAESGQV EDEDLKTNKL
     SFEKSVKELQ LMSETLNSEK KEINSDLFQN KKEIEELTQE NRNLKEINAI LNIEKMNLIQ
     KNVDFSNCLI QKENNISELS DRNMEEKLLL VKRCEEAEKE LELLKEKYKS LEKKNTEMGC
     ILSDHSLTLF GDRNNELKDL EGAFAREREY YIGKLALAEE KNEKLIFEME TIQCGLRTEN
     AAIQNSSKIE ADCLRQEISN FKHEQNKIQE QYHDLLQENG RLMKLIKAKD VQMNALVLTG
     SSASEQMSES ENQSENETDN FKMIKDLDAK DSSINTCNPQ VVRLEEVIKH MELRLKESEK
     EKEFLQKELE MIRKELEIRD SKVMEAKQYD QSFETNSYED CKREMDEKYI SVLHELSTSQ
     NDNAQLMTSL QGAVNKLNEL EKMCDILQIE KLKLTSELND SKSECILATT KMAEKMEKLV
     TDIKTLNNKN SNLPGDFNER NDKDKFGEQS NEQMSVSLKL LESNIEAGGD YEHLKLSNKE
     VQMHFFELQE KFSSLEVEHR ILHEQHCGMN SKLSELCSYI DTLKTENSVL SMNLKNLQTD
     LMKQCSPDNE EFALEEGGSL SSSCMNEIPN LTSFVESSFY KDLFEEPRET SSNSLEETIL
     NSQSSTNIHE TSLSSSVVEC ISKKIARSDP SLNIEEIKTL CQTYQISLKN LVEKYQRQEN
     IKDKEIQELK QLIGSERKEL QSLREQYLAE NEQWQQKLTN VTVEMESKLA AEKKQTEYLS
     LQLEVARLQL QGLDLSSRSF LIAESEDAIT DDQGNSVSDN SEEHNSFTDE KIIKSDSLHI
     CEENVQQILH LESEKIAENE GVSSAAICSR KLNLENEYRN PLDKTLNNSE CIHELSFSSN
     DTTVAMDFLE NQVLIQNLQT KLKETSNENL KLIRGIEESN KKVDSLLSRI KELDYELDLQ
     KTELTAKICK CSELEKAIQE LQKEKVDLSD KLESLSYDNH QLLQRVTSFE NLNSCLAPDK
     VIFTDATDIE ENIAVMSKDW KEQFLEIENE LKRVKSEKTN IENHAISIEG DLEELQRKNL
     SLEKDNDNKQ KMVSNLDEQL SLVIAEKNHL SGELEIVLKA KMQIEEICEK LKEKIEVLES
     NQRESLQHVE IMESEVRDKT QLLQSMTCHV NQLLKDKEDL QEQLQNLEKD KQKLSLIKDN
     IKHEFGQLTN EKELLIRELE NMKSKLNESE AENVNLSKVL ESSLIEKGEI AARLNSTQEE
     VHQLRTGIEK LKIRIEADEK KKHYVLEKLK ESERNADSFK DKVEALEREL QMSEENQEAV
     ILDAESTKSE LETLKTQMEE LTKGLKCSEL ELFALRAEKE NILKELQEKQ DQVSQLEQLA
     YNLKFVLSLS LEKERMSIEL QENSDHELEE LAKNLKSLEM EVVVLREEKE NMSKNQQENH
     LIELEKLTQT IKYLESELAA LREEKESISN EVLKEKQYRV CELEELTKSL KCLEIEVVSL
     RSEKESILKE LQKKQSEVSE LEELTKTLRC VETEVVSLRS EKDKENILKE LQEKQDQIAE
     LKMLNVSFES LLEKKEEEKR KLKEESNHTV ELLQKQLNDL NEKIETLHEE NNICKVNEHD
     LNCQVDCLKN EKIHLMQQLE ENKNNNALLK SSMNDLIQET ENNKQKLEKE TEENRILQKQ
     VEDLRKLSSM LTQMEAQQQL WEKEKLQMKS LIVELQEKIK ELSKNETLYD SLEALQCSYA
     NLEKELESTK RENSSLLEQI NKMTHNEVLL KKEMNEMMQK MTVLQEEYAG EKNRFTDHVK
     ITEAEAEKNK MQLDELILEK CELKKSLDCL QKELEEREGK SRDEISDYQC RLENANKKYE
     ALLKEANKKH EMEIEAYQEK LICKEQDLSL HKSEMEILKS NKEELTKSLK STSKILEELK
     KTKMDNMKYV TQLKKENENA KGKIQLLIKT CKQLENEKEA LQKEITGFEA LQENQKQNNA
     PGVNVEELMS EVKELKETIE EKNKETDEYL DKYCNLLVSY EKLEKTKDML EAQVTFLSSL
     QGKVTSQSSP MLNSADSIHS PNYPVPEKKT SASQSKVSNK RPRSCGIKEN DGESMSSTPE
     TFTKRIKKGV TPKGITSLSR GLENIEYEPE GLPEVVKKGF ADIPKGKTSP YVFRRTTLAT
     RASPRLAAQK LSPLNLQKGQ SENLAETSKP TAGGSRSQKA KDDHQHQVQS LVTIIEPTTR
     SPLSENNLSK KTSADIPKES MRTKRVKHSP SKHTVPEQNK EDNCRVQ
//
DBGET integrated database retrieval system