ID D2UXU1_NAEGR Unreviewed; 1733 AA.
AC D2UXU1;
DT 02-MAR-2010, integrated into UniProtKB/TrEMBL.
DT 02-MAR-2010, sequence version 1.
DT 29-MAY-2013, entry version 25.
DE SubName: Full=Predicted protein;
GN ORFNames=NAEGRDRAFT_61241;
OS Naegleria gruberi (Amoeba).
OC Eukaryota; Heterolobosea; Schizopyrenida; Vahlkampfiidae; Naegleria.
OX NCBI_TaxID=5762;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=NEG-M;
RX PubMed=20211133; DOI=10.1016/j.cell.2010.01.032;
RA Fritz-Laylin L.K., Prochnik S.E., Ginger M.L., Dacks J.B.,
RA Carpenter M.L., Field M.C., Kuo A., Paredez A., Chapman J., Pham J.,
RA Shu S., Neupane R., Cipriano M., Mancuso J., Tu H., Salamov A.,
RA Lindquist E., Shapiro H., Lucas S., Grigoriev I.V., Cande W.Z.,
RA Fulton C., Rokhsar D.S., Dawson S.C.;
RT "The genome of Naegleria gruberi illuminates early eukaryotic
RT versatility.";
RL Cell 140:631-642(2010).
CC -!- SUBCELLULAR LOCATION: Nucleus (By similarity).
CC -----------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution-NoDerivs License
CC -----------------------------------------------------------------------
DR EMBL; GG738845; EFC50347.1; -; Genomic_DNA.
DR RefSeq; XP_002683091.1; XM_002683045.1.
DR GeneID; 8863907; -.
DR KEGG; ngr:NAEGRDRAFT_61241; -.
DR KO; K15199; -.
DR OMA; HATIENS; -.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003682; F:chromatin binding; IEA:InterPro.
DR GO; GO:0043565; F:sequence-specific DNA binding; IEA:InterPro.
DR GO; GO:0003700; F:sequence-specific DNA binding transcription factor activity; IEA:InterPro.
DR Gene3D; 1.10.10.60; -; 2.
DR InterPro; IPR001356; Homeodomain.
DR InterPro; IPR009057; Homeodomain-like.
DR InterPro; IPR017877; Myb-like_dom.
DR InterPro; IPR001005; SANT/Myb.
DR InterPro; IPR007309; TFIIIC_Bblock-bd.
DR Pfam; PF04182; B-block_TFIIIC; 1.
DR Pfam; PF00046; Homeobox; 1.
DR Pfam; PF00249; Myb_DNA-binding; 1.
DR SMART; SM00389; HOX; 1.
DR SMART; SM00717; SANT; 1.
DR SUPFAM; SSF46689; Homeodomain_like; 2.
DR PROSITE; PS50090; MYB_LIKE; 1.
PE 3: Inferred from homology;
KW Complete proteome; DNA-binding; Homeobox; Nucleus.
SQ SEQUENCE 1733 AA; 200679 MW; 7E33899D4A2DAA7E CRC64;
MDKVLEASTL EIALEGFRGC SFEKLVKLLQ EYHEYDIKED EDMQICLWSY LLRNSKLSFF
KIVPNNGDSS TTNNTPPPKQ KSNKKAPQKN DYQYTTEGFQ EYAETRNNFD SRFSSKRLLV
DNLSELGTLP TSDICIVAQF EYRKFALGLD IRSDVEISEI QYCMLEEVGR SRYSGRFQND
LTKYLCIDSR STLYHLKRLY GLNLLLKKSS YDPRSKTTSN LIHLFRFTDT KKQKEFIKKT
DDEVNAEKIF ERYSQLKENI VDTVVGMLTK AKGNVLIDYE VRKKLLDFVK DDTQQKHKWE
RLKLILQASG KVEYFKAIFE NRLTYCVRLL NQDNNLSSSN PIATNNAIED GLLNPGFDME
RPQLYQMYDY IEKFGGKSGL NHMDIYQHFE MTSKVSGNLC NDLRKDFGVT SVAEKGEKQA
KSTSYRFFVP YIKEVENDKA NEVSNNEQES PQKSKSRKKK KSQPPKQSTN NSQEQVQQEV
QSVLPSINLE DSSDSLYDSD TGSDDESDEN KPALTPSQYL RTEGKRKEEG SRKEVITKNF
SERCKLALSF LKEKKCIARF QLREHFKQNN VSVESKAISR LIQYLESNNL AKKMTIAVPS
LFGTTSKMHI CLIDCSLDMK SKTVQDFITS LYSHEVQVNS KEKVDRTILE KVDSIDHLET
KRSQKRFSYL PLFHSLLNGY INAKMLRVRV FHQYLWNKVN EGSVNPERIC LYDIYSNMSV
SLFCKLVGCI AEIRLGSDDK EILQTVTISN APENIKKAVD ASVVGRRAVA LSRVETFIDI
MEKLGLLVVQ AIRDKESKSK VFSLLKTVEF PFPQQFPATE YRQVTFNSIN DVEEYWDDLE
MCSVNMETIR CEPIGDDQED SPQIFSKIQD LGKRPSWTEY RHFSLRQLRI LEASYKENQT
PNFEECDNIC KKLRIPMEQV VLYFYRYRNS ILSKRISDIG TIPEGQSKSN TKRKKDDDFI
VGDDEIEYES NDENVEVRAK KTKRLPSVSL LLIGNRKTRK GSTRKGNKKQ PTDTNENTEK
QKEKTIPTRG YKWTNEQDLL LLEGLSRLRE LDPSTGTFLT KPIKWQEIAD SIGDGITANR
CKLRWNNVLR KLPWCKRALE LAIGHRDLNK NNPNIPPIEI RTLVQTCRDQ YRIGPSDSNN
QLMSVDLPRD LINIVKYFKV IRFEPLVEDN ADENINITTE NRLRLSSLET LFKIILLTPD
EHYDVKLANK LVKRFKSKEI EECFNSLRKR SIISKSKRGL RMKGYHLNIN FTTKTTVNFY
PPEIFEDAPK FRKRALMINE EDDSDEEDDM EVKPLFNRHG ACFNPQSSGG VVAALMSMVM
LEEIRLIPRV APLSEEEKNQ VGRTRSIQDM EEDDGTGVSN HLLRVGAVRP GENDHIHAEG
TNIKSSHGSL QLPDWTIAIE SITEVNLFSE MGAENEHQTH ISRIVSEYGH KLSSIEFSIE
LPDQENENKR ERDEFNQDNT YDPPQKKKKQ HHITNSEIEA YFEMYAAESS SITTFTNDQI
DFVENITYPE IKPLAQYIIE SGVEEYSFWI ERIILVYSFI GNTKFEGANF DTISQKFSDF
STDYDLSIKE ILQQLVNFNC IVEVNSENET RFLRFEEAKL HTIPYSENGA TQYRPISVFR
QLSGELNDNL LQNFKKCIIS RIMRYPGILE KKLINSIGVI SAQDIRKVLG FLEDDQIIQS
SYHRATSNEK LKCKNILFMH QDEMDDLENS DCYPTEFHRS LFVTPKYIFF STQ
//