ID D2W4N9_NAEGR Unreviewed; 920 AA.
AC D2W4N9;
DT 02-MAR-2010, integrated into UniProtKB/TrEMBL.
DT 02-MAR-2010, sequence version 1.
DT 27-MAR-2024, entry version 34.
DE SubName: Full=Predicted protein {ECO:0000313|EMBL:EFC35964.1};
GN ORFNames=NAEGRDRAFT_54650 {ECO:0000313|EMBL:EFC35964.1};
OS Naegleria gruberi (Amoeba).
OC Eukaryota; Discoba; Heterolobosea; Tetramitia; Eutetramitia;
OC Vahlkampfiidae; Naegleria.
OX NCBI_TaxID=5762 {ECO:0000313|Proteomes:UP000006671};
RN [1] {ECO:0000313|EMBL:EFC35964.1, ECO:0000313|Proteomes:UP000006671}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=NEG-M {ECO:0000313|EMBL:EFC35964.1,
RC ECO:0000313|Proteomes:UP000006671};
RX PubMed=20211133; DOI=10.1016/j.cell.2010.01.032;
RA Fritz-Laylin L.K., Prochnik S.E., Ginger M.L., Dacks J.B., Carpenter M.L.,
RA Field M.C., Kuo A., Paredez A., Chapman J., Pham J., Shu S., Neupane R.,
RA Cipriano M., Mancuso J., Tu H., Salamov A., Lindquist E., Shapiro H.,
RA Lucas S., Grigoriev I.V., Cande W.Z., Fulton C., Rokhsar D.S., Dawson S.C.;
RT "The genome of Naegleria gruberi illuminates early eukaryotic
RT versatility.";
RL Cell 140:631-642(2010).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; GG738971; EFC35964.1; -; Genomic_DNA.
DR RefSeq; XP_002668708.1; XM_002668662.1.
DR AlphaFoldDB; D2W4N9; -.
DR EnsemblProtists; EFC35964; EFC35964; NAEGRDRAFT_54650.
DR GeneID; 8860811; -.
DR VEuPathDB; AmoebaDB:NAEGRDRAFT_54650; -.
DR InParanoid; D2W4N9; -.
DR OrthoDB; 5532629at2759; -.
DR Proteomes; UP000006671; Unassembled WGS sequence.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 1.
DR InterPro; IPR038765; Papain-like_cys_pep_sf.
DR InterPro; IPR036397; RNaseH_sf.
DR SUPFAM; SSF54001; Cysteine proteinases; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000006671}.
FT REGION 33..73
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 44..59
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 920 AA; 106326 MW; F3A250302C5F322A CRC64;
MSNKQSSSYD TSPQSFSSYS NLIDLLPSVS QVPSWTCTQD QPDNTPSKKK DEKTPGKKKE
EKPKKLKKGT KRKFENVEEA IRDQLTMKNV KFLKEFLKSE FGETGGNLRK DALVAKLVTR
ICSSVEEETD ERDESFEDNI KKKALLIVER NQETYNESQA KKAKLVNLEE TEKKENPDDK
IVLIGFKNKR KVGYVQKTRE EQIKLVEKYK EQYSTPDRGS QQEFAKQNNI SPSQFSRYLS
ACKRNELTLK VERGHPPPLL LKDQILQVID ITRKERSEKK QVSNIRLGEI MKEVAGLKKT
PSVDYISKFS KKFLYKRKIK TTITKRERIE TAYEQLYRTY YYYCFYAAIM FRNLKSIDRS
KIVVFDETGT NTKATERTNT PIKDDDAVVL QLADDIKDTF LVAVTAEGGV LPVSIVESVP
GKKSTINGNK VTVEEKIAGV HIEHIEQWVE KVYIPNSKEG DILLWDNLSH HKCKSVQKIL
KQHNRINILL PVGGHHQSPL DNMCFRYAKR ELADWKKNNS NSDRKSRNAA FEQIISNMNP
DVIVNSFRKC LMDIWNYDSE EEYLKYVRAS LKMNNDLYDL YIREFNPMCI GQFYVELPTP
SVKVETDEKL DQDPILDDVN NPWIDSDSMK ENFDQKQVEQ ISKIILHIQN KTEFNYNIQQ
NAANLQLYVL STETIACSIT LSTEALETIK DDTVFLPGNT FDIFGICFSK ISSQYLQYIP
SDFSQRVTNQ SKKFIETNLP KSMKFTSKGF IIFPVARSLH WYLLIYDCQS TDWFLFDSFA
KDFSVIESET SELFEMLPSF IVKPKTVRKT LQKKIQVDGN HCGDYCVLLM DFLSAGFSLS
DAVDIMTKPF SISNYRAILV ARLQHLHVVE FEPQSPLSES EGFQSEPKKS SKIELGDYDE
ELDELFNESW EDSIFGKKIQ
//