GenomeNet

Database: UniProt
Entry: D2W4N9_NAEGR
LinkDB: D2W4N9_NAEGR
Original site: D2W4N9_NAEGR 
ID   D2W4N9_NAEGR            Unreviewed;       920 AA.
AC   D2W4N9;
DT   02-MAR-2010, integrated into UniProtKB/TrEMBL.
DT   02-MAR-2010, sequence version 1.
DT   27-MAR-2024, entry version 34.
DE   SubName: Full=Predicted protein {ECO:0000313|EMBL:EFC35964.1};
GN   ORFNames=NAEGRDRAFT_54650 {ECO:0000313|EMBL:EFC35964.1};
OS   Naegleria gruberi (Amoeba).
OC   Eukaryota; Discoba; Heterolobosea; Tetramitia; Eutetramitia;
OC   Vahlkampfiidae; Naegleria.
OX   NCBI_TaxID=5762 {ECO:0000313|Proteomes:UP000006671};
RN   [1] {ECO:0000313|EMBL:EFC35964.1, ECO:0000313|Proteomes:UP000006671}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=NEG-M {ECO:0000313|EMBL:EFC35964.1,
RC   ECO:0000313|Proteomes:UP000006671};
RX   PubMed=20211133; DOI=10.1016/j.cell.2010.01.032;
RA   Fritz-Laylin L.K., Prochnik S.E., Ginger M.L., Dacks J.B., Carpenter M.L.,
RA   Field M.C., Kuo A., Paredez A., Chapman J., Pham J., Shu S., Neupane R.,
RA   Cipriano M., Mancuso J., Tu H., Salamov A., Lindquist E., Shapiro H.,
RA   Lucas S., Grigoriev I.V., Cande W.Z., Fulton C., Rokhsar D.S., Dawson S.C.;
RT   "The genome of Naegleria gruberi illuminates early eukaryotic
RT   versatility.";
RL   Cell 140:631-642(2010).
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; GG738971; EFC35964.1; -; Genomic_DNA.
DR   RefSeq; XP_002668708.1; XM_002668662.1.
DR   AlphaFoldDB; D2W4N9; -.
DR   EnsemblProtists; EFC35964; EFC35964; NAEGRDRAFT_54650.
DR   GeneID; 8860811; -.
DR   VEuPathDB; AmoebaDB:NAEGRDRAFT_54650; -.
DR   InParanoid; D2W4N9; -.
DR   OrthoDB; 5532629at2759; -.
DR   Proteomes; UP000006671; Unassembled WGS sequence.
DR   GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR   Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 1.
DR   InterPro; IPR038765; Papain-like_cys_pep_sf.
DR   InterPro; IPR036397; RNaseH_sf.
DR   SUPFAM; SSF54001; Cysteine proteinases; 1.
PE   4: Predicted;
KW   Reference proteome {ECO:0000313|Proteomes:UP000006671}.
FT   REGION          33..73
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        44..59
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   920 AA;  106326 MW;  F3A250302C5F322A CRC64;
     MSNKQSSSYD TSPQSFSSYS NLIDLLPSVS QVPSWTCTQD QPDNTPSKKK DEKTPGKKKE
     EKPKKLKKGT KRKFENVEEA IRDQLTMKNV KFLKEFLKSE FGETGGNLRK DALVAKLVTR
     ICSSVEEETD ERDESFEDNI KKKALLIVER NQETYNESQA KKAKLVNLEE TEKKENPDDK
     IVLIGFKNKR KVGYVQKTRE EQIKLVEKYK EQYSTPDRGS QQEFAKQNNI SPSQFSRYLS
     ACKRNELTLK VERGHPPPLL LKDQILQVID ITRKERSEKK QVSNIRLGEI MKEVAGLKKT
     PSVDYISKFS KKFLYKRKIK TTITKRERIE TAYEQLYRTY YYYCFYAAIM FRNLKSIDRS
     KIVVFDETGT NTKATERTNT PIKDDDAVVL QLADDIKDTF LVAVTAEGGV LPVSIVESVP
     GKKSTINGNK VTVEEKIAGV HIEHIEQWVE KVYIPNSKEG DILLWDNLSH HKCKSVQKIL
     KQHNRINILL PVGGHHQSPL DNMCFRYAKR ELADWKKNNS NSDRKSRNAA FEQIISNMNP
     DVIVNSFRKC LMDIWNYDSE EEYLKYVRAS LKMNNDLYDL YIREFNPMCI GQFYVELPTP
     SVKVETDEKL DQDPILDDVN NPWIDSDSMK ENFDQKQVEQ ISKIILHIQN KTEFNYNIQQ
     NAANLQLYVL STETIACSIT LSTEALETIK DDTVFLPGNT FDIFGICFSK ISSQYLQYIP
     SDFSQRVTNQ SKKFIETNLP KSMKFTSKGF IIFPVARSLH WYLLIYDCQS TDWFLFDSFA
     KDFSVIESET SELFEMLPSF IVKPKTVRKT LQKKIQVDGN HCGDYCVLLM DFLSAGFSLS
     DAVDIMTKPF SISNYRAILV ARLQHLHVVE FEPQSPLSES EGFQSEPKKS SKIELGDYDE
     ELDELFNESW EDSIFGKKIQ
//
DBGET integrated database retrieval system