GenomeNet

Database: UniProt/TrEMBL
Entry: C4M9X4_ENTHI
LinkDB: C4M9X4_ENTHI
Original site: C4M9X4_ENTHI 
ID   C4M9X4_ENTHI            Unreviewed;       384 AA.
AC   C4M9X4;
DT   07-JUL-2009, integrated into UniProtKB/TrEMBL.
DT   07-JUL-2009, sequence version 1.
DT   15-FEB-2017, entry version 43.
DE   SubName: Full=HMG box protein {ECO:0000313|EMBL:EAL45607.1};
DE   SubName: Full=Hmg box protein {ECO:0000313|EMBL:GAT98534.1};
GN   ORFNames=CL6EHI_179340 {ECO:0000313|EMBL:GAT98534.1}, EHI_179340
GN   {ECO:0000313|EMBL:EAL45607.1};
OS   Entamoeba histolytica.
OC   Eukaryota; Amoebozoa; Archamoebae; Entamoebidae; Entamoeba.
OX   NCBI_TaxID=5759 {ECO:0000313|EMBL:EAL45607.1, ECO:0000313|Proteomes:UP000001926};
RN   [1] {ECO:0000313|EMBL:EAL45607.1, ECO:0000313|Proteomes:UP000001926}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=ATCC 30459 / HM-1:IMSS {ECO:0000313|Proteomes:UP000001926}, and
RC   HM-1:IMSS {ECO:0000313|EMBL:EAL45607.1};
RX   PubMed=15729342; DOI=10.1038/nature03291;
RA   Loftus B.J., Anderson I., Davies R., Alsmark U.C., Samuelson J.,
RA   Amedeo P., Roncaglia P., Berriman M., Hirt R.P., Mann B.J., Nozaki T.,
RA   Suh B., Pop M., Duchene M., Ackers J., Tannich E., Leippe M.,
RA   Hofer M., Bruchhaus I., Willhoeft U., Bhattacharya A.,
RA   Chillingworth T., Churcher C.M., Hance Z., Harris B., Harris D.,
RA   Jagels K., Moule S., Mungall K.L., Ormond D., Squares R.,
RA   Whitehead S., Quail M.A., Rabbinowitsch E., Norbertczak H., Price C.,
RA   Wang Z., Guillen N., Gilchrist C., Stroup S.E., Bhattacharya S.,
RA   Lohia A., Foster P.G., Sicheritz-Ponten T., Weber C., Singh U.,
RA   Mukherjee C., El-Sayed N.M.A., Petri W.A., Clark C.G., Embley T.M.,
RA   Barrell B.G., Fraser C.M., Hall N.;
RT   "The genome of the protist parasite Entamoeba histolytica.";
RL   Nature 433:865-868(2005).
RN   [2] {ECO:0000313|EMBL:EAL45607.1, ECO:0000313|Proteomes:UP000001926}
RP   GENOME REANNOTATION.
RC   STRAIN=ATCC 30459 / HM-1:IMSS {ECO:0000313|Proteomes:UP000001926}, and
RC   HM-1:IMSS {ECO:0000313|EMBL:EAL45607.1};
RA   Lorenzi H., Amedeo P., Inman J., Schobel S., Caler E.;
RL   Submitted (MAR-2007) to the EMBL/GenBank/DDBJ databases.
RN   [3] {ECO:0000313|EMBL:GAT98534.1, ECO:0000313|Proteomes:UP000078387}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=HM1:IMSS clone 6 {ECO:0000313|EMBL:GAT98534.1,
RC   ECO:0000313|Proteomes:UP000078387};
RA   Mukherjee Avik.K., Izumyama S., Nakada-Tsukui K., Nozaki T.;
RT   "First whole genome sequencing of Entamoeba histolytica HM1:IMSS-
RT   clone-6.";
RL   Submitted (MAY-2016) to the EMBL/GenBank/DDBJ databases.
CC   -----------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution-NoDerivs License
CC   -----------------------------------------------------------------------
DR   EMBL; DS571431; EAL45607.1; -; Genomic_DNA.
DR   EMBL; BDEQ01000001; GAT98534.1; -; Genomic_DNA.
DR   RefSeq; XP_650993.1; XM_645901.1.
DR   STRING; 5759.rna_EHI_179340-1; -.
DR   EnsemblProtists; rna_EHI_179340-1; rna_EHI_179340-1; EHI_179340.
DR   GeneID; 3405295; -.
DR   KEGG; ehi:EHI_179340; -.
DR   EuPathDB; AmoebaDB:EHI_179340; -.
DR   eggNOG; ENOG410Y9B3; LUCA.
DR   InParanoid; C4M9X4; -.
DR   OMA; CIECISA; -.
DR   Proteomes; UP000001926; Partially assembled WGS sequence.
DR   Proteomes; UP000078387; Unassembled WGS sequence.
DR   GO; GO:0005634; C:nucleus; IEA:UniProtKB-KW.
DR   GO; GO:0003677; F:DNA binding; IEA:UniProtKB-KW.
DR   Gene3D; 1.10.30.10; -; 1.
DR   InterPro; IPR009071; HMG_box_dom.
DR   InterPro; IPR006594; LisH.
DR   Pfam; PF00505; HMG_box; 1.
DR   Pfam; PF08513; LisH; 1.
DR   SMART; SM00398; HMG; 1.
DR   SMART; SM00667; LisH; 1.
DR   SUPFAM; SSF47095; SSF47095; 1.
DR   PROSITE; PS50118; HMG_BOX_2; 1.
DR   PROSITE; PS50896; LISH; 1.
PE   4: Predicted;
KW   Coiled coil {ECO:0000256|SAM:Coils};
KW   Complete proteome {ECO:0000313|Proteomes:UP000001926,
KW   ECO:0000313|Proteomes:UP000078387};
KW   DNA-binding {ECO:0000256|PROSITE-ProRule:PRU00267};
KW   Nucleus {ECO:0000256|PROSITE-ProRule:PRU00267};
KW   Reference proteome {ECO:0000313|Proteomes:UP000001926}.
FT   DOMAIN        2     34       LisH. {ECO:0000259|PROSITE:PS50896}.
FT   DOMAIN      107    175       HMG box DNA-binding.
FT                                {ECO:0000259|PROSITE:PS50118}.
FT   DNA_BIND    107    175       HMG box. {ECO:0000256|PROSITE-ProRule:
FT                                PRU00267}.
FT   COILED      292    316       {ECO:0000256|SAM:Coils}.
SQ   SEQUENCE   384 AA;  45464 MW;  207789F65D72B019 CRC64;
     MSQQSTNAHI YIYMLEQGYR DAAKVFKKEA KVDYDAEASC IECISACSSQ SLEKKEGIVR
     KSSLLQDTKK PVIKTEKEKK KDKKVSKKED EKKASEKSTP KKDENKPKKP KNAYLLFSSE
     KYPQYKKQFP DLKISEIGKK IGVEWKELPE EQKKKYIDQY YASKAEYNDK LKEYDAQTLS
     TEDKKEKKSK KVTKKEDEKA TEEKKPKKVE IKKEDEKPKK VESKKEEKTK KVEIKKEDDE
     KTKKVEIKKE DEKKEKKHSK KEDKKKEEMK KNEGKKESDK KEDTKKDKKV KKSEKKDEIK
     KEDEKKHEKK EEKTEEKKPK KPESEKEESK KEKKHSKKED KKKDEEKSKK VEDKKSKKQK
     KDESSSSDED MNTVLLKIQK KIKS
//
DBGET integrated database retrieval system