ID A0A1E5R5D8_9ASCO Unreviewed; 670 AA.
AC A0A1E5R5D8;
DT 18-JAN-2017, integrated into UniProtKB/TrEMBL.
DT 18-JAN-2017, sequence version 1.
DT 24-JAN-2024, entry version 23.
DE RecName: Full=HMG box domain-containing protein {ECO:0000259|PROSITE:PS50118};
GN ORFNames=AWRI3579_g3653 {ECO:0000313|EMBL:OEJ82109.1};
OS Hanseniaspora osmophila.
OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; Saccharomycetes;
OC Saccharomycetales; Saccharomycodaceae; Hanseniaspora.
OX NCBI_TaxID=56408 {ECO:0000313|EMBL:OEJ82109.1, ECO:0000313|Proteomes:UP000095728};
RN [1] {ECO:0000313|Proteomes:UP000095728}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=AWRI3579 {ECO:0000313|Proteomes:UP000095728};
RX PubMed=27856586; DOI=10.1128/genomea.01287-16;
RA Sternes P.R., Lee D., Kutyna D.R., Borneman A.R.;
RT "Genome sequences of three species of Hanseniaspora isolated from
RT spontaneous wine fermentations.";
RL Genome Announc. 4:E01287-E01287(2016).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:OEJ82109.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; LPNM01000010; OEJ82109.1; -; Genomic_DNA.
DR STRING; 56408.A0A1E5R5D8; -.
DR InParanoid; A0A1E5R5D8; -.
DR OrthoDB; 3090452at2759; -.
DR Proteomes; UP000095728; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-UniRule.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR CDD; cd01389; HMG-box_ROX1-like; 1.
DR Gene3D; 1.10.30.10; High mobility group box domain; 2.
DR InterPro; IPR009071; HMG_box_dom.
DR InterPro; IPR036910; HMG_box_dom_sf.
DR PANTHER; PTHR10270:SF161; SOX DOMAIN-CONTAINING PROTEIN DICHAETE-RELATED; 1.
DR PANTHER; PTHR10270; SOX TRANSCRIPTION FACTOR; 1.
DR Pfam; PF00505; HMG_box; 1.
DR SMART; SM00398; HMG; 1.
DR SUPFAM; SSF47095; HMG-box; 1.
DR PROSITE; PS50118; HMG_BOX_2; 1.
PE 4: Predicted;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW ProRule:PRU00267}; Nucleus {ECO:0000256|PROSITE-ProRule:PRU00267};
KW Reference proteome {ECO:0000313|Proteomes:UP000095728}.
FT DOMAIN 492..624
FT /note="HMG box"
FT /evidence="ECO:0000259|PROSITE:PS50118"
FT DNA_BIND 492..624
FT /note="HMG box"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00267"
FT REGION 1..79
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 216..315
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 354..380
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 415..505
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 535..581
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 627..670
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 13..79
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 354..370
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 415..455
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 464..491
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 542..576
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 631..659
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 670 AA; 75148 MW; B5B50086BE38C149 CRC64;
MPFRLPPINS LIQSIDDPVN GSTTQPSNNI DTNGNRSSSN NNNNNNNNNN NNAESNAKDR
PSMAQTGSPT RMAYAHSNLQ HQYQQQTQVP QVYRPVSHIP PDINTLQKHS FNMQNNSGSL
ITPAAITNDN HIAYNYSGDS VGNNXMLLSS SAPASTNEML NNGKTSMFTL AAVASNMNNP
DDNNTTNQQK QYTFLNNNNX XTNNVHNTMI GQNNVYSRRN DSVPLSSSPT NSSVLSLTAF
RQPSNTEYYS LNSNPINRNP FTPSFLSTSN NGVNLNTMQN TSNNNNNNNN NNNNYGNSSN
NTSKNNSNSS LNTMLWNHPS TYSPIPVISS TTPIPVSYSH LDAPAQYHQN YHYQHPHHQP
HHHPHHHPHL PLPQFPHLQT SQLVSQNQNL SVNTGMLPKH GSSLQISPVV DTKNRSIAFQ
NTHPATSTTN LTSNSYTPIP SISQSATSNP TDPKPKEGCS CTKKPESQNR SSRSSSPKQT
NNSSTTAQGK RIPRPRNAFI LFRQHYHSQM FAPEQLLLEQ QQQQQQQQQQ QQQQQQQQQQ
QDQEQKEKKK ENEEVDDTND SDSVHFVEKK RSSSGGEDSF KLNSKVSQTI GLKWRNLDAQ
EKEYWLELAR KEKTEHQLKY PEYKYVPRRR KGTTNNASVS PIKNNTNTNS QSQKNSTELH
GKGCIHHVPN
//