ID A0A074XEQ4_AURPU Unreviewed; 707 AA.
AC A0A074XEQ4;
DT 01-OCT-2014, integrated into UniProtKB/TrEMBL.
DT 01-OCT-2014, sequence version 1.
DT 24-JAN-2024, entry version 24.
DE RecName: Full=HMG box domain-containing protein {ECO:0000259|PROSITE:PS50118};
GN ORFNames=M438DRAFT_346080 {ECO:0000313|EMBL:KEQ83995.1};
OS Aureobasidium pullulans EXF-150.
OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Dothideomycetes;
OC Dothideomycetidae; Dothideales; Saccotheciaceae; Aureobasidium.
OX NCBI_TaxID=1043002 {ECO:0000313|EMBL:KEQ83995.1, ECO:0000313|Proteomes:UP000030706};
RN [1] {ECO:0000313|EMBL:KEQ83995.1, ECO:0000313|Proteomes:UP000030706}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=EXF-150 {ECO:0000313|EMBL:KEQ83995.1,
RC ECO:0000313|Proteomes:UP000030706};
RX PubMed=24984952;
RA Gostin Ar C., Ohm R.A., Kogej T., Sonjak S., Turk M., Zajc J., Zalar P.,
RA Grube M., Sun H., Han J., Sharma A., Chiniquy J., Ngan C.Y., Lipzen A.,
RA Barry K., Grigoriev I.V., Gunde-Cimerman N.;
RT "Genome sequencing of four Aureobasidium pullulans varieties:
RT biotechnological potential, stress tolerance, and description of new
RT species.";
RL BMC Genomics 15:549-549(2014).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KL584983; KEQ83995.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A074XEQ4; -.
DR STRING; 1043002.A0A074XEQ4; -.
DR HOGENOM; CLU_010453_0_1_1; -.
DR OrthoDB; 3090452at2759; -.
DR Proteomes; UP000030706; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-UniRule.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR CDD; cd01389; HMG-box_ROX1-like; 1.
DR Gene3D; 1.10.30.10; High mobility group box domain; 1.
DR InterPro; IPR009071; HMG_box_dom.
DR InterPro; IPR036910; HMG_box_dom_sf.
DR PANTHER; PTHR10270:SF320; HMG BOX DOMAIN-CONTAINING PROTEIN-RELATED; 1.
DR PANTHER; PTHR10270; SOX TRANSCRIPTION FACTOR; 1.
DR Pfam; PF00505; HMG_box; 1.
DR SMART; SM00398; HMG; 1.
DR SUPFAM; SSF47095; HMG-box; 1.
DR PROSITE; PS50118; HMG_BOX_2; 1.
PE 4: Predicted;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW ProRule:PRU00267}; Nucleus {ECO:0000256|PROSITE-ProRule:PRU00267};
KW Reference proteome {ECO:0000313|Proteomes:UP000030706}.
FT DOMAIN 102..170
FT /note="HMG box"
FT /evidence="ECO:0000259|PROSITE:PS50118"
FT DNA_BIND 102..170
FT /note="HMG box"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00267"
FT REGION 1..87
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 158..450
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 557..585
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1..50
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 54..79
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 177..197
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 204..235
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 250..292
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 416..442
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 707 AA; 77425 MW; 9B8D86AAC946461F CRC64;
MPSAEHDDLS TSSQGIASDG GVSSRRASTN VPIAPQLQPS PKSRLSDTPL TPEESPPFRD
HRKRSAEALD QDEVNDVSPS HTRDGSTESL ANFCLCQPEP KVPRPRNAFI LYRQNRQAAV
VAQNPGLANP EISKIIGEQW RNEPEGEKNR WKAYAEEEKL QHQQRYPSYR YQPKRSGRRG
SVSSESGFSQ SDQRKCQKCG GRTIIHPTPS TPSSAREKPH SLDTSPSSAY HQPSRLNAFQ
LPPPTPSSTT TPSTRYLPML NNLSLSSPHP RNMSNGRPPQ PFMGQRGQDE HSSILSPDSK
RRRFESPQGT YVMSTRTMPP RHNAGPGTPF PFGPQGQPAM QQGQQMGPPH QFQPPPGAQH
LRRESLPRPA DLLRGPVPPN MNMMGPPPRP GPGYSQHRAS QGQRPGGPEL SLTLPPLQAG
NMSGPPTSTR SGPSQPPSGN RGSDSKEPDR RSVGEIIMSM SFMAKVQILG RVAPPFPAND
NKSRGTIIAI EGDDPEAARN VTDWLEEFLG REGDMVVKVV DGPKLPEGND GQEVQFQELL
RTVGEWHEKS KKIEELVRGD TERKDSDTSL LSSSGPDSEA MEVDSSHASG GKLVILLRTY
TVTASNVFAS RVAIRDAYKA ADHWQWTATL WRGIVGPDMT VYVKEADKAE DGGKGGVEIK
EENRIMAVRR FKGSEENGGV EGIEAGTLRR LGFEVGEWVR SGGGKKE
//