ID A0A2C5YGL5_9HYPO Unreviewed; 673 AA.
AC A0A2C5YGL5;
DT 20-DEC-2017, integrated into UniProtKB/TrEMBL.
DT 20-DEC-2017, sequence version 1.
DT 24-JAN-2024, entry version 17.
DE RecName: Full=HMG box domain-containing protein {ECO:0000259|PROSITE:PS50118};
GN ORFNames=CDD81_5978 {ECO:0000313|EMBL:PHH66846.1};
OS Ophiocordyceps australis.
OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Sordariomycetes;
OC Hypocreomycetidae; Hypocreales; Ophiocordycipitaceae; Ophiocordyceps.
OX NCBI_TaxID=1399860 {ECO:0000313|EMBL:PHH66846.1, ECO:0000313|Proteomes:UP000226192};
RN [1] {ECO:0000313|EMBL:PHH66846.1, ECO:0000313|Proteomes:UP000226192}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Map64 {ECO:0000313|EMBL:PHH66846.1,
RC ECO:0000313|Proteomes:UP000226192};
RA De Bekker C., Evans H.C., Brachmann A., Hughes D.P.;
RT "Ant-infecting Ophiocordyceps genomes reveal a high diversity of potential
RT behavioral manipulation genes and a possible major role for enterotoxins.";
RL Submitted (JUN-2017) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:PHH66846.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; NJET01000005; PHH66846.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A2C5YGL5; -.
DR STRING; 1399860.A0A2C5YGL5; -.
DR OrthoDB; 3090452at2759; -.
DR Proteomes; UP000226192; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-UniRule.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR CDD; cd01389; HMG-box_ROX1-like; 1.
DR Gene3D; 1.10.30.10; High mobility group box domain; 1.
DR InterPro; IPR009071; HMG_box_dom.
DR InterPro; IPR036910; HMG_box_dom_sf.
DR PANTHER; PTHR10270:SF161; SOX DOMAIN-CONTAINING PROTEIN DICHAETE-RELATED; 1.
DR PANTHER; PTHR10270; SOX TRANSCRIPTION FACTOR; 1.
DR Pfam; PF00505; HMG_box; 1.
DR SMART; SM00398; HMG; 1.
DR SUPFAM; SSF47095; HMG-box; 1.
DR PROSITE; PS50118; HMG_BOX_2; 1.
PE 4: Predicted;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW ProRule:PRU00267}; Nucleus {ECO:0000256|PROSITE-ProRule:PRU00267};
KW Reference proteome {ECO:0000313|Proteomes:UP000226192}.
FT DOMAIN 95..163
FT /note="HMG box"
FT /evidence="ECO:0000259|PROSITE:PS50118"
FT DNA_BIND 95..163
FT /note="HMG box"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00267"
FT REGION 25..78
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 149..350
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 31..51
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 159..180
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 196..210
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 228..248
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 262..279
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 325..339
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 673 AA; 73687 MW; 0C40CD10453A72A2 CRC64;
MSPVLALAAR QMDAVRVPLH IGIDHHRHSP LTPLSTPSST PVGSHRSQPH RSPSLAAGPG
PGLDAMPAPS NPSAAAHKRQ GSQGLVCLCA PAPKIPRPRN AFILYRQHHQ AQVTADNPRL
SNPDISKIIG GQWKMESDEV KQTWKRLADE EKQRHQNQYP HYRYQPRRST KPQGSWPGAS
PSDDQARCPK CNGKSVAGPQ TPSTPVSEPS SVKLAAAAQG PSLPRLDSAL SRRTSFDLSP
SSSLPSVPRL LPSVRDMHLN EPSSPPDMKR RRADDSGNYH AVDWRSSSYP GKPHALASRE
MPSSSGYART PLPELRNLAR PQSGHMPPPL HPPPSGWLDK DPPDNRRADF DESLRLPPLQ
ASVPAFPSRG ILADARQNSV CLPPPPACGS LREGSGRERQ PQAKTLKDAI MSIPLQKKIA
VLAGICQPIP PLAPDSNSAG DTRGAFISIE GADPRQLREV GDALEAGLAA CGDYLVRVWR
DDKDERLTPD DDDMNPLSQP GRHDARRYGD MFEPYINAIS AWQKKSRQIG RHVTGKTQEQ
KLAKELRQGA SSKTPVALAR DGFSLTIADR YACAMPIVDM YAPADHWQWM ATLWRGTVSP
DLIIYVMSRD EADAADGTGV ELSARQGLIT VKLPPDTALD EATSRRLTFE VLEWMRRGSF
RHEVPQGWRA GAW
//