ID A0A1Y2F9T1_9FUNG Unreviewed; 828 AA.
AC A0A1Y2F9T1;
DT 30-AUG-2017, integrated into UniProtKB/TrEMBL.
DT 30-AUG-2017, sequence version 1.
DT 24-JAN-2024, entry version 21.
DE RecName: Full=HMG box domain-containing protein {ECO:0000259|PROSITE:PS50118};
GN ORFNames=LY90DRAFT_698076 {ECO:0000313|EMBL:ORY79635.1};
OS Neocallimastix californiae.
OC Eukaryota; Fungi; Fungi incertae sedis; Chytridiomycota;
OC Chytridiomycota incertae sedis; Neocallimastigomycetes; Neocallimastigales;
OC Neocallimastigaceae; Neocallimastix.
OX NCBI_TaxID=1754190 {ECO:0000313|EMBL:ORY79635.1, ECO:0000313|Proteomes:UP000193920};
RN [1] {ECO:0000313|EMBL:ORY79635.1, ECO:0000313|Proteomes:UP000193920}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=G1 {ECO:0000313|EMBL:ORY79635.1,
RC ECO:0000313|Proteomes:UP000193920};
RG DOE Joint Genome Institute;
RA Haitjema C.H., Gilmore S.P., Henske J.K., Solomon K.V., De Groot R.,
RA Kuo A., Mondo S.J., Salamov A.A., Labutti K., Zhao Z., Chiniquy J.,
RA Barry K., Brewer H.M., Purvine S.O., Wright A.T., Boxma B., Van Alen T.,
RA Hackstein J.H., Baker S.E., Grigoriev I.V., O'Malley M.A.;
RT "A Parts List for Fungal Cellulosomes Revealed by Comparative Genomics.";
RL Submitted (AUG-2016) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:ORY79635.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; MCOG01000014; ORY79635.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A1Y2F9T1; -.
DR STRING; 1754190.A0A1Y2F9T1; -.
DR OrthoDB; 1339449at2759; -.
DR Proteomes; UP000193920; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-UniRule.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR CDD; cd00084; HMG-box_SF; 2.
DR Gene3D; 1.10.30.10; High mobility group box domain; 2.
DR InterPro; IPR009071; HMG_box_dom.
DR InterPro; IPR036910; HMG_box_dom_sf.
DR Pfam; PF00505; HMG_box; 1.
DR SMART; SM00398; HMG; 2.
DR SUPFAM; SSF47095; HMG-box; 2.
DR PROSITE; PS50118; HMG_BOX_2; 2.
PE 4: Predicted;
KW DNA-binding {ECO:0000256|PROSITE-ProRule:PRU00267};
KW Nucleus {ECO:0000256|PROSITE-ProRule:PRU00267};
KW Reference proteome {ECO:0000313|Proteomes:UP000193920}.
FT DOMAIN 123..190
FT /note="HMG box"
FT /evidence="ECO:0000259|PROSITE:PS50118"
FT DOMAIN 303..371
FT /note="HMG box"
FT /evidence="ECO:0000259|PROSITE:PS50118"
FT DNA_BIND 123..190
FT /note="HMG box"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00267"
FT DNA_BIND 303..371
FT /note="HMG box"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00267"
FT REGION 59..89
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 386..408
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 433..457
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 616..658
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 670..698
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 63..82
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 390..408
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 433..447
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 616..634
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 644..658
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 828 AA; 97183 MW; B81927FC541A3E24 CRC64;
MKSNEQNNTI THLKKEELKR LIDQEWITLE EEIKNKYIKD SVQKQTKIVK KILKDKKGRQ
TNLPKNSFKN VNNNQNGNVD KRSENDNNSE MIKQELKDYT EINLLTEMYN SAKQLPIPHF
SVIKHPPNPF SIFTKDNINM IIKKNPTMTR AQALRIVAKK WKNVDSEEKT RYIEKSKEYK
EDYENLVKYA YLQRALINNP NIARLCLENQ LKEKVNDKFD KKEENINFKE LNTDDEIVKA
FKYIVRKFPQ GYPLASPVLR FPERSITQKF KKSKNKSIED LNEHETLSIP KKTKKKLNIN
VNIKPKSQAF DFFVKDRYKL LLKATPTLSL SDLFDELEKE WKNLSYGDKL PYILLEIKDI
KRYYIDQCEI SKEKRGRKHN IITEMLKENE ETINDEEEEE EAEEEEEEVN TMVIDNILLG
NNNQNKKIKI IKKVKRRKGK HQQHQQHQQH QQHQQHFNSN EYSTLLEDSE NEQDSNYNLL
CIKKNRILGR NENYEIMEKH NLKQNDKINW SPSNDMNKNY ISSDTSEEYL TFNNFDGKYI
FENSNGNEIE NLAKENINDH FDIQTQINNS NININDRNNS NISRDCTFIP VTDSININDS
TNYSLSQDTV VYNIPEDKLN NNSNTTFSGN QYNDSKDDES NEQRSNTQMS ILSQDSKDEI
SLNKVLQKSF GHSLSESPNN KENNKIIENK NSEKNNSNLF RENKQTIFET PSSNIENSVK
SITSSLGINP SLLKENNLET TKVSQKSESL LKPLRSIRKS GIWKFGNVIG LDQQISKEKL
YYKSNKNKND YYDGDDDDDY FYDSDDDDTY ISLDEISSFS SSNDDNDE
//