ID A0A2A4KAB3_HELVI Unreviewed; 416 AA.
AC A0A2A4KAB3;
DT 20-DEC-2017, integrated into UniProtKB/TrEMBL.
DT 20-DEC-2017, sequence version 1.
DT 27-MAR-2024, entry version 14.
DE RecName: Full=HMG box domain-containing protein {ECO:0000259|PROSITE:PS50118};
GN ORFNames=B5V51_6805 {ECO:0000313|EMBL:PCG81019.1};
OS Heliothis virescens (Tobacco budworm moth).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; Noctuoidea;
OC Noctuidae; Heliothinae; Heliothis.
OX NCBI_TaxID=7102 {ECO:0000313|EMBL:PCG81019.1, ECO:0000313|Proteomes:UP000218220};
RN [1] {ECO:0000313|EMBL:PCG81019.1, ECO:0000313|Proteomes:UP000218220}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=HvINT- {ECO:0000313|EMBL:PCG81019.1};
RC TISSUE=Whole body {ECO:0000313|EMBL:PCG81019.1};
RA Fritz M.L., Deyonke A.M., Papanicolaou A., Micinski S., Westbrook J.,
RA Gould F.;
RT "Contemporary evolution of a Lepidopteran species, Heliothis virescens, in
RT response to modern agricultural practices.";
RL Submitted (SEP-2017) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:PCG81019.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; NWSH01000004; PCG81019.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A2A4KAB3; -.
DR STRING; 7102.A0A2A4KAB3; -.
DR Proteomes; UP000218220; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-UniRule.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR Gene3D; 1.10.30.10; High mobility group box domain; 1.
DR InterPro; IPR009071; HMG_box_dom.
DR InterPro; IPR036910; HMG_box_dom_sf.
DR PANTHER; PTHR10270; SOX TRANSCRIPTION FACTOR; 1.
DR PANTHER; PTHR10270:SF317; TRANSCRIPTION FACTOR SOX-15-RELATED; 1.
DR Pfam; PF00505; HMG_box; 1.
DR SUPFAM; SSF47095; HMG-box; 1.
DR PROSITE; PS50118; HMG_BOX_2; 1.
PE 4: Predicted;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW ProRule:PRU00267}; Nucleus {ECO:0000256|PROSITE-ProRule:PRU00267};
KW Reference proteome {ECO:0000313|Proteomes:UP000218220}.
FT DOMAIN 1..36
FT /note="HMG box"
FT /evidence="ECO:0000259|PROSITE:PS50118"
FT DNA_BIND 1..36
FT /note="HMG box"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00267"
FT REGION 27..149
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 267..301
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 328..362
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 94..122
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 285..301
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 334..354
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 416 AA; 46629 MW; 3ACA0781AEDEA622 CRC64;
MARKKWRSLT PQDRRPFVEE AERLRVIHMT EHPNYKYRPR RRKQNKTRPN QPSQAASAPP
AAAALGSPYA AGTDSPDARF SHNPGAGFSP YRTSPLASDF QNNQQFNGHV QTPESSPARS
PEPQGRRSAP AEAPLPTPDA SPVENEKENF QYEERRRAIN ASSMSDSYSP YKTYRTPGSF
SPAPVAAMGM ANGMYVMCTQ RTLTEQPPLV TGTFFPPVAT SQDQQALGTS TPRVSNAPSG
PMPTEYTIQY HPFEQYEQMY KTEDSYVSHY PDQPKTEYDT GGSYFSEEPP SNQQQEGEQN
FMNQRSEIRT GSPESDVDAR EFDKYLDYGA EGPMEQHQQQ QQYRYEQQQN YRPPYCPDQA
PGPDYCPERM YASPYASVIA GAPPANYAPS AAGSDVRPED EFSVILAGVR QTCYSN
//