ID A0A195BW04_9HYME Unreviewed; 429 AA.
AC A0A195BW04;
DT 05-OCT-2016, integrated into UniProtKB/TrEMBL.
DT 05-OCT-2016, sequence version 1.
DT 27-MAR-2024, entry version 28.
DE SubName: Full=Transcription factor SOX-8 {ECO:0000313|EMBL:KYM92148.1};
GN ORFNames=ALC53_01211 {ECO:0000313|EMBL:KYM92148.1};
OS Atta colombica.
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; Formicoidea;
OC Formicidae; Myrmicinae; Atta.
OX NCBI_TaxID=520822 {ECO:0000313|EMBL:KYM92148.1, ECO:0000313|Proteomes:UP000078540};
RN [1] {ECO:0000313|EMBL:KYM92148.1, ECO:0000313|Proteomes:UP000078540}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Treedump-2 {ECO:0000313|EMBL:KYM92148.1};
RC TISSUE=Whole body {ECO:0000313|EMBL:KYM92148.1};
RA Nygaard S., Hu H., Boomsma J., Zhang G.;
RT "Atta colombica WGS genome.";
RL Submitted (SEP-2015) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KQ976403; KYM92148.1; -; Genomic_DNA.
DR RefSeq; XP_018056191.1; XM_018200702.1.
DR AlphaFoldDB; A0A195BW04; -.
DR STRING; 520822.A0A195BW04; -.
DR GeneID; 108692453; -.
DR OrthoDB; 2902801at2759; -.
DR Proteomes; UP000078540; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR CDD; cd22031; HMG-box_SoxE; 1.
DR Gene3D; 1.10.30.10; High mobility group box domain; 1.
DR InterPro; IPR009071; HMG_box_dom.
DR InterPro; IPR036910; HMG_box_dom_sf.
DR InterPro; IPR022151; Sox_N.
DR PANTHER; PTHR45803; SOX100B; 1.
DR PANTHER; PTHR45803:SF5; SOX100B; 1.
DR Pfam; PF00505; HMG_box; 1.
DR Pfam; PF12444; Sox_N; 1.
DR SMART; SM00398; HMG; 1.
DR SUPFAM; SSF47095; HMG-box; 1.
DR PROSITE; PS50118; HMG_BOX_2; 1.
PE 4: Predicted;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW ProRule:PRU00267};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00267}; Reference proteome {ECO:0000313|Proteomes:UP000078540}.
FT DOMAIN 87..155
FT /note="HMG box"
FT /evidence="ECO:0000259|PROSITE:PS50118"
FT DNA_BIND 87..155
FT /note="HMG box"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00267"
FT REGION 1..30
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 148..243
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 9..30
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 148..166
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 167..216
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 429 AA; 47182 MW; D97D0E61E1AF2C57 CRC64;
MKMNDVKPES SGASISPAGG NNNNNNINGN GKTSATAAAA AAAAAATVAA NGGEGISAAV
AKVLQGYDWT LVPVATKGSG DKRAAHVKRP MNAFMVWAQA ARRKLADQYP QLHNAELSKT
LGKLWRLLSD NDKKPFIEEA DRLRVIHKRE HPDYKYQPRR RKQNGPTSGR ESSPTRSQSN
VTFSVTRSLK QEDMSPRGVQ GPNSPQSGVS SSPPTTPSQG LSPPTPPTTP RGQHYINQSN
QLPQNNTVYY QDLVNGTPSS ESPHQQPAVD LRYIEVGDGL SGEENQLNGL GTLDGLDLNL
PVNFQECESN ELDLYLPPQT APIHQYPPVQ VTTASQWLLN RYEEEIERPI KRHCSEQAIA
EVSSWEDRTQ EMVRYHELQP PLPPVQYISA HNAHHSHAST QMGHPHVSTP YAQYAHRYVS
GIETWPNYM
//