ID E0VVF8_PEDHC Unreviewed; 448 AA.
AC E0VVF8;
DT 02-NOV-2010, integrated into UniProtKB/TrEMBL.
DT 02-NOV-2010, sequence version 1.
DT 27-MAR-2024, entry version 63.
DE SubName: Full=Transcription factor Sox-2, putative {ECO:0000313|EMBL:EEB17364.1, ECO:0000313|EnsemblMetazoa:PHUM462920-PA};
GN Name=8238469 {ECO:0000313|EnsemblMetazoa:PHUM462920-PA};
GN ORFNames=Phum_PHUM462920 {ECO:0000313|EMBL:EEB17364.1};
OS Pediculus humanus subsp. corporis (Body louse).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Paraneoptera; Psocodea; Phthiraptera; Anoplura; Pediculidae;
OC Pediculus.
OX NCBI_TaxID=121224;
RN [1] {ECO:0000313|EMBL:EEB17364.1}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=USDA {ECO:0000313|EMBL:EEB17364.1};
RA Kirkness E., Hannick L., Hass B., Bruggner R., Lawson D., Bidwell S.,
RA Joardar V., Caler E., Walenz B., Inman J., Schobel S., Galinsky K.,
RA Amedeo P., Strausberg R.;
RT "Annotation of Pediculus humanus corporis strain USDA.";
RL Submitted (APR-2007) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EMBL:EEB17364.1}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=USDA {ECO:0000313|EMBL:EEB17364.1};
RG The Human Body Louse Genome Consortium;
RA Kirkness E., Walenz B., Hass B., Bruggner R., Strausberg R.;
RT "The genome of the human body louse.";
RL Submitted (APR-2007) to the EMBL/GenBank/DDBJ databases.
RN [3] {ECO:0000313|EnsemblMetazoa:PHUM462920-PA}
RP IDENTIFICATION.
RC STRAIN=USDA {ECO:0000313|EnsemblMetazoa:PHUM462920-PA};
RG EnsemblMetazoa;
RL Submitted (FEB-2021) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AAZO01005636; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; DS235811; EEB17364.1; -; Genomic_DNA.
DR RefSeq; XP_002430102.1; XM_002430057.1.
DR AlphaFoldDB; E0VVF8; -.
DR EnsemblMetazoa; PHUM462920-RA; PHUM462920-PA; PHUM462920.
DR GeneID; 8238469; -.
DR KEGG; phu:Phum_PHUM462920; -.
DR CTD; 8238469; -.
DR VEuPathDB; VectorBase:PHUM462920; -.
DR eggNOG; KOG0527; Eukaryota.
DR HOGENOM; CLU_021123_0_1_1; -.
DR InParanoid; E0VVF8; -.
DR OMA; GMMALPQ; -.
DR OrthoDB; 2902801at2759; -.
DR Proteomes; UP000009046; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-UniRule.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR GO; GO:0006355; P:regulation of DNA-templated transcription; IEA:InterPro.
DR CDD; cd01388; HMG-box_SoxB; 1.
DR Gene3D; 1.10.30.10; High mobility group box domain; 1.
DR InterPro; IPR009071; HMG_box_dom.
DR InterPro; IPR036910; HMG_box_dom_sf.
DR InterPro; IPR022097; SOX_fam.
DR PANTHER; PTHR10270:SF161; SOX DOMAIN-CONTAINING PROTEIN DICHAETE-RELATED; 1.
DR PANTHER; PTHR10270; SOX TRANSCRIPTION FACTOR; 1.
DR Pfam; PF00505; HMG_box; 1.
DR Pfam; PF12336; SOXp; 1.
DR SMART; SM00398; HMG; 1.
DR SUPFAM; SSF47095; HMG-box; 1.
DR PROSITE; PS50118; HMG_BOX_2; 1.
PE 4: Predicted;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW ProRule:PRU00267}; Nucleus {ECO:0000256|PROSITE-ProRule:PRU00267};
KW Reference proteome {ECO:0000313|Proteomes:UP000009046}.
FT DOMAIN 179..247
FT /note="HMG box"
FT /evidence="ECO:0000259|PROSITE:PS50118"
FT DNA_BIND 179..247
FT /note="HMG box"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00267"
FT REGION 111..179
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 261..284
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 349..413
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 111..142
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 160..178
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 350..391
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 448 AA; 49129 MW; 63C7350EF26517FF CRC64;
MSGGIQRYCL GENSSSIERG PTAKRLIVID FGCVKTIAGF SQRENATKAF LGSASRLPKS
NKLYAHREDE MLTMEADLKG GSLHGNMAPH PGLHQVSSYG TLGGLVHGSS LSPSSMALGQ
PHLQHMSQHQ PLQHSQHTPH HQSHQPPQHH QPPPSSLHQH APQPNNNNSS NKNQNIDRVK
RPMNAFMVWS RGQRRKMAQE NPKMHNSEIS KRLGAEWKLL SESEKRPFID EAKRLRAVHM
KEHPDYKYRP RRKTKTLLKK DKYPLGGTTP LIPGSGGESA RSPVSQHQTL GRDVYQMPNG
YMPNGYMVHD AAYQQHYGGY RYDVGQMQHA GYVNGSSYGM YGGTVPGGAP SPYLQQSSHS
PSGSSIKSEP VSPSSGGLHT PTPTTGAPVS IKREYSGTPG AVPAGSNGST GDLRQMISMY
LPGEPGAEQR LHQMQYHPSD QLQPLAHI
//