ID E0VH23_PEDHC Unreviewed; 455 AA.
AC E0VH23;
DT 02-NOV-2010, integrated into UniProtKB/TrEMBL.
DT 02-NOV-2010, sequence version 1.
DT 24-JAN-2024, entry version 55.
DE SubName: Full=Transcription factor Sox-2, putative {ECO:0000313|EMBL:EEB12679.1, ECO:0000313|EnsemblMetazoa:PHUM199330-PA};
GN Name=8240339 {ECO:0000313|EnsemblMetazoa:PHUM199330-PA};
GN ORFNames=Phum_PHUM199330 {ECO:0000313|EMBL:EEB12679.1};
OS Pediculus humanus subsp. corporis (Body louse).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Paraneoptera; Psocodea; Phthiraptera; Anoplura; Pediculidae;
OC Pediculus.
OX NCBI_TaxID=121224;
RN [1] {ECO:0000313|EMBL:EEB12679.1}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=USDA {ECO:0000313|EMBL:EEB12679.1};
RA Kirkness E., Hannick L., Hass B., Bruggner R., Lawson D., Bidwell S.,
RA Joardar V., Caler E., Walenz B., Inman J., Schobel S., Galinsky K.,
RA Amedeo P., Strausberg R.;
RT "Annotation of Pediculus humanus corporis strain USDA.";
RL Submitted (APR-2007) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EMBL:EEB12679.1}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=USDA {ECO:0000313|EMBL:EEB12679.1};
RG The Human Body Louse Genome Consortium;
RA Kirkness E., Walenz B., Hass B., Bruggner R., Strausberg R.;
RT "The genome of the human body louse.";
RL Submitted (APR-2007) to the EMBL/GenBank/DDBJ databases.
RN [3] {ECO:0000313|EnsemblMetazoa:PHUM199330-PA}
RP IDENTIFICATION.
RC STRAIN=USDA {ECO:0000313|EnsemblMetazoa:PHUM199330-PA};
RG EnsemblMetazoa;
RL Submitted (FEB-2021) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AAZO01002308; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; DS235157; EEB12679.1; -; Genomic_DNA.
DR RefSeq; XP_002425417.1; XM_002425372.1.
DR AlphaFoldDB; E0VH23; -.
DR STRING; 121224.E0VH23; -.
DR EnsemblMetazoa; PHUM199330-RA; PHUM199330-PA; PHUM199330.
DR GeneID; 8240339; -.
DR KEGG; phu:Phum_PHUM199330; -.
DR CTD; 8240339; -.
DR VEuPathDB; VectorBase:PHUM199330; -.
DR eggNOG; KOG0527; Eukaryota.
DR HOGENOM; CLU_584249_0_0_1; -.
DR InParanoid; E0VH23; -.
DR OMA; MVEYPDY; -.
DR OrthoDB; 2902801at2759; -.
DR Proteomes; UP000009046; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-UniRule.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR CDD; cd22029; HMG-box_SoxC; 1.
DR Gene3D; 1.10.30.10; High mobility group box domain; 1.
DR InterPro; IPR009071; HMG_box_dom.
DR InterPro; IPR036910; HMG_box_dom_sf.
DR PANTHER; PTHR10270:SF161; SOX DOMAIN-CONTAINING PROTEIN DICHAETE-RELATED; 1.
DR PANTHER; PTHR10270; SOX TRANSCRIPTION FACTOR; 1.
DR Pfam; PF00505; HMG_box; 1.
DR SMART; SM00398; HMG; 1.
DR SUPFAM; SSF47095; HMG-box; 1.
DR PROSITE; PS50118; HMG_BOX_2; 1.
PE 4: Predicted;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW ProRule:PRU00267}; Nucleus {ECO:0000256|PROSITE-ProRule:PRU00267};
KW Reference proteome {ECO:0000313|Proteomes:UP000009046}.
FT DOMAIN 14..82
FT /note="HMG box"
FT /evidence="ECO:0000259|PROSITE:PS50118"
FT DNA_BIND 14..82
FT /note="HMG box"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00267"
FT REGION 83..203
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 237..280
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 93..129
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 136..203
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 455 AA; 49975 MW; E78DB47649C008FE CRC64;
MAASGTKKHN PNHIKRPMNA FMVWSQIERR KICEVQPDMH NAEISKRLGK RWKNLTDDER
QPFIDEAEKL RILHMQEYPD YKYRPRKKVV KPASKTTPST TSKKQKKFSQ SQSNLNHDSN
NNNSLAHKGN SLKDKALSQR QRTLAIHRQS SSSTSKTSSS SSGGCSSRNS AERSSHCTSP
TTPATNSPST PARHVPSILS RPQLTLPVTS TCLQQLPGTE PPTLASTQSR LKSRLEMDKA
KENTVSSTRY VSPPPQQRYS LPMPSPASAK VPSSPSCEIP GSPESATFYD DTSILLDAKL
TSLNSPSSVS STVSFTLVAT SPLKPGTIRI LPFKPEPMDV TADDVGRIDI KEELVIKEEP
IELVNDSISV NKVGDNYDLA DLDCLTDLIQ IPADLKVELD TLTSDLDPTF DSASSSSGSH
FEFSCDSDVS DVLKGLGFNT TEWVDYTSFP SSVNC
//