ID E0VMY8_PEDHC Unreviewed; 1028 AA.
AC E0VMY8;
DT 02-NOV-2010, integrated into UniProtKB/TrEMBL.
DT 02-NOV-2010, sequence version 1.
DT 27-MAR-2024, entry version 83.
DE RecName: Full=DNA-binding protein SATB {ECO:0000256|RuleBase:RU361129};
DE AltName: Full=Special AT-rich sequence-binding protein {ECO:0000256|RuleBase:RU361129};
GN Name=8236120 {ECO:0000313|EnsemblMetazoa:PHUM322030-PA};
GN ORFNames=Phum_PHUM322030 {ECO:0000313|EMBL:EEB14744.1};
OS Pediculus humanus subsp. corporis (Body louse).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Paraneoptera; Psocodea; Phthiraptera; Anoplura; Pediculidae;
OC Pediculus.
OX NCBI_TaxID=121224;
RN [1] {ECO:0000313|EMBL:EEB14744.1}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=USDA {ECO:0000313|EMBL:EEB14744.1};
RA Kirkness E., Hannick L., Hass B., Bruggner R., Lawson D., Bidwell S.,
RA Joardar V., Caler E., Walenz B., Inman J., Schobel S., Galinsky K.,
RA Amedeo P., Strausberg R.;
RT "Annotation of Pediculus humanus corporis strain USDA.";
RL Submitted (APR-2007) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EMBL:EEB14744.1}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=USDA {ECO:0000313|EMBL:EEB14744.1};
RG The Human Body Louse Genome Consortium;
RA Kirkness E., Walenz B., Hass B., Bruggner R., Strausberg R.;
RT "The genome of the human body louse.";
RL Submitted (APR-2007) to the EMBL/GenBank/DDBJ databases.
RN [3] {ECO:0000313|EnsemblMetazoa:PHUM322030-PA}
RP IDENTIFICATION.
RC STRAIN=USDA {ECO:0000313|EnsemblMetazoa:PHUM322030-PA};
RG EnsemblMetazoa;
RL Submitted (FEB-2021) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|PROSITE-ProRule:PRU00108,
CC ECO:0000256|RuleBase:RU000682}.
CC -!- SIMILARITY: Belongs to the CUT homeobox family.
CC {ECO:0000256|ARBA:ARBA00008190, ECO:0000256|RuleBase:RU361129}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AAZO01003739; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; DS235331; EEB14744.1; -; Genomic_DNA.
DR RefSeq; XP_002427482.1; XM_002427437.1.
DR AlphaFoldDB; E0VMY8; -.
DR STRING; 121224.E0VMY8; -.
DR EnsemblMetazoa; PHUM322030-RA; PHUM322030-PA; PHUM322030.
DR GeneID; 8236120; -.
DR KEGG; phu:Phum_PHUM322030; -.
DR CTD; 8236120; -.
DR VEuPathDB; VectorBase:PHUM322030; -.
DR eggNOG; KOG2252; Eukaryota.
DR HOGENOM; CLU_001394_0_0_1; -.
DR InParanoid; E0VMY8; -.
DR OMA; LHNLPGM; -.
DR OrthoDB; 74668at2759; -.
DR Proteomes; UP000009046; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IEA:InterPro.
DR GO; GO:0048468; P:cell development; IEA:UniProt.
DR GO; GO:0048699; P:generation of neurons; IEA:UniProt.
DR CDD; cd00086; homeodomain; 1.
DR Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR Gene3D; 1.10.260.40; lambda repressor-like DNA-binding domains; 2.
DR InterPro; IPR003350; CUT_dom.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR017970; Homeobox_CS.
DR InterPro; IPR001356; Homeobox_dom.
DR InterPro; IPR010982; Lambda_DNA-bd_dom_sf.
DR PANTHER; PTHR14043; CCAAT DISPLACEMENT PROTEIN-RELATED; 1.
DR PANTHER; PTHR14043:SF2; HOMEOBOX PROTEIN CUT; 1.
DR Pfam; PF02376; CUT; 2.
DR Pfam; PF00046; Homeodomain; 1.
DR SMART; SM01109; CUT; 2.
DR SMART; SM00389; HOX; 1.
DR SUPFAM; SSF46689; Homeodomain-like; 1.
DR SUPFAM; SSF47413; lambda repressor-like DNA-binding domains; 2.
DR PROSITE; PS51042; CUT; 2.
DR PROSITE; PS00027; HOMEOBOX_1; 1.
DR PROSITE; PS50071; HOMEOBOX_2; 1.
PE 3: Inferred from homology;
KW Coiled coil {ECO:0000256|ARBA:ARBA00023054};
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Homeobox {ECO:0000256|ARBA:ARBA00023155, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00108}; Reference proteome {ECO:0000313|Proteomes:UP000009046};
KW Transcription {ECO:0000256|RuleBase:RU361129};
KW Transcription regulation {ECO:0000256|RuleBase:RU361129}.
FT DOMAIN 381..468
FT /note="CUT"
FT /evidence="ECO:0000259|PROSITE:PS51042"
FT DOMAIN 610..697
FT /note="CUT"
FT /evidence="ECO:0000259|PROSITE:PS51042"
FT DOMAIN 737..797
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DNA_BIND 739..798
FT /note="Homeobox"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT REGION 23..59
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 78..120
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 205..272
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 302..325
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 482..538
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 699..741
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 891..955
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 977..1028
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 23..43
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 80..94
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 95..110
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 205..222
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 235..251
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 501..520
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 521..538
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 712..728
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 914..933
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 978..1028
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1028 AA; 114971 MW; AEE4AC99B7969C97 CRC64;
MYLILNTQTV SKQKISFDFF QKEKKKRKKI DNKKVENDER GAGKEGSLPP FGREGEGQNL
TDERIAHILS EASHMMVKNS MSSHHEDQNS HEEDSKSPMN CSSPMSTKEN SLSRRLKKYE
NDDIPQEKVV RIYQEELAKL MGRGPGGPHP FPSLLFPHFF GGPPGSMDRS QEDMRMALDA
YHRELAKLNT CPSPGLPSLL ALQQQAMHQT TPSQQQNGGA QDLSLPKERK SETPGPPSTP
GSTSDRPGTA QSHKMVNGVV EPSENAEKDR DRDLLKKEGN IMESAVAEAM RHAGSAFSLV
RPKTESEWQQ PSTGSTASSP LSNSILPPVL NPTEDLVTSS AASPLQRMAS ITNSLISQPT
IPSHHATNQR PLKAVLPPIT QQQFDMFNNL NTEEIVKRVK EQLSQYSISQ RLFGESVLGL
SQGSVSDLLA RPKSWHMLTQ KGREPFIRMK MFLEDENAVH KLVASQYKIS PEKLMRTGGY
GGATINPSPI PPKHPLPPTS KHGDQLSNMA ANILQAQQQH QLEQRDREQR ERERKNIERE
REIMRETILP PSLNIIPPSS NSMLGSPPSL GGLMVANDLK KGLAPASMPP QPHQLTNMRA
LHQHISPSVY EMAALTQDLD TQIITTKIKE ALLANNIGQK IFGEAVLGLS QGSVSELLSK
PKPWHMLSIK GREPFIRMQL WLSDAHNIDR LQALKNERRE LNKRRRSSGH GGDNSSDTSS
NDTSEFYHSA SPGPGPPSAK KQRVLFNDEQ KEALRLAFAL DSYPNVATIE FLANELGLSP
RTITNWFHNH RMRLKQQVPL PQETNPSVQR DPQQPFDPVQ FRVLLNQRLL EVQKERLGLG
SVPLPYPPYF ANNNLATLIS RGLISPECEN ILANVAKEQF GGLDLSMSGL KREPYDDEDM
GSELGSEDSN LSSASMRDVK KEEGDDVEEE RTTASTGQNR SSRRKPAAPQ WVNPEWQEEK
KNLENEVIIN GVCVMQTDEF SREKEEETVR VEPTAVMDRY EHSDESSDEQ NNEREAETNT
VENAEKDQ
//