ID G3WBZ2_SARHA Unreviewed; 674 AA.
AC G3WBZ2;
DT 16-NOV-2011, integrated into UniProtKB/TrEMBL.
DT 07-APR-2021, sequence version 2.
DT 27-MAR-2024, entry version 78.
DE RecName: Full=DNA-binding protein SATB {ECO:0000256|RuleBase:RU361129};
DE AltName: Full=Special AT-rich sequence-binding protein {ECO:0000256|RuleBase:RU361129};
GN Name=SATB2 {ECO:0000313|Ensembl:ENSSHAP00000012947.2};
OS Sarcophilus harrisii (Tasmanian devil) (Sarcophilus laniarius).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Metatheria; Dasyuromorphia; Dasyuridae; Sarcophilus.
OX NCBI_TaxID=9305 {ECO:0000313|Ensembl:ENSSHAP00000012947.2, ECO:0000313|Proteomes:UP000007648};
RN [1] {ECO:0000313|Ensembl:ENSSHAP00000012947.2, ECO:0000313|Proteomes:UP000007648}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=21709235; DOI=10.1073/pnas.1102838108;
RA Miller W., Hayes V.M., Ratan A., Petersen D.C., Wittekindt N.E., Miller J.,
RA Walenz B., Knight J., Qi J., Zhao F., Wang Q., Bedoya-Reina O.C.,
RA Katiyar N., Tomsho L.P., Kasson L.M., Hardie R.A., Woodbridge P.,
RA Tindall E.A., Bertelsen M.F., Dixon D., Pyecroft S., Helgen K.M.,
RA Lesk A.M., Pringle T.H., Patterson N., Zhang Y., Kreiss A., Woods G.M.,
RA Jones M.E., Schuster S.C.;
RT "Genetic diversity and population structure of the endangered marsupial
RT Sarcophilus harrisii (Tasmanian devil).";
RL Proc. Natl. Acad. Sci. U.S.A. 108:12348-12353(2011).
RN [2] {ECO:0000313|Ensembl:ENSSHAP00000012947.2}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|PROSITE-ProRule:PRU00108,
CC ECO:0000256|RuleBase:RU000682}.
CC -!- SIMILARITY: Belongs to the CUT homeobox family.
CC {ECO:0000256|ARBA:ARBA00008190, ECO:0000256|RuleBase:RU361129}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; G3WBZ2; -.
DR STRING; 9305.ENSSHAP00000012947; -.
DR Ensembl; ENSSHAT00000013053.2; ENSSHAP00000012947.2; ENSSHAG00000011077.2.
DR eggNOG; KOG3755; Eukaryota.
DR GeneTree; ENSGT00390000008096; -.
DR HOGENOM; CLU_012559_1_0_1; -.
DR OrthoDB; 2969903at2759; -.
DR TreeFam; TF332714; -.
DR Proteomes; UP000007648; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR GO; GO:0003700; F:DNA-binding transcription factor activity; IEA:UniProt.
DR GO; GO:0006338; P:chromatin remodeling; IEA:InterPro.
DR GO; GO:0006357; P:regulation of transcription by RNA polymerase II; IEA:UniProt.
DR CDD; cd00086; homeodomain; 1.
DR Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR Gene3D; 1.10.260.40; lambda repressor-like DNA-binding domains; 2.
DR Gene3D; 1.10.260.70; SATB, CULT domain; 1.
DR Gene3D; 3.10.20.710; SATB, ubiquitin-like oligomerisation domain; 1.
DR InterPro; IPR003350; CUT_dom.
DR InterPro; IPR032355; CUTL.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR001356; Homeobox_dom.
DR InterPro; IPR010982; Lambda_DNA-bd_dom_sf.
DR InterPro; IPR039673; SATB1/SATB2.
DR InterPro; IPR038216; SATB_CUTL_sf.
DR InterPro; IPR038224; SATB_ULD_sf.
DR InterPro; IPR032392; ULD.
DR PANTHER; PTHR15116; DNA-BINDING PROTEIN SATB FAMILY MEMBER; 1.
DR PANTHER; PTHR15116:SF15; DNA-BINDING PROTEIN SATB2; 1.
DR Pfam; PF02376; CUT; 2.
DR Pfam; PF16557; CUTL; 1.
DR Pfam; PF00046; Homeodomain; 1.
DR Pfam; PF16534; ULD; 1.
DR SMART; SM01109; CUT; 2.
DR SMART; SM00389; HOX; 1.
DR SUPFAM; SSF46689; Homeodomain-like; 1.
DR SUPFAM; SSF47413; lambda repressor-like DNA-binding domains; 2.
DR PROSITE; PS51042; CUT; 2.
DR PROSITE; PS51983; CUTL; 1.
DR PROSITE; PS50071; HOMEOBOX_2; 1.
DR PROSITE; PS51982; ULD; 1.
PE 3: Inferred from homology;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Homeobox {ECO:0000256|ARBA:ARBA00023155, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00108}; Reference proteome {ECO:0000313|Proteomes:UP000007648};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Transcription {ECO:0000256|RuleBase:RU361129};
KW Transcription regulation {ECO:0000256|RuleBase:RU361129};
KW Ubl conjugation {ECO:0000256|ARBA:ARBA00022843}.
FT DOMAIN 1..99
FT /note="ULD"
FT /evidence="ECO:0000259|PROSITE:PS51982"
FT DOMAIN 102..175
FT /note="CUTL"
FT /evidence="ECO:0000259|PROSITE:PS51983"
FT DOMAIN 291..378
FT /note="CUT"
FT /evidence="ECO:0000259|PROSITE:PS51042"
FT DOMAIN 414..501
FT /note="CUT"
FT /evidence="ECO:0000259|PROSITE:PS51042"
FT DOMAIN 553..614
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DNA_BIND 555..615
FT /note="Homeobox"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT REGION 1..51
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 376..413
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 522..558
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 631..674
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1..19
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 380..413
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 536..551
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 633..653
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 654..674
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 674 AA; 76266 MW; 855B5D09676138FE CRC64;
MERRSESPCL RDSPDRRSGS PDVKGPPPVK VARLEQNGSP MGARGRPNGS VTKSVGGIIK
LGRWNPLPLS YVTDAPDATV ADMLQDVYHV VTLKIQLQSC SKLEDLPAEQ WNHATVRNAL
KELLKEMNQS TLAKECPLSQ SMISSIVNST YYANVSATKC QEFGRWYKKY KKIKVERVER
ENLTDYCVLG QRPMHLPNMN QLATLGKTNE QSPHSQIHHS TPIRNQVPTL QPIMSPGLLS
PQLSPQLVRQ QIAMAHLINQ QIAVSRLLAH QHPQAINQQF LNHPPIPRAV KPEPTNSSVE
VSPDIYQQVR DELKRASVSQ AVFARVAFNR TQGLLSEILR KEEDPRTASQ SLLVNLRAMQ
NFLNLPEVER DRIYQDERER SMNPNVSMVS SASSSPSSSR TPQAKTSTPT TDLPIKVEGA
NVNITAAIYD EIQQEMKRAK VSQALFAKVA ANKSQGWLCE LLRWKENPSP ENRTLWENLC
TIRRFLNLPQ HERDVIYEEE SRHHHSERMQ HVVQLTPEPV QVLHRQQSQP AKETSPPREE
APPPPPPAED SCTKKPRSRT KISLEALGIL QSFIHDVGLY PDQEAIHTLS AQLDLPKHTI
IKFFQNQRYH VKHHGKLKEH LGTGVDVAEY KDEELLTESE ENESEEGSEE MYKVEAEEEN
ADKNKPAPPE IDQR
//