ID G3WPH5_SARHA Unreviewed; 733 AA.
AC G3WPH5;
DT 16-NOV-2011, integrated into UniProtKB/TrEMBL.
DT 07-APR-2021, sequence version 2.
DT 27-MAR-2024, entry version 64.
DE SubName: Full=MYB proto-oncogene, transcription factor {ECO:0000313|Ensembl:ENSSHAP00000017330.2};
GN Name=MYB {ECO:0000313|Ensembl:ENSSHAP00000017330.2};
OS Sarcophilus harrisii (Tasmanian devil) (Sarcophilus laniarius).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Metatheria; Dasyuromorphia; Dasyuridae; Sarcophilus.
OX NCBI_TaxID=9305 {ECO:0000313|Ensembl:ENSSHAP00000017330.2, ECO:0000313|Proteomes:UP000007648};
RN [1] {ECO:0000313|Ensembl:ENSSHAP00000017330.2, ECO:0000313|Proteomes:UP000007648}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=21709235; DOI=10.1073/pnas.1102838108;
RA Miller W., Hayes V.M., Ratan A., Petersen D.C., Wittekindt N.E., Miller J.,
RA Walenz B., Knight J., Qi J., Zhao F., Wang Q., Bedoya-Reina O.C.,
RA Katiyar N., Tomsho L.P., Kasson L.M., Hardie R.A., Woodbridge P.,
RA Tindall E.A., Bertelsen M.F., Dixon D., Pyecroft S., Helgen K.M.,
RA Lesk A.M., Pringle T.H., Patterson N., Zhang Y., Kreiss A., Woods G.M.,
RA Jones M.E., Schuster S.C.;
RT "Genetic diversity and population structure of the endangered marsupial
RT Sarcophilus harrisii (Tasmanian devil).";
RL Proc. Natl. Acad. Sci. U.S.A. 108:12348-12353(2011).
RN [2] {ECO:0000313|Ensembl:ENSSHAP00000017330.2}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; G3WPH5; -.
DR STRING; 9305.ENSSHAP00000017330; -.
DR Ensembl; ENSSHAT00000017475.2; ENSSHAP00000017330.2; ENSSHAG00000014728.2.
DR eggNOG; KOG0048; Eukaryota.
DR GeneTree; ENSGT00940000156248; -.
DR TreeFam; TF326257; -.
DR Proteomes; UP000007648; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-KW.
DR GO; GO:0006355; P:regulation of DNA-templated transcription; IEA:InterPro.
DR CDD; cd00167; SANT; 3.
DR Gene3D; 1.10.10.60; Homeodomain-like; 3.
DR InterPro; IPR015395; C-myb_C.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR017930; Myb_dom.
DR InterPro; IPR001005; SANT/Myb.
DR InterPro; IPR012642; Tscrpt_reg_Wos2-domain.
DR PANTHER; PTHR45614; MYB PROTEIN-RELATED; 1.
DR PANTHER; PTHR45614:SF5; TRANSCRIPTIONAL ACTIVATOR MYB; 1.
DR Pfam; PF09316; Cmyb_C; 1.
DR Pfam; PF07988; LMSTEN; 1.
DR Pfam; PF00249; Myb_DNA-binding; 3.
DR SMART; SM00717; SANT; 3.
DR SUPFAM; SSF46689; Homeodomain-like; 2.
DR PROSITE; PS51294; HTH_MYB; 3.
DR PROSITE; PS50090; MYB_LIKE; 3.
PE 4: Predicted;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125};
KW Reference proteome {ECO:0000313|Proteomes:UP000007648}.
FT DOMAIN 35..86
FT /note="HTH myb-type"
FT /evidence="ECO:0000259|PROSITE:PS51294"
FT DOMAIN 35..86
FT /note="Myb-like"
FT /evidence="ECO:0000259|PROSITE:PS50090"
FT DOMAIN 87..142
FT /note="HTH myb-type"
FT /evidence="ECO:0000259|PROSITE:PS51294"
FT DOMAIN 87..138
FT /note="Myb-like"
FT /evidence="ECO:0000259|PROSITE:PS50090"
FT DOMAIN 139..189
FT /note="Myb-like"
FT /evidence="ECO:0000259|PROSITE:PS50090"
FT DOMAIN 143..193
FT /note="HTH myb-type"
FT /evidence="ECO:0000259|PROSITE:PS51294"
FT REGION 286..313
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 286..303
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 733 AA; 83362 MW; 27D964A4247B4979 CRC64;
MARRPRHSIY SSDEDEEDID IYDHDYDGLL PKTGKRHLGK TRWTREEDEK LKKLVEQNGT
DDWKIIANFL PNRTDVQCQH RWQKVLNPEL IKGPWTKEED QRVIELVQKY GPKRWSVIAK
HLKGRIGKQC RERWHNHLNP EVKKTSWTEE EDRIIYQAHK RLGNRWAEIA KLLPGRTDNA
IKNHWNSTMR RKVEQEGYLQ ESPKANQPTV ATSFQKSNHL MGFAHTPPSA QLPSTAQTPV
SNDYSYYHIS ETQNRHYNDE DPEKEKRIKE LELLLKSTEN ELKGQQALPT QNHTSNYPGW
HSTTIVDHTR PHGDSAPVSC LGEHHSTPSL PVDHGCLPEE SASPARCMIV HQGNILDNVK
NLLEFAETLQ FIDSDSSSWC DLNSFEFFEE ADISPSQHHS NKAIQLQQRE GSVYRPEGYI
STNLSKCMLS QGLLDSSKSL PTTARHSTIP LVILRKKRGH AGSLSTGDYS SFIFTDVSNS
TPKRSPDKSL PFSPSQFLNT SNNHENVDMD MPTLTSTPLN GHKLTVTTPF HRDQTVKPQK
ENNIFRTPAI KRSILESSPR TPTPFKHALA AQEIKYGPLK MLPQTPSHLV EDLQDVIKQE
SDETGIVSGF HGNGPPLLKK IKQEVESPTT KAGNFFCSNH WEGENLNTQL FTQASPMEDM
PNLLTSSVLM MPVSEDDVLK TFTVPRNRSL ASPLQHLSNA WESVSCGKTE DQMMASDQGR
KYINAFSTRT LVM
//