ID G3WFB1_SARHA Unreviewed; 1389 AA.
AC G3WFB1;
DT 16-NOV-2011, integrated into UniProtKB/TrEMBL.
DT 07-APR-2021, sequence version 2.
DT 27-MAR-2024, entry version 59.
DE SubName: Full=ASXL transcriptional regulator 2 {ECO:0000313|Ensembl:ENSSHAP00000014116.2};
GN Name=ASXL2 {ECO:0000313|Ensembl:ENSSHAP00000014116.2};
OS Sarcophilus harrisii (Tasmanian devil) (Sarcophilus laniarius).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Metatheria; Dasyuromorphia; Dasyuridae; Sarcophilus.
OX NCBI_TaxID=9305 {ECO:0000313|Ensembl:ENSSHAP00000014116.2, ECO:0000313|Proteomes:UP000007648};
RN [1] {ECO:0000313|Ensembl:ENSSHAP00000014116.2, ECO:0000313|Proteomes:UP000007648}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=21709235; DOI=10.1073/pnas.1102838108;
RA Miller W., Hayes V.M., Ratan A., Petersen D.C., Wittekindt N.E., Miller J.,
RA Walenz B., Knight J., Qi J., Zhao F., Wang Q., Bedoya-Reina O.C.,
RA Katiyar N., Tomsho L.P., Kasson L.M., Hardie R.A., Woodbridge P.,
RA Tindall E.A., Bertelsen M.F., Dixon D., Pyecroft S., Helgen K.M.,
RA Lesk A.M., Pringle T.H., Patterson N., Zhang Y., Kreiss A., Woods G.M.,
RA Jones M.E., Schuster S.C.;
RT "Genetic diversity and population structure of the endangered marsupial
RT Sarcophilus harrisii (Tasmanian devil).";
RL Proc. Natl. Acad. Sci. U.S.A. 108:12348-12353(2011).
RN [2] {ECO:0000313|Ensembl:ENSSHAP00000014116.2}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- SIMILARITY: Belongs to the Asx family. {ECO:0000256|ARBA:ARBA00006391}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR STRING; 9305.ENSSHAP00000014116; -.
DR Ensembl; ENSSHAT00000014233.2; ENSSHAP00000014116.2; ENSSHAG00000012066.2.
DR eggNOG; ENOG502QWPH; Eukaryota.
DR GeneTree; ENSGT00520000055578; -.
DR TreeFam; TF328464; -.
DR Proteomes; UP000007648; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:InterPro.
DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR InterPro; IPR026905; ASX-like_PHD.
DR InterPro; IPR024811; ASX/ASX-like.
DR InterPro; IPR028020; ASX_DEUBAD_dom.
DR InterPro; IPR044867; DEUBAD_dom.
DR PANTHER; PTHR13578; ADDITIONAL SEX COMBS LIKE PROTEIN ASXL; 1.
DR PANTHER; PTHR13578:SF11; POLYCOMB GROUP PROTEIN ASXL2-RELATED; 1.
DR Pfam; PF13919; ASXH; 1.
DR Pfam; PF13922; PHD_3; 1.
DR PROSITE; PS51916; DEUBAD; 1.
PE 3: Inferred from homology;
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000007648};
KW Repressor {ECO:0000256|ARBA:ARBA00022491};
KW Transcription {ECO:0000256|ARBA:ARBA00023163};
KW Transcription regulation {ECO:0000256|ARBA:ARBA00023015};
KW Zinc {ECO:0000256|ARBA:ARBA00022833};
KW Zinc-finger {ECO:0000256|ARBA:ARBA00022771}.
FT DOMAIN 224..333
FT /note="DEUBAD"
FT /evidence="ECO:0000259|PROSITE:PS51916"
FT REGION 1..142
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 160..209
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 335..559
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 620..835
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 863..900
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 933..965
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1113..1157
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 43..64
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 78..142
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 165..189
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 356..370
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 409..432
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 503..521
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 693..749
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 762..800
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 818..835
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 945..960
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1126..1144
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1389 AA; 148577 MW; B5A6F5D71EC5C2E6 CRC64;
MLHTNSRGEE GIFYKVPGRM GVYTLKKDVP DGVKELSEGS EESSDGQSDS PSSENSSSSS
DGGSNKEGRK SRWKRKVSSR LSQPSSPQSS CPSPSISAGK VISPSQKHSK KALKQALKQQ
QQKKQQQQQQ QCRTSMPISS NQHLLLKTVK TSTDPVATKP AVWEGKQSDG QSSSPQNSNS
SSSSSVKVEN PLPGLGKKPF QRSDRLHTRQ LKRTKCAEID VETPDSILVN TNLRALINKH
TFSVLPGDCQ QRLLLLLPEV DRQVGADGLM KLNGSALNNE FFTSAAQGWK ERLSEGEFTP
EMQVRIRQEI EKEKKVEPWK EQFFESYYGQ SSGLSLEDSR KLTSATSDPK MRKPPIEQPK
PTPPSDDPPI KSVMAAPEVE SKPVPLFSPI RKESEESTEE VLPKPSKSPE PLVSSTSSVI
EQNKTGPVTV PENEDSVGQR KPIPSAEPKP ENENHLTTAT PIIKENKGTA ILTPSKPKSP
GIGKPIIRPI IEESPQENTV KDASPSEHSE LNPEGLKRKS SLTQEENPGS WEKRPRVLEN
HQHQQSLQAS PQPFPLRGER VPVRKVPPLK IPVSRISHVP FPTGQVSPRV RFPASITSPN
RKGARTLADI KAKAQLAKAQ RAAAAAAAAA AAAASVGGTI PGPGPGGGGE GAPRKGGDPG
SGGASETGKG NTLELAGTGS RGGARELLPD GSETESQAET KVSGSASPHS VSRAQLQQAP
PLPSRSAVSG ACPSSPSPTL TNPSSPTLEK LKGENPDCSL GTDRTVPCSP SPVPNNFQQE
KAPSPSAGSA LTSGTPPAQV TAECTVEPRA SSRKDTASVE AVVETSPSAP MEVASSSLTT
LPVAVTLEKP LPLQTSVTAV PTVSAPPSST LPISSLKAQG ASSNTSGPAA RPSSSIPANN
PLVTQLLQGK DVPLEQILPK PLTKVEMKTV PLVTKEEKGP GAPRDTSITG TSTRGEGGER
QSHLSTPHLE KLHQNKQLPQ VPRTLHLFSG KDLRDSNVEQ YPCQEGLNKA TQEQLLQTLI
KRVQRQNLLS VLQPAQFSLT HSGFQLEDIS TSQRFMLGFA GRRTSKPAMS GHYLLNISTY
GRGSESFKRA QSMNSEDRFC LNSPPEAFKM EHADYKGPTA DGSSSKDEDE TDEESTDEEE
EQILVKEEPL ATQVSTKCEG GLGAYSRESL STYDTPANKN LKVEAPGMGH AIASKENFHL
FPAGQVFDGN TLARDFIQAA QERMAHAMRG KMVNSSPELY GTLPLPTDSP THQPLLLPPL
QTPKLYGSPP RQLGPSYRGM INVSTSSDVD HSSAVSGIPD CNQVSSNMGD VMSFSVTVTT
IPASQAMNPS SHGQTLPVQA FPEENSVEDS PSKCYCRLKA MIMCKGCGAF CHDDCIGPSK
LCVSCLVVR
//