ID G3X0R9_SARHA Unreviewed; 1308 AA.
AC G3X0R9;
DT 16-NOV-2011, integrated into UniProtKB/TrEMBL.
DT 07-APR-2021, sequence version 2.
DT 27-MAR-2024, entry version 72.
DE SubName: Full=AT-rich interaction domain 4A {ECO:0000313|Ensembl:ENSSHAP00000021274.2};
GN Name=ARID4A {ECO:0000313|Ensembl:ENSSHAP00000021274.2};
OS Sarcophilus harrisii (Tasmanian devil) (Sarcophilus laniarius).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Metatheria; Dasyuromorphia; Dasyuridae; Sarcophilus.
OX NCBI_TaxID=9305 {ECO:0000313|Ensembl:ENSSHAP00000021274.2, ECO:0000313|Proteomes:UP000007648};
RN [1] {ECO:0000313|Ensembl:ENSSHAP00000021274.2, ECO:0000313|Proteomes:UP000007648}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=21709235; DOI=10.1073/pnas.1102838108;
RA Miller W., Hayes V.M., Ratan A., Petersen D.C., Wittekindt N.E., Miller J.,
RA Walenz B., Knight J., Qi J., Zhao F., Wang Q., Bedoya-Reina O.C.,
RA Katiyar N., Tomsho L.P., Kasson L.M., Hardie R.A., Woodbridge P.,
RA Tindall E.A., Bertelsen M.F., Dixon D., Pyecroft S., Helgen K.M.,
RA Lesk A.M., Pringle T.H., Patterson N., Zhang Y., Kreiss A., Woods G.M.,
RA Jones M.E., Schuster S.C.;
RT "Genetic diversity and population structure of the endangered marsupial
RT Sarcophilus harrisii (Tasmanian devil).";
RL Proc. Natl. Acad. Sci. U.S.A. 108:12348-12353(2011).
RN [2] {ECO:0000313|Ensembl:ENSSHAP00000021274.2}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR STRING; 9305.ENSSHAP00000021274; -.
DR Ensembl; ENSSHAT00000021446.2; ENSSHAP00000021274.2; ENSSHAG00000018032.2.
DR eggNOG; KOG2744; Eukaryota.
DR eggNOG; KOG3001; Eukaryota.
DR GeneTree; ENSGT00940000156159; -.
DR HOGENOM; CLU_007419_1_0_1; -.
DR InParanoid; G3X0R9; -.
DR TreeFam; TF106427; -.
DR Proteomes; UP000007648; Unassembled WGS sequence.
DR GO; GO:0005654; C:nucleoplasm; IEA:Ensembl.
DR GO; GO:0005886; C:plasma membrane; IEA:Ensembl.
DR GO; GO:0017053; C:transcription repressor complex; IEA:Ensembl.
DR GO; GO:0003690; F:double-stranded DNA binding; IEA:InterPro.
DR GO; GO:0006306; P:DNA methylation; IEA:Ensembl.
DR GO; GO:0048821; P:erythrocyte development; IEA:Ensembl.
DR GO; GO:0097368; P:establishment of Sertoli cell barrier; IEA:Ensembl.
DR GO; GO:0071514; P:genomic imprinting; IEA:Ensembl.
DR GO; GO:0045892; P:negative regulation of DNA-templated transcription; IEA:Ensembl.
DR GO; GO:0045944; P:positive regulation of transcription by RNA polymerase II; IEA:Ensembl.
DR GO; GO:0007283; P:spermatogenesis; IEA:Ensembl.
DR GO; GO:0006366; P:transcription by RNA polymerase II; IEA:Ensembl.
DR CDD; cd16882; ARID_ARID4A; 1.
DR CDD; cd18641; CBD_RBP1_like; 1.
DR CDD; cd20390; Tudor_ARID4_rpt2; 1.
DR CDD; cd20459; Tudor_ARID4A_rpt1; 1.
DR Gene3D; 2.30.30.140; -; 3.
DR Gene3D; 1.10.150.60; ARID DNA-binding domain; 1.
DR InterPro; IPR012603; ARID4A/B_PWWP.
DR InterPro; IPR001606; ARID_dom.
DR InterPro; IPR036431; ARID_dom_sf.
DR InterPro; IPR047473; CBD_RBP1-like.
DR InterPro; IPR016197; Chromo-like_dom_sf.
DR InterPro; IPR000953; Chromo/chromo_shadow_dom.
DR InterPro; IPR002999; Tudor.
DR InterPro; IPR025995; Tudor-knot.
DR InterPro; IPR047472; Tudor_ARID4A_rpt1.
DR PANTHER; PTHR13964:SF26; AT-RICH INTERACTIVE DOMAIN-CONTAINING PROTEIN 4A; 1.
DR PANTHER; PTHR13964; RBP-RELATED; 1.
DR Pfam; PF01388; ARID; 1.
DR Pfam; PF08169; RBB1NT; 1.
DR Pfam; PF11717; Tudor-knot; 1.
DR SMART; SM01014; ARID; 1.
DR SMART; SM00501; BRIGHT; 1.
DR SMART; SM00298; CHROMO; 1.
DR SMART; SM00333; TUDOR; 1.
DR SUPFAM; SSF46774; ARID-like; 1.
DR SUPFAM; SSF54160; Chromo domain-like; 1.
DR SUPFAM; SSF63748; Tudor/PWWP/MBT; 1.
DR PROSITE; PS51011; ARID; 1.
PE 4: Predicted;
KW Isopeptide bond {ECO:0000256|ARBA:ARBA00022499};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Phosphoprotein {ECO:0000256|ARBA:ARBA00022553};
KW Reference proteome {ECO:0000313|Proteomes:UP000007648};
KW Transcription {ECO:0000256|ARBA:ARBA00023163};
KW Transcription regulation {ECO:0000256|ARBA:ARBA00023015};
KW Ubl conjugation {ECO:0000256|ARBA:ARBA00022843}.
FT DOMAIN 309..401
FT /note="ARID"
FT /evidence="ECO:0000259|PROSITE:PS51011"
FT REGION 123..169
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 271..311
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 432..582
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 631..738
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 831..931
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 982..1191
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1236..1276
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 273..311
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 432..557
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 645..667
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 671..716
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 721..738
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 859..931
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 982..1001
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1008..1022
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1061..1085
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1120..1141
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1169..1188
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1252..1275
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1308 AA; 148023 MW; 474A33C5063C1CA2 CRC64;
MKAADEPAYL TVGTDVSAKY RGAFCEAKIK TVKRLVKVKV ILKQDNSTQL VQDDQVKGPL
RVGAIVETKT SDGSFQEAII SKLTDASWYT VVFDDGDERT LRRTSLCLKG ERHFAESETL
DQLPLTNPEH FGTPVIGKKS NRGRRSSLPV TEDEKEEESS EEEDEDKRRL NDELLGKVVS
VSSSSEKTEW YPALVISPSC NDDIMVKKDQ CLVRSFTDSK FYSIARKDIK EVDILSLPES
ELSTKPGLQK ASTFLKTKVV PDNWKMDISE ILESSSSDDE DGANEESDEE KDKENAKEEE
ETPEEELDPE ERDNFLQQLY KFMEDRGTPI NKPPVLGYKD LNLFKLFRLV YHQGGCDNID
SGAVWKQIYM DLGIPILNSA ASYNVKTAYR KYLYGFEEYC RSANIQFRTI HHNEPKVIEE
IKKLEESMEE TVKVEQEMPL VEVKSEPEEN IDSNSESEKE EIELKSPRGR RRIARDLTPA
KKENEEEKTE DKLRDNNKEN KDIEEESDII EKKENEALLG KKNTPKPKEK KIRKQEESDK
DSDEEEEKRR EREEIENKEE SEGEEDEEDI EPCLTGTKVK VKYGRGKTQK IYEASIKSTE
IDDGEVLYLV HYYGWNVRYD EWVKADRIIW PLDKGGPKRK QKKKTKNKED SEKDDKKDEE
RQKSKRGRPP LKSTLPSNMS YSVSKSPNIE GKASARTARS SLPDSSPLAN GMEDSASSDS
EIEDTSDKNL VNEEFSPEIL EELEKSEKFI DDKIEEENPK FPCVLKENDR TQVQSLETLK
LEVEESEQIV QIFGNKTEQV EEIKKETEKS PKAKGRKNKI KDLSLETIKI SPLSQDEARS
ESYIEPLSLE VSSLESKDFS STTEDEMDSC SKEKKLKRKI PGQSSPEKKI RIETEMDVPA
LVSEERTSDS IISEEFKELN SEEQPEIEHE EMPSLIAESE PPVQDLATEN FDFPNAKESE
NVAVKEEEED IMPLIGPETL VCHEVDLDDL DEKDKNSADE AGTDIPDPVS PVPNPPALPP
AAPSGFSAAS PLALSQEESR SIKSESDITI EVDSVAEESQ EGLCESESAN GFEASTTSST
CSVTVQEREI RDKGQKRQSD GNGGSLAKKQ KRPPKRTSAT IKNEKNGAGQ SSDSEDLPVP
DSTSKCTPVK HIHVAKPPKL ARSPARMMSP HIKDGDKDKH RDKHQNSSPR AYKWSFQLNE
LDNMNSTERI SFLQEKLQEI RKYYMSLKSE VATIDRRRKR LKKKDREVSH AGASMSSASS
DTGMSPSSSS PPQNVLAVEC RKLLQIRETG KQGTLVQNRG LSWGDMTL
//