ID F6SSS5_HORSE Unreviewed; 1315 AA.
AC F6SSS5;
DT 27-JUL-2011, integrated into UniProtKB/TrEMBL.
DT 13-SEP-2023, sequence version 4.
DT 27-MAR-2024, entry version 80.
DE SubName: Full=AT-rich interaction domain 4B {ECO:0000313|Ensembl:ENSECAP00000015807.4};
GN Name=ARID4B {ECO:0000313|Ensembl:ENSECAP00000015807.4,
GN ECO:0000313|VGNC:VGNC:15503};
OS Equus caballus (Horse).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Perissodactyla; Equidae; Equus.
OX NCBI_TaxID=9796 {ECO:0000313|Ensembl:ENSECAP00000015807.4, ECO:0000313|Proteomes:UP000002281};
RN [1] {ECO:0000313|Ensembl:ENSECAP00000015807.4, ECO:0000313|Proteomes:UP000002281}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Thoroughbred {ECO:0000313|Ensembl:ENSECAP00000015807.4,
RC ECO:0000313|Proteomes:UP000002281};
RX PubMed=19892987; DOI=10.1126/science.1178158;
RG Broad Institute Genome Sequencing Platform;
RG Broad Institute Whole Genome Assembly Team;
RA Wade C.M., Giulotto E., Sigurdsson S., Zoli M., Gnerre S., Imsland F.,
RA Lear T.L., Adelson D.L., Bailey E., Bellone R.R., Bloecker H., Distl O.,
RA Edgar R.C., Garber M., Leeb T., Mauceli E., MacLeod J.N., Penedo M.C.T.,
RA Raison J.M., Sharpe T., Vogel J., Andersson L., Antczak D.F., Biagi T.,
RA Binns M.M., Chowdhary B.P., Coleman S.J., Della Valle G., Fryc S.,
RA Guerin G., Hasegawa T., Hill E.W., Jurka J., Kiialainen A., Lindgren G.,
RA Liu J., Magnani E., Mickelson J.R., Murray J., Nergadze S.G., Onofrio R.,
RA Pedroni S., Piras M.F., Raudsepp T., Rocchi M., Roeed K.H., Ryder O.A.,
RA Searle S., Skow L., Swinburne J.E., Syvaenen A.C., Tozaki T., Valberg S.J.,
RA Vaudin M., White J.R., Zody M.C., Lander E.S., Lindblad-Toh K.;
RT "Genome sequence, comparative analysis, and population genetics of the
RT domestic horse.";
RL Science 326:865-867(2009).
RN [2] {ECO:0000313|Ensembl:ENSECAP00000015807.4}
RP IDENTIFICATION.
RC STRAIN=Thoroughbred {ECO:0000313|Ensembl:ENSECAP00000015807.4};
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR STRING; 9796.ENSECAP00000015807; -.
DR PaxDb; 9796-ENSECAP00000015807; -.
DR Ensembl; ENSECAT00000019320.4; ENSECAP00000015807.4; ENSECAG00000017836.4.
DR VGNC; VGNC:15503; ARID4B.
DR GeneTree; ENSGT00940000158149; -.
DR HOGENOM; CLU_007419_0_0_1; -.
DR InParanoid; F6SSS5; -.
DR OMA; XPEEESS; -.
DR OrthoDB; 445024at2759; -.
DR TreeFam; TF106427; -.
DR Proteomes; UP000002281; Chromosome 1.
DR Bgee; ENSECAG00000017836; Expressed in brainstem and 23 other cell types or tissues.
DR ExpressionAtlas; F6SSS5; baseline.
DR GO; GO:0005829; C:cytosol; IEA:Ensembl.
DR GO; GO:0005739; C:mitochondrion; IEA:Ensembl.
DR GO; GO:0005654; C:nucleoplasm; IEA:Ensembl.
DR GO; GO:0005634; C:nucleus; IBA:GO_Central.
DR GO; GO:0000976; F:transcription cis-regulatory region binding; IBA:GO_Central.
DR GO; GO:0006357; P:regulation of transcription by RNA polymerase II; IBA:GO_Central.
DR CDD; cd16883; ARID_ARID4B; 1.
DR CDD; cd20460; Tudor_ARID4B_rpt1; 1.
DR CDD; cd20462; Tudor_ARID4B_rpt2; 1.
DR Gene3D; 2.30.30.140; -; 3.
DR Gene3D; 1.10.150.60; ARID DNA-binding domain; 1.
DR InterPro; IPR012603; ARID4A/B_PWWP.
DR InterPro; IPR028853; ARID4B_ARID/BRIGHT.
DR InterPro; IPR001606; ARID_dom.
DR InterPro; IPR036431; ARID_dom_sf.
DR InterPro; IPR016197; Chromo-like_dom_sf.
DR InterPro; IPR002999; Tudor.
DR InterPro; IPR025995; Tudor-knot.
DR InterPro; IPR047476; Tudor_ARID4B_rpt1.
DR InterPro; IPR047474; Tudor_ARID4B_rpt2.
DR PANTHER; PTHR13964:SF24; AT-RICH INTERACTIVE DOMAIN-CONTAINING PROTEIN 4B; 1.
DR PANTHER; PTHR13964; RBP-RELATED; 1.
DR Pfam; PF01388; ARID; 1.
DR Pfam; PF08169; RBB1NT; 1.
DR Pfam; PF11717; Tudor-knot; 1.
DR SMART; SM01014; ARID; 1.
DR SMART; SM00501; BRIGHT; 1.
DR SMART; SM00333; TUDOR; 2.
DR SUPFAM; SSF46774; ARID-like; 1.
DR SUPFAM; SSF54160; Chromo domain-like; 1.
DR SUPFAM; SSF63748; Tudor/PWWP/MBT; 1.
DR PROSITE; PS51011; ARID; 1.
PE 4: Predicted;
KW Isopeptide bond {ECO:0000256|ARBA:ARBA00022499};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Phosphoprotein {ECO:0000256|ARBA:ARBA00022553};
KW Reference proteome {ECO:0000313|Proteomes:UP000002281};
KW Transcription {ECO:0000256|ARBA:ARBA00023163};
KW Transcription regulation {ECO:0000256|ARBA:ARBA00023015};
KW Ubl conjugation {ECO:0000256|ARBA:ARBA00022843}.
FT DOMAIN 312..404
FT /note="ARID"
FT /evidence="ECO:0000259|PROSITE:PS51011"
FT REGION 124..166
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 274..312
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 453..580
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 638..683
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 711..889
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 912..1215
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1255..1291
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 144..166
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 280..312
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 453..474
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 492..536
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 537..569
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 648..663
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 728..752
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 779..853
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 865..889
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 912..930
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1013..1061
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1082..1110
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1147..1194
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1197..1213
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1276..1291
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1315 AA; 147525 MW; 26FA53BE1271DBE1 CRC64;
MKALDEPPYL TVGTDVSAKY RGAFCEAKIK TAKRLVKVKV TFRHDSSTVE VQDDHIKGPL
KVGAIVEVKN LDGAYQEAVI NKLTDASWYT VVFDDGDEKT LRRSSLCLKG ERHFAESETL
DQLPLTNPEH FGTPVIGKKT NRGRRSNHIP EEESSSSSSD EDEDDRKQID ELLGKVVCVD
YVSLDKKKAL WFPALVVCPD CSDEIAVKKD NILVRSFKDG KFTSVPRKDV HEITSDTAPK
PDAVLKQVYG LFLAFDQALE FHKSRTIPAN WKTELKEDSS SSEAEEEEEE EEDEKEKEDN
SSEEEEEIEP FPEERENFLQ QLYKFMEDRG TPINKRPVLG YRNLNLFKLF RLVHKLGGFD
NIESGAVWKQ VYQDLGIPVL NSAAGYNVKC AYKKYLYGFE EYCRSANIEF QMALPEKVVN
KPCKECENVK EIKVKEENES EIKEVKIEEE ENIIPKEEKP TEDDIERKEN IKPSLGSKKN
LLESIPTQSD QEKEVNVKRT EENENLEDKD ETTGVDESLS IKVEAEEEKA KSGDETNKEE
DEDDEEAEEE EEEEEEDEDD DDNNEEEEFE CYPPGMKVQV RYGRGKNQKM YEASIKDSDV
EGGEVLYLVH YCGWNVRYDE WIKADKIVRP ADKNVPKIKH RKKIKNKLDK EKDKDEKYSP
KNCKLRRLSK PPFQTNPSPE MVSKLDLTDA KNSDTAHIKS IEITSILNGL QASESSEDSE
QEDETSAQDI DNGGKEESKV DHLTHTRNDL ISKEEQNSSS LLEENKVHAD LVISKPVSKS
PERIRKDIEG LSEDTDYEED EVTKKRKDIK KDTTDKSSKP QVKRGKRRYC NTEECLKTGS
PGKKEEKAKN KESLCIENSS NSSSDEDEEE KSKAKMTPTK KYNGLEEKRK SLRTTGFYSG
FSEVAEKRIK LLNNSDERLQ NSRAKDRKDV WSSIQGQWPK KTLKELFSDS DTEAAASPPH
PAPEEGPAEA SLQTVAEEES CSPSAELETP LLATADSKPT DEKPVEVSDK KAEFPSSGSN
SVLNTPPTTP ESPSSVTVTE ASRQQSSVTV SETLAPNQEE VRSIKSETDS TIEVDSVAGE
LQDLQSEGNS SPAGFDASVS SSSSNQPEPE HPEKACTGQK RVKEAQGGGS SSKKQKRSHK
ATVVNNKKKG KGTNSSDSEE LSAGESVTKT QPVKSVSAGM KSHSTKSPAR TQSPGKCGKN
GDKDPDLKEP SNRLPKVYKW SFQMSDLENM TSAERITILQ EKLQEIRKHY LSLKSEVASI
DRRRKRLKKK ERESAATSSS SSSPSSSSIT AAVMLTLAEP SMSSASQNGM SVECR
//