ID A0A158NGB5_ATTCE Unreviewed; 1787 AA.
AC A0A158NGB5;
DT 08-JUN-2016, integrated into UniProtKB/TrEMBL.
DT 08-JUN-2016, sequence version 1.
DT 24-JAN-2024, entry version 33.
DE RecName: Full=Bromo domain-containing protein {ECO:0000259|PROSITE:PS50014};
GN Name=105619664 {ECO:0000313|EnsemblMetazoa:XP_012056573.1};
OS Atta cephalotes (Leafcutter ant).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; Formicoidea;
OC Formicidae; Myrmicinae; Atta.
OX NCBI_TaxID=12957 {ECO:0000313|EnsemblMetazoa:XP_012056573.1, ECO:0000313|Proteomes:UP000005205};
RN [1] {ECO:0000313|Proteomes:UP000005205}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=21347285; DOI=10.1371/journal.pgen.1002007;
RA Suen G., Teiling C., Li L., Holt C., Abouheif E., Bornberg-Bauer E.,
RA Bouffard P., Caldera E.J., Cash E., Cavanaugh A., Denas O., Elhaik E.,
RA Fave M.J., Gadau J., Gibson J.D., Graur D., Grubbs K.J., Hagen D.E.,
RA Harkins T.T., Helmkampf M., Hu H., Johnson B.R., Kim J., Marsh S.E.,
RA Moeller J.A., Munoz-Torres M.C., Murphy M.C., Naughton M.C., Nigam S.,
RA Overson R., Rajakumar R., Reese J.T., Scott J.J., Smith C.R., Tao S.,
RA Tsutsui N.D., Viljakainen L., Wissler L., Yandell M.D., Zimmer F.,
RA Taylor J., Slater S.C., Clifton S.W., Warren W.C., Elsik C.G., Smith C.D.,
RA Weinstock G.M., Gerardo N.M., Currie C.R.;
RT "The genome sequence of the leaf-cutter ant Atta cephalotes reveals
RT insights into its obligate symbiotic lifestyle.";
RL PLoS Genet. 7:e1002007-e1002007(2011).
RN [2] {ECO:0000313|EnsemblMetazoa:XP_012056573.1}
RP IDENTIFICATION.
RG EnsemblMetazoa;
RL Submitted (APR-2016) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; ADTU01015125; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; ADTU01015126; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR RefSeq; XP_012056573.1; XM_012201183.1.
DR STRING; 12957.A0A158NGB5; -.
DR EnsemblMetazoa; XM_012201183.1; XP_012056573.1; LOC105619664.
DR GeneID; 105619664; -.
DR KEGG; acep:105619664; -.
DR eggNOG; KOG0644; Eukaryota.
DR InParanoid; A0A158NGB5; -.
DR OrthoDB; 24713at2759; -.
DR Proteomes; UP000005205; Unassembled WGS sequence.
DR CDD; cd05529; Bromo_WDR9_I_like; 1.
DR CDD; cd00200; WD40; 1.
DR Gene3D; 2.30.30.1040; -; 1.
DR Gene3D; 1.20.920.10; Bromodomain-like; 2.
DR Gene3D; 2.130.10.10; YVTN repeat-like/Quinoprotein amine dehydrogenase; 3.
DR InterPro; IPR001487; Bromodomain.
DR InterPro; IPR036427; Bromodomain-like_sf.
DR InterPro; IPR018359; Bromodomain_CS.
DR InterPro; IPR015943; WD40/YVTN_repeat-like_dom_sf.
DR InterPro; IPR019775; WD40_repeat_CS.
DR InterPro; IPR036322; WD40_repeat_dom_sf.
DR InterPro; IPR001680; WD40_rpt.
DR PANTHER; PTHR16266:SF17; BRWD3; 1.
DR PANTHER; PTHR16266; WD REPEAT DOMAIN 9; 1.
DR Pfam; PF00439; Bromodomain; 2.
DR Pfam; PF00400; WD40; 6.
DR PRINTS; PR00503; BROMODOMAIN.
DR SMART; SM00297; BROMO; 2.
DR SMART; SM00320; WD40; 8.
DR SUPFAM; SSF47370; Bromodomain; 2.
DR SUPFAM; SSF50978; WD40 repeat-like; 1.
DR PROSITE; PS00633; BROMODOMAIN_1; 1.
DR PROSITE; PS50014; BROMODOMAIN_2; 2.
DR PROSITE; PS00678; WD_REPEATS_1; 1.
DR PROSITE; PS50082; WD_REPEATS_2; 6.
DR PROSITE; PS50294; WD_REPEATS_REGION; 4.
PE 4: Predicted;
KW Bromodomain {ECO:0000256|PROSITE-ProRule:PRU00035};
KW Reference proteome {ECO:0000313|Proteomes:UP000005205};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW WD repeat {ECO:0000256|ARBA:ARBA00022574, ECO:0000256|PROSITE-
KW ProRule:PRU00221}.
FT REPEAT 180..221
FT /note="WD"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00221"
FT REPEAT 222..263
FT /note="WD"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00221"
FT REPEAT 264..300
FT /note="WD"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00221"
FT REPEAT 367..408
FT /note="WD"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00221"
FT REPEAT 425..466
FT /note="WD"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00221"
FT REPEAT 467..501
FT /note="WD"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00221"
FT DOMAIN 1138..1208
FT /note="Bromo"
FT /evidence="ECO:0000259|PROSITE:PS50014"
FT DOMAIN 1287..1357
FT /note="Bromo"
FT /evidence="ECO:0000259|PROSITE:PS50014"
FT REGION 703..726
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 756..887
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1381..1787
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 704..720
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 761..778
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 807..825
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1389..1407
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1455..1469
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1499..1513
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1541..1555
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1567..1583
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1598..1614
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1615..1632
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1635..1667
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1668..1692
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1699..1716
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1723..1756
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1787 AA; 203389 MW; 0B8C0869CC115E25 CRC64;
MEGSASRIES GIAPELYFLI VKFLESGICP EAATILKREL ERTEVLPARI DWEGNVHSQS
FEELEKKYPH IGSTYLLQIC ARIGPVLEKE VPPCVRGVIS LLGAGRQSLL RTHEDSTRHT
YNIFDYSGRL GGKPFLELAG SLSVPNIVHV LQGRENSGPL SRTQAIPTKF YSKMKLYRHT
LGHLSAVYCV LFDRTGTYII TGADDLLVKV WSSIDGRLLA TFRGASAEIM DIAVNFDNTL
LAAGSLDRII RVWCFQSMSP VAVLTGHSGM ITSINFCPIV CNDVYYLVST STDGSVAFWS
HTKKSNERAV FHTKPIQYHE KMRPGQAQMI CASFSPGGAF MAAGSADHNV RVYAMLGDEG
PRRVLEIEAH SDTVDSIQWA HSGLKFISGS KDGTANVWHF EQQQWIHKQL LMTAKLPGEP
EMEDDTNKKA RVTMVCWDVS DEFVITAVND YTLKVWGAKS GELMKVLRGH KDEVFVLESH
PIDPRMILSA GHDGQLIIWD VLNTEPMVCF QNFVEGQGNC AVFDAKWSPD GTKLAATDSN
GHLLMYGFGC GVEKLKIVPK ELFFHTDYRP LIRDGNNYVL DEQTQTAPHL MPPPFLVDVD
GNPYPPALQR LVPGRENCRG EQLVPNIAVG AGGMQEVIEG LPEQEPRSNI DRLIEALAQR
QNINGEAAGA GDDRENNVAE QPIRQIASPR GSRAGLRREG NVEGVRQSSG NWQRDNTTPW
NKPMLARPMN PAVKETQFKM IQAMADMELD NWRREMRRRP QPTSSTATNQ GNIGNKLLNR
KRNRTRHGYR TRATRDEEDD DEYENLDNAG TSASSASSNS NDSTAHEEDM CSDSTTESST
EYSDWIADHG LNLEPPKRSK RKPVKKRSVT PPSETDRRRR RRPKKKGIQI ANGIREVPEI
YRPSEWLTEV IPRKAPYYPQ MGDEIVYFRQ GHKFYLDAIR NKKVYELSPR CEPWTKINIR
AQEFVKVVGI KYEIRPPRLC CLKLALMDED GRLSGQNFTI KYHDMADVLD FLVLRQTFDM
ALARSWSEGD KFRCMIDDGW WMGQIVGMEP LDEEFSESLF MCFRVRWDNG EYERMSPWDL
EPVDEDIRLG VPVEIGGAVP VLPEERQAIL YQPHAEEWPM GDREATCRRI IRGLDQVMSL
AIAEPFVVPV DLSLYPTYAY IVEYPIDLST IKARFENHFY RRVTSAQFDV RYLATNAEQF
NEPHSHIVKK ARIVTDLCLR IIKETTDVDV PAVYHQLNDT YHSSESEEID VERPSTSKVR
SKRKCSLLQE ISKDWKLASR YLLETLWQCE DSIPFREPVD TLEHPEYHQI IDTPMDLRTV
KEDLLGGNYE TPAEFAKDMR LIFSNSRNYN TNKRSKIYSM TIRLSAMFEE HMRRILTSWK
SAHKLSETKK TKPKGKKKQK KKKKKTNLSN GTGPSKSKST LSDEDDIDSE DDYDDEESGK
EQKKKDFDIS APGPSRMTNG HTKRSNSGPS RPRLKLRICR STVSKKLKDS DSDISDSEVT
TDTDSSNSAP VRRTRARTAI PSVSADTDSG DDYTPGGRSK QVRKNRGKNI KNGKQSKSKV
KLNVVANDDD DDDDEEYVAE SKMIEDNEDE LALEFSEESE EEKIIKRDTR SRKLVSENVQ
NVQKYTTNNG KDDNSDSESE EDDNDEEQEE PEEEEEEEEE EEEEEEEQPQ LQRKTNSQDS
EDSDYSDKKY KSSSRSSRRK PRSSMESEDT RQHYGSSRRS SARKRPYYNE DSDDSTTMPV
RNRRKIKRRS YAEDSDESVP EPGISISSRG RIRKMTPRAR AYLLESP
//