ID E2AP52_CAMFO Unreviewed; 1546 AA.
AC E2AP52;
DT 30-NOV-2010, integrated into UniProtKB/TrEMBL.
DT 30-NOV-2010, sequence version 1.
DT 27-MAR-2024, entry version 57.
DE SubName: Full=Arginine-glutamic acid dipeptide repeats protein {ECO:0000313|EMBL:EFN64788.1};
GN ORFNames=EAG_03905 {ECO:0000313|EMBL:EFN64788.1};
OS Camponotus floridanus (Florida carpenter ant).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; Formicoidea;
OC Formicidae; Formicinae; Camponotus.
OX NCBI_TaxID=104421 {ECO:0000313|Proteomes:UP000000311};
RN [1] {ECO:0000313|EMBL:EFN64788.1, ECO:0000313|Proteomes:UP000000311}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=C129 {ECO:0000313|Proteomes:UP000000311};
RX PubMed=20798317; DOI=10.1126/science.1192428;
RA Bonasio R., Zhang G., Ye C., Mutti N.S., Fang X., Qin N., Donahue G.,
RA Yang P., Li Q., Li C., Zhang P., Huang Z., Berger S.L., Reinberg D.,
RA Wang J., Liebig J.;
RT "Genomic comparison of the ants Camponotus floridanus and Harpegnathos
RT saltator.";
RL Science 329:1068-1071(2010).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; GL441452; EFN64788.1; -; Genomic_DNA.
DR RefSeq; XP_011261494.1; XM_011263192.2.
DR STRING; 104421.E2AP52; -.
DR EnsemblMetazoa; XM_011263192.3; XP_011261494.1; LOC105254490.
DR GeneID; 105254490; -.
DR InParanoid; E2AP52; -.
DR OMA; VMYLRAA; -.
DR OrthoDB; 3349425at2759; -.
DR Proteomes; UP000000311; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-KW.
DR CDD; cd11661; SANT_MTA3_like; 1.
DR Gene3D; 4.10.1240.50; -; 1.
DR Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR InterPro; IPR002951; Atrophin-like.
DR InterPro; IPR000949; ELM2_dom.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR001005; SANT/Myb.
DR InterPro; IPR017884; SANT_dom.
DR PANTHER; PTHR13859; ATROPHIN-RELATED; 1.
DR PANTHER; PTHR13859:SF11; GRUNGE, ISOFORM J; 1.
DR Pfam; PF03154; Atrophin-1; 1.
DR SMART; SM01189; ELM2; 1.
DR SMART; SM00717; SANT; 1.
DR SUPFAM; SSF46689; Homeodomain-like; 1.
DR PROSITE; PS51156; ELM2; 1.
DR PROSITE; PS51293; SANT; 1.
PE 4: Predicted;
KW Coiled coil {ECO:0000256|SAM:Coils};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Phosphoprotein {ECO:0000256|ARBA:ARBA00022553};
KW Reference proteome {ECO:0000313|Proteomes:UP000000311};
KW Transcription {ECO:0000256|ARBA:ARBA00023163};
KW Transcription regulation {ECO:0000256|ARBA:ARBA00023015}.
FT DOMAIN 7..122
FT /note="ELM2"
FT /evidence="ECO:0000259|PROSITE:PS51156"
FT DOMAIN 126..178
FT /note="SANT"
FT /evidence="ECO:0000259|PROSITE:PS51293"
FT REGION 177..275
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 312..484
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 610..645
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 719..738
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 744..827
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 850..934
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 978..1012
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1041..1127
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1156..1214
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1330..1356
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1420..1451
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 1267..1297
FT /evidence="ECO:0000256|SAM:Coils"
FT COMPBIAS 180..196
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 332..348
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 367..416
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 610..632
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 746..786
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 897..921
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 984..1011
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1041..1093
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1156..1185
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1430..1448
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1546 AA; 168830 MW; 8E92737DBEC899EA CRC64;
MSAGTQGEIR VGPSHQELVS CQARLPEYRP GIPPGELPPD PEFSKEREEL RWIPAMALDG
DLLMYLRAAR SMAAFAGMCD GGSPDDGCVA AARDDTTINA LDILHDSGYD PSRALQALVK
CPAPKGIDKK WSEEETKRFV KGLRQFGKNF SRIRKDLLPH KDTPELVEFY YLWKKTPGAN
NNRPHRRRRQ SSLRRIRNTR NSRAGTPKEE IPTPTKDTPP AVNLSTKEIA SEVETVPVGT
PASHNPGGEI SSVTEDDNSE EDSDSRDTNT NGATHSCQHC FSAGSKDYQV AGKDRLLLCA
ECRAHLKKTG ELPPLQPYLF RPVPAESPES PGRMRTRNKA KETPRPARPR RTGGGTDTPD
QEKQQQQQTP DKNKKKSSGK ADTPKKGHKR SGQTDDTCND EDKEAQKRKR GSGERPDSPS
ESLTTDSNSL MDEPERETEG DANENQPTPP TVAGVTGGEE PVSSPAVTTP EEPSEPTPAS
TPVPAVIQSL PISVPVIHNL EKKPSVLLDQ EPIDQNKITD QTEDVPLVMN QALKLEPLPI
ESTMSPTMSN EDVGKEPETQ LNLSTSAQSV GTGPNDTASP AVTVRNLSQT IPSIITSVGS
QATISSSQIP VISPGQQQTG TGPGGLPTSL SMQYVQPPPP PPPPPVQNVP QNLSQSMAPQ
ITSSSLPPNV PNMGPTNLRG HVTESLPTSV QTLPQSMSGP PPLNLNISQS VSAGIGGPIG
QMPQMPPMSS SPQPLGLTVM SSESRNAERM VDDRLSVNDR VRDNRAPDSR MMSERVSERT
NENNESERSE PSNLFQPIQS GGMLQMEKPA GIYNLPASTP PMEPQNLKIK QEIIPPEPDP
LQSLKEVKVP GFQSSNFPGP SLDNIKKDPD GASKPPTPSK HSMPPVSQAV PSIQPVAASP
TPTPNLPPPP SSIPQPVMHP AQQPSPHMAH PFHPHHPLMH HSLFTAMHTY HPHAYPGYAP
VGGYPSFPPY AYGPVPHAIP PPSPQRSQES GSAMMTAHHA SASSSVASRE EGENLIASHH
HSSSMHQQPT NLHHDKLLTI SSHSSHSHSS SHSSHSSQRK PSLVSATCLT SSASSVHHHH
RASQPQQPQP IVQEPKIEQD METEPEEPIS PRGPSPEPRI EDSECHRSQS AIFLRHWNRG
ENNSCTRTDL MFKPVPDSKL ARKREERSRK QAEREREERD RAAAAAQQAR KMTTPEKQPE
VCKPPSRGPL EPVVSPYDRY TARPGSYADT PALRQLSEYA RPHAAFSPAR HPAPPDPMLH
YIYGREAAAQ RLELEHLERE KREREIRELR ERELNDRLKE ELFKGTPRPM PAPPVDPHWL
EIHRRYAAAG LTPGPSGPPQ ALHQFGLYGA PPGPSQLERE RLERLGIPTA AGGGPAGGAA
GHPVAAHHHG QLEERLALAA DPMVRLQMAG ISPEYHAHTH AHTHAHTHLH LHPGQQQAQQ
QAQQQQEAAA AAAGFPLPAA AGANYPRPGL MPRDPALALH PAELLGRPYA DMAAHHEQLQ
RHLMIERERF PAHASLVAHH EEYLRQQRER ELKVRALEDA ARGSRQ
//