ID A0A158NH45_ATTCE Unreviewed; 739 AA.
AC A0A158NH45;
DT 08-JUN-2016, integrated into UniProtKB/TrEMBL.
DT 08-JUN-2016, sequence version 1.
DT 27-MAR-2024, entry version 26.
DE RecName: Full=PWWP domain-containing protein {ECO:0000259|PROSITE:PS50812};
GN Name=105619950 {ECO:0000313|EnsemblMetazoa:XP_012056853.1};
OS Atta cephalotes (Leafcutter ant).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; Formicoidea;
OC Formicidae; Myrmicinae; Atta.
OX NCBI_TaxID=12957 {ECO:0000313|EnsemblMetazoa:XP_012056853.1, ECO:0000313|Proteomes:UP000005205};
RN [1] {ECO:0000313|Proteomes:UP000005205}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=21347285; DOI=10.1371/journal.pgen.1002007;
RA Suen G., Teiling C., Li L., Holt C., Abouheif E., Bornberg-Bauer E.,
RA Bouffard P., Caldera E.J., Cash E., Cavanaugh A., Denas O., Elhaik E.,
RA Fave M.J., Gadau J., Gibson J.D., Graur D., Grubbs K.J., Hagen D.E.,
RA Harkins T.T., Helmkampf M., Hu H., Johnson B.R., Kim J., Marsh S.E.,
RA Moeller J.A., Munoz-Torres M.C., Murphy M.C., Naughton M.C., Nigam S.,
RA Overson R., Rajakumar R., Reese J.T., Scott J.J., Smith C.R., Tao S.,
RA Tsutsui N.D., Viljakainen L., Wissler L., Yandell M.D., Zimmer F.,
RA Taylor J., Slater S.C., Clifton S.W., Warren W.C., Elsik C.G., Smith C.D.,
RA Weinstock G.M., Gerardo N.M., Currie C.R.;
RT "The genome sequence of the leaf-cutter ant Atta cephalotes reveals
RT insights into its obligate symbiotic lifestyle.";
RL PLoS Genet. 7:e1002007-e1002007(2011).
RN [2] {ECO:0000313|EnsemblMetazoa:XP_012056853.1}
RP IDENTIFICATION.
RG EnsemblMetazoa;
RL Submitted (APR-2016) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; ADTU01015568; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; ADTU01015569; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR RefSeq; XP_012056853.1; XM_012201463.1.
DR AlphaFoldDB; A0A158NH45; -.
DR EnsemblMetazoa; XM_012201463.1; XP_012056853.1; LOC105619950.
DR GeneID; 105619950; -.
DR KEGG; acep:105619950; -.
DR eggNOG; ENOG502QU6V; Eukaryota.
DR InParanoid; A0A158NH45; -.
DR OrthoDB; 2904336at2759; -.
DR Proteomes; UP000005205; Unassembled WGS sequence.
DR CDD; cd20140; PWWP_PWWP2; 1.
DR Gene3D; 2.30.30.140; -; 1.
DR InterPro; IPR000313; PWWP_dom.
DR PANTHER; PTHR16112; METHYL-CPG BINDING PROTEIN, DROSOPHILA; 1.
DR PANTHER; PTHR16112:SF22; PWWP DOMAIN-CONTAINING PROTEIN 2B; 1.
DR Pfam; PF00855; PWWP; 1.
DR SMART; SM00293; PWWP; 1.
DR SUPFAM; SSF63748; Tudor/PWWP/MBT; 1.
DR PROSITE; PS50812; PWWP; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000005205}.
FT DOMAIN 629..689
FT /note="PWWP"
FT /evidence="ECO:0000259|PROSITE:PS50812"
FT REGION 141..209
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 367..534
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 571..601
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 177..209
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 394..409
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 477..494
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 502..531
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 739 AA; 81601 MW; D98576A57F63B7CC CRC64;
MAAAAAETPS AHLALATGDK ILVTVESALP DILVVSFVHG AKCFQGALLD ATKRGLPCGV
QPPESVPDPD GDKLATIAAR FSYFQEKRNV LSSSAATVVA KVDLRRSINP PARYKNARPT
VRLRPRQVLC SKCRSICNEN SENVDVSRKR KHEDAQQQSQ QLIPGSTRRS DRRCGQPQKT
QRTLLSECRT ESTIQTTTKT SSSTDSKLGA SLIPKLSRLQ PNEINNVIQS PPDQLSDKMK
VAASVQSSHW PVTNEEEMSL RNVLTSQERV ESTAECDNNP STAYCATPRR LSSSSVESAK
IPEEEDKKTF AAADSTMRLK TGRVPRKKRS VGSMEDLWDE SVFEDPTRIA RTTPVIKISF
GAQGEGTVLK IPAKPQDPDA YEQETDDADD TQQERDPLEL SRSFGEYYEG EEDGQEQVDP
QKQGGVVKDA SAKAAKRALK KAKKEARRKM LGGVSPARSP CNGSPRYNPT YDPLLYHRRK
HKVKHKKKHK EGRKHKNQQQ QEQQSHSEQS CLDSTDPQQQ SQQQPQNWNV LPGGGESYRA
IKEQCLKQKL SISLKRLNTN AYARCDYPVS NASSGCKSSG GSSDELSEHE PEVETAPDFP
PHQSHPLVMR LAATSVAHCL TTNGRRMDVG DVVWGKIHGF PWWPGKVLSI TVSCKEDGTS
SGPQAHVAWY GSSTSSLMSC DQLSPFLETF KTRYNKKKRG PYKEAIRQAQ NEARSQITPN
STSNVLNVCG SPREVNVLS
//