ID A0A195AWH9_9HYME Unreviewed; 2536 AA.
AC A0A195AWH9;
DT 05-OCT-2016, integrated into UniProtKB/TrEMBL.
DT 05-OCT-2016, sequence version 1.
DT 24-JAN-2024, entry version 20.
DE SubName: Full=Protein winged eye {ECO:0000313|EMBL:KYM76603.1};
GN ORFNames=ALC53_13047 {ECO:0000313|EMBL:KYM76603.1};
OS Atta colombica.
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; Formicoidea;
OC Formicidae; Myrmicinae; Atta.
OX NCBI_TaxID=520822 {ECO:0000313|EMBL:KYM76603.1, ECO:0000313|Proteomes:UP000078540};
RN [1] {ECO:0000313|EMBL:KYM76603.1, ECO:0000313|Proteomes:UP000078540}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Treedump-2 {ECO:0000313|EMBL:KYM76603.1};
RC TISSUE=Whole body {ECO:0000313|EMBL:KYM76603.1};
RA Nygaard S., Hu H., Boomsma J., Zhang G.;
RT "Atta colombica WGS genome.";
RL Submitted (SEP-2015) to the EMBL/GenBank/DDBJ databases.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KQ976725; KYM76603.1; -; Genomic_DNA.
DR STRING; 520822.A0A195AWH9; -.
DR Proteomes; UP000078540; Unassembled WGS sequence.
DR GO; GO:0003682; F:chromatin binding; IEA:InterPro.
DR Gene3D; 2.30.30.490; -; 1.
DR InterPro; IPR001025; BAH_dom.
DR InterPro; IPR043151; BAH_sf.
DR InterPro; IPR048924; BAHCC1-like_Tudor.
DR PANTHER; PTHR12505; PHD FINGER TRANSCRIPTION FACTOR; 1.
DR PANTHER; PTHR12505:SF24; PROTEIN WINGED EYE; 1.
DR Pfam; PF21744; BAHCC1-like_Tudor; 1.
DR SMART; SM00439; BAH; 1.
DR PROSITE; PS51038; BAH; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000078540}.
FT DOMAIN 2406..2531
FT /note="BAH"
FT /evidence="ECO:0000259|PROSITE:PS51038"
FT REGION 1..29
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 966..990
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1130..1158
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1501..1521
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1564..1636
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1681..1725
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1853..1924
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2087..2378
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1130..1150
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1564..1582
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1596..1636
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1696..1716
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1904..1924
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2098..2130
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2154..2185
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2244..2265
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2279..2305
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2306..2324
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2331..2351
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 2536 AA; 279396 MW; 03B45ADB5A7FA99A CRC64;
MLVPPVAQSG TGDRPVGDTV ARAQQTTGPA AATTTLWPLA PAQPNQVPNQ QSQGIPPLAA
TPAPAHNQGA NRYGLYSLFP GGGGGSYVAA TPATPEQPMP ARAYHATHKE SVFSAAVGVG
VGGYTWGAPT PPPGSPYSPV PVTQLELLAK NLANLAPAAA TGQQALSLQG VGVNVAGSLH
HAQHTQHTQH TLNVTGIHHL HHGGLHQNTF SPQLSLVTGS APTLTINNFS SPNSTNGIAP
TTGVLLQEYQ PPGATAAATG CTVTVNGSSC LTNTSSTTSS MMMHGGVGSG SNQAISNQIA
VQAIATKKRE AFLSSQGQPV KLENKSTRTS CLCRNSNGKT KVVHSDVGCS RTLPVVSSWN
GQDSSASTNN NSASLTVKRE PLASVPCQVA EVSTSLHATT TSVKIEPVPP KSENGIAVSN
AGASGIPVGI AVARQRLQHQ ETSTSMRNVS LSHHTSHHAT SYHHFQPDSL GSSTAMAVGG
TTLVHCGGPG GEDRAAHLAA IPSGALAPSL SLNSSLNSSL NSTLGSISGG PAATADTAAA
AWPPTLWQYP ATAAAAMPTE PVGFPQMGSG LQGGLQLVRD PTTGHILLIH AAEQMQQALV
WPNYSNHNNN VAPPPLLLPP PPPSIQLLGD IGGARLVLTE NKRKQQSALP IVKIETECSN
SPTTIITSTE SSKTALQTMT SVTGALVPDT ALVTTLHYYP QAPALVQISQ AEPTTTHCRS
QATSPVSYLT PPPEISATVH AIEPSFGVQD ASNQTDAPED TEDKQETLVK QEQNLSPRQV
ASQSTGTNLT AANFEIVPSP TERICNVVRI TTTTTTVNPT VCGIVTKVDK AENTIINMAD
CAKYSAKDGR GLVKAIEQDV DEEVCKDDKI TPICSRGSRL GARIIEITEE NCDSFHENLE
FFARRRDVNI DQRDDYSKDE ENIVQDPNIH EKEVESKIIV SENIDIHHQM CSKTSNASVI
NETKSDIADS TESPRVNCES CDKNRNTSNA ETRPQITVKS FDMPNIDCDD QEVQQVAIKQ
ETVDSCYEVC YESSQCEPTS TSDRSRFNCE KSPQEITKRH SMEKPHHPGI ENVVEKLKKN
AAALQEAAIP NVQRINESME RSNVEIKQQR YSETIPKKLH LLRSCQNSLE SGTTDTEISS
SQTAISEHQS GRHSPRNDRH LIGECLFNEN NNDSRSNNNN IKAFEDKKLL TKNEKNVSHD
ITAYVKPKTV WRCEVQPFSR PNVTHKSDHV EDVSSEIKSR IQGQKPAIDV SGLELLSNSI
EQLEQRIGQP EHLQLDTDVE KSPVRSKLVS QQSENNNNNV GSPLGLLCAL AEQIMEVGDK
VPRKLNLESS EEISHAGRLL LNLGRSGSLE KDENKRKYIE IDDYRSKRFK LNDSEEETKN
ASLNYHEEDI KEQKMEINER KQVLKIKESI DNFSEEEVNK INDETTIGSE EQSFNRDDDD
CYSNTKVVKN LQVLAETSNN LQSTDALYDK FAKEDNLLDS ESETFESKMS EQLMSCEIRN
NKRKTSVEER DARDNHDYKN ARTKLEAKKF IAKKGNRDNE DDWPSMNATE LDMRVRMADI
QRQYREKQKE LSKLIPKKDD KKTPGRPRKK SHSSISSDHG TLSSPPAQDF MTSPSRKSPS
RSSRSPEPGA SSPLTSAVLT MPKCNVNLVK LGEPRSHIKL LDNIPSIPIS VPPVTSPIPV
LPGKATSEDD EKSTGTVGYD TSSPAPTVAS STSASKKRKV GRPRKLTCTS GSVRHLTETI
VAKKPKSKSS LVGYVLSSKN RHLQTKQCIN NKIGYTPLPF KSGVSSLKPQ AKIQKVTKVK
TKPAKQTPLH NKNVISSIIA EKAKLSQEVK LEKHSSKIKP KLKAEVKVKN WEDDENDTIQ
PENVLPKEPI VQNPVKETES AEQSEEPEKL RHEKSKKKKR KSSSSQSPNR DKKESKRRKS
LECKQCAKAA RTSESIVNRC KLTSAHLAID QLRVLMAIGG LFYAGRLSAV QAPDVYAITL
DGERGNRPHI HSREEILKDA IVEVCPSSTK ELPPGTRLCA YWSQQYRCLY PGTSVKPMVP
DPQHDEKFVT VEFDDGDSGL IVLDDIRLLQ PNYPVVVYDP NPLLSLGKRR RQTSTSEDKR
STSITSNKLT NANECAEAET ETNVETSSTS GTEYRKKKHM KKIKKNRKLL EAQEGKKKHR
KHRSCKEHRK HKHRKHRKHK HRHHNGIGDA NSVSNQENSW EQRNEEEAIT DLPNTSFSAI
EETRLSEEDD DTEEPLVTTQ KDNEVPTKLE EPSESVQKDD NVQMESKSSE PIQQTPLKPE
EEEEEEEEEE EKEEEEEEEE ALVQQTQVDS EETEPNQVCV EPQTKSEELS EDVQADDEVP
PEPEEPLEPE EPPEPVEKKL KIQQKTSTNR SSNPRMLPNK RHWKWASSSR VSGGNQYFTA
IRRGRETINI GDSVLFYSYR KPHEKPYIGK IVSLWLNQKL EMRVRSQWFY RPEELQPPCS
LNPPGGLFES KHTDSNDVQT ISHKVMVLPL ENYNKVLQAS QRHQKGYEDN DPYYYYAGFY
THPTVTYAPN VTVLDE
//