ID A0A8S3WG14_PARAO Unreviewed; 2610 AA.
AC A0A8S3WG14;
DT 12-OCT-2022, integrated into UniProtKB/TrEMBL.
DT 12-OCT-2022, sequence version 1.
DT 18-JUN-2025, entry version 10.
DE SubName: Full=(apollo) hypothetical protein {ECO:0000313|EMBL:CAG4958108.1};
GN ORFNames=PAPOLLO_LOCUS5900 {ECO:0000313|EMBL:CAG4958108.1};
OS Parnassius apollo (Apollo butterfly) (Papilio apollo).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; Papilionoidea;
OC Papilionidae; Parnassiinae; Parnassini; Parnassius; Parnassius.
OX NCBI_TaxID=110799 {ECO:0000313|EMBL:CAG4958108.1, ECO:0000313|Proteomes:UP000691718};
RN [1] {ECO:0000313|EMBL:CAG4958108.1}
RP NUCLEOTIDE SEQUENCE.
RA Tunstrom K.;
RL Submitted (APR-2021) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- SIMILARITY: Belongs to the WAL family. {ECO:0000256|ARBA:ARBA00007444}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:CAG4958108.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CAJQZP010000359; CAG4958108.1; -; Genomic_DNA.
DR OrthoDB; 784962at2759; -.
DR Proteomes; UP000691718; Unassembled WGS sequence.
DR GO; GO:0000785; C:chromatin; IEA:TreeGrafter.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:InterPro.
DR GO; GO:0008270; F:zinc ion binding; IEA:UniProtKB-KW.
DR CDD; cd05503; Bromo_BAZ2A_B_like; 1.
DR CDD; cd15545; PHD_BAZ2A_like; 1.
DR FunFam; 3.30.40.10:FF:000199; Bromodomain adjacent to zinc finger domain 2B; 1.
DR InterPro; IPR037374; BAZ2A/B_Bromo.
DR InterPro; IPR001487; Bromodomain.
DR InterPro; IPR018359; Bromodomain_CS.
DR InterPro; IPR018501; DDT_dom.
DR InterPro; IPR001739; Methyl_CpG_DNA-bd.
DR InterPro; IPR028942; WHIM1_dom.
DR InterPro; IPR001965; Znf_PHD.
DR InterPro; IPR019787; Znf_PHD-finger.
DR PANTHER; PTHR45915:SF2; TOUTATIS, ISOFORM E; 1.
DR PANTHER; PTHR45915; TRANSCRIPTION INTERMEDIARY FACTOR; 1.
DR Pfam; PF00439; Bromodomain; 1.
DR Pfam; PF02791; DDT; 1.
DR Pfam; PF01429; MBD; 1.
DR Pfam; PF00628; PHD; 2.
DR Pfam; PF15612; WHIM1; 1.
DR SMART; SM00297; BROMO; 1.
DR SMART; SM00571; DDT; 1.
DR SMART; SM00391; MBD; 1.
DR SMART; SM00249; PHD; 2.
DR PROSITE; PS00633; BROMODOMAIN_1; 1.
DR PROSITE; PS50014; BROMODOMAIN_2; 1.
DR PROSITE; PS50827; DDT; 1.
DR PROSITE; PS50982; MBD; 1.
DR PROSITE; PS50016; ZF_PHD_2; 2.
PE 3: Inferred from homology;
KW Bromodomain {ECO:0000256|ARBA:ARBA00023117, ECO:0000256|PROSITE-
KW ProRule:PRU00035};
KW Coiled coil {ECO:0000256|ARBA:ARBA00023054, ECO:0000256|SAM:Coils};
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000691718};
KW Transcription {ECO:0000256|ARBA:ARBA00023163};
KW Transcription regulation {ECO:0000256|ARBA:ARBA00023015};
KW Zinc {ECO:0000256|ARBA:ARBA00022833};
KW Zinc-finger {ECO:0000256|ARBA:ARBA00022771, ECO:0000256|PROSITE-
KW ProRule:PRU00146}.
FT DOMAIN 646..722
FT /note="MBD"
FT /evidence="ECO:0000259|PROSITE:PS50982"
FT DOMAIN 1185..1248
FT /note="DDT"
FT /evidence="ECO:0000259|PROSITE:PS50827"
FT DOMAIN 2272..2322
FT /note="PHD-type"
FT /evidence="ECO:0000259|PROSITE:PS50016"
FT DOMAIN 2326..2375
FT /note="PHD-type"
FT /evidence="ECO:0000259|PROSITE:PS50016"
FT DOMAIN 2523..2593
FT /note="Bromo"
FT /evidence="ECO:0000259|PROSITE:PS50014"
FT REGION 1..27
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 136..255
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 291..325
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 341..417
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 450..585
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 622..647
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1439..1503
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1512..1531
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1641..1674
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1949..1968
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2097..2119
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2377..2465
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 1066..1154
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 1557..1584
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 1777..1829
FT /evidence="ECO:0000256|SAM:Coils"
FT COMPBIAS 158..169
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 179..190
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 206..229
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 230..239
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 456..470
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 506..523
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 526..536
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 553..564
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 627..639
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1444..1475
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1483..1498
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1653..1674
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1951..1962
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2377..2389
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2404..2423
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2427..2437
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 2610 AA; 289390 MW; CBC601E66039FBE2 CRC64;
MLFGRFRRGM DKENGEGGDA AGKPPPDGLL DAAGLFGAYW GREGSGGGAA AAQAQAQAQA
QAALFGFGSR YPPPSTLGVA ANQAASLGIH PAASAAWWSM ASHLAAQDYL ARLQASGLNF
PPLGDPYSAL SALSGASAGM KQPPKHPKQP IRNDSRSGRS SSGTSSSASM KDKSPSTSGA
NSQPTMSEWG TSYGFPKTSS PSTMHSQASL SSLASLNSLV SAPHQSKPSS KPPPAPTPPR
KSSSSSSGKS KDRDLALLRG DLMLAQAAAH GAYHAAVAAA TKGKNMSPLG YPFGMPGSDK
DRHSSIDSLT GLPHTILSDP SSVLGGVRLP PDTEIIKYTS SLAGPKVPPG TTNRGRKKTI
SLDPPQVSVH PSAGDRSTPL PNKRPKVEDY GSSRSSVEVI RLPGKADRAG GPLAGTPPPN
LADYAGISRE LLQTIASQSG VSLAALERQL AGAQPASDSG LNLSTKSASS EDAPLDLGVK
TAADEDAPLN LSLKPTPPST QASDALSRLT SLSSSLNSSS NNDRISRRKP GAKPRRVAPE
LNSQVAESPR PKSSGSEDSE SVSGWPSREG RPRNLGRGVS KPKKNTVASL LAQSRALGLR
PALAHQLLAE TDLEKLRSLL GEGASTDSEC PSDSGPSDSD ASDPGRRHDA QLRLPLALGW
KRVTLIKGLS RNCNIKGDVS YTPPEPHTGL AIKSTQELTT FLESNPCPPL TTDNFSFSAR
SLLGEYVQPA PDLGEPLIFN EAEITKRLEE ARALAAVSAP RPTPPPVERR IELARRQQAA
RDARRDNAPR RDHARLVREL ERSERAELAK REKEVRSQQL LEHINKNRTQ LTIEPLPSVQ
KMNTDSNEWK IPEIVMGPAV KTNKDRTMPP ISLSLIPVSG PKDNQNVPIK TLNFDQSGWK
EEIEEYNALD KMKDFAEKAA FDLPKRPKDK YFDFTLMKAA EKKTKVPDKE NKLEKLIESY
SRISEFINGC DWSKLEGDSK TEDPKSLEQK YLDAKNEFMS QNLMLMPKDT QKSVLKEVID
LSCDDENILN DIITKKISQG TFNVGKDGAL SISVHPFGSK DMYLAKKRKQ EELEKQKIED
QAKRQQEREL KRQQAILLKE QERERRRQHT TFVRQLDARR RWAERERRKH QNLLDRLLAK
ERKLQQRRRE MELLAELRRP QEDSTLSDQK PLPTLNRIPG LKLPGQAMAD LLQVYEFINN
FGQTLGFDVE SLPSLQTLQC ALLAECSADA EDELLQVLTQ LLVCAIEDPG IPHPGRHTTL
LGQAIRMGDI TPANLSEVLR IYLYANATGE IKALTGLTAE RERERRVADH HQNDAEMQHT
CSNTKNAAYY EHLHNNSTYK LSETLRDKPF LALNPTTKAR MLAYLCDELL QSKAVLRQID
ASLDHLNQLR KERYLMDMKI RKVRVLHQRK LRAEQAEKQQ ALALERMQRL VEESTAINTV
LPHLPEEDHS MSDKSETPKK LDKVESPHLE HETNKQPELP SPYKDHAPIE ESDKELSPLK
DLSNSIKTKE ILNNNKDDCS PPDNSKIDKD SVIGDCDAIL SDLESEGTQP EEDEDKNLSS
EELARKLEKL LRQSEQQLQQ LSAGSHALRA TCYGQDRYWR RYWSLGKSGG IYVEAMESAQ
PEILAYQEAL EAAALKGQPL SDTAKRKKKG SKDKKEVPDE KQEGESEAQR QEEALQSELE
LRNIKSELNL SDHTQIIKYE PNVKHGTCKV EGSSIKMEEK YIQQKQNKEE EDLLDIEDSI
PTAFLVQKPT HKPMFATQAE PMDKPQEIVK VENVVKTEST EKEVEESKDE LVNNLEELRK
MAEAVSSQLD AAKKAEEKLK EEAEVKTKKE LLDAESAHAQ LYMKMLEGKW FSILRHESSY
LTNINDEKSH EQKIPDFCDN EHTCSEVVMC QGHKWDVSNN LHLLNDPSLF TLNSMVTSVQ
VPTNNIYTDS SLTMSGLDQE MMDASINRED NEQEEEMEED NKEQDNDLEK ELQADSAKLE
AAALKAKASG LTSLGLLNFN ALSTYVTCDS PPPIHMSPEE MEQLEHCKVH GLPKKLEGGF
VPRELRHGWW RLAASELGGV LRALHTRGVR ERELHAALTH HPPTFPEHAH IDKTDTTNME
VTPSTGEKPD QDFPPPDSPN SFCATTARRV DMQLLAMVEG LEERVAAASI QVKGWRPSRL
PLPEEATGAE IVARARHKLA SVEAHIERRY LKPPLVQSTT EATLGAVLQG EYGNASVTSP
QNSDANEGKD RGIARGLATW REAVARCNTS AQLAMLLQAL EAAIAWDKSI MKANCQFCLS
GDNEDQLLLC DSCDKGYHTY CFKPRMEKIP EGDWYCWECV NKARGERVCI VCGGVAAGRT
IPCALCVRAY HQDCHYPPLT KNPRGKWYCS QCISRAPPKK PRNTKKRESK HKDNSIDLDQ
SMVPSPAASQ ASTSTTAEEC AASVLHTPEK HDDRDEPLAE PENGLNHHPV GELTEDGQPP
EKRRALQFGG NGALQHDESD HMDVEDVNSE NMPLVPRGKK EKTSAKKQLK ELQFCKNLLC
EMECHEHAWP FLVPVNTKQF PQYRKVIKCP MDLSTIRRKL QDGFYKCKEE FASDVRLIFS
NCEVFNEDDS PVGRAGHSMR QFFDARWPHA
//