ID A0A212FGV4_DANPL Unreviewed; 957 AA.
AC A0A212FGV4;
DT 27-SEP-2017, integrated into UniProtKB/TrEMBL.
DT 27-SEP-2017, sequence version 1.
DT 24-JAN-2024, entry version 15.
DE RecName: Full=Mab-21-like HhH/H2TH-like domain-containing protein {ECO:0000259|Pfam:PF20266};
GN ORFNames=KGM_205074 {ECO:0000313|EMBL:OWR52954.1};
OS Danaus plexippus plexippus.
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; Papilionoidea;
OC Nymphalidae; Danainae; Danaini; Danaina; Danaus; Danaus.
OX NCBI_TaxID=278856 {ECO:0000313|EMBL:OWR52954.1, ECO:0000313|Proteomes:UP000007151};
RN [1] {ECO:0000313|EMBL:OWR52954.1, ECO:0000313|Proteomes:UP000007151}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=F-2 {ECO:0000313|EMBL:OWR52954.1};
RX PubMed=22118469; DOI=10.1016/j.cell.2011.09.052;
RA Zhan S., Merlin C., Boore J.L., Reppert S.M.;
RT "The monarch butterfly genome yields insights into long-distance
RT migration.";
RL Cell 147:1171-1185(2011).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:OWR52954.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AGBW02008596; OWR52954.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A212FGV4; -.
DR KEGG; dpl:KGM_205074; -.
DR eggNOG; KOG3963; Eukaryota.
DR InParanoid; A0A212FGV4; -.
DR Proteomes; UP000007151; Unassembled WGS sequence.
DR Gene3D; 1.10.1410.40; -; 1.
DR InterPro; IPR024810; Mab-21-like.
DR InterPro; IPR046906; Mab-21_HhH/H2TH-like.
DR PANTHER; PTHR10656; CELL FATE DETERMINING PROTEIN MAB21-RELATED; 1.
DR PANTHER; PTHR10656:SF69; MAB-21 DOMAIN-CONTAINING PROTEIN; 1.
DR Pfam; PF20266; Mab-21_C; 1.
DR SMART; SM01265; Mab-21; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000007151}.
FT DOMAIN 606..678
FT /note="Mab-21-like HhH/H2TH-like"
FT /evidence="ECO:0000259|Pfam:PF20266"
FT REGION 1..51
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 212..250
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 331..377
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 762..789
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 7..36
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 226..250
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 957 AA; 111150 MW; 7F07FA621550BC15 CRC64;
MGNSKSKKVQ REPDKFFERE RWKEQRREEK KKKNLNPQNK RTAVVNPPIN PTMALLPPSQ
GAMPRPPPSQ NLSYDDEALD RMRYQLNNDC DAFLLNNILL SVQFFENYER EMSHIKSNPI
NESQHTMDEA MVQCHKHVFL ADKIQECVQE HVLFQVKKDQ IEPLVAPRLF IIYDNVEASE
PGYVKLKKLE ILESSLSKET AYKIQQRVSS DDSIYTDSAK ESNASDDIPY SDKREKNLQK
TKNKVSDTNI DSKQKNDVEF YDSRSRTSVN DVEMINKLYD TSHLLPNGNS SVMEVTFSKQ
NESDEYDIKA RKSILKNTRR YSGPTKEIVN KSFSNNRRHT SLDNILSAED TETSGYRSNA
SSRQAESSET ESDYGYATIT ESTTPKKIGL NKRTNYPLAS GVLPDESWIS VNVKSIDWSD
DEEGDTASKE SLNRNLYETH NYLGSIAFMD DFVNNFILSL GSGLGFSQDT IKNSMTQGAS
IYCNAIQNGL TTGYEVFPAL IAAWPNSANR WIIRERKIIQ NPRTNFSYQW PTKYMVNKTV
GFGCLLVPIG FRPKRGLNPE QQVQWKVIFP AAERYLESCL AHSHTRCYLF TLTLYRAFLE
NSTSKIGINE SHIKNHLFWQ CEDNYAKWPE DRLGESLRLF LGSFYAHFGQ SRFPNYFIES
CNEFKYIPKP LLLKLQRKLA DILESPVMHV LNAIHKLKYT KRDFYPKFNC LRLYEILTCK
NPLRILNPHL PIVAPYNETT DSEDEQIKNI WDRAKAHDKH YQWKKERQRQ MREKRQSNNV
YKKQRGSGKA EPEINKNIIL PSKLPFERRR LVLEFFIPHF IAMARSSEKF EALRQAVIYL
EQAQRLCYLL MEEASGELSA KEFLDIIRDK LSDCQQKLVN QEGYKFSIVE KNNAERKMAS
DIIRKRRPRC EHILNLASPM EGSSNTPVTF AEVHEQHKSR RDIKTYIDYD GSEESKL
//