ID A0A212FCH5_DANPL Unreviewed; 1029 AA.
AC A0A212FCH5;
DT 27-SEP-2017, integrated into UniProtKB/TrEMBL.
DT 27-SEP-2017, sequence version 1.
DT 27-MAR-2024, entry version 25.
DE SubName: Full=Cap-n-collar {ECO:0000313|EMBL:OWR51441.1};
GN ORFNames=KGM_214612 {ECO:0000313|EMBL:OWR51441.1};
OS Danaus plexippus plexippus.
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; Papilionoidea;
OC Nymphalidae; Danainae; Danaini; Danaina; Danaus; Danaus.
OX NCBI_TaxID=278856 {ECO:0000313|EMBL:OWR51441.1, ECO:0000313|Proteomes:UP000007151};
RN [1] {ECO:0000313|EMBL:OWR51441.1, ECO:0000313|Proteomes:UP000007151}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=F-2 {ECO:0000313|EMBL:OWR51441.1};
RX PubMed=22118469; DOI=10.1016/j.cell.2011.09.052;
RA Zhan S., Merlin C., Boore J.L., Reppert S.M.;
RT "The monarch butterfly genome yields insights into long-distance
RT migration.";
RL Cell 147:1171-1185(2011).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:OWR51441.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AGBW02009189; OWR51441.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A212FCH5; -.
DR STRING; 278856.A0A212FCH5; -.
DR KEGG; dpl:KGM_214612; -.
DR eggNOG; ENOG502S6YH; Eukaryota.
DR InParanoid; A0A212FCH5; -.
DR Proteomes; UP000007151; Unassembled WGS sequence.
DR GO; GO:0003700; F:DNA-binding transcription factor activity; IEA:InterPro.
DR GO; GO:0000978; F:RNA polymerase II cis-regulatory region sequence-specific DNA binding; IEA:InterPro.
DR GO; GO:0006357; P:regulation of transcription by RNA polymerase II; IEA:InterPro.
DR CDD; cd14698; bZIP_CNC; 1.
DR Gene3D; 1.10.880.10; Transcription factor, Skn-1-like, DNA-binding domain; 1.
DR InterPro; IPR004827; bZIP.
DR InterPro; IPR004826; bZIP_Maf.
DR InterPro; IPR047167; NFE2-like.
DR InterPro; IPR008917; TF_DNA-bd_sf.
DR PANTHER; PTHR24411; NUCLEAR FACTOR ERYTHROID 2-RELATED FACTOR; 1.
DR PANTHER; PTHR24411:SF55; SEGMENTATION PROTEIN CAP'N'COLLAR; 1.
DR Pfam; PF03131; bZIP_Maf; 1.
DR SMART; SM00338; BRLZ; 1.
DR SUPFAM; SSF47454; A DNA-binding domain in eukaryotic transcription factors; 1.
DR PROSITE; PS50217; BZIP; 1.
DR PROSITE; PS00036; BZIP_BASIC; 1.
PE 4: Predicted;
KW Activator {ECO:0000256|ARBA:ARBA00023159};
KW Coiled coil {ECO:0000256|SAM:Coils};
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125};
KW Reference proteome {ECO:0000313|Proteomes:UP000007151};
KW Transcription {ECO:0000256|ARBA:ARBA00023163};
KW Transcription regulation {ECO:0000256|ARBA:ARBA00023015}.
FT DOMAIN 883..946
FT /note="BZIP"
FT /evidence="ECO:0000259|PROSITE:PS50217"
FT REGION 309..343
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 665..689
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 705..733
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 764..846
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 382..409
FT /evidence="ECO:0000256|SAM:Coils"
FT COMPBIAS 322..343
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 765..790
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 815..829
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1029 AA; 116865 MW; CF3DF0AD08250064 CRC64;
MIALKKLYGD ELLRLALVLS LLKANPEEYH ELETQQLAGL HITNGTDWSL EHEARTLIRP
RHVHPKSLDH ILMNYERQLF EELNSLGRYN YLETNDRPWY NQPVYTYLLN DVATDTIAPA
LGQEESQQVE DNVSADENVA QEVKVEEEKE ETAVVTLSAD FLQNHTESDI FAEIASRSFD
VNEFLNPEMQ IKKEEDLMIE VKKEKEIEDV FSDNAIDDFV PYFTARSEKI ELGEANSVFN
QNIADLQDYD EFDVKELNNL EHLDVKQQQE NLEQYLLENT PLDSEVYELA PMFEEEMIVK
RERASTSFTS ASSSGVSEMD SSSKDLDVKL EPDEGHHSGE ELTQEDMNLI EVLWKQDVDM
GFSLEDPLQM SNYIKDGPQA TRVNVSEELK NKQEQIEKVK ATLLDEKEDD PWAGLSYTVD
TETGEYVIQG DLPGELVNEE RFNLLEETLR LVELGDEGDA KDEQPQAVEG SSSGSMLHPA
MRHVPHHPLA HYHNQQQVSD TRERCRRVIN EVSRAADLTR TATDRQTRWG WAPNALLTND
MSTVAASSAH GGAGYAPNYH APIPPIPEKH HEAYGAPAPL DGAYKVEAAH HPQQHDGLYY
QTPTEPQQDG FLQSILNDED LQLMDMAMNE GMYTMRMLDG APTVHQTHAH MPVAAERDSA
SDSAVSSMGS ERVPSLSDGE WCDGSDSAQE FHSSKFRPYE AAYGRERSHA PQKKHHMFGK
RSFQEQPSQE TRPVVKYECE QTYHEMHMHA DYTPRQHIPP QLGVQPTLDI NSPHSSHALQ
HTTLPSPNPP RFGFSSGDRV RHNHTYSAAL PPTEERLPTR DKRVRRLTDG STSDSGSGHL
SRDEKRAKAL GIPLEVQDII NLPMDEFNER LSKHDLSEAQ LSLIRDIRRR GKNKVAAQNC
RKRKLDQITS LADEVRTVRD RKARTQRDRH NLLADRQKLK ERFAALYRHV FQHLRDPEGR
PLSSSQYSLQ QAADGSVVLV PRMGGATHSF TFTARPLHEP DGGGPRAEEQ LRALVRHRPR
TARGWVAAF
//