ID A0A194Q3N4_PAPXU Unreviewed; 517 AA.
AC A0A194Q3N4;
DT 05-OCT-2016, integrated into UniProtKB/TrEMBL.
DT 05-OCT-2016, sequence version 1.
DT 27-MAR-2024, entry version 26.
DE SubName: Full=Homeotic protein proboscipedia {ECO:0000313|EMBL:KPI99968.1};
GN ORFNames=RR46_03338 {ECO:0000313|EMBL:KPI99968.1};
OS Papilio xuthus (Asian swallowtail butterfly).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; Papilionoidea;
OC Papilionidae; Papilioninae; Papilio.
OX NCBI_TaxID=66420 {ECO:0000313|EMBL:KPI99968.1, ECO:0000313|Proteomes:UP000053268};
RN [1] {ECO:0000313|EMBL:KPI99968.1, ECO:0000313|Proteomes:UP000053268}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Ya'a_city_454_Px {ECO:0000313|EMBL:KPI99968.1};
RC TISSUE=Whole body {ECO:0000313|EMBL:KPI99968.1};
RX PubMed=26354079; DOI=10.1038/ncomms9212;
RA Li X., Fan D., Zhang W., Liu G., Zhang L., Zhao L., Fang X., Chen L.,
RA Dong Y., Chen Y., Ding Y., Zhao R., Feng M., Zhu Y., Feng Y., Jiang X.,
RA Zhu D., Xiang H., Feng X., Li S., Wang J., Zhang G., Kronforst M.R.,
RA Wang W.;
RT "Outbred genome sequencing and CRISPR/Cas9 gene editing in butterflies.";
RL Nat. Commun. 6:8212-8212(2015).
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC ECO:0000256|PROSITE-ProRule:PRU00108, ECO:0000256|RuleBase:RU000682}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KQ459551; KPI99968.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A194Q3N4; -.
DR STRING; 66420.A0A194Q3N4; -.
DR Proteomes; UP000053268; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IEA:InterPro.
DR CDD; cd00086; homeodomain; 1.
DR Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR017970; Homeobox_CS.
DR InterPro; IPR001356; Homeobox_dom.
DR InterPro; IPR020479; Homeobox_metazoa.
DR PANTHER; PTHR45664:SF2; HOMEOTIC PROTEIN PROBOSCIPEDIA; 1.
DR PANTHER; PTHR45664; PROTEIN ZERKNUELLT 1-RELATED; 1.
DR Pfam; PF00046; Homeodomain; 1.
DR PRINTS; PR00024; HOMEOBOX.
DR SMART; SM00389; HOX; 1.
DR SUPFAM; SSF46689; Homeodomain-like; 1.
DR PROSITE; PS00027; HOMEOBOX_1; 1.
DR PROSITE; PS50071; HOMEOBOX_2; 1.
PE 4: Predicted;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Homeobox {ECO:0000256|ARBA:ARBA00023155, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00108}; Reference proteome {ECO:0000313|Proteomes:UP000053268}.
FT DOMAIN 9..69
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DNA_BIND 11..70
FT /note="Homeobox"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT REGION 64..103
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 124..277
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 330..356
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 69..103
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 127..163
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 230..246
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 253..274
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 330..353
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 517 AA; 56382 MW; 133F16F4FC2C0484 CRC64;
MPRRALENGL PRRLRTAYTN TQLLELEKEF HFNKYLCRPR RIEIAASLDL TERQVKVWFQ
NRRMKHKRQT LSKSEDGDDK DSTTSEGGKS SKARDKFLDD DGPLSGKKSC QGCELPPGAL
CSPTEDLPEL TSRTRNNNTP SATNNNSFAS DGASSVASSS SLDKLAEDDS RDAHPTATLA
PVPRNLAKRI KQESRKRSPS LDATGCKVSP SSSKDGLVVL GGMPDGGKFS SVSLTPSSTP
GTPSGIHQSP LGHYPRPSPP NVPPGAPHPQ AVPNAMPPYV IRGNVPPGQF GPHADFRMDS
KQFGKLAQYP QGNRSFDAYG PALQGVDQHT YTRSQQHTRP QEPSPSTRPS NGVGRQAYPH
EMYQNYGYTA YGKEQATYGH PGYEQPQGYP GDHRAYSNSH YGYHYHESGQ HEHTHGYYGA
EGQKSASNEY GRWNEASNYS QQAAVTPAAY GPAGSQPPAP TEAYPSGAEC ADGYGSFQQF
YEATHGSQAA GENSNSSSDF HFLSNLANDF APEYYTI
//