ID A0A182PFU9_9DIPT Unreviewed; 497 AA.
AC A0A182PFU9;
DT 07-SEP-2016, integrated into UniProtKB/TrEMBL.
DT 07-SEP-2016, sequence version 1.
DT 27-MAR-2024, entry version 36.
DE RecName: Full=Paired domain-containing protein {ECO:0000259|PROSITE:PS51057};
OS Anopheles epiroticus.
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Culicidae;
OC Anophelinae; Anopheles.
OX NCBI_TaxID=199890 {ECO:0000313|EnsemblMetazoa:AEPI005808-PA, ECO:0000313|Proteomes:UP000075885};
RN [1] {ECO:0000313|Proteomes:UP000075885}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Epiroticus2 {ECO:0000313|Proteomes:UP000075885};
RG The Broad Institute Genomics Platform;
RA Neafsey D.E., Howell P., Walker B., Young S.K., Zeng Q., Gargeya S.,
RA Fitzgerald M., Haas B., Abouelleil A., Allen A.W., Alvarado L.,
RA Arachchi H.M., Berlin A.M., Chapman S.B., Gainer-Dewar J., Goldberg J.,
RA Griggs A., Gujja S., Hansen M., Howarth C., Imamovic A., Ireland A.,
RA Larimer J., McCowan C., Murphy C., Pearson M., Poon T.W., Priest M.,
RA Roberts A., Saif S., Shea T., Sisk P., Sykes S., Wortman J., Nusbaum C.,
RA Birren B.;
RT "The Genome Sequence of Anopheles epiroticus epiroticus2.";
RL Submitted (MAR-2013) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EnsemblMetazoa:AEPI005808-PA}
RP IDENTIFICATION.
RC STRAIN=Epiroticus2 {ECO:0000313|EnsemblMetazoa:AEPI005808-PA};
RG EnsemblMetazoa;
RL Submitted (MAY-2020) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; A0A182PFU9; -.
DR STRING; 199890.A0A182PFU9; -.
DR EnsemblMetazoa; AEPI005808-RA; AEPI005808-PA; AEPI005808.
DR VEuPathDB; VectorBase:AEPI005808; -.
DR OrthoDB; 3685030at2759; -.
DR Proteomes; UP000075885; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-KW.
DR GO; GO:0006355; P:regulation of DNA-templated transcription; IEA:InterPro.
DR Gene3D; 1.10.10.10; Winged helix-like DNA-binding domain superfamily/Winged helix DNA-binding domain; 2.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR043182; PAIRED_DNA-bd_dom.
DR InterPro; IPR001523; Paired_dom.
DR InterPro; IPR043565; PAX_fam.
DR InterPro; IPR036388; WH-like_DNA-bd_sf.
DR PANTHER; PTHR45636:SF43; PAIRED BOX POX-NEURO PROTEIN-RELATED; 1.
DR PANTHER; PTHR45636; PAIRED BOX PROTEIN PAX-6-RELATED-RELATED; 1.
DR Pfam; PF00292; PAX; 1.
DR PRINTS; PR00027; PAIREDBOX.
DR SMART; SM00351; PAX; 1.
DR SUPFAM; SSF46689; Homeodomain-like; 1.
DR PROSITE; PS00034; PAIRED_1; 1.
DR PROSITE; PS51057; PAIRED_2; 1.
PE 4: Predicted;
KW Paired box {ECO:0000256|ARBA:ARBA00022724};
KW Transcription {ECO:0000256|ARBA:ARBA00023163};
KW Transcription regulation {ECO:0000256|ARBA:ARBA00023015}.
FT DOMAIN 1..118
FT /note="Paired"
FT /evidence="ECO:0000259|PROSITE:PS51057"
FT REGION 120..180
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 195..262
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 279..312
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 346..368
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 126..180
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 204..237
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 245..261
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 352..368
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 497 AA; 54136 MW; 112971E58B187E9F CRC64;
PSAGTEAHKL SGTAWRIVEL AHNGIRPCDI SRQLRVSHGC VSKILSRYYE TGSFKAGVIG
GSKPKVATPP VVEAIAAYKL QNPTMFAWEI RDKLLADGIC VHDNVPSVSS INRIVRNKAA
EKAKYSSQRS DNSGMDSPRI LSRNNQQQQQ QQSQQQQQDC INQQMQADQE SMKSPQQVEN
VASQSYSING LLGLSQKSLS GSSSKRRRIK EDPDMKGSML SCIKRDKEAK DMSESMLHES
IGKGGGQDQP QQQQPPQQQQ EKGHDLFVGG VDFVSSLAMA QDDGNNPDNQ QPGFSSMRTG
PRPGTMSSSP IRSRENALFL LAGGDKVHKY HRADEPMPTG VIIEDDSSAR KHQQAQQQQQ
QQAHQANDVK TTYDKILAGE LVPGSAVKRT TTTEPTALTQ SIEFLSDMNN NISQNQHQGF
AAAAYEGAYS AADAADSAVQ LNPHLAACSS NYSAFLQNTD QFAAANPELI FPTAAYTQYA
APGMEEVGVM ENCSIIH
//