GenomeNet

Database: UniProt
Entry: A0A182PK98_9DIPT
LinkDB: A0A182PK98_9DIPT
Original site: A0A182PK98_9DIPT 
ID   A0A182PK98_9DIPT        Unreviewed;       509 AA.
AC   A0A182PK98;
DT   07-SEP-2016, integrated into UniProtKB/TrEMBL.
DT   07-SEP-2016, sequence version 1.
DT   27-MAR-2024, entry version 33.
DE   RecName: Full=Homeobox domain-containing protein {ECO:0000259|PROSITE:PS50071};
OS   Anopheles epiroticus.
OC   Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC   Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Culicidae;
OC   Anophelinae; Anopheles.
OX   NCBI_TaxID=199890 {ECO:0000313|EnsemblMetazoa:AEPI007364-PA, ECO:0000313|Proteomes:UP000075885};
RN   [1] {ECO:0000313|Proteomes:UP000075885}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=Epiroticus2 {ECO:0000313|Proteomes:UP000075885};
RG   The Broad Institute Genomics Platform;
RA   Neafsey D.E., Howell P., Walker B., Young S.K., Zeng Q., Gargeya S.,
RA   Fitzgerald M., Haas B., Abouelleil A., Allen A.W., Alvarado L.,
RA   Arachchi H.M., Berlin A.M., Chapman S.B., Gainer-Dewar J., Goldberg J.,
RA   Griggs A., Gujja S., Hansen M., Howarth C., Imamovic A., Ireland A.,
RA   Larimer J., McCowan C., Murphy C., Pearson M., Poon T.W., Priest M.,
RA   Roberts A., Saif S., Shea T., Sisk P., Sykes S., Wortman J., Nusbaum C.,
RA   Birren B.;
RT   "The Genome Sequence of Anopheles epiroticus epiroticus2.";
RL   Submitted (MAR-2013) to the EMBL/GenBank/DDBJ databases.
RN   [2] {ECO:0000313|EnsemblMetazoa:AEPI007364-PA}
RP   IDENTIFICATION.
RC   STRAIN=Epiroticus2 {ECO:0000313|EnsemblMetazoa:AEPI007364-PA};
RG   EnsemblMetazoa;
RL   Submitted (MAY-2020) to UniProtKB.
CC   -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC       ECO:0000256|PROSITE-ProRule:PRU00108, ECO:0000256|RuleBase:RU000682}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   AlphaFoldDB; A0A182PK98; -.
DR   STRING; 199890.A0A182PK98; -.
DR   EnsemblMetazoa; AEPI007364-RA; AEPI007364-PA; AEPI007364.
DR   VEuPathDB; VectorBase:AEPI007364; -.
DR   OrthoDB; 461623at2759; -.
DR   Proteomes; UP000075885; Unassembled WGS sequence.
DR   GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR   GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR   GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IEA:InterPro.
DR   CDD; cd00086; homeodomain; 1.
DR   Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR   InterPro; IPR009057; Homeobox-like_sf.
DR   InterPro; IPR017970; Homeobox_CS.
DR   InterPro; IPR001356; Homeobox_dom.
DR   InterPro; IPR020479; Homeobox_metazoa.
DR   PANTHER; PTHR24340; HOMEOBOX PROTEIN NKX; 1.
DR   PANTHER; PTHR24340:SF82; HOMEOBOX PROTEIN VND-RELATED; 1.
DR   Pfam; PF00046; Homeodomain; 1.
DR   PRINTS; PR00024; HOMEOBOX.
DR   SMART; SM00389; HOX; 1.
DR   SUPFAM; SSF46689; Homeodomain-like; 1.
DR   PROSITE; PS00027; HOMEOBOX_1; 1.
DR   PROSITE; PS50071; HOMEOBOX_2; 1.
PE   4: Predicted;
KW   Developmental protein {ECO:0000256|ARBA:ARBA00022473};
KW   DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW   ProRule:PRU00108};
KW   Homeobox {ECO:0000256|ARBA:ARBA00023155, ECO:0000256|PROSITE-
KW   ProRule:PRU00108};
KW   Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW   ProRule:PRU00108}.
FT   DOMAIN          363..423
FT                   /note="Homeobox"
FT                   /evidence="ECO:0000259|PROSITE:PS50071"
FT   DNA_BIND        365..424
FT                   /note="Homeobox"
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT   REGION          1..21
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          33..157
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          191..213
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          284..329
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          344..367
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          428..450
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        50..68
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        87..109
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        112..135
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        191..209
FT                   /note="Basic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        305..324
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   509 AA;  55434 MW;  FA4599500202FD34 CRC64;
     MGFLDSMKLP QSQRTTSAGA GFRICNLLEL DNDKKQEQQR QQQQHHHAKK RKNSIDEPPA
     QRSRDREDPD EADDEDDSGA PCPDGSLTRR SGTTLSQEDT SSNPDLHGCS IINSDDDSRP
     EGAEGRIEGR GTASRGQDDR TSPSAASGDE GGKPTFGLRT AQSLAEDLHA RFGYPALHPG
     AHLPHLPAHH HPHPHHLHHA THLHHPHHTP PPSHGLFGGR TWPYESCNTL NACHQQAAHQ
     QAHQRLFAQQ VSPADSTSPV HSERSYLGSA AGLATLAAPA GDLSVPHHLA GGRTGSGMRT
     PSPSDSERGE AHHHHLHGDT HTENSDDVDI EEGCDDEMID MIEDTDDGSM PTERGHQTNG
     MGHKKRKRRV LFSKAQTYEL ERRFRQQRYL SAPEREHLAS LIRLTPTQVK IWFQNHRYKT
     KRAAHEKGAL DHHGGGGGHG GSGAGGLPSP RRVAVPVLVR DGKPCLGGSK QPHDLLTAVP
     GAAHLQLPPG FQHASLLHHA AAAAGRWWS
//
DBGET integrated database retrieval system