ID A0A182TJ38_9DIPT Unreviewed; 281 AA.
AC A0A182TJ38;
DT 07-SEP-2016, integrated into UniProtKB/TrEMBL.
DT 07-SEP-2016, sequence version 1.
DT 27-MAR-2024, entry version 39.
DE RecName: Full=LIM/homeobox protein Awh {ECO:0008006|Google:ProtNLM};
OS Anopheles melas.
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Culicidae;
OC Anophelinae; Anopheles.
OX NCBI_TaxID=34690 {ECO:0000313|EnsemblMetazoa:AMEC003225-PA, ECO:0000313|Proteomes:UP000075902};
RN [1] {ECO:0000313|Proteomes:UP000075902}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=CM1001059 {ECO:0000313|Proteomes:UP000075902};
RG The Broad Institute Genomics Platform;
RA Neafsey D.E., Besansky N., Howell P., Walton C., Young S.K., Zeng Q.,
RA Gargeya S., Fitzgerald M., Haas B., Abouelleil A., Allen A.W., Alvarado L.,
RA Arachchi H.M., Berlin A.M., Chapman S.B., Gainer-Dewar J., Goldberg J.,
RA Griggs A., Gujja S., Hansen M., Howarth C., Imamovic A., Ireland A.,
RA Larimer J., McCowan C., Murphy C., Pearson M., Poon T.W., Priest M.,
RA Roberts A., Saif S., Shea T., Sisk P., Sykes S., Wortman J., Nusbaum C.,
RA Birren B.;
RT "The Genome Sequence of Anopheles melas CM1001059_A (V2).";
RL Submitted (JAN-2014) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EnsemblMetazoa:AMEC003225-PA}
RP IDENTIFICATION.
RC STRAIN=CM1001059 {ECO:0000313|EnsemblMetazoa:AMEC003225-PA};
RG EnsemblMetazoa;
RL Submitted (MAY-2020) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC ECO:0000256|PROSITE-ProRule:PRU00108, ECO:0000256|RuleBase:RU000682}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; A0A182TJ38; -.
DR STRING; 34690.A0A182TJ38; -.
DR EnsemblMetazoa; AMEC003225-RA; AMEC003225-PA; AMEC003225.
DR VEuPathDB; VectorBase:AMEC003225; -.
DR Proteomes; UP000075902; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR CDD; cd00086; homeodomain; 1.
DR CDD; cd09379; LIM2_AWH; 1.
DR Gene3D; 2.10.110.10; Cysteine Rich Protein; 1.
DR Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR001356; Homeobox_dom.
DR InterPro; IPR001781; Znf_LIM.
DR PANTHER; PTHR24208; LIM/HOMEOBOX PROTEIN LHX; 1.
DR PANTHER; PTHR24208:SF127; LIM_HOMEOBOX PROTEIN AWH; 1.
DR Pfam; PF00046; Homeodomain; 1.
DR Pfam; PF00412; LIM; 1.
DR SMART; SM00389; HOX; 1.
DR SMART; SM00132; LIM; 1.
DR SUPFAM; SSF57716; Glucocorticoid receptor-like (DNA-binding domain); 1.
DR SUPFAM; SSF46689; Homeodomain-like; 1.
DR PROSITE; PS50071; HOMEOBOX_2; 1.
DR PROSITE; PS00478; LIM_DOMAIN_1; 1.
DR PROSITE; PS50023; LIM_DOMAIN_2; 1.
PE 4: Predicted;
KW DNA-binding {ECO:0000256|PROSITE-ProRule:PRU00108,
KW ECO:0000256|RuleBase:RU000682};
KW Homeobox {ECO:0000256|PROSITE-ProRule:PRU00108,
KW ECO:0000256|RuleBase:RU000682};
KW LIM domain {ECO:0000256|ARBA:ARBA00023038, ECO:0000256|PROSITE-
KW ProRule:PRU00125}; Metal-binding {ECO:0000256|PROSITE-ProRule:PRU00125};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00108}; Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Zinc {ECO:0000256|PROSITE-ProRule:PRU00125}.
FT DOMAIN 34..96
FT /note="LIM zinc-binding"
FT /evidence="ECO:0000259|PROSITE:PS50023"
FT DOMAIN 113..173
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DNA_BIND 115..174
FT /note="Homeobox"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT REGION 219..281
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 225..263
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 281 AA; 30871 MW; F3BBF013D5A6E8F2 CRC64;
MGKLRLRVCS PGTSVLVGGV QSRSTAGDHR RFGTKCARCS RTISATDWVR RARDLIFHLA
CFACDSCGRQ LSTGEQFALV DDKVLCKTHY SEMFDCGTSS DDGCEADGYQ KNNKTKRVRT
TFTEEQLQIL QANFNIDSNP DGQDLERIAS VTGLSKRVTQ VWFQNSRARQ KKHVQVPREG
EMNPFARHIN LQLSYTFQQT GGAVHNSLHL GPGGGMGVLS GSPFGPNGSF GSHHHNNNNN
NNNSTNINNN HSSKSSAYST HDSSLDELSE DSAIHCMQSE A
//