GenomeNet

Database: UniProt
Entry: A7US45_ANOGA
LinkDB: A7US45_ANOGA
Original site: A7US45_ANOGA 
ID   A7US45_ANOGA            Unreviewed;       401 AA.
AC   A7US45;
DT   23-OCT-2007, integrated into UniProtKB/TrEMBL.
DT   23-OCT-2007, sequence version 1.
DT   27-MAR-2024, entry version 101.
DE   SubName: Full=AGAP011253-PA {ECO:0000313|EMBL:EDO64462.1};
DE   Flags: Fragment;
GN   ORFNames=AgaP_AGAP011253 {ECO:0000313|EMBL:EDO64462.1};
OS   Anopheles gambiae (African malaria mosquito).
OC   Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC   Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Culicidae;
OC   Anophelinae; Anopheles.
OX   NCBI_TaxID=7165 {ECO:0000313|EMBL:EDO64462.1};
RN   [1] {ECO:0000313|EMBL:EDO64462.1}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=PEST {ECO:0000313|EMBL:EDO64462.1};
RX   PubMed=12364791; DOI=10.1126/science.1076181;
RA   Holt R.A., Subramanian G.M., Halpern A., Sutton G.G., Charlab R.,
RA   Nusskern D.R., Wincker P., Clark A.G., Ribeiro J.M.C., Wides R.,
RA   Salzberg S.L., Loftus B.J., Yandell M.D., Majoros W.H., Rusch D.B., Lai Z.,
RA   Kraft C.L., Abril J.F., Anthouard V., Arensburger P., Atkinson P.W.,
RA   Baden H., de Berardinis V., Baldwin D., Benes V., Biedler J., Blass C.,
RA   Bolanos R., Boscus D., Barnstead M., Cai S., Center A., Chaturverdi K.,
RA   Christophides G.K., Chrystal M.A.M., Clamp M., Cravchik A., Curwen V.,
RA   Dana A., Delcher A., Dew I., Evans C.A., Flanigan M.,
RA   Grundschober-Freimoser A., Friedli L., Gu Z., Guan P., Guigo R.,
RA   Hillenmeyer M.E., Hladun S.L., Hogan J.R., Hong Y.S., Hoover J.,
RA   Jaillon O., Ke Z., Kodira C.D., Kokoza E., Koutsos A., Letunic I.,
RA   Levitsky A.A., Liang Y., Lin J.-J., Lobo N.F., Lopez J.R., Malek J.A.,
RA   McIntosh T.C., Meister S., Miller J.R., Mobarry C., Mongin E., Murphy S.D.,
RA   O'Brochta D.A., Pfannkoch C., Qi R., Regier M.A., Remington K., Shao H.,
RA   Sharakhova M.V., Sitter C.D., Shetty J., Smith T.J., Strong R., Sun J.,
RA   Thomasova D., Ton L.Q., Topalis P., Tu Z.J., Unger M.F., Walenz B.,
RA   Wang A.H., Wang J., Wang M., Wang X., Woodford K.J., Wortman J.R., Wu M.,
RA   Yao A., Zdobnov E.M., Zhang H., Zhao Q., Zhao S., Zhu S.C., Zhimulev I.,
RA   Coluzzi M., della Torre A., Roth C.W., Louis C., Kalush F., Mural R.J.,
RA   Myers E.W., Adams M.D., Smith H.O., Broder S., Gardner M.J., Fraser C.M.,
RA   Birney E., Bork P., Brey P.T., Venter J.C., Weissenbach J., Kafatos F.C.,
RA   Collins F.H., Hoffman S.L.;
RT   "The genome sequence of the malaria mosquito Anopheles gambiae.";
RL   Science 298:129-149(2002).
RN   [2] {ECO:0000313|EMBL:EDO64462.1}
RP   NUCLEOTIDE SEQUENCE.
RC   STRAIN=PEST {ECO:0000313|EMBL:EDO64462.1};
RG   The Anopheles Genome Sequencing Consortium;
RL   Submitted (MAR-2002) to the EMBL/GenBank/DDBJ databases.
RN   [3] {ECO:0000313|EMBL:EDO64462.1}
RP   NUCLEOTIDE SEQUENCE.
RC   STRAIN=PEST {ECO:0000313|EMBL:EDO64462.1};
RX   PubMed=14747013; DOI=10.1016/j.pt.2003.11.003;
RA   Mongin E., Louis C., Holt R.A., Birney E., Collins F.H.;
RT   "The Anopheles gambiae genome: an update.";
RL   Trends Parasitol. 20:49-52(2004).
RN   [4] {ECO:0000313|EMBL:EDO64462.1}
RP   NUCLEOTIDE SEQUENCE.
RC   STRAIN=PEST {ECO:0000313|EMBL:EDO64462.1};
RX   PubMed=17210077; DOI=10.1186/gb-2007-8-1-r5;
RA   Sharakhova M.V., Hammond M.P., Lobo N.F., Krzywinski J., Unger M.F.,
RA   Hillenmeyer M.E., Bruggner R.V., Birney E., Collins F.H.;
RT   "Update of the Anopheles gambiae PEST genome assembly.";
RL   Genome Biol. 8:R5.1-R5.13(2007).
RN   [5] {ECO:0000313|EMBL:EDO64462.1}
RP   NUCLEOTIDE SEQUENCE.
RC   STRAIN=PEST {ECO:0000313|EMBL:EDO64462.1};
RG   VectorBase;
RL   Submitted (MAY-2011) to the EMBL/GenBank/DDBJ databases.
CC   -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC       ECO:0000256|PROSITE-ProRule:PRU00108, ECO:0000256|RuleBase:RU000682}.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:EDO64462.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; AAAB01008816; EDO64462.1; -; Genomic_DNA.
DR   RefSeq; XP_001688185.1; XM_001688133.1.
DR   AlphaFoldDB; A7US45; -.
DR   STRING; 7165.A7US45; -.
DR   PaxDb; 7165-AGAP011253-PA; -.
DR   GeneID; 5667880; -.
DR   KEGG; aga:AgaP_AGAP011253; -.
DR   VEuPathDB; VectorBase:AGAP011253; -.
DR   eggNOG; KOG0842; Eukaryota.
DR   HOGENOM; CLU_688067_0_0_1; -.
DR   OMA; MSQGANG; -.
DR   OrthoDB; 461623at2759; -.
DR   GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR   GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR   GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IEA:InterPro.
DR   CDD; cd00086; homeodomain; 1.
DR   Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR   InterPro; IPR009057; Homeobox-like_sf.
DR   InterPro; IPR017970; Homeobox_CS.
DR   InterPro; IPR001356; Homeobox_dom.
DR   InterPro; IPR020479; Homeobox_metazoa.
DR   PANTHER; PTHR24340; HOMEOBOX PROTEIN NKX; 1.
DR   PANTHER; PTHR24340:SF82; HOMEOBOX PROTEIN VND-RELATED; 1.
DR   Pfam; PF00046; Homeodomain; 1.
DR   PRINTS; PR00024; HOMEOBOX.
DR   SMART; SM00389; HOX; 1.
DR   SUPFAM; SSF46689; Homeodomain-like; 1.
DR   PROSITE; PS00027; HOMEOBOX_1; 1.
DR   PROSITE; PS50071; HOMEOBOX_2; 1.
PE   4: Predicted;
KW   Developmental protein {ECO:0000256|ARBA:ARBA00022473};
KW   DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW   ProRule:PRU00108};
KW   Homeobox {ECO:0000256|ARBA:ARBA00023155, ECO:0000256|PROSITE-
KW   ProRule:PRU00108};
KW   Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW   ProRule:PRU00108}.
FT   DOMAIN          305..365
FT                   /note="Homeobox"
FT                   /evidence="ECO:0000259|PROSITE:PS50071"
FT   DNA_BIND        307..366
FT                   /note="Homeobox"
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT   REGION          79..131
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          249..313
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          366..401
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        79..124
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        257..271
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   NON_TER         401
FT                   /evidence="ECO:0000313|EMBL:EDO64462.1"
SQ   SEQUENCE   401 AA;  43549 MW;  E3B76EC99FF88D6D CRC64;
     MSQGANGLPA GQPSATPYYP YTGLHAPDEG YDANFHIFPP DYYAPECDYR SDILDSMRMA
     PSQRSQRFHI SNILELNNQQ HQHQDQPATK DQPSVASLSS STLESQQHAT QPSLDVTAPT
     SYPDDQSAAL PYGTMSSQTA SCLPLQPVLN PSAIGTGIPY DPPPYYSYSH HAHHLFGGGS
     AALGSSGVVS EPPRSSYSYG GVNLDYANSS NQQLSPDSTS PGPELYSLAS YSNIPRVVNE
     ASDRLVALES DGPSLDPECS VDTNNNNNNH HTPDDTPDDL DSPGPRASRS GGHRGESGCG
     TGGGHKKRKR RILFSKSQTF ELERRFKQAR YLSAPEREHL ASMINLTPTQ VKIWFQNHRY
     KTKRAQTEKS SCYGAPPPNV LGGSPPKKIN PPVLVRDGKP C
//
DBGET integrated database retrieval system