GenomeNet

Database: UniProt
Entry: Q5TRW4_ANOGA
LinkDB: Q5TRW4_ANOGA
Original site: Q5TRW4_ANOGA 
ID   Q5TRW4_ANOGA            Unreviewed;       660 AA.
AC   Q5TRW4;
DT   07-DEC-2004, integrated into UniProtKB/TrEMBL.
DT   27-JUL-2011, sequence version 4.
DT   16-OCT-2019, entry version 110.
DE   SubName: Full=AGAP004619-PA {ECO:0000313|EMBL:EAL40172.4};
DE   Flags: Fragment;
GN   ORFNames=AgaP_AGAP004619 {ECO:0000313|EMBL:EAL40172.4};
OS   Anopheles gambiae (African malaria mosquito).
OC   Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta;
OC   Pterygota; Neoptera; Holometabola; Diptera; Nematocera; Culicoidea;
OC   Culicidae; Anophelinae; Anopheles.
OX   NCBI_TaxID=7165 {ECO:0000313|EMBL:EAL40172.4, ECO:0000313|Proteomes:UP000007062};
RN   [1] {ECO:0000313|EMBL:EAL40172.4, ECO:0000313|Proteomes:UP000007062}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=PEST {ECO:0000313|EMBL:EAL40172.4,
RC   ECO:0000313|Proteomes:UP000007062};
RX   PubMed=12364791; DOI=10.1126/science.1076181;
RA   Holt R.A., Subramanian G.M., Halpern A., Sutton G.G., Charlab R.,
RA   Nusskern D.R., Wincker P., Clark A.G., Ribeiro J.M.C., Wides R.,
RA   Salzberg S.L., Loftus B.J., Yandell M.D., Majoros W.H., Rusch D.B.,
RA   Lai Z., Kraft C.L., Abril J.F., Anthouard V., Arensburger P.,
RA   Atkinson P.W., Baden H., de Berardinis V., Baldwin D., Benes V.,
RA   Biedler J., Blass C., Bolanos R., Boscus D., Barnstead M., Cai S.,
RA   Center A., Chaturverdi K., Christophides G.K., Chrystal M.A.M.,
RA   Clamp M., Cravchik A., Curwen V., Dana A., Delcher A., Dew I.,
RA   Evans C.A., Flanigan M., Grundschober-Freimoser A., Friedli L., Gu Z.,
RA   Guan P., Guigo R., Hillenmeyer M.E., Hladun S.L., Hogan J.R.,
RA   Hong Y.S., Hoover J., Jaillon O., Ke Z., Kodira C.D., Kokoza E.,
RA   Koutsos A., Letunic I., Levitsky A.A., Liang Y., Lin J.-J., Lobo N.F.,
RA   Lopez J.R., Malek J.A., McIntosh T.C., Meister S., Miller J.R.,
RA   Mobarry C., Mongin E., Murphy S.D., O'Brochta D.A., Pfannkoch C.,
RA   Qi R., Regier M.A., Remington K., Shao H., Sharakhova M.V.,
RA   Sitter C.D., Shetty J., Smith T.J., Strong R., Sun J., Thomasova D.,
RA   Ton L.Q., Topalis P., Tu Z.J., Unger M.F., Walenz B., Wang A.H.,
RA   Wang J., Wang M., Wang X., Woodford K.J., Wortman J.R., Wu M., Yao A.,
RA   Zdobnov E.M., Zhang H., Zhao Q., Zhao S., Zhu S.C., Zhimulev I.,
RA   Coluzzi M., della Torre A., Roth C.W., Louis C., Kalush F.,
RA   Mural R.J., Myers E.W., Adams M.D., Smith H.O., Broder S.,
RA   Gardner M.J., Fraser C.M., Birney E., Bork P., Brey P.T., Venter J.C.,
RA   Weissenbach J., Kafatos F.C., Collins F.H., Hoffman S.L.;
RT   "The genome sequence of the malaria mosquito Anopheles gambiae.";
RL   Science 298:129-149(2002).
CC   -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|RuleBase:RU004019,
CC       ECO:0000256|SAAS:SAAS00594387}.
CC   -!- SIMILARITY: Belongs to the ETS family.
CC       {ECO:0000256|RuleBase:RU004019, ECO:0000256|SAAS:SAAS00594391}.
CC   -!- CAUTION: The sequence shown here is derived from an
CC       EMBL/GenBank/DDBJ whole genome shotgun (WGS) entry which is
CC       preliminary data. {ECO:0000313|EMBL:EAL40172.4}.
CC   -----------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   -----------------------------------------------------------------------
DR   EMBL; AAAB01008952; EAL40172.4; -; Genomic_DNA.
DR   RefSeq; XP_557464.4; XM_557464.4.
DR   STRING; 7165.AGAP004619-PA; -.
DR   PaxDb; Q5TRW4; -.
DR   GeneID; 1275912; -.
DR   KEGG; aga:AgaP_AGAP004619; -.
DR   eggNOG; KOG3806; Eukaryota.
DR   eggNOG; ENOG410Z0ZF; LUCA.
DR   InParanoid; Q5TRW4; -.
DR   KO; K02678; -.
DR   OrthoDB; 990808at2759; -.
DR   PhylomeDB; Q5TRW4; -.
DR   Proteomes; UP000007062; Chromosome 2R.
DR   GO; GO:0005737; C:cytoplasm; IBA:GO_Central.
DR   GO; GO:0005829; C:cytosol; IBA:GO_Central.
DR   GO; GO:0005739; C:mitochondrion; IBA:GO_Central.
DR   GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR   GO; GO:0003700; F:DNA-binding transcription factor activity; IEA:InterPro.
DR   GO; GO:0043565; F:sequence-specific DNA binding; IEA:InterPro.
DR   GO; GO:0004326; F:tetrahydrofolylpolyglutamate synthase activity; IBA:GO_Central.
DR   GO; GO:0009396; P:folic acid-containing compound biosynthetic process; IBA:GO_Central.
DR   GO; GO:0046901; P:tetrahydrofolylpolyglutamate biosynthetic process; IBA:GO_Central.
DR   Gene3D; 1.10.10.10; -; 1.
DR   Gene3D; 1.10.150.50; -; 1.
DR   InterPro; IPR000418; Ets_dom.
DR   InterPro; IPR003118; Pointed_dom.
DR   InterPro; IPR013761; SAM/pointed_sf.
DR   InterPro; IPR036388; WH-like_DNA-bd_sf.
DR   InterPro; IPR036390; WH_DNA-bd_sf.
DR   Pfam; PF00178; Ets; 1.
DR   Pfam; PF02198; SAM_PNT; 1.
DR   PRINTS; PR00454; ETSDOMAIN.
DR   SMART; SM00413; ETS; 1.
DR   SMART; SM00251; SAM_PNT; 1.
DR   SUPFAM; SSF46785; SSF46785; 1.
DR   SUPFAM; SSF47769; SSF47769; 1.
DR   PROSITE; PS00345; ETS_DOMAIN_1; 1.
DR   PROSITE; PS00346; ETS_DOMAIN_2; 1.
DR   PROSITE; PS50061; ETS_DOMAIN_3; 1.
DR   PROSITE; PS51433; PNT; 1.
PE   3: Inferred from homology;
KW   Complete proteome {ECO:0000313|Proteomes:UP000007062};
KW   DNA-binding {ECO:0000256|RuleBase:RU004019,
KW   ECO:0000256|SAAS:SAAS00594397};
KW   Nucleus {ECO:0000256|RuleBase:RU004019,
KW   ECO:0000256|SAAS:SAAS00594400};
KW   Reference proteome {ECO:0000313|Proteomes:UP000007062}.
FT   DOMAIN       15    101       PNT. {ECO:0000259|PROSITE:PS51433}.
FT   DOMAIN      552    632       ETS. {ECO:0000259|PROSITE:PS50061}.
FT   REGION      144    315       Disordered. {ECO:0000256|SAM:MobiDB-
FT                                lite}.
FT   REGION      334    512       Disordered. {ECO:0000256|SAM:MobiDB-
FT                                lite}.
FT   COMPBIAS    144    270       Polar. {ECO:0000256|SAM:MobiDB-lite}.
FT   COMPBIAS    371    404       Polar. {ECO:0000256|SAM:MobiDB-lite}.
FT   COMPBIAS    405    426       Basic. {ECO:0000256|SAM:MobiDB-lite}.
FT   COMPBIAS    446    467       Polar. {ECO:0000256|SAM:MobiDB-lite}.
FT   NON_TER       1      1       {ECO:0000313|EMBL:EAL40172.4}.
SQ   SEQUENCE   660 AA;  71757 MW;  CF6E8C9469293AC0 CRC64;
     LTPGTNKKLT EVLYASFASW EKEVQTFKIT KDPRQWTAEH VLIWLNWSIK EFSLEGVNKE
     PFQKMSGRDI VGLGREGFLA IAPPFTGDIL WEHLEILQKD CEKALLEHSS SVNNGGYDAS
     CGQAAGSVGV NELNEYNSTL QRLSGQQQEF GRGNGSSSNS NSSNTPASTS STTPGTTTPN
     SSSTTNTSSG SNGAYGSVHD RSTNDRSPPP LITSSGGNVA NGSTNGSSST SSMASYLSSL
     GSRGPLNGGN SSGTGGSVYH QQSGSGAAMK EEQEPNSFIN IDDLPNYGVV PGHYDDQDQY
     HSLPAPETQT TSHGYLENSP EFYSAIQMEQ KYMPPHYKSG PYTSRGGRYH QPHHHSSNAS
     GGGGGGGGGG GSGNQHTQQQ HHQQHQQQQQ QQQQQQQQQQ QQQQHQHHHP HHHPHHHAHH
     HPASHHQTAH HPDSGYPEYG SPYDTPPFQT VPSNTTSPQS SATSGNGGPG GPGGANGSGN
     SNNNNGDLLG GHVVDPWSLH HPGQHPHAGA PHPLLDFQHP AYMGTTMGFD KTMLGTYGAQ
     GGAPCFTGSG PIQLWQFLLE LLTDKTCQSF ISWTGDEWEF KLTDPDEVAR RWGVRKNKPK
     MNYEKLSRGL RYYYDKNIIH KTAGKRYVYR FVCDLQTLLG YSAKQVHEMV DLKPDKKDDE
//
DBGET integrated database retrieval system