GenomeNet

Database: UniProt
Entry: A7URT8_ANOGA
LinkDB: A7URT8_ANOGA
Original site: A7URT8_ANOGA 
ID   A7URT8_ANOGA            Unreviewed;      2108 AA.
AC   A7URT8;
DT   23-OCT-2007, integrated into UniProtKB/TrEMBL.
DT   23-OCT-2007, sequence version 1.
DT   24-JAN-2024, entry version 102.
DE   SubName: Full=AGAP006990-PA {ECO:0000313|EMBL:EDO64730.1};
GN   ORFNames=AgaP_AGAP006990 {ECO:0000313|EMBL:EDO64730.1};
OS   Anopheles gambiae (African malaria mosquito).
OC   Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC   Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Culicidae;
OC   Anophelinae; Anopheles.
OX   NCBI_TaxID=7165 {ECO:0000313|EMBL:EDO64730.1};
RN   [1] {ECO:0000313|EMBL:EDO64730.1}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=PEST {ECO:0000313|EMBL:EDO64730.1};
RX   PubMed=12364791; DOI=10.1126/science.1076181;
RA   Holt R.A., Subramanian G.M., Halpern A., Sutton G.G., Charlab R.,
RA   Nusskern D.R., Wincker P., Clark A.G., Ribeiro J.M.C., Wides R.,
RA   Salzberg S.L., Loftus B.J., Yandell M.D., Majoros W.H., Rusch D.B., Lai Z.,
RA   Kraft C.L., Abril J.F., Anthouard V., Arensburger P., Atkinson P.W.,
RA   Baden H., de Berardinis V., Baldwin D., Benes V., Biedler J., Blass C.,
RA   Bolanos R., Boscus D., Barnstead M., Cai S., Center A., Chaturverdi K.,
RA   Christophides G.K., Chrystal M.A.M., Clamp M., Cravchik A., Curwen V.,
RA   Dana A., Delcher A., Dew I., Evans C.A., Flanigan M.,
RA   Grundschober-Freimoser A., Friedli L., Gu Z., Guan P., Guigo R.,
RA   Hillenmeyer M.E., Hladun S.L., Hogan J.R., Hong Y.S., Hoover J.,
RA   Jaillon O., Ke Z., Kodira C.D., Kokoza E., Koutsos A., Letunic I.,
RA   Levitsky A.A., Liang Y., Lin J.-J., Lobo N.F., Lopez J.R., Malek J.A.,
RA   McIntosh T.C., Meister S., Miller J.R., Mobarry C., Mongin E., Murphy S.D.,
RA   O'Brochta D.A., Pfannkoch C., Qi R., Regier M.A., Remington K., Shao H.,
RA   Sharakhova M.V., Sitter C.D., Shetty J., Smith T.J., Strong R., Sun J.,
RA   Thomasova D., Ton L.Q., Topalis P., Tu Z.J., Unger M.F., Walenz B.,
RA   Wang A.H., Wang J., Wang M., Wang X., Woodford K.J., Wortman J.R., Wu M.,
RA   Yao A., Zdobnov E.M., Zhang H., Zhao Q., Zhao S., Zhu S.C., Zhimulev I.,
RA   Coluzzi M., della Torre A., Roth C.W., Louis C., Kalush F., Mural R.J.,
RA   Myers E.W., Adams M.D., Smith H.O., Broder S., Gardner M.J., Fraser C.M.,
RA   Birney E., Bork P., Brey P.T., Venter J.C., Weissenbach J., Kafatos F.C.,
RA   Collins F.H., Hoffman S.L.;
RT   "The genome sequence of the malaria mosquito Anopheles gambiae.";
RL   Science 298:129-149(2002).
RN   [2] {ECO:0000313|EMBL:EDO64730.1}
RP   NUCLEOTIDE SEQUENCE.
RC   STRAIN=PEST {ECO:0000313|EMBL:EDO64730.1};
RG   The Anopheles Genome Sequencing Consortium;
RL   Submitted (MAR-2002) to the EMBL/GenBank/DDBJ databases.
RN   [3] {ECO:0000313|EMBL:EDO64730.1}
RP   NUCLEOTIDE SEQUENCE.
RC   STRAIN=PEST {ECO:0000313|EMBL:EDO64730.1};
RX   PubMed=14747013; DOI=10.1016/j.pt.2003.11.003;
RA   Mongin E., Louis C., Holt R.A., Birney E., Collins F.H.;
RT   "The Anopheles gambiae genome: an update.";
RL   Trends Parasitol. 20:49-52(2004).
RN   [4] {ECO:0000313|EMBL:EDO64730.1}
RP   NUCLEOTIDE SEQUENCE.
RC   STRAIN=PEST {ECO:0000313|EMBL:EDO64730.1};
RX   PubMed=17210077; DOI=10.1186/gb-2007-8-1-r5;
RA   Sharakhova M.V., Hammond M.P., Lobo N.F., Krzywinski J., Unger M.F.,
RA   Hillenmeyer M.E., Bruggner R.V., Birney E., Collins F.H.;
RT   "Update of the Anopheles gambiae PEST genome assembly.";
RL   Genome Biol. 8:R5.1-R5.13(2007).
RN   [5] {ECO:0000313|EMBL:EDO64730.1}
RP   NUCLEOTIDE SEQUENCE.
RC   STRAIN=PEST {ECO:0000313|EMBL:EDO64730.1};
RG   VectorBase;
RL   Submitted (MAY-2011) to the EMBL/GenBank/DDBJ databases.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:EDO64730.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; AAAB01008807; EDO64730.1; -; Genomic_DNA.
DR   RefSeq; XP_001688081.1; XM_001688029.1.
DR   STRING; 7165.A7URT8; -.
DR   PaxDb; 7165-AGAP006990-PA; -.
DR   GeneID; 1270109; -.
DR   KEGG; aga:AgaP_AGAP006990; -.
DR   VEuPathDB; VectorBase:AGAP006990; -.
DR   eggNOG; KOG2312; Eukaryota.
DR   OMA; IPAKVQP; -.
DR   OrthoDB; 5001776at2759; -.
DR   PhylomeDB; A7URT8; -.
DR   GO; GO:0003677; F:DNA binding; IEA:InterPro.
DR   GO; GO:0006355; P:regulation of DNA-templated transcription; IEA:InterPro.
DR   Gene3D; 1.10.150.60; ARID DNA-binding domain; 1.
DR   Gene3D; 1.25.10.10; Leucine-rich Repeat Variant; 1.
DR   Gene3D; 1.10.10.10; Winged helix-like DNA-binding domain superfamily/Winged helix DNA-binding domain; 1.
DR   InterPro; IPR001606; ARID_dom.
DR   InterPro; IPR036431; ARID_dom_sf.
DR   InterPro; IPR011989; ARM-like.
DR   InterPro; IPR016024; ARM-type_fold.
DR   InterPro; IPR003150; DNA-bd_RFX.
DR   InterPro; IPR036388; WH-like_DNA-bd_sf.
DR   PANTHER; PTHR22970; AT-RICH INTERACTIVE DOMAIN-CONTAINING PROTEIN 2; 1.
DR   PANTHER; PTHR22970:SF14; AT-RICH INTERACTIVE DOMAIN-CONTAINING PROTEIN 2; 1.
DR   Pfam; PF01388; ARID; 1.
DR   SMART; SM01014; ARID; 1.
DR   SMART; SM00501; BRIGHT; 1.
DR   SUPFAM; SSF46774; ARID-like; 1.
DR   SUPFAM; SSF48371; ARM repeat; 1.
DR   PROSITE; PS51011; ARID; 1.
DR   PROSITE; PS51526; RFX_DBD; 1.
PE   4: Predicted;
KW   Coiled coil {ECO:0000256|SAM:Coils}.
FT   DOMAIN          75..167
FT                   /note="ARID"
FT                   /evidence="ECO:0000259|PROSITE:PS51011"
FT   DOMAIN          669..750
FT                   /note="RFX-type winged-helix"
FT                   /evidence="ECO:0000259|PROSITE:PS51526"
FT   REGION          314..333
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1597..1673
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1685..1721
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1782..1817
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1965..2019
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COILED          1112..1166
FT                   /evidence="ECO:0000256|SAM:Coils"
FT   COMPBIAS        1597..1643
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1644..1669
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1782..1800
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1801..1817
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1987..2015
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   2108 AA;  225603 MW;  2021D3216E919ABF CRC64;
     MTTDVVKIER TGVPMAIDES TNDASGASFS SVSGGGGGAA GVGGGGTVAG SAAGTPRESA
     KSQNLRRTLA KPAMEKDKCS FLNDLQTFHE KHGTPYLKLP KISGKDVDLH KLYSIVIGRG
     GWMKVNARED WDEVIEELDL PTRCVNNEIA LKQIYIRYLD RYERVNFHGE DKDPAEDEDD
     EKRLHRRWSV RMLHSTPTVY NHHQHNVPEG LRSSLKLSDD LYRASEYDKL ILSLLSPLPN
     EQDFAINVCT LMSNENKHTL KVDKCPRLVY VLLAHAGVFN HFSLRDTFDE YYSNIRKNSL
     QRFWKECLFE KPQPDEEMDG KSGGNDGGEA GKRADSDFEI MKGRLSTATL RTFLSLGTGL
     GTNDYIGQRV LQIAAIFRNL SFNEENLPVL GHNRTFIRFL IMCANSRWNN LHHLGLDMLG
     NIANEIDIND PQSDEVTRCL VSTLSEGLEG ADRGVIISCL EVLSKIAQKE SNEENLNRCL
     NQQLYDQICL FLCLNDIMLL LYTLECIYSL TSMGEKPCNA IMHIRGVIDT LVSLVTVEAQ
     SYGPDACILM RVVETIPGNM ATHHAQGAYY ANAAGQPGQA NPPPPTMTQL TVHKDAAQVQ
     SGGQGQVGNN PHVVYPTPEV PKLPSLTVTS KPVPSSPHVP QQPVQQQQIV LQAGTIQTQQ
     QAQESEQFAY AWLRASFEAA PAVRMDQQEL YKLYLATNGK LGRKVVLPQT HFPRCVRAVF
     GGTVGPVQLK IEQKGIETVG YYYEGLRLRA KPLPIVHKGT VLVSMSNVNV LHLFSSNSVP
     TQSRCVSSTT VSTTIAKPAT VISNAGTASG RTIVSTGQET YRSLEVIQTS LSQATPTTSS
     PGTFRSSSVI AMAPPSSNAS TPATMKHQPQ TSGQTVSVLH KTIPTPLQAV KSSQQQTQPR
     AVTIGKQLIV TQVGGKNVLI NKAAAGVVQR QQMQKQKQLI EQKLLSNSPI NPSVVVTSNN
     PNNLTNIKVG NSTISIKPGT LPTNTVITAT NPHETSFNAP PPLAPLSQTG IQQGTIIKTI
     AAAPHDPSQH HMGSPGGSKL LPNKVLSELL EKKPGEMIIA GETTIKRKYD VTDTSMEPPV
     KRMEGGTKTA DLYAELAGSI TEGEDVEEQT TATAAAAAAQ EAEQQAAKLK LQQIAQQQQQ
     QQQHLVQQQH QQQQQQQQQQ QQQQQQMITM PVSMQRQIIV TPNSAQPMII STPTANVGSN
     PVQQQQQQQL TSQTTATIKT DSGYQTVPII LQHNAMGGGG TLQLQKATAA PGGALMQSSV
     LAAPQHPQPT QYILATNPQG QTYVVAQQPQ PQPQLQQTVL LAQGQQHGGT GQKTIIIVQQ
     QPQQQQTTAT SLPQMTMQSA MGPNQGGQTT QKIFLNQQGQ QILMTQLPRQ MIVSHAGPSV
     SGGATIITSS APTMMPGSNT GTTMIATGGG GGAVGGGGTI IEKKPIYITT NAQGNTISFE
     GPPQQLQMQT IQAGSAVGGG GGGTYQIKQY GNVQQQQQQQ QQHIIQQANL STGGPVGQQT
     IQFIQQPGTV VQTQGHLASA AQSMQPQQIV FQTHGGGTGS GTAQIIQQQI IQAAPSGQHA
     GQHIVQKVLH QQQPTQQQQQ QHHLIHTIKG NQSQMITVQQ QKQPMATITS QQAHPQPSTS
     QLPSTLSIQR QTVGGSSLSQ TLVPPKPVPV PNVGPTVVSS APPAPTPGLP GGKQQYFQVV
     QQKISPAPSQ GAGGGNTAQQ GTQQHLRHMA GPSGATVTSQ GGQQSLVPSI TVTAIPAHSA
     TTPTTTSTIT SGQFAASLMA QAAGLNVPVT TTTSATVSAA TASSISTSSV PAAASAQPTV
     TAQPPPQPPP PPPPIANTIT IPVSVATANG GTSTSIQMIP AMDPQKIVEE DVDPSWPWVC
     DWRGCPRKKF QSATEVFRHA CTVHCPDTVD VSADIYCQWG PGPNLCDNLP RKRFSLMTHL
     LDRHCTADSF KTAVQRRLAG GPQQPVQPYP VTLIRQPAPA NAAIANSSSA SAAPSSAGDR
     GSPAPASCTK VESTGGTSEG SSGSGSSSTN GPGQLSAAGP AAMHAIKRYS MDYVNSKEFQ
     DEMEGPVTKS IRLTSALILR NLVVYTNSAK RSLRMYEAHL AGVALSNVES SRTVAQLLFE
     MNDTGPNY
//
DBGET integrated database retrieval system