ID A7URT8_ANOGA Unreviewed; 2108 AA.
AC A7URT8;
DT 23-OCT-2007, integrated into UniProtKB/TrEMBL.
DT 23-OCT-2007, sequence version 1.
DT 24-JAN-2024, entry version 102.
DE SubName: Full=AGAP006990-PA {ECO:0000313|EMBL:EDO64730.1};
GN ORFNames=AgaP_AGAP006990 {ECO:0000313|EMBL:EDO64730.1};
OS Anopheles gambiae (African malaria mosquito).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Culicidae;
OC Anophelinae; Anopheles.
OX NCBI_TaxID=7165 {ECO:0000313|EMBL:EDO64730.1};
RN [1] {ECO:0000313|EMBL:EDO64730.1}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=PEST {ECO:0000313|EMBL:EDO64730.1};
RX PubMed=12364791; DOI=10.1126/science.1076181;
RA Holt R.A., Subramanian G.M., Halpern A., Sutton G.G., Charlab R.,
RA Nusskern D.R., Wincker P., Clark A.G., Ribeiro J.M.C., Wides R.,
RA Salzberg S.L., Loftus B.J., Yandell M.D., Majoros W.H., Rusch D.B., Lai Z.,
RA Kraft C.L., Abril J.F., Anthouard V., Arensburger P., Atkinson P.W.,
RA Baden H., de Berardinis V., Baldwin D., Benes V., Biedler J., Blass C.,
RA Bolanos R., Boscus D., Barnstead M., Cai S., Center A., Chaturverdi K.,
RA Christophides G.K., Chrystal M.A.M., Clamp M., Cravchik A., Curwen V.,
RA Dana A., Delcher A., Dew I., Evans C.A., Flanigan M.,
RA Grundschober-Freimoser A., Friedli L., Gu Z., Guan P., Guigo R.,
RA Hillenmeyer M.E., Hladun S.L., Hogan J.R., Hong Y.S., Hoover J.,
RA Jaillon O., Ke Z., Kodira C.D., Kokoza E., Koutsos A., Letunic I.,
RA Levitsky A.A., Liang Y., Lin J.-J., Lobo N.F., Lopez J.R., Malek J.A.,
RA McIntosh T.C., Meister S., Miller J.R., Mobarry C., Mongin E., Murphy S.D.,
RA O'Brochta D.A., Pfannkoch C., Qi R., Regier M.A., Remington K., Shao H.,
RA Sharakhova M.V., Sitter C.D., Shetty J., Smith T.J., Strong R., Sun J.,
RA Thomasova D., Ton L.Q., Topalis P., Tu Z.J., Unger M.F., Walenz B.,
RA Wang A.H., Wang J., Wang M., Wang X., Woodford K.J., Wortman J.R., Wu M.,
RA Yao A., Zdobnov E.M., Zhang H., Zhao Q., Zhao S., Zhu S.C., Zhimulev I.,
RA Coluzzi M., della Torre A., Roth C.W., Louis C., Kalush F., Mural R.J.,
RA Myers E.W., Adams M.D., Smith H.O., Broder S., Gardner M.J., Fraser C.M.,
RA Birney E., Bork P., Brey P.T., Venter J.C., Weissenbach J., Kafatos F.C.,
RA Collins F.H., Hoffman S.L.;
RT "The genome sequence of the malaria mosquito Anopheles gambiae.";
RL Science 298:129-149(2002).
RN [2] {ECO:0000313|EMBL:EDO64730.1}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=PEST {ECO:0000313|EMBL:EDO64730.1};
RG The Anopheles Genome Sequencing Consortium;
RL Submitted (MAR-2002) to the EMBL/GenBank/DDBJ databases.
RN [3] {ECO:0000313|EMBL:EDO64730.1}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=PEST {ECO:0000313|EMBL:EDO64730.1};
RX PubMed=14747013; DOI=10.1016/j.pt.2003.11.003;
RA Mongin E., Louis C., Holt R.A., Birney E., Collins F.H.;
RT "The Anopheles gambiae genome: an update.";
RL Trends Parasitol. 20:49-52(2004).
RN [4] {ECO:0000313|EMBL:EDO64730.1}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=PEST {ECO:0000313|EMBL:EDO64730.1};
RX PubMed=17210077; DOI=10.1186/gb-2007-8-1-r5;
RA Sharakhova M.V., Hammond M.P., Lobo N.F., Krzywinski J., Unger M.F.,
RA Hillenmeyer M.E., Bruggner R.V., Birney E., Collins F.H.;
RT "Update of the Anopheles gambiae PEST genome assembly.";
RL Genome Biol. 8:R5.1-R5.13(2007).
RN [5] {ECO:0000313|EMBL:EDO64730.1}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=PEST {ECO:0000313|EMBL:EDO64730.1};
RG VectorBase;
RL Submitted (MAY-2011) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EDO64730.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AAAB01008807; EDO64730.1; -; Genomic_DNA.
DR RefSeq; XP_001688081.1; XM_001688029.1.
DR STRING; 7165.A7URT8; -.
DR PaxDb; 7165-AGAP006990-PA; -.
DR GeneID; 1270109; -.
DR KEGG; aga:AgaP_AGAP006990; -.
DR VEuPathDB; VectorBase:AGAP006990; -.
DR eggNOG; KOG2312; Eukaryota.
DR OMA; IPAKVQP; -.
DR OrthoDB; 5001776at2759; -.
DR PhylomeDB; A7URT8; -.
DR GO; GO:0003677; F:DNA binding; IEA:InterPro.
DR GO; GO:0006355; P:regulation of DNA-templated transcription; IEA:InterPro.
DR Gene3D; 1.10.150.60; ARID DNA-binding domain; 1.
DR Gene3D; 1.25.10.10; Leucine-rich Repeat Variant; 1.
DR Gene3D; 1.10.10.10; Winged helix-like DNA-binding domain superfamily/Winged helix DNA-binding domain; 1.
DR InterPro; IPR001606; ARID_dom.
DR InterPro; IPR036431; ARID_dom_sf.
DR InterPro; IPR011989; ARM-like.
DR InterPro; IPR016024; ARM-type_fold.
DR InterPro; IPR003150; DNA-bd_RFX.
DR InterPro; IPR036388; WH-like_DNA-bd_sf.
DR PANTHER; PTHR22970; AT-RICH INTERACTIVE DOMAIN-CONTAINING PROTEIN 2; 1.
DR PANTHER; PTHR22970:SF14; AT-RICH INTERACTIVE DOMAIN-CONTAINING PROTEIN 2; 1.
DR Pfam; PF01388; ARID; 1.
DR SMART; SM01014; ARID; 1.
DR SMART; SM00501; BRIGHT; 1.
DR SUPFAM; SSF46774; ARID-like; 1.
DR SUPFAM; SSF48371; ARM repeat; 1.
DR PROSITE; PS51011; ARID; 1.
DR PROSITE; PS51526; RFX_DBD; 1.
PE 4: Predicted;
KW Coiled coil {ECO:0000256|SAM:Coils}.
FT DOMAIN 75..167
FT /note="ARID"
FT /evidence="ECO:0000259|PROSITE:PS51011"
FT DOMAIN 669..750
FT /note="RFX-type winged-helix"
FT /evidence="ECO:0000259|PROSITE:PS51526"
FT REGION 314..333
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1597..1673
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1685..1721
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1782..1817
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1965..2019
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 1112..1166
FT /evidence="ECO:0000256|SAM:Coils"
FT COMPBIAS 1597..1643
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1644..1669
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1782..1800
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1801..1817
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1987..2015
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 2108 AA; 225603 MW; 2021D3216E919ABF CRC64;
MTTDVVKIER TGVPMAIDES TNDASGASFS SVSGGGGGAA GVGGGGTVAG SAAGTPRESA
KSQNLRRTLA KPAMEKDKCS FLNDLQTFHE KHGTPYLKLP KISGKDVDLH KLYSIVIGRG
GWMKVNARED WDEVIEELDL PTRCVNNEIA LKQIYIRYLD RYERVNFHGE DKDPAEDEDD
EKRLHRRWSV RMLHSTPTVY NHHQHNVPEG LRSSLKLSDD LYRASEYDKL ILSLLSPLPN
EQDFAINVCT LMSNENKHTL KVDKCPRLVY VLLAHAGVFN HFSLRDTFDE YYSNIRKNSL
QRFWKECLFE KPQPDEEMDG KSGGNDGGEA GKRADSDFEI MKGRLSTATL RTFLSLGTGL
GTNDYIGQRV LQIAAIFRNL SFNEENLPVL GHNRTFIRFL IMCANSRWNN LHHLGLDMLG
NIANEIDIND PQSDEVTRCL VSTLSEGLEG ADRGVIISCL EVLSKIAQKE SNEENLNRCL
NQQLYDQICL FLCLNDIMLL LYTLECIYSL TSMGEKPCNA IMHIRGVIDT LVSLVTVEAQ
SYGPDACILM RVVETIPGNM ATHHAQGAYY ANAAGQPGQA NPPPPTMTQL TVHKDAAQVQ
SGGQGQVGNN PHVVYPTPEV PKLPSLTVTS KPVPSSPHVP QQPVQQQQIV LQAGTIQTQQ
QAQESEQFAY AWLRASFEAA PAVRMDQQEL YKLYLATNGK LGRKVVLPQT HFPRCVRAVF
GGTVGPVQLK IEQKGIETVG YYYEGLRLRA KPLPIVHKGT VLVSMSNVNV LHLFSSNSVP
TQSRCVSSTT VSTTIAKPAT VISNAGTASG RTIVSTGQET YRSLEVIQTS LSQATPTTSS
PGTFRSSSVI AMAPPSSNAS TPATMKHQPQ TSGQTVSVLH KTIPTPLQAV KSSQQQTQPR
AVTIGKQLIV TQVGGKNVLI NKAAAGVVQR QQMQKQKQLI EQKLLSNSPI NPSVVVTSNN
PNNLTNIKVG NSTISIKPGT LPTNTVITAT NPHETSFNAP PPLAPLSQTG IQQGTIIKTI
AAAPHDPSQH HMGSPGGSKL LPNKVLSELL EKKPGEMIIA GETTIKRKYD VTDTSMEPPV
KRMEGGTKTA DLYAELAGSI TEGEDVEEQT TATAAAAAAQ EAEQQAAKLK LQQIAQQQQQ
QQQHLVQQQH QQQQQQQQQQ QQQQQQMITM PVSMQRQIIV TPNSAQPMII STPTANVGSN
PVQQQQQQQL TSQTTATIKT DSGYQTVPII LQHNAMGGGG TLQLQKATAA PGGALMQSSV
LAAPQHPQPT QYILATNPQG QTYVVAQQPQ PQPQLQQTVL LAQGQQHGGT GQKTIIIVQQ
QPQQQQTTAT SLPQMTMQSA MGPNQGGQTT QKIFLNQQGQ QILMTQLPRQ MIVSHAGPSV
SGGATIITSS APTMMPGSNT GTTMIATGGG GGAVGGGGTI IEKKPIYITT NAQGNTISFE
GPPQQLQMQT IQAGSAVGGG GGGTYQIKQY GNVQQQQQQQ QQHIIQQANL STGGPVGQQT
IQFIQQPGTV VQTQGHLASA AQSMQPQQIV FQTHGGGTGS GTAQIIQQQI IQAAPSGQHA
GQHIVQKVLH QQQPTQQQQQ QHHLIHTIKG NQSQMITVQQ QKQPMATITS QQAHPQPSTS
QLPSTLSIQR QTVGGSSLSQ TLVPPKPVPV PNVGPTVVSS APPAPTPGLP GGKQQYFQVV
QQKISPAPSQ GAGGGNTAQQ GTQQHLRHMA GPSGATVTSQ GGQQSLVPSI TVTAIPAHSA
TTPTTTSTIT SGQFAASLMA QAAGLNVPVT TTTSATVSAA TASSISTSSV PAAASAQPTV
TAQPPPQPPP PPPPIANTIT IPVSVATANG GTSTSIQMIP AMDPQKIVEE DVDPSWPWVC
DWRGCPRKKF QSATEVFRHA CTVHCPDTVD VSADIYCQWG PGPNLCDNLP RKRFSLMTHL
LDRHCTADSF KTAVQRRLAG GPQQPVQPYP VTLIRQPAPA NAAIANSSSA SAAPSSAGDR
GSPAPASCTK VESTGGTSEG SSGSGSSSTN GPGQLSAAGP AAMHAIKRYS MDYVNSKEFQ
DEMEGPVTKS IRLTSALILR NLVVYTNSAK RSLRMYEAHL AGVALSNVES SRTVAQLLFE
MNDTGPNY
//