ID Q7PQW2_ANOGA Unreviewed; 2915 AA.
AC Q7PQW2;
DT 15-DEC-2003, integrated into UniProtKB/TrEMBL.
DT 27-JUL-2011, sequence version 5.
DT 24-JAN-2024, entry version 133.
DE SubName: Full=AGAP002648-PA {ECO:0000313|EMBL:EAA08203.6};
GN Name=1273312 {ECO:0000313|EnsemblMetazoa:AGAP002648-PA};
GN ORFNames=AgaP_AGAP002648 {ECO:0000313|EMBL:EAA08203.6};
OS Anopheles gambiae (African malaria mosquito).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Culicidae;
OC Anophelinae; Anopheles.
OX NCBI_TaxID=7165 {ECO:0000313|EMBL:EAA08203.6};
RN [1] {ECO:0000313|EMBL:EAA08203.6, ECO:0000313|Proteomes:UP000007062}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=PEST {ECO:0000313|EMBL:EAA08203.6,
RC ECO:0000313|Proteomes:UP000007062};
RX PubMed=12364791; DOI=10.1126/science.1076181;
RA Holt R.A., Subramanian G.M., Halpern A., Sutton G.G., Charlab R.,
RA Nusskern D.R., Wincker P., Clark A.G., Ribeiro J.M.C., Wides R.,
RA Salzberg S.L., Loftus B.J., Yandell M.D., Majoros W.H., Rusch D.B., Lai Z.,
RA Kraft C.L., Abril J.F., Anthouard V., Arensburger P., Atkinson P.W.,
RA Baden H., de Berardinis V., Baldwin D., Benes V., Biedler J., Blass C.,
RA Bolanos R., Boscus D., Barnstead M., Cai S., Center A., Chaturverdi K.,
RA Christophides G.K., Chrystal M.A.M., Clamp M., Cravchik A., Curwen V.,
RA Dana A., Delcher A., Dew I., Evans C.A., Flanigan M.,
RA Grundschober-Freimoser A., Friedli L., Gu Z., Guan P., Guigo R.,
RA Hillenmeyer M.E., Hladun S.L., Hogan J.R., Hong Y.S., Hoover J.,
RA Jaillon O., Ke Z., Kodira C.D., Kokoza E., Koutsos A., Letunic I.,
RA Levitsky A.A., Liang Y., Lin J.-J., Lobo N.F., Lopez J.R., Malek J.A.,
RA McIntosh T.C., Meister S., Miller J.R., Mobarry C., Mongin E., Murphy S.D.,
RA O'Brochta D.A., Pfannkoch C., Qi R., Regier M.A., Remington K., Shao H.,
RA Sharakhova M.V., Sitter C.D., Shetty J., Smith T.J., Strong R., Sun J.,
RA Thomasova D., Ton L.Q., Topalis P., Tu Z.J., Unger M.F., Walenz B.,
RA Wang A.H., Wang J., Wang M., Wang X., Woodford K.J., Wortman J.R., Wu M.,
RA Yao A., Zdobnov E.M., Zhang H., Zhao Q., Zhao S., Zhu S.C., Zhimulev I.,
RA Coluzzi M., della Torre A., Roth C.W., Louis C., Kalush F., Mural R.J.,
RA Myers E.W., Adams M.D., Smith H.O., Broder S., Gardner M.J., Fraser C.M.,
RA Birney E., Bork P., Brey P.T., Venter J.C., Weissenbach J., Kafatos F.C.,
RA Collins F.H., Hoffman S.L.;
RT "The genome sequence of the malaria mosquito Anopheles gambiae.";
RL Science 298:129-149(2002).
RN [2] {ECO:0000313|EMBL:EAA08203.6}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=PEST {ECO:0000313|EMBL:EAA08203.6};
RG The Anopheles Genome Sequencing Consortium;
RL Submitted (MAR-2002) to the EMBL/GenBank/DDBJ databases.
RN [3] {ECO:0000313|EMBL:EAA08203.6}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=PEST {ECO:0000313|EMBL:EAA08203.6};
RX PubMed=14747013; DOI=10.1016/j.pt.2003.11.003;
RA Mongin E., Louis C., Holt R.A., Birney E., Collins F.H.;
RT "The Anopheles gambiae genome: an update.";
RL Trends Parasitol. 20:49-52(2004).
RN [4] {ECO:0000313|EMBL:EAA08203.6}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=PEST {ECO:0000313|EMBL:EAA08203.6};
RX PubMed=17210077; DOI=10.1186/gb-2007-8-1-r5;
RA Sharakhova M.V., Hammond M.P., Lobo N.F., Krzywinski J., Unger M.F.,
RA Hillenmeyer M.E., Bruggner R.V., Birney E., Collins F.H.;
RT "Update of the Anopheles gambiae PEST genome assembly.";
RL Genome Biol. 8:R5.1-R5.13(2007).
RN [5] {ECO:0000313|EMBL:EAA08203.6}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=PEST {ECO:0000313|EMBL:EAA08203.6};
RG VectorBase;
RL Submitted (MAY-2011) to the EMBL/GenBank/DDBJ databases.
RN [6] {ECO:0000313|EnsemblMetazoa:AGAP002648-PA}
RP IDENTIFICATION.
RC STRAIN=PEST {ECO:0000313|EnsemblMetazoa:AGAP002648-PA};
RG EnsemblMetazoa;
RL Submitted (MAY-2020) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AAAB01008859; EAA08203.6; -; Genomic_DNA.
DR RefSeq; XP_312278.5; XM_312278.5.
DR STRING; 7165.Q7PQW2; -.
DR PaxDb; 7165-AGAP002648-PA; -.
DR EnsemblMetazoa; AGAP002648-RA; AGAP002648-PA; AGAP002648.
DR GeneID; 1273312; -.
DR KEGG; aga:AgaP_AGAP002648; -.
DR VEuPathDB; VectorBase:AGAP002648; -.
DR eggNOG; KOG0548; Eukaryota.
DR HOGENOM; CLU_000553_1_0_1; -.
DR InParanoid; Q7PQW2; -.
DR OMA; LGCYPPQ; -.
DR OrthoDB; 2963601at2759; -.
DR Proteomes; UP000007062; Chromosome 2R.
DR Gene3D; 1.25.40.10; Tetratricopeptide repeat domain; 7.
DR InterPro; IPR024983; CHAT_dom.
DR InterPro; IPR011990; TPR-like_helical_dom_sf.
DR InterPro; IPR019734; TPR_repeat.
DR PANTHER; PTHR10098; RAPSYN-RELATED; 1.
DR PANTHER; PTHR10098:SF106; TETRATRICOPEPTIDE REPEAT PROTEIN 28; 1.
DR Pfam; PF12770; CHAT; 1.
DR Pfam; PF13424; TPR_12; 8.
DR Pfam; PF13432; TPR_16; 1.
DR Pfam; PF13176; TPR_7; 2.
DR SMART; SM00028; TPR; 26.
DR SUPFAM; SSF48452; TPR-like; 7.
DR PROSITE; PS50005; TPR; 6.
PE 4: Predicted;
KW Coiled coil {ECO:0000256|SAM:Coils};
KW Reference proteome {ECO:0000313|Proteomes:UP000007062};
KW TPR repeat {ECO:0000256|PROSITE-ProRule:PRU00339}.
FT REPEAT 154..187
FT /note="TPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00339"
FT REPEAT 342..375
FT /note="TPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00339"
FT REPEAT 422..455
FT /note="TPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00339"
FT REPEAT 742..775
FT /note="TPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00339"
FT REPEAT 1026..1059
FT /note="TPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00339"
FT REPEAT 1066..1099
FT /note="TPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00339"
FT DOMAIN 1451..1770
FT /note="CHAT"
FT /evidence="ECO:0000259|Pfam:PF12770"
FT REGION 1..25
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1993..2100
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2118..2151
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2235..2290
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2298..2317
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2345..2386
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2406..2461
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2554..2584
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2630..2650
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2776..2915
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 835..862
FT /evidence="ECO:0000256|SAM:Coils"
FT COMPBIAS 1..15
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2015..2094
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2124..2151
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2345..2370
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2418..2461
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2632..2646
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2776..2801
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2818..2915
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 2915 AA; 312691 MW; 3AF14F5878ACE9AD CRC64;
MHTERRHRHR GRGTVAGHHQ TGVGSSGTVV TATAAISAHH QQQQQQQQQQ QPAIHPTVWE
ILSAELILSN EPEGTPELPA ANRALFLEKV RQSNTACQNG DFSTAVQLYT DALGLDPGNH
ILYSNRSAAR LKQGQFALAL QDATRARELC PQWPKAYFRQ GVALQCLGRY GEALAAFSAG
LAQDPNSKQL LAGLVEASIK SPLRHALEPT FQQLKAMKLD QSPFVVISVV GQELLGAGQY
HAAVTVLESA LRIGSCSLKL RGSVFSALSS AHWALNQLDK AIAYMQQDLA VAKSLGDTAG
ECRAHGNLGS AYFSQGSYKE ALTSHRYQLV LAMKCKDTQA AAAALTSLGH VYTAIGDYPN
ALASHKQCVQ LVKQMGDRLQ EAREIGNVGA VYLAMGEFDS AVDCHTQHLR LARKLGNQVE
EARAYSNLGS SYHYKRNFTQ AITYHESVLR IAQQLGDRAI EARAYAGLGH AARCGHDFVQ
AKRWHEKQLE MALAARDKVG EGRACSNLGI VYQLLGEHDA ALKLHQAHLT IARQLQDKAG
MGRAYGNIGN AYSAAGYYES AIKYHKQELI ISKEVHDRSA EASTHGNLAV AYQALGAHDM
ALMHYRAHLN IARELKDTAG EACALLNLGN CLSSRQEFAQ AVPYYEQYLM LSQELGDVAA
EGKACHFLGY AHYCIGNYRE AVRYYDQDLA LAKDLQNKMN MGRAYCNLGL AHLALGNTGG
ALECQKYFLA IAHMTNHLPG KFRALGNIGD VLIRMGDVDE AIKMYQRQLA LARQTRERGM
EAAACGALGL AHRLLKKLDK ALGYHTQELT LRQEMSDLPG ECRAHGHLGA VHMALGNYTH
AVKCYQEQLE RAQELQDSAV EAQAFGNLGI ARLNMGHYED AIGYLEQQLG TLEQVSSPTA
QHDRARALGH LGDCYDALGD YHTEAIKCHE RHLQLAIALQ SPRDQERAYR GLGNCYKSVG
NLQEALVCLE KRLVVSHELG SAEAKAAAYG DLGSIHSALG NYEQAINCLE HQRDIARELG
DRVLTSDAIS GLGAVFQQMG DYDESLRLHK QDLELGESVN HATLQARASG NLGSVYDALR
NYAESARYYE KQLTLTADRQ TKAHACLALG RVYHAMEQVP QAVGFLRQGL AIAQSLNKLE
DEAKLRYRLG LALVASGEDD AARQQMESAA QILESIRSDQ VTPEGRTQLY DLQTACYQTL
QRVLVGLGRT EEALVAAERC RSRLGADSNQ SAENSLNNRK TLLTCSEYIF DTVNRSKTSI
IYYSLAGADL YAWFLQPQKR IVRFHATKLD DQTLPMLKQK ALAGPAGAST ATLDTKGGGG
GKLVVQDGAG GDDGSMGVEQ SLLARYINYV RDCLGVNSGS VLQEGDGSGW KSSSENLIDD
FTNERAGFLR MVNRNHLLNS SNYSLSSLFS LGSVGGSVAS LQGSTRSIGS LQGSTRSRRS
NMLPPWQGPS CLHVLYNLLL APFEDLLPDI STTARIGRRE LILVLEKELY LVPFAILRSG
DEDGEYLSER CSLLTVPSLH TLRQKSRIKT REPAEGLNSA LVIGGPKIPT SLSDTWGWSD
SPASLQEAAM VSDMLNTKPL VSTSATKEAI VSELPAAECV HFAANVSWKL GAVVLSPGEV
LDSPSTGKRF YPSAAGELLG ADNDEEPTDL STTNMEIPPL SDFILSAADL LSMRLTAKLV
VLSSYHSVEP ITGSGVANLA SSWLFAGTGA VLVSLWPVPE TAAKILLRAF YSALLQGTRA
ARALAEAMQT VQHTKHFAHP ANWAGFILIG GNVRLSNKVA LIGQALCELM RTPDKCRDAL
RVCLHLVEKS LQRIHRGQKN AMYTTQKSID NKAGPVSGWK DLLMAVGFRF EPAANGIPSS
VFFPQSDPED RLSQCSASLQ ALLGLSPTTL HALSKLVHGA EIADEIIGVM RNVVAQFPSK
ATDNESAIDV PLSVRLWRVS GCHELLASLG FDLMEVGQDQ VTLRTGKQAN RRNCQFVLQA
LLALFDTQEA PKSLGIESSS SSESLNEEDE QDEQGAAAAQ QSTGQQSQQS QSQSAQQQQQ
QQQSSQQQVT QQQQQQQMKV SSSVSPTPTS GDSQQRSISP AVTVKSQSSY NFSRPPLPLR
RVPFLSTRSA FISYVRRRGE PDGGQTDSAQ TAPAANGPAL DTSLANTTDS ELSDGYTTQQ
ILLKSDHLAK GLGYSSLRGT IKVSRPGGGG ESDAAFTPSP PVTIQTVDQN VSLALAHQTR
IKNLYTNNGN GLLHGTMNGG HGQTAAPSGL ASGYAGLPDG VHHHPNPVHH HRRPDSSSSA
SSATDWEGSG HATVLRRAAG HQGHHLPPLP PPRQTLPMVE SLRPLAPLAP VYNNINGAVG
PSSVGAASGK SGGQKSLSVL ESTSSDSEFE RSFDLPPGGP SNAASISSKL TSLAHSLQSM
RTRNKLKMGP MPHIPTAAGQ QHLQQHHQQQ QQQHQQQQQQ QQLHQQQLHG HPLHSTTSTI
SRSKLPVDQF GFLDRLSCRT EISSAATLGS LAAPPRKPLS TLPDDERTLN LNANKLYFSP
TDAEMLPLAE ASPADLGLGG KVAASHAASA HAPPLGGVGA GAPGPGAASS KDVQPGAKSS
QKTIQDSILR HMSREMTPTI SEVYHERNIG LGLAPSLSKL LLSKNYDESP DLGKGSGSNS
SNGGGAGAGG ALSAGGASSA AAAAASMLNK PSAALTVGNL AEAMNEIEMN ATTSSKLDEG
ACAICHSPSD LLCGCSATST VAAVAAMTAA LGASTMAKKS SSNKPWLSNV SPNIVKASDL
TTADILEQQK QLKSSVSSGL TSNLSSSTEN SLSTVVKRSG SPFSDLSRRD EGDGRSVADS
QCSGSFRTDI TGSTVATSKS QQQQQQIVSS SSQQQQQQQQ QKQQSTQQQQ LQTQQSTGGG
GDPNGSSAAQ QMASSVTMPT GTTVTQQRGK YIIDT
//