ID Q7PH90_ANOGA Unreviewed; 2257 AA.
AC Q7PH90;
DT 15-DEC-2003, integrated into UniProtKB/TrEMBL.
DT 27-JUL-2011, sequence version 5.
DT 27-MAR-2024, entry version 128.
DE SubName: Full=AGAP003546-PA {ECO:0000313|EMBL:EAA44639.5};
GN ORFNames=AgaP_AGAP003546 {ECO:0000313|EMBL:EAA44639.5};
OS Anopheles gambiae (African malaria mosquito).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Culicidae;
OC Anophelinae; Anopheles.
OX NCBI_TaxID=7165 {ECO:0000313|EMBL:EAA44639.5};
RN [1] {ECO:0000313|EMBL:EAA44639.5}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=PEST {ECO:0000313|EMBL:EAA44639.5};
RX PubMed=12364791; DOI=10.1126/science.1076181;
RA Holt R.A., Subramanian G.M., Halpern A., Sutton G.G., Charlab R.,
RA Nusskern D.R., Wincker P., Clark A.G., Ribeiro J.M.C., Wides R.,
RA Salzberg S.L., Loftus B.J., Yandell M.D., Majoros W.H., Rusch D.B., Lai Z.,
RA Kraft C.L., Abril J.F., Anthouard V., Arensburger P., Atkinson P.W.,
RA Baden H., de Berardinis V., Baldwin D., Benes V., Biedler J., Blass C.,
RA Bolanos R., Boscus D., Barnstead M., Cai S., Center A., Chaturverdi K.,
RA Christophides G.K., Chrystal M.A.M., Clamp M., Cravchik A., Curwen V.,
RA Dana A., Delcher A., Dew I., Evans C.A., Flanigan M.,
RA Grundschober-Freimoser A., Friedli L., Gu Z., Guan P., Guigo R.,
RA Hillenmeyer M.E., Hladun S.L., Hogan J.R., Hong Y.S., Hoover J.,
RA Jaillon O., Ke Z., Kodira C.D., Kokoza E., Koutsos A., Letunic I.,
RA Levitsky A.A., Liang Y., Lin J.-J., Lobo N.F., Lopez J.R., Malek J.A.,
RA McIntosh T.C., Meister S., Miller J.R., Mobarry C., Mongin E., Murphy S.D.,
RA O'Brochta D.A., Pfannkoch C., Qi R., Regier M.A., Remington K., Shao H.,
RA Sharakhova M.V., Sitter C.D., Shetty J., Smith T.J., Strong R., Sun J.,
RA Thomasova D., Ton L.Q., Topalis P., Tu Z.J., Unger M.F., Walenz B.,
RA Wang A.H., Wang J., Wang M., Wang X., Woodford K.J., Wortman J.R., Wu M.,
RA Yao A., Zdobnov E.M., Zhang H., Zhao Q., Zhao S., Zhu S.C., Zhimulev I.,
RA Coluzzi M., della Torre A., Roth C.W., Louis C., Kalush F., Mural R.J.,
RA Myers E.W., Adams M.D., Smith H.O., Broder S., Gardner M.J., Fraser C.M.,
RA Birney E., Bork P., Brey P.T., Venter J.C., Weissenbach J., Kafatos F.C.,
RA Collins F.H., Hoffman S.L.;
RT "The genome sequence of the malaria mosquito Anopheles gambiae.";
RL Science 298:129-149(2002).
RN [2] {ECO:0000313|EMBL:EAA44639.5}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=PEST {ECO:0000313|EMBL:EAA44639.5};
RG The Anopheles Genome Sequencing Consortium;
RL Submitted (MAR-2002) to the EMBL/GenBank/DDBJ databases.
RN [3] {ECO:0000313|EMBL:EAA44639.5}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=PEST {ECO:0000313|EMBL:EAA44639.5};
RX PubMed=14747013; DOI=10.1016/j.pt.2003.11.003;
RA Mongin E., Louis C., Holt R.A., Birney E., Collins F.H.;
RT "The Anopheles gambiae genome: an update.";
RL Trends Parasitol. 20:49-52(2004).
RN [4] {ECO:0000313|EMBL:EAA44639.5}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=PEST {ECO:0000313|EMBL:EAA44639.5};
RX PubMed=17210077; DOI=10.1186/gb-2007-8-1-r5;
RA Sharakhova M.V., Hammond M.P., Lobo N.F., Krzywinski J., Unger M.F.,
RA Hillenmeyer M.E., Bruggner R.V., Birney E., Collins F.H.;
RT "Update of the Anopheles gambiae PEST genome assembly.";
RL Genome Biol. 8:R5.1-R5.13(2007).
RN [5] {ECO:0000313|EMBL:EAA44639.5}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=PEST {ECO:0000313|EMBL:EAA44639.5};
RG VectorBase;
RL Submitted (MAY-2011) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Cell junction, tight junction
CC {ECO:0000256|ARBA:ARBA00004435}. Cell membrane
CC {ECO:0000256|ARBA:ARBA00004413}; Peripheral membrane protein
CC {ECO:0000256|ARBA:ARBA00004413}; Cytoplasmic side
CC {ECO:0000256|ARBA:ARBA00004413}. Membrane
CC {ECO:0000256|ARBA:ARBA00004287}; Peripheral membrane protein
CC {ECO:0000256|ARBA:ARBA00004287}; Cytoplasmic side
CC {ECO:0000256|ARBA:ARBA00004287}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EAA44639.5}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AAAB01008888; EAA44639.5; -; Genomic_DNA.
DR RefSeq; XP_313293.5; XM_313293.5.
DR GeneID; 1274206; -.
DR VEuPathDB; VectorBase:AGAP003546; -.
DR OMA; MEMGPPP; -.
DR GO; GO:0005923; C:bicellular tight junction; IEA:UniProtKB-SubCell.
DR GO; GO:0005886; C:plasma membrane; IEA:UniProtKB-SubCell.
DR CDD; cd00992; PDZ_signaling; 2.
DR CDD; cd11859; SH3_ZO; 1.
DR Gene3D; 2.30.42.10; -; 3.
DR Gene3D; 2.60.220.30; -; 1.
DR Gene3D; 3.40.50.300; P-loop containing nucleotide triphosphate hydrolases; 1.
DR Gene3D; 2.30.30.40; SH3 Domains; 1.
DR InterPro; IPR008145; GK/Ca_channel_bsu.
DR InterPro; IPR008144; Guanylate_kin-like_dom.
DR InterPro; IPR027417; P-loop_NTPase.
DR InterPro; IPR001478; PDZ.
DR InterPro; IPR036034; PDZ_sf.
DR InterPro; IPR036028; SH3-like_dom_sf.
DR InterPro; IPR001452; SH3_domain.
DR InterPro; IPR000906; ZU5_dom.
DR PANTHER; PTHR13865:SF28; POLYCHAETOID, ISOFORM O; 1.
DR PANTHER; PTHR13865; TIGHT JUNCTION PROTEIN; 1.
DR Pfam; PF00625; Guanylate_kin; 1.
DR Pfam; PF00595; PDZ; 2.
DR Pfam; PF07653; SH3_2; 1.
DR Pfam; PF00791; ZU5; 1.
DR SMART; SM00072; GuKc; 1.
DR SMART; SM00228; PDZ; 3.
DR SMART; SM00218; ZU5; 1.
DR SUPFAM; SSF52540; P-loop containing nucleoside triphosphate hydrolases; 1.
DR SUPFAM; SSF50156; PDZ domain-like; 3.
DR SUPFAM; SSF50044; SH3-domain; 1.
DR PROSITE; PS50052; GUANYLATE_KINASE_2; 1.
DR PROSITE; PS50106; PDZ; 3.
DR PROSITE; PS50002; SH3; 1.
DR PROSITE; PS51145; ZU5; 1.
PE 4: Predicted;
KW Cell junction {ECO:0000256|ARBA:ARBA00022949};
KW Cell membrane {ECO:0000256|ARBA:ARBA00022475};
KW Membrane {ECO:0000256|ARBA:ARBA00022475};
KW SH3 domain {ECO:0000256|ARBA:ARBA00022443, ECO:0000256|PROSITE-
KW ProRule:PRU00192}; Tight junction {ECO:0000256|ARBA:ARBA00022427}.
FT DOMAIN 23..110
FT /note="PDZ"
FT /evidence="ECO:0000259|PROSITE:PS50106"
FT DOMAIN 166..248
FT /note="PDZ"
FT /evidence="ECO:0000259|PROSITE:PS50106"
FT DOMAIN 393..466
FT /note="PDZ"
FT /evidence="ECO:0000259|PROSITE:PS50106"
FT DOMAIN 478..546
FT /note="SH3"
FT /evidence="ECO:0000259|PROSITE:PS50002"
FT DOMAIN 654..755
FT /note="Guanylate kinase-like"
FT /evidence="ECO:0000259|PROSITE:PS50052"
FT DOMAIN 2124..2257
FT /note="ZU5"
FT /evidence="ECO:0000259|PROSITE:PS51145"
FT REGION 321..369
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 791..1094
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1138..1157
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1168..1295
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1439..1476
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1498..1539
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1558..1580
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1775..1875
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1919..2028
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2041..2074
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 352..366
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 791..826
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 860..874
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 882..896
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 924..981
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1004..1024
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1039..1076
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1138..1156
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1173..1190
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1275..1289
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1440..1471
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1500..1529
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1558..1577
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1775..1809
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1817..1848
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1849..1867
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1933..1959
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1966..1982
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2011..2028
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2043..2060
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 2257 AA; 244236 MW; 549DDAD53EC8CD69 CRC64;
MCFFFSFFPF QLQLERTSWD YSTVTLSRVT GYGFGIAVSG GRDNPHFANG DPSIAVSDVL
KNGPAEGQLQ VNDRIISVNG VSLENVEYAT AVQVLRDSGN TVTLVVKRRV PNHSLMHPLP
GGPSAGAMGV GMPAGVVGMN SHQHQHSISS TGLGLGANNG SQQQIKVIVT KANKKDDFGI
VLGCRLFIRE ISSKTKDQLA ANGYSLQEGD LVTRIHNTNC NDSMSLKEAK KIIDGCKERL
TLAVLREPNG IAGGYGGGAA GGGGTASGMQ SPVYSHTAQV SNCSNMDENY LNGTGGGSYS
GQNLYVQPPT RPSAMSTLLA DDKSNLTPRG RSRGPLTDIS LQQLDRPSTP PGTVVDEPPR
PPPPRGEDFY ATRRQLMDEK PPTTEPRYIT FQKEGSVGIR LTGGNEVGIF VTAVQQNSPA
SAQGLVPGDK ILKVNDMDMN GVTREEAVLF LLSLQDRIEL IVQYCKEEFD SITAQQRGDS
FHIKTHFHCD APTKGELSFK AGDVFRVIDT LYNGVVGAWQ VLRIGRGHQE LQRGVIPNKA
RAEELATVQF NASKKELNAS ESRVSFFRRR RSTHRRSKSL SRENWDDVVF SDAISKFPAY
ERVVLRHPGF VRPVVIFGPV ADIARERLMK DFADKYTAPL QDDDKGSSKC GGIVRLSNIR
DIMDRGKHAL LDVTPNAVDR LNYAQFYPIV IFLKADTKHT IKQMRQGLPK SAHKSSKKLF
EQCQKLERVW SHVFSTTINL NDPDTWYRKL RDSIDQQQSG AVWMSETKPV ESLSDDFLFP
MTTSRLSYAS SPESDLELSP GPSASLSLGN LPQLVKSSSD PSIATNQDNL DRDRELGDGM
PPPYTNPYEH GPHSRRMTVD NKYGFSSSKN GIGGGPSMNP EDAIYGSSTS RPAPPLQGTA
NGPHFGTVPD LPPRIDRASK PANAPGAPST SGSSLPSRNS SSGGTLGRSA QERLFGNGKQ
SSTDALHDPT SSAQDEYSSR NQLLGLGAGD KRPTLPPMVN GGTTAPGAAS SLERNPNANT
SLDRSGGGGG GGRSAGHHHP GNNPQNTSGK ANGSYDSVSS YDSYNTSSQL TAQNMRLGPN
APDDLKSVPN RNGAHAMGAN VSGLSDYGRN PALNNAPSDM LLTTARSNYN VHETLTQRNS
GDRTAGGMNM PQRPTNLVLD SPRKHIIETK TDYGKYSRNN SASQADYSKP NKGPSMIGSP
GPPPGAMGAN MGNGAPFKPV PPPKPKNYRP PVGGSGSNGT GMHHSGQWDN GEPISPRSPD
GFYYPPMASS HYHQGMAHNV PSSPNNGGSH APGMHPYNPY TVGNGANGGN GGPMYNGNAN
GSHSYMSNGV RDMGAMGASN GYNNGNAQYP YGNTYMHRGN GAGTHGIALH PSDRHALDLA
GSREQRGSAF ELYRKPQLGT MVGHHHNIRD MEPMMSIHEF NQHQQHLMHQ ERMRQLQQQP
LPPIPRPPPP TSRGPYPGHD PSLPPELPAK PPKKNILKSP LKAIKNAFIK STRPLRRQVS
LAGDSDKKSL RPILKRQHSM MEPRSARMRM PDQQHQQLYD QQRSMYAQEM QQQAYYNDPR
YGSSYQGPYS PQPNRSYQRH EPYYPKDGNS TYQNLEMESM YGNSQGRGGY YDQQSEGPGY
YPTDENLYAN RALIELERSR APLGPGATVS TLGRRIVRRH SMADRTAPSP SFNQLNRRRM
GSERAPHHAS IVDHHRAVMA NANVSRGSSA HDESIYQSKS GSFLYNEARE HNRRAFEDPV
YQSRREMHRD HLYQSKQQMQ DRIQQSRMVE MVAPAQTQPQ PAHGSSPASG NGSSSMGGTT
TSAASDRFLR SQGLSGGGGG GTASGSNSSS PNSPSSSSCK SGYSSSRNGG EMRERRTQMR
DQIYQSRKEA MESMAEPVYV SRRELKHEPI YESKNENESA SVASAGGAVD APVNEMQGLR
LNMPEPFAGE NSGQKAKTEQ PEASSKHTTP STPSTGRKPQ DLSAPLAAEE DDDVEDEEEQ
HEQSDERTLT VSNQNDSVFE KLAMGVVPSP GSPVATSSPV PSTLGTPKSI RATHISNFLK
RTAPPVMPPP PPPPTALNRP SLPPSEEAGP ATAAVYASRT SIETQYTSTA TGSQMSLPIG
PPNATSTPFA SELSLGLPPP RQPITRRGLF DASGGSLADP IWNVSLQIPP GAIPPGVQQE
IYFTVTDPRL SESVGGPPLD MENGETMLSP LVMCGPQGTE FLKPVTLNIP HCAGRTASLG
LSLKATDSEK NLQTDWEDID LPSNTAAHTV SVKVDHF
//