ID F5HLA3_ANOGA Unreviewed; 1409 AA.
AC F5HLA3;
DT 27-JUL-2011, integrated into UniProtKB/TrEMBL.
DT 27-JUL-2011, sequence version 1.
DT 27-MAR-2024, entry version 72.
DE SubName: Full=AGAP003546-PB {ECO:0000313|EMBL:EGK97062.1};
GN ORFNames=AgaP_AGAP003546 {ECO:0000313|EMBL:EGK97062.1};
OS Anopheles gambiae (African malaria mosquito).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Culicidae;
OC Anophelinae; Anopheles.
OX NCBI_TaxID=7165 {ECO:0000313|EMBL:EGK97062.1};
RN [1] {ECO:0000313|EMBL:EGK97062.1}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=PEST {ECO:0000313|EMBL:EGK97062.1};
RX PubMed=12364791; DOI=10.1126/science.1076181;
RA Holt R.A., Subramanian G.M., Halpern A., Sutton G.G., Charlab R.,
RA Nusskern D.R., Wincker P., Clark A.G., Ribeiro J.M.C., Wides R.,
RA Salzberg S.L., Loftus B.J., Yandell M.D., Majoros W.H., Rusch D.B., Lai Z.,
RA Kraft C.L., Abril J.F., Anthouard V., Arensburger P., Atkinson P.W.,
RA Baden H., de Berardinis V., Baldwin D., Benes V., Biedler J., Blass C.,
RA Bolanos R., Boscus D., Barnstead M., Cai S., Center A., Chaturverdi K.,
RA Christophides G.K., Chrystal M.A.M., Clamp M., Cravchik A., Curwen V.,
RA Dana A., Delcher A., Dew I., Evans C.A., Flanigan M.,
RA Grundschober-Freimoser A., Friedli L., Gu Z., Guan P., Guigo R.,
RA Hillenmeyer M.E., Hladun S.L., Hogan J.R., Hong Y.S., Hoover J.,
RA Jaillon O., Ke Z., Kodira C.D., Kokoza E., Koutsos A., Letunic I.,
RA Levitsky A.A., Liang Y., Lin J.-J., Lobo N.F., Lopez J.R., Malek J.A.,
RA McIntosh T.C., Meister S., Miller J.R., Mobarry C., Mongin E., Murphy S.D.,
RA O'Brochta D.A., Pfannkoch C., Qi R., Regier M.A., Remington K., Shao H.,
RA Sharakhova M.V., Sitter C.D., Shetty J., Smith T.J., Strong R., Sun J.,
RA Thomasova D., Ton L.Q., Topalis P., Tu Z.J., Unger M.F., Walenz B.,
RA Wang A.H., Wang J., Wang M., Wang X., Woodford K.J., Wortman J.R., Wu M.,
RA Yao A., Zdobnov E.M., Zhang H., Zhao Q., Zhao S., Zhu S.C., Zhimulev I.,
RA Coluzzi M., della Torre A., Roth C.W., Louis C., Kalush F., Mural R.J.,
RA Myers E.W., Adams M.D., Smith H.O., Broder S., Gardner M.J., Fraser C.M.,
RA Birney E., Bork P., Brey P.T., Venter J.C., Weissenbach J., Kafatos F.C.,
RA Collins F.H., Hoffman S.L.;
RT "The genome sequence of the malaria mosquito Anopheles gambiae.";
RL Science 298:129-149(2002).
RN [2] {ECO:0000313|EMBL:EGK97062.1}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=PEST {ECO:0000313|EMBL:EGK97062.1};
RG The Anopheles Genome Sequencing Consortium;
RL Submitted (MAR-2002) to the EMBL/GenBank/DDBJ databases.
RN [3] {ECO:0000313|EMBL:EGK97062.1}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=PEST {ECO:0000313|EMBL:EGK97062.1};
RX PubMed=14747013; DOI=10.1016/j.pt.2003.11.003;
RA Mongin E., Louis C., Holt R.A., Birney E., Collins F.H.;
RT "The Anopheles gambiae genome: an update.";
RL Trends Parasitol. 20:49-52(2004).
RN [4] {ECO:0000313|EMBL:EGK97062.1}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=PEST {ECO:0000313|EMBL:EGK97062.1};
RX PubMed=17210077; DOI=10.1186/gb-2007-8-1-r5;
RA Sharakhova M.V., Hammond M.P., Lobo N.F., Krzywinski J., Unger M.F.,
RA Hillenmeyer M.E., Bruggner R.V., Birney E., Collins F.H.;
RT "Update of the Anopheles gambiae PEST genome assembly.";
RL Genome Biol. 8:R5.1-R5.13(2007).
RN [5] {ECO:0000313|EMBL:EGK97062.1}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=PEST {ECO:0000313|EMBL:EGK97062.1};
RG VectorBase;
RL Submitted (MAY-2011) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Cell junction, tight junction
CC {ECO:0000256|ARBA:ARBA00004435}. Cell membrane
CC {ECO:0000256|ARBA:ARBA00004413}; Peripheral membrane protein
CC {ECO:0000256|ARBA:ARBA00004413}; Cytoplasmic side
CC {ECO:0000256|ARBA:ARBA00004413}. Membrane
CC {ECO:0000256|ARBA:ARBA00004287}; Peripheral membrane protein
CC {ECO:0000256|ARBA:ARBA00004287}; Cytoplasmic side
CC {ECO:0000256|ARBA:ARBA00004287}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EGK97062.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AAAB01008888; EGK97062.1; -; Genomic_DNA.
DR RefSeq; XP_003436522.1; XM_003436474.1.
DR GeneID; 1274206; -.
DR VEuPathDB; VectorBase:AGAP003546; -.
DR HOGENOM; CLU_001538_2_0_1; -.
DR OMA; MEMGPPP; -.
DR GO; GO:0005923; C:bicellular tight junction; IEA:UniProtKB-SubCell.
DR GO; GO:0005886; C:plasma membrane; IEA:UniProtKB-SubCell.
DR CDD; cd00992; PDZ_signaling; 2.
DR CDD; cd11859; SH3_ZO; 1.
DR Gene3D; 2.30.42.10; -; 3.
DR Gene3D; 3.40.50.300; P-loop containing nucleotide triphosphate hydrolases; 1.
DR Gene3D; 2.30.30.40; SH3 Domains; 1.
DR InterPro; IPR008145; GK/Ca_channel_bsu.
DR InterPro; IPR008144; Guanylate_kin-like_dom.
DR InterPro; IPR027417; P-loop_NTPase.
DR InterPro; IPR001478; PDZ.
DR InterPro; IPR036034; PDZ_sf.
DR InterPro; IPR036028; SH3-like_dom_sf.
DR InterPro; IPR001452; SH3_domain.
DR PANTHER; PTHR13865:SF28; POLYCHAETOID, ISOFORM O; 1.
DR PANTHER; PTHR13865; TIGHT JUNCTION PROTEIN; 1.
DR Pfam; PF00625; Guanylate_kin; 1.
DR Pfam; PF00595; PDZ; 2.
DR Pfam; PF07653; SH3_2; 1.
DR SMART; SM00072; GuKc; 1.
DR SMART; SM00228; PDZ; 3.
DR SUPFAM; SSF52540; P-loop containing nucleoside triphosphate hydrolases; 1.
DR SUPFAM; SSF50156; PDZ domain-like; 3.
DR SUPFAM; SSF50044; SH3-domain; 1.
DR PROSITE; PS50052; GUANYLATE_KINASE_2; 1.
DR PROSITE; PS50106; PDZ; 3.
DR PROSITE; PS50002; SH3; 1.
PE 4: Predicted;
KW Cell junction {ECO:0000256|ARBA:ARBA00022949};
KW Cell membrane {ECO:0000256|ARBA:ARBA00022475};
KW Membrane {ECO:0000256|ARBA:ARBA00022475};
KW SH3 domain {ECO:0000256|ARBA:ARBA00022443, ECO:0000256|PROSITE-
KW ProRule:PRU00192}; Tight junction {ECO:0000256|ARBA:ARBA00022427}.
FT DOMAIN 23..110
FT /note="PDZ"
FT /evidence="ECO:0000259|PROSITE:PS50106"
FT DOMAIN 166..248
FT /note="PDZ"
FT /evidence="ECO:0000259|PROSITE:PS50106"
FT DOMAIN 393..466
FT /note="PDZ"
FT /evidence="ECO:0000259|PROSITE:PS50106"
FT DOMAIN 478..546
FT /note="SH3"
FT /evidence="ECO:0000259|PROSITE:PS50002"
FT DOMAIN 654..755
FT /note="Guanylate kinase-like"
FT /evidence="ECO:0000259|PROSITE:PS50052"
FT REGION 321..369
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 791..1094
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1138..1157
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1168..1295
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 352..366
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 791..826
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 860..874
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 882..896
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 924..981
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1004..1024
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1039..1076
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1138..1156
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1173..1190
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1275..1289
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1409 AA; 151247 MW; 5E566B271F4855FE CRC64;
MCFFFSFFPF QLQLERTSWD YSTVTLSRVT GYGFGIAVSG GRDNPHFANG DPSIAVSDVL
KNGPAEGQLQ VNDRIISVNG VSLENVEYAT AVQVLRDSGN TVTLVVKRRV PNHSLMHPLP
GGPSAGAMGV GMPAGVVGMN SHQHQHSISS TGLGLGANNG SQQQIKVIVT KANKKDDFGI
VLGCRLFIRE ISSKTKDQLA ANGYSLQEGD LVTRIHNTNC NDSMSLKEAK KIIDGCKERL
TLAVLREPNG IAGGYGGGAA GGGGTASGMQ SPVYSHTAQV SNCSNMDENY LNGTGGGSYS
GQNLYVQPPT RPSAMSTLLA DDKSNLTPRG RSRGPLTDIS LQQLDRPSTP PGTVVDEPPR
PPPPRGEDFY ATRRQLMDEK PPTTEPRYIT FQKEGSVGIR LTGGNEVGIF VTAVQQNSPA
SAQGLVPGDK ILKVNDMDMN GVTREEAVLF LLSLQDRIEL IVQYCKEEFD SITAQQRGDS
FHIKTHFHCD APTKGELSFK AGDVFRVIDT LYNGVVGAWQ VLRIGRGHQE LQRGVIPNKA
RAEELATVQF NASKKELNAS ESRVSFFRRR RSTHRRSKSL SRENWDDVVF SDAISKFPAY
ERVVLRHPGF VRPVVIFGPV ADIARERLMK DFADKYTAPL QDDDKGSSKC GGIVRLSNIR
DIMDRGKHAL LDVTPNAVDR LNYAQFYPIV IFLKADTKHT IKQMRQGLPK SAHKSSKKLF
EQCQKLERVW SHVFSTTINL NDPDTWYRKL RDSIDQQQSG AVWMSETKPV ESLSDDFLFP
MTTSRLSYAS SPESDLELSP GPSASLSLGN LPQLVKSSSD PSIATNQDNL DRDRELGDGM
PPPYTNPYEH GPHSRRMTVD NKYGFSSSKN GIGGGPSMNP EDAIYGSSTS RPAPPLQGTA
NGPHFGTVPD LPPRIDRASK PANAPGAPST SGSSLPSRNS SSGGTLGRSA QERLFGNGKQ
SSTDALHDPT SSAQDEYSSR NQLLGLGAGD KRPTLPPMVN GGTTAPGAAS SLERNPNANT
SLDRSGGGGG GGRSAGHHHP GNNPQNTSGK ANGSYDSVSS YDSYNTSSQL TAQNMRLGPN
APDDLKSVPN RNGAHAMGAN VSGLSDYGRN PALNNAPSDM LLTTARSNYN VHETLTQRNS
GDRTAGGMNM PQRPTNLVLD SPRKHIIETK TDYGKYSRNN SASQADYSKP NKGPSMIGSP
GPPPGAMGAN MGNGAPFKPV PPPKPKNYRP PVGGSGSNGT GMHHSGQWDN GEPISPRSPD
GFYYPPMASS HYHQGMAHNV PSSPNNGGSH APGMHPYNPY TVGNGANGGN GGPMYNGNAN
GSHSYMSNGV RDMGAMGASN GYNNGNAQYP YGNTYMHRGN GAGTHGIALH PSDRHALDLA
GSREQRGSAF ELYRKPQLGT MVGHHHNIR
//