ID Q7Q127_ANOGA Unreviewed; 596 AA.
AC Q7Q127;
DT 15-DEC-2003, integrated into UniProtKB/TrEMBL.
DT 23-OCT-2007, sequence version 4.
DT 27-MAR-2024, entry version 119.
DE SubName: Full=AGAP010003-PA {ECO:0000313|EMBL:EAA13911.4};
GN ORFNames=AgaP_AGAP010003 {ECO:0000313|EMBL:EAA13911.4};
OS Anopheles gambiae (African malaria mosquito).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Culicidae;
OC Anophelinae; Anopheles.
OX NCBI_TaxID=7165 {ECO:0000313|EMBL:EAA13911.4};
RN [1] {ECO:0000313|EMBL:EAA13911.4}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=PEST {ECO:0000313|EMBL:EAA13911.4};
RX PubMed=12364791; DOI=10.1126/science.1076181;
RA Holt R.A., Subramanian G.M., Halpern A., Sutton G.G., Charlab R.,
RA Nusskern D.R., Wincker P., Clark A.G., Ribeiro J.M.C., Wides R.,
RA Salzberg S.L., Loftus B.J., Yandell M.D., Majoros W.H., Rusch D.B., Lai Z.,
RA Kraft C.L., Abril J.F., Anthouard V., Arensburger P., Atkinson P.W.,
RA Baden H., de Berardinis V., Baldwin D., Benes V., Biedler J., Blass C.,
RA Bolanos R., Boscus D., Barnstead M., Cai S., Center A., Chaturverdi K.,
RA Christophides G.K., Chrystal M.A.M., Clamp M., Cravchik A., Curwen V.,
RA Dana A., Delcher A., Dew I., Evans C.A., Flanigan M.,
RA Grundschober-Freimoser A., Friedli L., Gu Z., Guan P., Guigo R.,
RA Hillenmeyer M.E., Hladun S.L., Hogan J.R., Hong Y.S., Hoover J.,
RA Jaillon O., Ke Z., Kodira C.D., Kokoza E., Koutsos A., Letunic I.,
RA Levitsky A.A., Liang Y., Lin J.-J., Lobo N.F., Lopez J.R., Malek J.A.,
RA McIntosh T.C., Meister S., Miller J.R., Mobarry C., Mongin E., Murphy S.D.,
RA O'Brochta D.A., Pfannkoch C., Qi R., Regier M.A., Remington K., Shao H.,
RA Sharakhova M.V., Sitter C.D., Shetty J., Smith T.J., Strong R., Sun J.,
RA Thomasova D., Ton L.Q., Topalis P., Tu Z.J., Unger M.F., Walenz B.,
RA Wang A.H., Wang J., Wang M., Wang X., Woodford K.J., Wortman J.R., Wu M.,
RA Yao A., Zdobnov E.M., Zhang H., Zhao Q., Zhao S., Zhu S.C., Zhimulev I.,
RA Coluzzi M., della Torre A., Roth C.W., Louis C., Kalush F., Mural R.J.,
RA Myers E.W., Adams M.D., Smith H.O., Broder S., Gardner M.J., Fraser C.M.,
RA Birney E., Bork P., Brey P.T., Venter J.C., Weissenbach J., Kafatos F.C.,
RA Collins F.H., Hoffman S.L.;
RT "The genome sequence of the malaria mosquito Anopheles gambiae.";
RL Science 298:129-149(2002).
RN [2] {ECO:0000313|EMBL:EAA13911.4}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=PEST {ECO:0000313|EMBL:EAA13911.4};
RG The Anopheles Genome Sequencing Consortium;
RL Submitted (MAR-2002) to the EMBL/GenBank/DDBJ databases.
RN [3] {ECO:0000313|EMBL:EAA13911.4}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=PEST {ECO:0000313|EMBL:EAA13911.4};
RX PubMed=14747013; DOI=10.1016/j.pt.2003.11.003;
RA Mongin E., Louis C., Holt R.A., Birney E., Collins F.H.;
RT "The Anopheles gambiae genome: an update.";
RL Trends Parasitol. 20:49-52(2004).
RN [4] {ECO:0000313|EMBL:EAA13911.4}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=PEST {ECO:0000313|EMBL:EAA13911.4};
RX PubMed=17210077; DOI=10.1186/gb-2007-8-1-r5;
RA Sharakhova M.V., Hammond M.P., Lobo N.F., Krzywinski J., Unger M.F.,
RA Hillenmeyer M.E., Bruggner R.V., Birney E., Collins F.H.;
RT "Update of the Anopheles gambiae PEST genome assembly.";
RL Genome Biol. 8:R5.1-R5.13(2007).
RN [5] {ECO:0000313|EMBL:EAA13911.4}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=PEST {ECO:0000313|EMBL:EAA13911.4};
RG VectorBase;
RL Submitted (MAY-2011) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EAA13911.4}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AAAB01008980; EAA13911.4; -; Genomic_DNA.
DR RefSeq; XP_319147.4; XM_319147.4.
DR AlphaFoldDB; Q7Q127; -.
DR PaxDb; 7165-AGAP010003-PA; -.
DR GeneID; 1279430; -.
DR KEGG; aga:AgaP_AGAP010003; -.
DR VEuPathDB; VectorBase:AGAP010003; -.
DR eggNOG; KOG4597; Eukaryota.
DR HOGENOM; CLU_000660_6_1_1; -.
DR OMA; CPANKPE; -.
DR OrthoDB; 2910701at2759; -.
DR PhylomeDB; Q7Q127; -.
DR GO; GO:0005576; C:extracellular region; IEA:UniProtKB-KW.
DR Gene3D; 2.60.120.830; -; 1.
DR Gene3D; 2.20.100.10; Thrombospondin type-1 (TSP1) repeat; 4.
DR InterPro; IPR010294; ADAMTS_spacer1.
DR InterPro; IPR010909; PLAC.
DR InterPro; IPR000884; TSP1_rpt.
DR InterPro; IPR036383; TSP1_rpt_sf.
DR PANTHER; PTHR13723; ADAMTS A DISINTEGRIN AND METALLOPROTEASE WITH THROMBOSPONDIN MOTIFS PROTEASE; 1.
DR PANTHER; PTHR13723:SF314; LONELY HEART, ISOFORM A; 1.
DR Pfam; PF05986; ADAMTS_spacer1; 1.
DR Pfam; PF08686; PLAC; 1.
DR Pfam; PF19030; TSP1_ADAMTS; 4.
DR SMART; SM00209; TSP1; 4.
DR SUPFAM; SSF82895; TSP-1 type 1 repeat; 4.
DR PROSITE; PS50900; PLAC; 1.
DR PROSITE; PS50092; TSP1; 4.
PE 4: Predicted;
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Secreted {ECO:0000256|ARBA:ARBA00022525};
KW Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..26
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 27..596
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5014588051"
FT DOMAIN 555..592
FT /note="PLAC"
FT /evidence="ECO:0000259|PROSITE:PS50900"
FT REGION 175..204
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 175..202
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 596 AA; 66448 MW; 794588C3CC76CAB6 CRC64;
MFFKNSLLRI VAVALLVAVH NSGVISGTFM NNGAVRCGTT VCRPISGIYT KTNPNNGYVH
VATIPAGASN ITITELQNSQ NYLALKTADQ RFFINGDYTI SLSGHYIAAG TVFDYRRVDG
LNNGSNSSFR HVEGITEWVT ALGPTNEPVQ VFLLSQSPNP GIKYEYLLPV SVSPTEPSSL
EEVPSAEGTS PNNNNTGTIP GTGRVGQRKR KYLWKVIGFS ACSKSCGGGI QQPIIKCVRE
SPTRVFSAKR CAHLQQPVLN ENLLRCNTQP CPAYWKLGDW DQCNCEEKYD EEVFRTREVK
CVQELLSGIV IQVNNGACMD ELPPSTERCE CSRNDAENGP LANGGGFAVP HSRQRTDINH
INYRAEAKKT GVWLTANWST RCSTTCGIGM QTRSIFCDRS SSINSERCDL RMMPETSREC
TSNRACEFGE WFAGPWTLCS GDCFNLTRSR TVLCIRNDHF APESECDADL RPSSIEPCDT
ESFEECKPRW HFSEWSECTK PCGGGNQRRV VKCLEFNLRD KLLQESGGCR YADRPTSYRV
CNEESCPARS DMMSNDENCR DDFPNCTIVV KAKLCNYAYY SQACCQMCRT RQNELY
//