ID A7USK9_ANOGA Unreviewed; 600 AA.
AC A7USK9;
DT 23-OCT-2007, integrated into UniProtKB/TrEMBL.
DT 27-JUL-2011, sequence version 2.
DT 24-JAN-2024, entry version 77.
DE SubName: Full=AGAP000745-PA {ECO:0000313|EMBL:EDO64303.2};
DE Flags: Fragment;
GN ORFNames=AgaP_AGAP000745 {ECO:0000313|EMBL:EDO64303.2};
OS Anopheles gambiae (African malaria mosquito).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Culicidae;
OC Anophelinae; Anopheles.
OX NCBI_TaxID=7165 {ECO:0000313|EMBL:EDO64303.2};
RN [1] {ECO:0000313|EMBL:EDO64303.2}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=PEST {ECO:0000313|EMBL:EDO64303.2};
RX PubMed=12364791; DOI=10.1126/science.1076181;
RA Holt R.A., Subramanian G.M., Halpern A., Sutton G.G., Charlab R.,
RA Nusskern D.R., Wincker P., Clark A.G., Ribeiro J.M.C., Wides R.,
RA Salzberg S.L., Loftus B.J., Yandell M.D., Majoros W.H., Rusch D.B., Lai Z.,
RA Kraft C.L., Abril J.F., Anthouard V., Arensburger P., Atkinson P.W.,
RA Baden H., de Berardinis V., Baldwin D., Benes V., Biedler J., Blass C.,
RA Bolanos R., Boscus D., Barnstead M., Cai S., Center A., Chaturverdi K.,
RA Christophides G.K., Chrystal M.A.M., Clamp M., Cravchik A., Curwen V.,
RA Dana A., Delcher A., Dew I., Evans C.A., Flanigan M.,
RA Grundschober-Freimoser A., Friedli L., Gu Z., Guan P., Guigo R.,
RA Hillenmeyer M.E., Hladun S.L., Hogan J.R., Hong Y.S., Hoover J.,
RA Jaillon O., Ke Z., Kodira C.D., Kokoza E., Koutsos A., Letunic I.,
RA Levitsky A.A., Liang Y., Lin J.-J., Lobo N.F., Lopez J.R., Malek J.A.,
RA McIntosh T.C., Meister S., Miller J.R., Mobarry C., Mongin E., Murphy S.D.,
RA O'Brochta D.A., Pfannkoch C., Qi R., Regier M.A., Remington K., Shao H.,
RA Sharakhova M.V., Sitter C.D., Shetty J., Smith T.J., Strong R., Sun J.,
RA Thomasova D., Ton L.Q., Topalis P., Tu Z.J., Unger M.F., Walenz B.,
RA Wang A.H., Wang J., Wang M., Wang X., Woodford K.J., Wortman J.R., Wu M.,
RA Yao A., Zdobnov E.M., Zhang H., Zhao Q., Zhao S., Zhu S.C., Zhimulev I.,
RA Coluzzi M., della Torre A., Roth C.W., Louis C., Kalush F., Mural R.J.,
RA Myers E.W., Adams M.D., Smith H.O., Broder S., Gardner M.J., Fraser C.M.,
RA Birney E., Bork P., Brey P.T., Venter J.C., Weissenbach J., Kafatos F.C.,
RA Collins F.H., Hoffman S.L.;
RT "The genome sequence of the malaria mosquito Anopheles gambiae.";
RL Science 298:129-149(2002).
RN [2] {ECO:0000313|EMBL:EDO64303.2}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=PEST {ECO:0000313|EMBL:EDO64303.2};
RG The Anopheles Genome Sequencing Consortium;
RL Submitted (MAR-2002) to the EMBL/GenBank/DDBJ databases.
RN [3] {ECO:0000313|EMBL:EDO64303.2}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=PEST {ECO:0000313|EMBL:EDO64303.2};
RX PubMed=14747013; DOI=10.1016/j.pt.2003.11.003;
RA Mongin E., Louis C., Holt R.A., Birney E., Collins F.H.;
RT "The Anopheles gambiae genome: an update.";
RL Trends Parasitol. 20:49-52(2004).
RN [4] {ECO:0000313|EMBL:EDO64303.2}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=PEST {ECO:0000313|EMBL:EDO64303.2};
RX PubMed=17210077; DOI=10.1186/gb-2007-8-1-r5;
RA Sharakhova M.V., Hammond M.P., Lobo N.F., Krzywinski J., Unger M.F.,
RA Hillenmeyer M.E., Bruggner R.V., Birney E., Collins F.H.;
RT "Update of the Anopheles gambiae PEST genome assembly.";
RL Genome Biol. 8:R5.1-R5.13(2007).
RN [5] {ECO:0000313|EMBL:EDO64303.2}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=PEST {ECO:0000313|EMBL:EDO64303.2};
RG VectorBase;
RL Submitted (MAY-2011) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EDO64303.2}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AAAB01008847; EDO64303.2; -; Genomic_DNA.
DR RefSeq; XP_001689397.2; XM_001689345.2.
DR AlphaFoldDB; A7USK9; -.
DR PaxDb; 7165-AGAP000745-PA; -.
DR GeneID; 5666781; -.
DR KEGG; aga:AgaP_AGAP000745; -.
DR VEuPathDB; VectorBase:AGAP000745; -.
DR eggNOG; ENOG502QW2N; Eukaryota.
DR HOGENOM; CLU_035681_0_0_1; -.
DR InParanoid; A7USK9; -.
DR PhylomeDB; A7USK9; -.
DR GO; GO:0062129; C:chitin-based extracellular matrix; IBA:GO_Central.
DR GO; GO:0008010; F:structural constituent of chitin-based larval cuticle; IBA:GO_Central.
DR InterPro; IPR000618; Insect_cuticle.
DR Pfam; PF00379; Chitin_bind_4; 1.
DR PROSITE; PS51155; CHIT_BIND_RR_2; 1.
PE 4: Predicted;
KW Cuticle {ECO:0000256|PROSITE-ProRule:PRU00497};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..17
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 18..600
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5002716531"
FT REGION 85..127
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 163..310
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 342..367
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 380..416
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 460..480
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 221..240
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 242..259
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 294..310
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 344..365
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT NON_TER 1
FT /evidence="ECO:0000313|EMBL:EDO64303.2"
SQ SEQUENCE 600 AA; 65597 MW; 768489AC84F099DF CRC64;
WLTLAFGLVV LVASGDAQTR RLRVRPRVLA APSSAEYVDS AEDAQQDNRQ YYAAPQRAQE
RLGDVVLVAS SDEDYGGGQY GAPVAAARPR ADQQQYQRPA AKSTTAAPVA ARQKAPASES
RAPPVQTIRN YSKVNDDGSF TFGYEAADGS FKEETRGTDC VVRGKYGYID PDGNKREFTY
VSGNPCDPNN PDGSEEEESD RAEGGQEDSN ENVPQNYPVR RPVPVARPTP AAPVRHHSTP
APAPRPTTTV FQNDYQDRQR QEQQSADAEE EVQIGQRGSP PRPAARPFAG AVTTTQRPRV
QIVSTTPSPT PTIFHSPAAP AAPQTVLPVN ITPKPVYRVS PLPTQPTLAP TTYRPTSSPS
TRGPTGSIDF EAEFKRFQAD NKLPSPPTPS TAPSGGAAPK PTGSPFGRPG PQLAAGNPIY
QSQLIFDPAS GQYDTALYQQ LPQSDGDFQL NHRIQPYVAG PQQHQHHQPQ PQPQQQQHPG
AGQLVTLEQL QQQSPLYRAQ PSPRPATVQI PQQLYQKQQN ELQFINSQQL FAQQLELQQS
QLRADRLEAA KKVTVGGPPM HRFQPQPQPQ QQYYFIQPQG PPQGAPGQID AFLRGHNIEY
//