ID Q7Q1J5_ANOGA Unreviewed; 920 AA.
AC Q7Q1J5;
DT 15-DEC-2003, integrated into UniProtKB/TrEMBL.
DT 23-OCT-2007, sequence version 4.
DT 01-MAY-2013, entry version 61.
DE SubName: Full=AGAP009763-PA;
DE Flags: Fragment;
GN ORFNames=AGAP009763, AgaP_AGAP009763;
OS Anopheles gambiae (African malaria mosquito).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta;
OC Pterygota; Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea;
OC Culicidae; Anophelinae; Anopheles.
OX NCBI_TaxID=7165;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=PEST;
RX PubMed=12364791; DOI=10.1126/science.1076181;
RA Holt R.A., Subramanian G.M., Halpern A., Sutton G.G., Charlab R.,
RA Nusskern D.R., Wincker P., Clark A.G., Ribeiro J.M.C., Wides R.,
RA Salzberg S.L., Loftus B.J., Yandell M.D., Majoros W.H., Rusch D.B.,
RA Lai Z., Kraft C.L., Abril J.F., Anthouard V., Arensburger P.,
RA Atkinson P.W., Baden H., de Berardinis V., Baldwin D., Benes V.,
RA Biedler J., Blass C., Bolanos R., Boscus D., Barnstead M., Cai S.,
RA Center A., Chaturverdi K., Christophides G.K., Chrystal M.A.M.,
RA Clamp M., Cravchik A., Curwen V., Dana A., Delcher A., Dew I.,
RA Evans C.A., Flanigan M., Grundschober-Freimoser A., Friedli L., Gu Z.,
RA Guan P., Guigo R., Hillenmeyer M.E., Hladun S.L., Hogan J.R.,
RA Hong Y.S., Hoover J., Jaillon O., Ke Z., Kodira C.D., Kokoza E.,
RA Koutsos A., Letunic I., Levitsky A.A., Liang Y., Lin J.-J., Lobo N.F.,
RA Lopez J.R., Malek J.A., McIntosh T.C., Meister S., Miller J.R.,
RA Mobarry C., Mongin E., Murphy S.D., O'Brochta D.A., Pfannkoch C.,
RA Qi R., Regier M.A., Remington K., Shao H., Sharakhova M.V.,
RA Sitter C.D., Shetty J., Smith T.J., Strong R., Sun J., Thomasova D.,
RA Ton L.Q., Topalis P., Tu Z.J., Unger M.F., Walenz B., Wang A.H.,
RA Wang J., Wang M., Wang X., Woodford K.J., Wortman J.R., Wu M., Yao A.,
RA Zdobnov E.M., Zhang H., Zhao Q., Zhao S., Zhu S.C., Zhimulev I.,
RA Coluzzi M., della Torre A., Roth C.W., Louis C., Kalush F.,
RA Mural R.J., Myers E.W., Adams M.D., Smith H.O., Broder S.,
RA Gardner M.J., Fraser C.M., Birney E., Bork P., Brey P.T., Venter J.C.,
RA Weissenbach J., Kafatos F.C., Collins F.H., Hoffman S.L.;
RT "The genome sequence of the malaria mosquito Anopheles gambiae.";
RL Science 298:129-149(2002).
CC -!- CAUTION: The sequence shown here is derived from an
CC EMBL/GenBank/DDBJ whole genome shotgun (WGS) entry which is
CC preliminary data.
CC -----------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution-NoDerivs License
CC -----------------------------------------------------------------------
DR EMBL; AAAB01008980; EAA14477.4; -; Genomic_DNA.
DR RefSeq; XP_318851.4; XM_318851.4.
DR ProteinModelPortal; Q7Q1J5; -.
DR EnsemblMetazoa; AGAP009763-RA; AGAP009763-PA; AGAP009763.
DR GeneID; 1279171; -.
DR KEGG; aga:AgaP_AGAP009763; -.
DR VectorBase; AGAP009763; Anopheles gambiae.
DR eggNOG; NOG12793; -.
DR HOGENOM; HOG000202351; -.
DR OMA; CWADNDG; -.
DR OrthoDB; EOG4HQC0P; -.
DR PhylomeDB; Q7Q1J5; -.
DR InterPro; IPR003341; Cys_rich_tripleX.
DR InterPro; IPR000742; EG-like_dom.
DR InterPro; IPR009030; Growth_fac_rcpt.
DR Pfam; PF02363; C_tripleX; 17.
DR SMART; SM00181; EGF; 20.
DR SUPFAM; SSF57184; Grow_fac_recept; 3.
PE 4: Predicted;
KW Complete proteome; Reference proteome; Repeat.
FT NON_TER 1 1
FT NON_TER 920 920
SQ SEQUENCE 920 AA; 98309 MW; E760EAD2254B61A1 CRC64;
DYDVRRKRTC CEGYHTVGDE CLPSCRLKCI FGECVAPDTC ECYSGYQKVN DHRCEPICEV
ACENGQCVAP NVCLCDEGYE RDGTSGSCRR KCDRACRFGR CVDDVCQCDE GYRPDATDGN
VCVPQCTEEC QNGKCVLPNV CQCDPGYTLS TQSNVTCEPV CSEGCANGNC IGPDQCACQE
GYELDDSNQC VPVCLKPCQG GQCIAPGRCS CGEGYSPAED DNSLCLPSCS EPCVNGDCVA
PNVCVCHANY RPRDDRTPHV CVPDCPKGCA HGECFGPGVC VCGKGFVYNE TLGGCEPFCT
EPCRNGVCIG GNQCQCHEGY QLDPKTSSKC VPHCSKPCVH GICVAPDVCD CKKGYVKSEN
SKNVCEPKCS KGCSNGRCVA PDHCECHKGY IATSSFPFVS YSLSSISSAA NSKSSICTPY
CKNKCVNAYC IRPNVCQCLA GHRFADNSTN VCEPICEDAL VDCSNGRCTQ PNVCECNEGY
TLAIRNGRML CEPIACREHC VNGYCVEEGR CVCHPGYQPS QHFHSICEPM CEGGCENGVC
IAPNTCVCNP GYARLPGGDS CEASCDPNVV DCYNGVCRGA NQCQCLEGYY LRASPNGGPY
ECAPVCDGGC ANGRCLAPNE CLCNDGYEYN GEMDRCDPVC VPSCVNGQCV APNVCQCLEG
YVPVEDPDSL EESPFECAPY CDANVVNCTF GICGGPNVCR CFDDFYSTTD QLGRQTCELL
PEPVDCVKEP VKERSDCPKL SCPEVVLPPP PVCNVTCPPP PPPPTPCPKV VCPTVPPTKP
TTRAPPIVPV TCPPVQMCPD VICPTVPPPE CPTTPTPEPL CDELYVNCAH GDCIEQNVCS
CHPGYQLAVG PESIVHCQPV CSGDCTNGIC TEPDVCVCLP GYVETLEGQC EPYCEEACGE
GAYCAQPNVC ACPKGLSIDE
//