ID A0A182WXI7_ANOQN Unreviewed; 942 AA.
AC A0A182WXI7;
DT 07-SEP-2016, integrated into UniProtKB/TrEMBL.
DT 07-SEP-2016, sequence version 1.
DT 27-MAR-2024, entry version 33.
DE RecName: Full=EGF-like domain-containing protein {ECO:0000259|PROSITE:PS50026};
OS Anopheles quadriannulatus (Mosquito).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Culicidae;
OC Anophelinae; Anopheles.
OX NCBI_TaxID=34691 {ECO:0000313|EnsemblMetazoa:AQUA002247-PA, ECO:0000313|Proteomes:UP000076407};
RN [1] {ECO:0000313|Proteomes:UP000076407}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=SANGQUA {ECO:0000313|Proteomes:UP000076407};
RG The Broad Institute Genomics Platform;
RA Neafsey D.E., Howell P., Walker B., Young S.K., Zeng Q., Gargeya S.,
RA Fitzgerald M., Haas B., Abouelleil A., Allen A.W., Alvarado L.,
RA Arachchi H.M., Berlin A.M., Chapman S.B., Gainer-Dewar J., Goldberg J.,
RA Griggs A., Gujja S., Hansen M., Howarth C., Imamovic A., Ireland A.,
RA Larimer J., McCowan C., Murphy C., Pearson M., Poon T.W., Priest M.,
RA Roberts A., Saif S., Shea T., Sisk P., Sykes S., Wortman J., Nusbaum C.,
RA Birren B.;
RT "The Genome Sequence of Anopheles quadriannulatus QUAD4_A.";
RL Submitted (MAR-2013) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EnsemblMetazoa:AQUA002247-PA}
RP IDENTIFICATION.
RC STRAIN=SANGQUA {ECO:0000313|EnsemblMetazoa:AQUA002247-PA};
RG EnsemblMetazoa;
RL Submitted (MAY-2020) to UniProtKB.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00076}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; A0A182WXI7; -.
DR STRING; 34691.A0A182WXI7; -.
DR EnsemblMetazoa; AQUA002247-RA; AQUA002247-PA; AQUA002247.
DR VEuPathDB; VectorBase:AQUA002247; -.
DR Proteomes; UP000076407; Unassembled WGS sequence.
DR GO; GO:0005509; F:calcium ion binding; IEA:InterPro.
DR CDD; cd00054; EGF_CA; 5.
DR Gene3D; 2.10.25.10; Laminin; 13.
DR InterPro; IPR026823; cEGF.
DR InterPro; IPR001881; EGF-like_Ca-bd_dom.
DR InterPro; IPR000742; EGF-like_dom.
DR InterPro; IPR000152; EGF-type_Asp/Asn_hydroxyl_site.
DR InterPro; IPR018097; EGF_Ca-bd_CS.
DR InterPro; IPR009030; Growth_fac_rcpt_cys_sf.
DR PANTHER; PTHR24034; EGF-LIKE DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR24034:SF198; RE68558P; 1.
DR Pfam; PF12662; cEGF; 1.
DR Pfam; PF07645; EGF_CA; 11.
DR SMART; SM00181; EGF; 11.
DR SMART; SM00179; EGF_CA; 13.
DR SUPFAM; SSF57196; EGF/Laminin; 2.
DR SUPFAM; SSF57184; Growth factor receptor domain; 4.
DR PROSITE; PS00010; ASX_HYDROXYL; 5.
DR PROSITE; PS01186; EGF_2; 4.
DR PROSITE; PS50026; EGF_3; 4.
DR PROSITE; PS01187; EGF_CA; 4.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW EGF-like domain {ECO:0000256|ARBA:ARBA00022536, ECO:0000256|PROSITE-
KW ProRule:PRU00076}; Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Secreted {ECO:0000256|ARBA:ARBA00022530}.
FT DOMAIN 371..409
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 597..638
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 639..682
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 724..766
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
SQ SEQUENCE 942 AA; 104081 MW; 24614ED66EFA7CC4 CRC64;
MAGAIAIEQI LSLCCQEGEE WGLQSRTCSS FNKSLELVPA GLHGLCLSTI EICCSKQHKI
YQCTAGQIAA RQGLSCSLKG DHSGSEFYTD CCEACKIGLV VGSSANKCSV EPFAFGSPWD
EIYDACCNDI KKEGEIILLS GNDEQSLCEQ FPSICSQVCE NVEGGSYVCK CHPGYELLDD
RKTCALVSDE DNETVERKGC DAGFKHNKRT DKCEDVNECE TGEATCNPDS QVCRNTRGSF
LCVDVVLPDI ACDPGYTPKN GKCEDVNECL EQLDACDRER QHCLNGRGNY SCLPKAVTMC
QPGFAYNVSL GVCEDVNECE EDGACDEGYR CVNIEGSYEC IAVVKNYPTP ARKKVETCTP
GFRRHNDQCV DIDECAADRN ACDSNQVCTN EIGGFRCDCK IGFNLDTVTN ACVDINECQV
NAHECLETQR CDNTIGSYTC IRLQSCGTGY TLNAETGHCD DDDECALGRH NCRPPFECFN
TKGSFRCRQL SRYHTLTSTS TSTTTSTTTV RPSYNPAGYG HYPASRYSSY QPQLPPCGIG
FERNSAGACV DIDECARGAS CHRHQQCINT NGSYRCRDLL TCPVGYRVND DITECLDIDE
CATGEALCGP DQTCKNKKGG YVCVCPPGHM IGRNKRCEDI DECAMHGSKV CQQNSNCVNT
IGSYRCDCKE GFKNGPNEMI CTDVDECKEI PGLCHQRCLN YWGSYRCGCH PGYRISYNNR
TCDDVDECEE YKSANLCVGI CENTPGSYSC RCPHGYKLGA DGRSCIDIDE CETGDVCRGR
QDICTNIGGS YRCTTIDCPY GYKHDPDRRN RCERSNKYCN TGDMECLRRP HSYSYNFLTI
VSNILIPPEG RGLFTLTGPS HFQMIDFDLK LVSVDAAPHV KPVDIHYFGL EKRTNDAQLN
LKKSIEGPQD IELELSMSVF HNGELYGTNV AKLFLMISAY EY
//