ID A0A182TKK9_9DIPT Unreviewed; 3530 AA.
AC A0A182TKK9;
DT 07-SEP-2016, integrated into UniProtKB/TrEMBL.
DT 07-SEP-2016, sequence version 1.
DT 27-MAR-2024, entry version 28.
DE RecName: Full=EGF-like domain-containing protein {ECO:0000259|PROSITE:PS50026};
OS Anopheles melas.
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Culicidae;
OC Anophelinae; Anopheles.
OX NCBI_TaxID=34690 {ECO:0000313|EnsemblMetazoa:AMEC004010-PA, ECO:0000313|Proteomes:UP000075902};
RN [1] {ECO:0000313|Proteomes:UP000075902}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=CM1001059 {ECO:0000313|Proteomes:UP000075902};
RG The Broad Institute Genomics Platform;
RA Neafsey D.E., Besansky N., Howell P., Walton C., Young S.K., Zeng Q.,
RA Gargeya S., Fitzgerald M., Haas B., Abouelleil A., Allen A.W., Alvarado L.,
RA Arachchi H.M., Berlin A.M., Chapman S.B., Gainer-Dewar J., Goldberg J.,
RA Griggs A., Gujja S., Hansen M., Howarth C., Imamovic A., Ireland A.,
RA Larimer J., McCowan C., Murphy C., Pearson M., Poon T.W., Priest M.,
RA Roberts A., Saif S., Shea T., Sisk P., Sykes S., Wortman J., Nusbaum C.,
RA Birren B.;
RT "The Genome Sequence of Anopheles melas CM1001059_A (V2).";
RL Submitted (JAN-2014) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EnsemblMetazoa:AMEC004010-PA}
RP IDENTIFICATION.
RC STRAIN=CM1001059 {ECO:0000313|EnsemblMetazoa:AMEC004010-PA};
RG EnsemblMetazoa;
RL Submitted (MAY-2020) to UniProtKB.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00076}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR STRING; 34690.A0A182TKK9; -.
DR EnsemblMetazoa; AMEC004010-RA; AMEC004010-PA; AMEC004010.
DR VEuPathDB; VectorBase:AMEC004010; -.
DR Proteomes; UP000075902; Unassembled WGS sequence.
DR GO; GO:0005509; F:calcium ion binding; IEA:InterPro.
DR CDD; cd00054; EGF_CA; 12.
DR Gene3D; 2.10.25.10; Laminin; 19.
DR InterPro; IPR006150; Cys_repeat_1.
DR InterPro; IPR001881; EGF-like_Ca-bd_dom.
DR InterPro; IPR000742; EGF-like_dom.
DR InterPro; IPR000152; EGF-type_Asp/Asn_hydroxyl_site.
DR InterPro; IPR018097; EGF_Ca-bd_CS.
DR InterPro; IPR024731; EGF_dom.
DR InterPro; IPR003645; Fol_N.
DR InterPro; IPR009030; Growth_fac_rcpt_cys_sf.
DR PANTHER; PTHR22963:SF39; EGF-LIKE DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR22963; ENDOGLIN-RELATED; 1.
DR Pfam; PF12947; EGF_3; 1.
DR Pfam; PF07645; EGF_CA; 14.
DR SMART; SM00181; EGF; 56.
DR SMART; SM00179; EGF_CA; 19.
DR SMART; SM00274; FOLN; 19.
DR SMART; SM00286; PTI; 20.
DR SMART; SM00289; WR1; 13.
DR SUPFAM; SSF57196; EGF/Laminin; 6.
DR SUPFAM; SSF57184; Growth factor receptor domain; 6.
DR PROSITE; PS00010; ASX_HYDROXYL; 14.
DR PROSITE; PS01186; EGF_2; 14.
DR PROSITE; PS50026; EGF_3; 23.
DR PROSITE; PS01187; EGF_CA; 10.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW ProRule:PRU00076};
KW EGF-like domain {ECO:0000256|ARBA:ARBA00022536, ECO:0000256|PROSITE-
KW ProRule:PRU00076}; Repeat {ECO:0000256|ARBA:ARBA00022737}.
FT DOMAIN 1..39
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 40..80
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 81..120
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 121..164
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 204..248
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 294..338
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 339..381
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 455..492
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 498..540
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 541..584
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 605..646
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 647..689
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 768..801
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 976..1014
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1017..1056
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1057..1097
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1516..1555
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1851..1888
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 2020..2063
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 2382..2418
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 2535..2571
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 3098..3137
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 3140..3179
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DISULFID 508..525
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 980..990
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
SQ SEQUENCE 3530 AA; 370341 MW; 96C17F6A5DED274D CRC64;
VDECSGNPCA EGAICINTPG GYRCKCPPGL VASDDGQCTD VNECAKAHAC GENAKCINFP
GSYKCLCPQG YEGRGELFCK NVNECLDNPC GENALCTDTV GSFICSCKPE YTGDPFRGCV
DIDECSAYEK PCGEHAICEN ASPGYNCLCP QGYVGRPNAK VACEQADVNV LCTTAFDCTN
NAECIEGQCF CQDGFEPQGS VCVDVDECRS GAGGLRKEPA CGPSAVCVNM PGSYRCECEA
GFIGTPPRVP CKPPCADVKC GKNAYCKAEG QEAFCICEEG WTFNPADIGA GCVDIDECDP
TQGPNGRCGL NAVCTNHPGS YSCTCPPGYT GDATRQCQDV DECARPGACG ANALCKNLDG
SHQCSCPTGS IADPDPSVRC ISVVACAKDA DCPGNAVCDQ QKRCLCPEPN VGNDCRHPCE
KVTCGPNAHC MLVPGGGAQC LCSEGFTGQP GQCVDINECG ANPCPSGAVC TNLPGGYTCQ
CPGGSSGDPY SGGCSKSALN ACGESNPCPA GEKCVQDAYS GNSVCICGQG YKRDSKGRCR
DVDECADDSG KTACGVNAFC KNLPGSYECR CPAGFNGNPY QSCDECHSAE CRCAAPYKLM
EGNCVLDSCS PDGKCPGGAE CITITGGVSY CACPKGFRTL ANGHCEDIDE CGEGQQVCGY
DAICLNTIGG FECKCPLGYS GDPYHGLCTL AQKRCAADRE CGANERCVQP GECVCPPPYY
MDAYDGNRCK SPCERFPCGM NARCTPSDPP QCMCEVGFKG DPLTGCIDED ECANSPCAYG
AQCVNQRGGY KCVCPAGMVG DAYKGGCILE QGAVKSQCRR NEDCADTLAC ERGTCVSPCA
SLLCGPNAFC EPEKHAAWCR CRAGFVEGPG GDCVSQCEGY MCGQGAMCIV SNTGPTCKCP
PGEMGNPFPG GACTTDQCST SKPCADPQVC INGRCKHKCD GVICGIGATC DAASGKCICE
PYFVGNPELI CMPPVSSPAC EPSCGQNAHC EYGVVQNVCV CNPGTTGNPY GLCEPQQRNM
CSRMRCGTNA ECRESLASAE CVCPGGFSGN PYVACRDVDE CSAVGVCGEG AICINSEGSF
DCRCRPGYGG NPFVMCSAIE KTVCTNPRQC QCGANMQCPP GYGCVRGVCN NLCANTTCGP
RAACDAGRCV CPPGYTGNAA DRKTGCVPDG QCYSDADCEA SKICFQTSKG VRRCVDACSK
VQCGPNALCV SNDHRSTCIC APSYVGNPGD LTVGCQQEAK LVAECKRDGD CKPGHVCTAV
TETGHQACVN PCSAVECGVH EVCTVNEVNQ PVCHCQTGYR WNPVTSGCVK PSIPDCTTDA
DCHQVAACRP DAVGVLKCIA VCTEFTCPPN SVCVSSNHRG SCQCLPGYTG NPNDRNGCRP
EQQNTCLTSA ECAESDACVA HDGAALSCRP ACESVQCGPY ALCITNNHAA QCQCPPGSYA
GDPYDLAKGC QSVPCVYNRD CPSNQLCNRM THSCVDVCQE DTCGENAVCI AENHRSVCQC
PPGYRANPIA EVECALVRSC DPNPCHPSAS CEPGPDGYVC KCPVGQIGNP LTGCRQEGAC
PGGDRDCPDG AACVNGKCTD PCLGACGINS QCTVVNREPV CSCKAKYVPG ATGSARDGCV
RSSSGCMSDL DCNGDVCHGG QCAVACRNTN DCSPGERCLS NVCAVPCSDH SQCGQGQACA
GGICTIGCRS SRDCSGTTAC IDFKCTDPCT ADRNACGPNA LCSAVDHVPQ CTCPAGFEGN
PSPDQGCVRM PTSCETSAQC AAGHMCIGNL CSLPCTETSP SCAIGERCAN SVCAKVCYTN
NNCLPGEVCS EAGVCVPGCG TDADCPSQRV CQAGKCKCMK GFIGTPFGCA DIDECSERPC
HASAVCENIP GSYRCQCPEG AVGDAYASPG CRKSSQCRRD VDCADELACI GGKCRSPCST
KQCSRNAECQ VVGHRAECFC PAGYLGDATD GEIGGIGCFK VECVHNEDCA VERACSEESN
RCVNPCEQLN CGRGTCQIQN HEAVCVCYQG YTFANGKCED IDECARESPG PCHETALCEN
LPGNYLCSCP AGLVGDPVTA GCKRADECLS SEDCPSGAVC VDAHCQNPCA EANVCGENAL
CTVVGERAQC ECPPATRGNP KLACKRLECT TADECTADRT CIGNRCIDPC SLKSACGSSA
NCVSKNHLAV CSCQPGTTGN PLLGCVPLQY CNDDPQCPTG TKCSAGVCCS LCGTNRDCLD
DQLCIQGVCQ PTCRSNTTCP DFQYCHNGIC TKEFKCRADD DCGADEMCIT DANGRSECRN
ACNSGRVLCG RNADCSARNH VAECECKQGF YRDAGGICRK VECERDDDCS SDKVCENHAC
KIACLTGEAP CGANALCSAE NHRQVCYCQP GFTGDPKRGC SLIDFCKERP CAPNAKCRNS
RGSYRCSCPA GLVGDPYQGG CKRAAECERN SDCPEVAECV QENGESKCRD VCAGVQCGPN
AECAARKHNA ECKCRPRFEG DPKDLTNGCK PKPMSCKRNN DCPENSYCHG QICKPACSET
DECNQDEVCS NGQCINPCHE VNACGMNAEC LMGAHAKQCS CPAGFTGDAA VECVRVPISC
ASNTDCTDGS ICKESMCLPR CRNDQECALN EKCLQGSCML TCRLDNDCFL GHICLAGRCV
YGCHADSDCS ASETCRDNRC VNPCQENPCG PNAACTVVNH RASCSCFSGM VPSPTAKVGC
VRAPALQCTE NRDCADGTSC IDRLCRPVCA NDQSCLNNER CDGGSCKPIC RKDDDCRTGE
ICQGQTCMIG CRSDGGCADH LMCSGQQCVD PCQEPTACGT NAECVVVGHR KQCSCPAGLV
GDPFSLGCRQ ETRLCQARTD CPKGQACYGG TCMQTCRNDQ NCLADERCVR GTCRTVCNSD
AACGDGLICE GRICQAGCRS DNQCSNTQAC INKKCTDPCA TLGQCGSCSE CKVIDHGVQC
SCPQGYLGNP LLSCSPPAEK CIAQCTCDED GMYCVKVCRQ QKDCGCGQTC HRGKCRSKCN
PGNCPAGLLC QNGACVAGCR TNADCPSERS CTNGKCVDPC AGGRACGKNA LCQVSDHRSL
CLCPDGFQGV PSVGCVPYEC QTNDDCELDK KCASGKCINP CLSPGACGLN AQCRVVNRQA
QCSCTPGFFG NARHECQPVQ KNGCAQNPCG ENTVCREDEN GYECSCQPGC VGDPRQGCLC
EEKLKKDDCD QYACGTNAVC RMTEWGAPSC VCLPTHPHGD PYMSCTQDDT ATDCRTTGCA
EGECVRDGAK FICRKDESCA NDLQCANDKA CIGGKCSDPC SLRGACGDNA LCQTVLHRPR
CSCPNCYIGR PNVECKPDPK CEEVTTPRPN DPKIVSVACE TNGDCHESLR CDASGQCSDP
CTVPAPFVCD SNKKCVSRRH RPSCVCAHGF IVNDNGELVC APEKRECFGD DGCASNMACL
DGRCVNPCFA NGRRSAPCPD DKACVVVDHR PTCVCMKDCS PSLSICLRDS GCPDELACRN
YQCVNPCETT TCAEDTPCYV EDHKPICKFC PPGFVKDYAN VKKHQTVQED
//