ID A0A016WZ62_9BILA Unreviewed; 3139 AA.
AC A0A016WZ62;
DT 11-JUN-2014, integrated into UniProtKB/TrEMBL.
DT 11-JUN-2014, sequence version 1.
DT 27-MAR-2024, entry version 44.
DE RecName: Full=EGF-like domain protein {ECO:0008006|Google:ProtNLM};
GN Name=Acey_s0458.g1821 {ECO:0000313|EMBL:EYC44527.1};
GN ORFNames=Y032_0458g1821 {ECO:0000313|EMBL:EYC44527.1};
OS Ancylostoma ceylanicum.
OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida;
OC Rhabditina; Rhabditomorpha; Strongyloidea; Ancylostomatidae;
OC Ancylostomatinae; Ancylostoma.
OX NCBI_TaxID=53326 {ECO:0000313|EMBL:EYC44527.1, ECO:0000313|Proteomes:UP000024635};
RN [1] {ECO:0000313|Proteomes:UP000024635}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=HY135 {ECO:0000313|Proteomes:UP000024635};
RX PubMed=25730766; DOI=10.1038/ng.3237;
RA Schwarz E.M., Hu Y., Antoshechkin I., Miller M.M., Sternberg P.W.,
RA Aroian R.V.;
RT "The genome and transcriptome of the zoonotic hookworm Ancylostoma
RT ceylanicum identify infection-specific gene families.";
RL Nat. Genet. 47:416-422(2015).
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00076}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EYC44527.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; JARK01000058; EYC44527.1; -; Genomic_DNA.
DR STRING; 53326.A0A016WZ62; -.
DR Proteomes; UP000024635; Unassembled WGS sequence.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-KW.
DR GO; GO:0005509; F:calcium ion binding; IEA:InterPro.
DR CDD; cd00054; EGF_CA; 12.
DR Gene3D; 2.10.25.10; Laminin; 30.
DR Gene3D; 2.90.20.10; Plasmodium vivax P25 domain; 1.
DR Gene3D; 3.40.50.410; von Willebrand factor, type A domain; 1.
DR InterPro; IPR001881; EGF-like_Ca-bd_dom.
DR InterPro; IPR013032; EGF-like_CS.
DR InterPro; IPR000742; EGF-like_dom.
DR InterPro; IPR000152; EGF-type_Asp/Asn_hydroxyl_site.
DR InterPro; IPR018097; EGF_Ca-bd_CS.
DR InterPro; IPR024731; EGF_dom.
DR InterPro; IPR009030; Growth_fac_rcpt_cys_sf.
DR InterPro; IPR000082; SEA_dom.
DR InterPro; IPR002035; VWF_A.
DR InterPro; IPR036465; vWFA_dom_sf.
DR PANTHER; PTHR24039; FIBRILLIN-RELATED; 1.
DR PANTHER; PTHR24039:SF40; TRANSMEMBRANE MATRIX RECEPTOR MUP-4; 1.
DR Pfam; PF00008; EGF; 2.
DR Pfam; PF12947; EGF_3; 1.
DR Pfam; PF07645; EGF_CA; 21.
DR Pfam; PF12661; hEGF; 4.
DR Pfam; PF00092; VWA; 1.
DR PRINTS; PR00453; VWFADOMAIN.
DR SMART; SM00181; EGF; 45.
DR SMART; SM00179; EGF_CA; 37.
DR SMART; SM00200; SEA; 2.
DR SMART; SM00327; VWA; 1.
DR SUPFAM; SSF57196; EGF/Laminin; 10.
DR SUPFAM; SSF57184; Growth factor receptor domain; 4.
DR SUPFAM; SSF53300; vWA-like; 1.
DR PROSITE; PS00010; ASX_HYDROXYL; 29.
DR PROSITE; PS00022; EGF_1; 1.
DR PROSITE; PS01186; EGF_2; 1.
DR PROSITE; PS50026; EGF_3; 36.
DR PROSITE; PS01187; EGF_CA; 2.
DR PROSITE; PS50024; SEA; 2.
DR PROSITE; PS50234; VWFA; 1.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW ProRule:PRU00076};
KW EGF-like domain {ECO:0000256|ARBA:ARBA00022536, ECO:0000256|PROSITE-
KW ProRule:PRU00076}; Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Membrane {ECO:0000256|SAM:Phobius};
KW Reference proteome {ECO:0000313|Proteomes:UP000024635};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}; Signal {ECO:0000256|SAM:SignalP};
KW Transmembrane {ECO:0000256|SAM:Phobius};
KW Transmembrane helix {ECO:0000256|SAM:Phobius}.
FT SIGNAL 1..17
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 18..3139
FT /note="EGF-like domain protein"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5001495060"
FT TRANSMEM 2822..2843
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT DOMAIN 70..108
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 166..204
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 265..304
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 314..355
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 367..406
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 432..475
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 476..513
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 525..563
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 575..614
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 635..811
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 1015..1053
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1065..1105
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1115..1162
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1166..1205
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1269..1308
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1320..1358
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1370..1408
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1421..1459
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1520..1565
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1567..1606
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1615..1654
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1665..1704
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1715..1753
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1765..1803
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1814..1852
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1861..1900
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1918..1958
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1969..2010
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 2022..2061
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 2072..2111
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 2120..2159
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 2167..2206
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 2278..2404
FT /note="SEA"
FT /evidence="ECO:0000259|PROSITE:PS50024"
FT DOMAIN 2454..2579
FT /note="SEA"
FT /evidence="ECO:0000259|PROSITE:PS50024"
FT DOMAIN 2581..2618
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 2629..2668
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 2677..2714
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 2733..2771
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 2778..2814
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT REGION 3082..3139
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3082..3123
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3124..3139
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT DISULFID 443..460
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 1981..1998
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 2785..2802
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 2804..2813
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
SQ SEQUENCE 3139 AA; 346823 MW; 0786D334BA7ED0BD CRC64;
MIRLYLGAAV LVGTVLALGS NPCQDYSLHD CDPVAECYSE QPGYFQCRCP KGFIDVSPDN
RTPGRKCVRV VDECSLGTHT CDPNADCVDT PEGYTCRCRA GWQDASRDLR NPGRICRKAN
VCANMDCAQE AECRETPIGP TCQCVSGFVD ISRQHGRPAG RICRPVVNEC AEGKHDCSTH
ASCIDTADGF TCRCHDNYRD ESPSPATLPG RVCIRAFVPD PPECEVSDPM SCDQRKSEVC
VFVNGTYKCR CASGYSRLPD GRCLAINECE HKRLNTCGEN AECIDLAEGY TCQCRSGFAD
ISRTGQPGRL CRARVNECSN KEKYHVDCDE NAICVDTDDS FTCQCRPGFA DISAAFNRLP
GRRCIEAINE CSSKQLNDCS EFALCEDAKE GYVCSCRSGY VDASPNATHY PGRVCRKPVE
KFTVAEFTSS FSHDQCDPTR PQCGANEVCT DRGQRGHYAC QCADNAFRYE DGTCRVFSAC
SKATECDKNA VCLNTFDSYS CQCRPGFIDM SPDPERRPGR RCKELVNECA TSNNECSPFA
KCVDLTEGYA CQCLEGYVDV SSKHKLPPGR RCSQSNNECA FKHLNTCDEN ADCIDSPDGY
TCQCYPGFVD VSSNANLPPG RVCTVQTTCP KQKTDLVFLI DGSGSIGSYV FKNEVLRFVR
EFVELFEIGL DRTRVALIQY SDQIRHEFDL DQYTDKASVL RAITETQYLT GLTRTGAAIQ
HMVQEGFSER RGARPQSSDI ARVAIVLTDG RSQDNVSGPA EAARKLSITT FSIGVTDHVL
SSELEAIAGS PNRWFYVDKF KDLDTRLRSM IQKAACPSPV KAESPPQGTC NPRTQTGCDR
SLNEYCAEEN GRTHCVCPAG FHRHPTTRVC GGALCNPQLI TSCVYPEECL VTPYNNYRCA
CPEGYSRDHR SGFCVSVKEI HIFPQHDADC HNGGQRCGQN EYCASDKAGH WFCECLAGFE
RSQSTGQCSY PGSCLPDKPN SCDIRKREKC LPHGAFYTCQ CDKNERRHPV TGICLKNECL
TGEHDCDRSA RCIDTDDGYL CACPSGFIDR SPDPVARPGR LCVAEQNECL DGTHKCSPHA
LCTDTQSGYV CRCKPGFVDY SPNAQSFPGL VCKELVNECS SPSLNNCDRN AICIDTVEAY
TCICRAGYID QDEFRNPGRN CQKLKTNDRC SAGKNDCDRN ARCSQIGDDD YSCTCPPGFK
DKSPSSSRPG RVCIPVIPEC DNPTLNDCDS PDRAICTDTD DGYMCRCRQG FLDISPNIAT
KPGRLCKPLQ NECALGTDDC ARDGGICEDT PDSFLCRCAM NYLDVSFDRQ NRPGRKCKRL
VDECATGQND CSREAICTDT EDSYVCACPA THIDLSADPV NRPGRKCLLR INECTSGRHD
CSPNADCMDT PESYNCRCRD DFVDESPDIA RRPGRICRPA LVDECRLGKH DCHSDAFCQD
LPQGYTCRCK PEFLDQSPHR ATHPGRLCVP RPTPPPPECR IDGPNQCKAH LNEVCRLVSG
DPKCACPINY QRDSSGSCSV INECEFPQLV DCHPSAECID QLVGYTCRCR PGFKDIGNKP
GRMCKPLVNE CQFPHLNDCH QHAQCIDQEE GYECRCNQGF MDRSHGRPGR ICKQLINECA
VPGMNSCDRN ARCIDEEEGY RCECRDGYLD VSPLPQLKGR SCRKLVDECR DPKLNDCDRN
AKCRDTMDSY ECECPPNSKD ISPSPAFPGR VCLMFENECM TGKHDCDPSA ICHDNEQSFS
CECPAGFIDR SPNKLHRPGR VCVKLVDECA TGRHTCSAQA DCRDLEEGYT CECREGYVDR
SPNLASQPGR VCSAPEVCPS NHQCSSAAVC EPLGGNKYQC SCIQGYVDQS PNGQKGRICV
RNNACRDPKL NTCSRNAICY DEARGYRCEC ARGFIDRSPD PALRGRVCEP PPPPTPPPRH
PCQDPTLNDC HPAGSCRATG AQSYTCECLQ GYADKSPDPR KPGRICVLTE PVCLDKSQND
CHSAAICSEV SGPEKYTCQC RDGYIDQSPN RNTRPGRICV EMVNECLDRS LNDCHSLAIC
EDKREGYTCR CPVNTMDKSP DRNRPGRLCV KQINECRNPS LNTCSRFAEC IDKENGYECR
CKPGYHDNDP SHPGTQCSYI INECDSPNLN DCDRNAICMD SEGGYDCKCK PPYRDESPSG
HPGRVCRLNE CLDVNLNNCD KNAECQDMDD GYICSCREGY YDQSPNPQEP GRVCLEFQVD
HKVEQVTITP VQSHPLNEGL PCGRDFCKVT MGEVCISGSY CGCRPGESRS VATGRCERVE
ETPLQIRVVS RDSTPLLYSS EYGSTKSPPY VEIVDLFQKD MARTFGGTIY APRYVNTKVE
YITHPKTVNS SWPDGLLFKY DVQTTPSKQQ PVDKCEVWKQ MMASLQRTNG VIGGGTLRIA
DDSELLNPCR AEEPMGECGG HDCKTELGEI CIAGSVCGCP VGMRRAASTD VCRAVESWNV
PLWVIRKDYK NLVYNDSFAN PMDSIYKTYV QDYEKGIAGC YPHTTLRNAF VAADVNEIVN
PKMMNASWES GLLFNTTVHF RKGAVRIPSD VYYELVRYII ERNGYEVGDS GLYLNEYQPN
PYKACFKNDC HPKGICIDVS NRSYRCECGA GFRDLDPSDP GKKCIPTYGF NECEKKEDNE
CSENARCIDL EHLYKCECLP SYSDASPPGA VPGSICVLDY CSDVNFCPTN TTCKNMEQQA
ECRCDPGFTD IRKSDRRNAL GLGDDTFCMH VRDVNECALG LTNCSGVAEC IDRPIGYTCK
CPDGYIDGNP DEPGRVCGAL LCDLCNSHGD CVHNARTNNI TCVCTEGWTG EFCQVAPSNA
SLVLLILLAL LFLLLTLCCL LYFCTKCHCF KGRGIAGGAP FVYRRGGAWP WSTLEGSSSS
ESGAEFSALS AAGHDYYPDI GIPRAKLKAG AAALDTTAKS MDVARLDQYL SEGAVRIPRA
HLVGGAGRLN DSCDSMSSAS SEYTIKEEVE RKVITDVTTK EIKTTTTTDS AGNVVTTRAE
SYVYPSEHTV SHGESAAAHS SSFVGESAYA ARNAEYSNSA AFNERSFHHA GEERERGESV
AEFSIGRVKS KDYAARDREL VEYSSEHEAA HSDLEEHESG DIRTRVTHSH HYEPIRNGES
ERLRTEVVTT QSSTSVSKH
//