ID A0A016STI6_9BILA Unreviewed; 3484 AA.
AC A0A016STI6;
DT 11-JUN-2014, integrated into UniProtKB/TrEMBL.
DT 11-JUN-2014, sequence version 1.
DT 27-MAR-2024, entry version 39.
DE RecName: Full=EGF-like domain-containing protein {ECO:0000259|PROSITE:PS01186};
GN Name=Acey_s0180.g824 {ECO:0000313|EMBL:EYB93682.1};
GN ORFNames=Y032_0180g824 {ECO:0000313|EMBL:EYB93682.1};
OS Ancylostoma ceylanicum.
OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida;
OC Rhabditina; Rhabditomorpha; Strongyloidea; Ancylostomatidae;
OC Ancylostomatinae; Ancylostoma.
OX NCBI_TaxID=53326 {ECO:0000313|EMBL:EYB93682.1, ECO:0000313|Proteomes:UP000024635};
RN [1] {ECO:0000313|Proteomes:UP000024635}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=HY135 {ECO:0000313|Proteomes:UP000024635};
RX PubMed=25730766; DOI=10.1038/ng.3237;
RA Schwarz E.M., Hu Y., Antoshechkin I., Miller M.M., Sternberg P.W.,
RA Aroian R.V.;
RT "The genome and transcriptome of the zoonotic hookworm Ancylostoma
RT ceylanicum identify infection-specific gene families.";
RL Nat. Genet. 47:416-422(2015).
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00124}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EYB93682.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; JARK01001516; EYB93682.1; -; Genomic_DNA.
DR Proteomes; UP000024635; Unassembled WGS sequence.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-KW.
DR GO; GO:0006897; P:endocytosis; IEA:UniProtKB-KW.
DR CDD; cd00112; LDLa; 18.
DR Gene3D; 4.10.1220.10; EGF-type module; 1.
DR Gene3D; 2.10.25.10; Laminin; 2.
DR Gene3D; 4.10.400.10; Low-density Lipoprotein Receptor; 19.
DR Gene3D; 2.120.10.30; TolB, C-terminal domain; 7.
DR InterPro; IPR011042; 6-blade_b-propeller_TolB-like.
DR InterPro; IPR000742; EGF-like_dom.
DR InterPro; IPR036055; LDL_receptor-like_sf.
DR InterPro; IPR023415; LDLR_class-A_CS.
DR InterPro; IPR000033; LDLR_classB_rpt.
DR InterPro; IPR002172; LDrepeatLR_classA_rpt.
DR PANTHER; PTHR22722; LOW-DENSITY LIPOPROTEIN RECEPTOR-RELATED PROTEIN 2-RELATED; 1.
DR PANTHER; PTHR22722:SF14; MEGALIN, ISOFORM A; 1.
DR Pfam; PF00057; Ldl_recept_a; 16.
DR Pfam; PF00058; Ldl_recept_b; 2.
DR PRINTS; PR00261; LDLRECEPTOR.
DR SMART; SM00181; EGF; 14.
DR SMART; SM00192; LDLa; 20.
DR SMART; SM00135; LY; 17.
DR SUPFAM; SSF57424; LDL receptor-like module; 20.
DR SUPFAM; SSF63825; YWTD domain; 7.
DR PROSITE; PS01186; EGF_2; 1.
DR PROSITE; PS01209; LDLRA_1; 7.
DR PROSITE; PS50068; LDLRA_2; 20.
DR PROSITE; PS51120; LDLRB; 3.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW ProRule:PRU00124}; EGF-like domain {ECO:0000256|ARBA:ARBA00022536};
KW Endocytosis {ECO:0000256|ARBA:ARBA00022583};
KW Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Membrane {ECO:0000256|ARBA:ARBA00023136, ECO:0000256|SAM:Phobius};
KW Receptor {ECO:0000256|ARBA:ARBA00023170};
KW Reference proteome {ECO:0000313|Proteomes:UP000024635};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP};
KW Transmembrane {ECO:0000256|SAM:Phobius};
KW Transmembrane helix {ECO:0000256|SAM:Phobius}.
FT SIGNAL 1..27
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 28..3484
FT /note="EGF-like domain-containing protein"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5001486792"
FT TRANSMEM 3407..3425
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT REPEAT 1364..1409
FT /note="LDL-receptor class B"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00461"
FT REPEAT 1410..1452
FT /note="LDL-receptor class B"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00461"
FT REPEAT 1502..1544
FT /note="LDL-receptor class B"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00461"
FT DOMAIN 1929..1942
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS01186"
FT REGION 3452..3484
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT DISULFID 75..87
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 82..100
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 935..947
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 942..960
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 998..1013
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 1018..1030
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 1025..1043
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 1037..1052
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 1077..1092
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 1122..1137
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 1211..1226
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 2625..2637
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 2632..2650
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 2644..2659
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 2661..2673
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 2668..2686
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 2680..2695
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 2727..2742
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 2776..2791
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 2803..2821
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 2815..2830
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 2838..2850
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 2845..2863
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 2932..2944
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 2939..2957
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
SQ SEQUENCE 3484 AA; 387688 MW; 359A80D979ED29FA CRC64;
MLDTGVSLVL CWLCAALLSN GSGVVTAQIE WSSSEVGCGI GQFMCDGRCV PLKWKCDGEP
DCADGSDELS FCARCGSDEF RCQSGRCLPK AYICDGTADC GSNGIDDDSD EDPSICLHKA
VCPPNFYSCR HEARCIHLSQ FCNGIVDCND HSDEHHKCDD WHNITCHYGV GLTIDGPMCY
CPTDQVSIGD RCVDEDKCGR RLRGHAPICS QMCQNLPSGV TECDCHEDFK PNGSYCDPRE
TLGNYVAVLG NMFVMTPSVK WEEFPSRFQR TVNQNRRMIA AAADTKRNWI CYVMAAVKAD
PLTYSFMECS RITKKSSKIM KTDITTSFPL DEVRIIRHDP NGDNWLFLVG RTRLVICKNE
QPKMEKCRIV VDDVDIEDFV VDHFAGLIFY STIGRNAGVW QVTYENRTKI SMSDRQLQMP
GGIALDPFSR TFYYNDRYFE RIFALDYGKN GSVRSVLHDK RLKYSTSLTY FSDFLYVPSR
GDLLRVDTMT SQSELVDFHT NVDGFFVNHN LTTRKGAARC SGCSEICVAN TNSSTLCLFS
DGMIMEGGKC TVSGSVENLI LYSRQSPMWI HGVRLSTNDK SGQITATPSI PPIFSLRGGA
VFTVDPIRAE IYFYDDMQHA IYRRSIHGGN STLVTNKGVH HVTCMAYDSS SGNLYYGTRP
NVEPAGITVF RPDAPDFRAQ IVKAVGGIHS IFVDSKSRYL FYSSTTQLRQ HNIVRARLGG
NFLDPTILVS FFSFNPLIMT IDVGARKVYW LDPFQYSLYR IDFEGGEKEK LAISTRSLHS
IAVLSGTVIW ADDFGVKRGV SKGEGDFDVV SIDSLPITAQ LYFVENKSTS VSKCLENNGG
CEQLCYPETC PNVKSCDEGS RKCGCADGFI VDKNNPEKCV AARVGADAKC DEKTQFMCKH
TEICIAKERV CDGKWDCYDG SDEDLRGICV GNFSCAQDEF RCDSMTCIPD YLVCDGRADC
EDRSDEKWTV CRDRVQNCAN GTFACSVNKQ CLPDSWRCDG RVDCPDRSDE KNCEARDCNA
DEFQCLTGQC VPLHWVCDGR ANCRDGSDEL HCHEGCRTGR EFRCDASSAC LDLSLKCDGV
VDCENGFDEM NCANISSTRF CRKLNEYLCK REQRCIRRSA VCDGVEDCSD GQDERKCKGK
SCGVGLFACR SGDDCVAGHL ECDGVADCAD GSDEHEHCSF DAKVVEARCR APDITCRTFT
GIVCLPAPKI CDGIPDCFDA KDEEFCGYEQ QQSCMDLECQ DACYVQPRDG TYTTVCGCAA
NRTLKDDGRT CEATERRPCD FGACSQHCVI HSNRTSHCFC ERGYQMMPDG FTCRAVDSRR
PFLLYSDRHT LMYLPSDAPR AVPLLPQLEN AVSFDYLYHV NGTISIFWAD VTLDTIFRVE
VTGKTASNPR PIVSTGLSTV EGIAVDWISE VIYWTDSHHD HIQVAKIDGS MRATVVKGEI
HNPRDIVVDP SHGLMFWTDW QEENPRIERA TMGGNNRVVI FKVSSIVNAG WPNGLVCDII
AKRIYWVDAK SDTVHTVTYD GRDHVEVLRD HVFSTHPFSV DLFENYVYWT DWRINAIVRA
NKWNGSSIAA IFHTPIRPFY VKVVHRSKQP RTVRNPCAKS DCSHLCLIDG PGEYSCECPQ
FMRHKSGSSS ICEEVKSAVL LSTKSSIHGV NVASDNDTIF NAAGFQDIRA IAASNDQIFL
YDAFDDILWK YSTADREKRT VLTGDLSDCY GIAVDKVSGA MYYTSFSEDR AAIYVTNDGV
RRSIFDSSIN KELKKPKFLV FLENSAALLW LDVGYAPPAF FSAKGDGSKL SRISTDSFDE
LLEDVTSMAY DKQGNRVLWV TSRKSFVVQM NVKTWKVTPF VYANGSSIDA LTVDQWTGDV
FLMIDNVLVK KSYRSNSTDI WASGTNMTTL TSRDPFRVAT VMDRTAASKC GLNSCNYLCV
RSAKERYECL CPQGYTMKRG KCEVSTETLL LAGDKLLTAT NGSGSVFLLH PAVAYKQWRK
VAVEYSNELV YLISDFELWV AHLNGSYADR LLISDDTLTA VTVDPVTGNV ILGAEIGRRA
GEIVILDPKR VKENIRVRLL TDNEGAIRHL EMDPVKGFIF WSRGCIKKAN YDGTNVTCLV
NVTTNQFSLD QLTSRLCYLE KSGEVHCVSY DGTNNQVVAF FSVVDTVQSE MVIGNEKLYL
AQRKRSASNF LLVTEYKREA NGNFTEVMSY NTTTTLRFRA TAVYRQKSAD LTNSPCSKNN
GGCSHLCIST PQLDSRCLCA YSLLQPNGSC TANPSFLSYS YGGVVDFVSI SPNTTVPRGT
LRFPDIPRGI SVMEADPDRN QLILVDRASN RIICFRFTTN DWYSVADEVG EVEGISLDAT
NRELYYTRLS PPSIWRLSLS ADDPASYPVI PTRVAFLGQG NKPKDIAVHP CRMLIFFTNS
GTIPSVERMY YSGYRRERII EDEIIGESRV SIDFTAEKLY ISEITSSKIY RVDFDGKHKE
VVIPGAQNRT TNTRRPFALA VYNDWLIYSN IGSYVSTLEL AMVDKVDGLG ERVVADTPSP
VQSITVSAKN IQKCGTNACA SLKCGDECRL SARGEPHCAC RGERKLEADN VTCSGSEFAT
KTCAENEFLC KLDDKCIPYE ETCDRYPDCA HAEDENVDMC SQRTCRPGYF NCGSGLCVAL
SKKCDRNNDC LNFADEIDCE CSENEFRCES GICIAGNLTC DLKPDCNDAS DEKNCPPRDC
TNTTEFDFPG LVNCEGTTQC ILPQWRCDGS NDCWDNSDEK DCSEIVLPVL PGLRPCSSDE
FTCGRTRSCL PRGWVCDGQK DCADGSDEMD CVNACRVGVE YTCHSGDCVH IDKKCDGKKD
CPDGDDEVDC DTLTHDDCIG ASFQCRNGRC ISAEWVCDGA DDCADATAGG VSSDESNCTD
LTTTCTSDEF LCRLTGTSFR TCLSTVHQCD GFTDCVGGSD ENRTECGRPN RCRETSFRCR
SGQCIPKGWI CNGLEDCTDG SDEDKQMCST KSSECALDEV ACVKGERTTC ISQEIICNQS
HDCENLRYFA ETMCGVNECK FDLCEEQCID LPFAYRCECQ PPKIVDPKNP ASCIMGDQCS
TSNCSQFCME KGNGNYECAC GSGYILEADK HGCKLKSRLI PPMLVTIASD TIRLNSLRDS
YQTLAINTLS GRVLAYSSRT SSIYWIDESE VLLRSVALYD PDGVAVDEFS GNIYWTSKSR
NAIMMSDSEN LYIKTVYRRG PGVLPYALAI DSAHRTIFWS DVGKKPSLNR MSIIDDDRGV
DVVLDSSLVR PTALAVDPYA KRLYWIDQAL NYLGVCNYDG ANRQILGRKM GRGLYGLDVF
GDFLFFSDYT KGTVEKMHKL TAKNRTTVVS GLSHPKGIQV VHPEKWPRKN ENNPCENNQT
CVQICVPTST RQGYQCLCRD GMRYDEGACV NLVQSAMKTR EIDYATFRNF IIALLITTAL
VMLFFHKNRL YVMATPASTS WVGGQERYPL NTVTPDGGVE NPVFESNEDT PMEPDPVVVD
ESNG
//