GenomeNet

Database: UniProt
Entry: A0A016STI6_9BILA
LinkDB: A0A016STI6_9BILA
Original site: A0A016STI6_9BILA 
ID   A0A016STI6_9BILA        Unreviewed;      3484 AA.
AC   A0A016STI6;
DT   11-JUN-2014, integrated into UniProtKB/TrEMBL.
DT   11-JUN-2014, sequence version 1.
DT   27-MAR-2024, entry version 39.
DE   RecName: Full=EGF-like domain-containing protein {ECO:0000259|PROSITE:PS01186};
GN   Name=Acey_s0180.g824 {ECO:0000313|EMBL:EYB93682.1};
GN   ORFNames=Y032_0180g824 {ECO:0000313|EMBL:EYB93682.1};
OS   Ancylostoma ceylanicum.
OC   Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida;
OC   Rhabditina; Rhabditomorpha; Strongyloidea; Ancylostomatidae;
OC   Ancylostomatinae; Ancylostoma.
OX   NCBI_TaxID=53326 {ECO:0000313|EMBL:EYB93682.1, ECO:0000313|Proteomes:UP000024635};
RN   [1] {ECO:0000313|Proteomes:UP000024635}
RP   NUCLEOTIDE SEQUENCE.
RC   STRAIN=HY135 {ECO:0000313|Proteomes:UP000024635};
RX   PubMed=25730766; DOI=10.1038/ng.3237;
RA   Schwarz E.M., Hu Y., Antoshechkin I., Miller M.M., Sternberg P.W.,
RA   Aroian R.V.;
RT   "The genome and transcriptome of the zoonotic hookworm Ancylostoma
RT   ceylanicum identify infection-specific gene families.";
RL   Nat. Genet. 47:416-422(2015).
CC   -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC       feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00124}.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:EYB93682.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; JARK01001516; EYB93682.1; -; Genomic_DNA.
DR   Proteomes; UP000024635; Unassembled WGS sequence.
DR   GO; GO:0016020; C:membrane; IEA:UniProtKB-KW.
DR   GO; GO:0006897; P:endocytosis; IEA:UniProtKB-KW.
DR   CDD; cd00112; LDLa; 18.
DR   Gene3D; 4.10.1220.10; EGF-type module; 1.
DR   Gene3D; 2.10.25.10; Laminin; 2.
DR   Gene3D; 4.10.400.10; Low-density Lipoprotein Receptor; 19.
DR   Gene3D; 2.120.10.30; TolB, C-terminal domain; 7.
DR   InterPro; IPR011042; 6-blade_b-propeller_TolB-like.
DR   InterPro; IPR000742; EGF-like_dom.
DR   InterPro; IPR036055; LDL_receptor-like_sf.
DR   InterPro; IPR023415; LDLR_class-A_CS.
DR   InterPro; IPR000033; LDLR_classB_rpt.
DR   InterPro; IPR002172; LDrepeatLR_classA_rpt.
DR   PANTHER; PTHR22722; LOW-DENSITY LIPOPROTEIN RECEPTOR-RELATED PROTEIN 2-RELATED; 1.
DR   PANTHER; PTHR22722:SF14; MEGALIN, ISOFORM A; 1.
DR   Pfam; PF00057; Ldl_recept_a; 16.
DR   Pfam; PF00058; Ldl_recept_b; 2.
DR   PRINTS; PR00261; LDLRECEPTOR.
DR   SMART; SM00181; EGF; 14.
DR   SMART; SM00192; LDLa; 20.
DR   SMART; SM00135; LY; 17.
DR   SUPFAM; SSF57424; LDL receptor-like module; 20.
DR   SUPFAM; SSF63825; YWTD domain; 7.
DR   PROSITE; PS01186; EGF_2; 1.
DR   PROSITE; PS01209; LDLRA_1; 7.
DR   PROSITE; PS50068; LDLRA_2; 20.
DR   PROSITE; PS51120; LDLRB; 3.
PE   4: Predicted;
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW   ProRule:PRU00124}; EGF-like domain {ECO:0000256|ARBA:ARBA00022536};
KW   Endocytosis {ECO:0000256|ARBA:ARBA00022583};
KW   Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW   Membrane {ECO:0000256|ARBA:ARBA00023136, ECO:0000256|SAM:Phobius};
KW   Receptor {ECO:0000256|ARBA:ARBA00023170};
KW   Reference proteome {ECO:0000313|Proteomes:UP000024635};
KW   Repeat {ECO:0000256|ARBA:ARBA00022737};
KW   Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP};
KW   Transmembrane {ECO:0000256|SAM:Phobius};
KW   Transmembrane helix {ECO:0000256|SAM:Phobius}.
FT   SIGNAL          1..27
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           28..3484
FT                   /note="EGF-like domain-containing protein"
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5001486792"
FT   TRANSMEM        3407..3425
FT                   /note="Helical"
FT                   /evidence="ECO:0000256|SAM:Phobius"
FT   REPEAT          1364..1409
FT                   /note="LDL-receptor class B"
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00461"
FT   REPEAT          1410..1452
FT                   /note="LDL-receptor class B"
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00461"
FT   REPEAT          1502..1544
FT                   /note="LDL-receptor class B"
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00461"
FT   DOMAIN          1929..1942
FT                   /note="EGF-like"
FT                   /evidence="ECO:0000259|PROSITE:PS01186"
FT   REGION          3452..3484
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   DISULFID        75..87
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT   DISULFID        82..100
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT   DISULFID        935..947
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT   DISULFID        942..960
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT   DISULFID        998..1013
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT   DISULFID        1018..1030
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT   DISULFID        1025..1043
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT   DISULFID        1037..1052
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT   DISULFID        1077..1092
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT   DISULFID        1122..1137
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT   DISULFID        1211..1226
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT   DISULFID        2625..2637
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT   DISULFID        2632..2650
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT   DISULFID        2644..2659
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT   DISULFID        2661..2673
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT   DISULFID        2668..2686
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT   DISULFID        2680..2695
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT   DISULFID        2727..2742
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT   DISULFID        2776..2791
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT   DISULFID        2803..2821
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT   DISULFID        2815..2830
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT   DISULFID        2838..2850
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT   DISULFID        2845..2863
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT   DISULFID        2932..2944
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT   DISULFID        2939..2957
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
SQ   SEQUENCE   3484 AA;  387688 MW;  359A80D979ED29FA CRC64;
     MLDTGVSLVL CWLCAALLSN GSGVVTAQIE WSSSEVGCGI GQFMCDGRCV PLKWKCDGEP
     DCADGSDELS FCARCGSDEF RCQSGRCLPK AYICDGTADC GSNGIDDDSD EDPSICLHKA
     VCPPNFYSCR HEARCIHLSQ FCNGIVDCND HSDEHHKCDD WHNITCHYGV GLTIDGPMCY
     CPTDQVSIGD RCVDEDKCGR RLRGHAPICS QMCQNLPSGV TECDCHEDFK PNGSYCDPRE
     TLGNYVAVLG NMFVMTPSVK WEEFPSRFQR TVNQNRRMIA AAADTKRNWI CYVMAAVKAD
     PLTYSFMECS RITKKSSKIM KTDITTSFPL DEVRIIRHDP NGDNWLFLVG RTRLVICKNE
     QPKMEKCRIV VDDVDIEDFV VDHFAGLIFY STIGRNAGVW QVTYENRTKI SMSDRQLQMP
     GGIALDPFSR TFYYNDRYFE RIFALDYGKN GSVRSVLHDK RLKYSTSLTY FSDFLYVPSR
     GDLLRVDTMT SQSELVDFHT NVDGFFVNHN LTTRKGAARC SGCSEICVAN TNSSTLCLFS
     DGMIMEGGKC TVSGSVENLI LYSRQSPMWI HGVRLSTNDK SGQITATPSI PPIFSLRGGA
     VFTVDPIRAE IYFYDDMQHA IYRRSIHGGN STLVTNKGVH HVTCMAYDSS SGNLYYGTRP
     NVEPAGITVF RPDAPDFRAQ IVKAVGGIHS IFVDSKSRYL FYSSTTQLRQ HNIVRARLGG
     NFLDPTILVS FFSFNPLIMT IDVGARKVYW LDPFQYSLYR IDFEGGEKEK LAISTRSLHS
     IAVLSGTVIW ADDFGVKRGV SKGEGDFDVV SIDSLPITAQ LYFVENKSTS VSKCLENNGG
     CEQLCYPETC PNVKSCDEGS RKCGCADGFI VDKNNPEKCV AARVGADAKC DEKTQFMCKH
     TEICIAKERV CDGKWDCYDG SDEDLRGICV GNFSCAQDEF RCDSMTCIPD YLVCDGRADC
     EDRSDEKWTV CRDRVQNCAN GTFACSVNKQ CLPDSWRCDG RVDCPDRSDE KNCEARDCNA
     DEFQCLTGQC VPLHWVCDGR ANCRDGSDEL HCHEGCRTGR EFRCDASSAC LDLSLKCDGV
     VDCENGFDEM NCANISSTRF CRKLNEYLCK REQRCIRRSA VCDGVEDCSD GQDERKCKGK
     SCGVGLFACR SGDDCVAGHL ECDGVADCAD GSDEHEHCSF DAKVVEARCR APDITCRTFT
     GIVCLPAPKI CDGIPDCFDA KDEEFCGYEQ QQSCMDLECQ DACYVQPRDG TYTTVCGCAA
     NRTLKDDGRT CEATERRPCD FGACSQHCVI HSNRTSHCFC ERGYQMMPDG FTCRAVDSRR
     PFLLYSDRHT LMYLPSDAPR AVPLLPQLEN AVSFDYLYHV NGTISIFWAD VTLDTIFRVE
     VTGKTASNPR PIVSTGLSTV EGIAVDWISE VIYWTDSHHD HIQVAKIDGS MRATVVKGEI
     HNPRDIVVDP SHGLMFWTDW QEENPRIERA TMGGNNRVVI FKVSSIVNAG WPNGLVCDII
     AKRIYWVDAK SDTVHTVTYD GRDHVEVLRD HVFSTHPFSV DLFENYVYWT DWRINAIVRA
     NKWNGSSIAA IFHTPIRPFY VKVVHRSKQP RTVRNPCAKS DCSHLCLIDG PGEYSCECPQ
     FMRHKSGSSS ICEEVKSAVL LSTKSSIHGV NVASDNDTIF NAAGFQDIRA IAASNDQIFL
     YDAFDDILWK YSTADREKRT VLTGDLSDCY GIAVDKVSGA MYYTSFSEDR AAIYVTNDGV
     RRSIFDSSIN KELKKPKFLV FLENSAALLW LDVGYAPPAF FSAKGDGSKL SRISTDSFDE
     LLEDVTSMAY DKQGNRVLWV TSRKSFVVQM NVKTWKVTPF VYANGSSIDA LTVDQWTGDV
     FLMIDNVLVK KSYRSNSTDI WASGTNMTTL TSRDPFRVAT VMDRTAASKC GLNSCNYLCV
     RSAKERYECL CPQGYTMKRG KCEVSTETLL LAGDKLLTAT NGSGSVFLLH PAVAYKQWRK
     VAVEYSNELV YLISDFELWV AHLNGSYADR LLISDDTLTA VTVDPVTGNV ILGAEIGRRA
     GEIVILDPKR VKENIRVRLL TDNEGAIRHL EMDPVKGFIF WSRGCIKKAN YDGTNVTCLV
     NVTTNQFSLD QLTSRLCYLE KSGEVHCVSY DGTNNQVVAF FSVVDTVQSE MVIGNEKLYL
     AQRKRSASNF LLVTEYKREA NGNFTEVMSY NTTTTLRFRA TAVYRQKSAD LTNSPCSKNN
     GGCSHLCIST PQLDSRCLCA YSLLQPNGSC TANPSFLSYS YGGVVDFVSI SPNTTVPRGT
     LRFPDIPRGI SVMEADPDRN QLILVDRASN RIICFRFTTN DWYSVADEVG EVEGISLDAT
     NRELYYTRLS PPSIWRLSLS ADDPASYPVI PTRVAFLGQG NKPKDIAVHP CRMLIFFTNS
     GTIPSVERMY YSGYRRERII EDEIIGESRV SIDFTAEKLY ISEITSSKIY RVDFDGKHKE
     VVIPGAQNRT TNTRRPFALA VYNDWLIYSN IGSYVSTLEL AMVDKVDGLG ERVVADTPSP
     VQSITVSAKN IQKCGTNACA SLKCGDECRL SARGEPHCAC RGERKLEADN VTCSGSEFAT
     KTCAENEFLC KLDDKCIPYE ETCDRYPDCA HAEDENVDMC SQRTCRPGYF NCGSGLCVAL
     SKKCDRNNDC LNFADEIDCE CSENEFRCES GICIAGNLTC DLKPDCNDAS DEKNCPPRDC
     TNTTEFDFPG LVNCEGTTQC ILPQWRCDGS NDCWDNSDEK DCSEIVLPVL PGLRPCSSDE
     FTCGRTRSCL PRGWVCDGQK DCADGSDEMD CVNACRVGVE YTCHSGDCVH IDKKCDGKKD
     CPDGDDEVDC DTLTHDDCIG ASFQCRNGRC ISAEWVCDGA DDCADATAGG VSSDESNCTD
     LTTTCTSDEF LCRLTGTSFR TCLSTVHQCD GFTDCVGGSD ENRTECGRPN RCRETSFRCR
     SGQCIPKGWI CNGLEDCTDG SDEDKQMCST KSSECALDEV ACVKGERTTC ISQEIICNQS
     HDCENLRYFA ETMCGVNECK FDLCEEQCID LPFAYRCECQ PPKIVDPKNP ASCIMGDQCS
     TSNCSQFCME KGNGNYECAC GSGYILEADK HGCKLKSRLI PPMLVTIASD TIRLNSLRDS
     YQTLAINTLS GRVLAYSSRT SSIYWIDESE VLLRSVALYD PDGVAVDEFS GNIYWTSKSR
     NAIMMSDSEN LYIKTVYRRG PGVLPYALAI DSAHRTIFWS DVGKKPSLNR MSIIDDDRGV
     DVVLDSSLVR PTALAVDPYA KRLYWIDQAL NYLGVCNYDG ANRQILGRKM GRGLYGLDVF
     GDFLFFSDYT KGTVEKMHKL TAKNRTTVVS GLSHPKGIQV VHPEKWPRKN ENNPCENNQT
     CVQICVPTST RQGYQCLCRD GMRYDEGACV NLVQSAMKTR EIDYATFRNF IIALLITTAL
     VMLFFHKNRL YVMATPASTS WVGGQERYPL NTVTPDGGVE NPVFESNEDT PMEPDPVVVD
     ESNG
//
DBGET integrated database retrieval system