ID A0A016RXQ5_9BILA Unreviewed; 559 AA.
AC A0A016RXQ5;
DT 11-JUN-2014, integrated into UniProtKB/TrEMBL.
DT 11-JUN-2014, sequence version 1.
DT 27-MAR-2024, entry version 36.
DE RecName: Full=EGF-like domain-containing protein {ECO:0000259|PROSITE:PS50026};
GN Name=Acey_s0342.g3033 {ECO:0000313|EMBL:EYB83123.1};
GN Synonyms=Acey-nid-1 {ECO:0000313|EMBL:EYB83123.1};
GN ORFNames=Y032_0342g3033 {ECO:0000313|EMBL:EYB83123.1};
OS Ancylostoma ceylanicum.
OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida;
OC Rhabditina; Rhabditomorpha; Strongyloidea; Ancylostomatidae;
OC Ancylostomatinae; Ancylostoma.
OX NCBI_TaxID=53326 {ECO:0000313|EMBL:EYB83123.1, ECO:0000313|Proteomes:UP000024635};
RN [1] {ECO:0000313|Proteomes:UP000024635}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=HY135 {ECO:0000313|Proteomes:UP000024635};
RX PubMed=25730766; DOI=10.1038/ng.3237;
RA Schwarz E.M., Hu Y., Antoshechkin I., Miller M.M., Sternberg P.W.,
RA Aroian R.V.;
RT "The genome and transcriptome of the zoonotic hookworm Ancylostoma
RT ceylanicum identify infection-specific gene families.";
RL Nat. Genet. 47:416-422(2015).
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00076}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EYB83123.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; JARK01001678; EYB83123.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A016RXQ5; -.
DR STRING; 53326.A0A016RXQ5; -.
DR Proteomes; UP000024635; Unassembled WGS sequence.
DR Gene3D; 2.10.25.10; Laminin; 6.
DR Gene3D; 2.120.10.30; TolB, C-terminal domain; 1.
DR InterPro; IPR011042; 6-blade_b-propeller_TolB-like.
DR InterPro; IPR000742; EGF-like_dom.
DR InterPro; IPR024731; EGF_dom.
DR InterPro; IPR000033; LDLR_classB_rpt.
DR PANTHER; PTHR46513:SF34; NIDOGEN (ENTACTIN); 1.
DR PANTHER; PTHR46513; VITELLOGENIN RECEPTOR-LIKE PROTEIN-RELATED-RELATED; 1.
DR Pfam; PF12947; EGF_3; 2.
DR Pfam; PF00058; Ldl_recept_b; 3.
DR SMART; SM00181; EGF; 5.
DR SMART; SM00135; LY; 4.
DR SUPFAM; SSF63825; YWTD domain; 1.
DR PROSITE; PS01186; EGF_2; 4.
DR PROSITE; PS50026; EGF_3; 5.
DR PROSITE; PS51120; LDLRB; 3.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|PROSITE-ProRule:PRU00076};
KW EGF-like domain {ECO:0000256|PROSITE-ProRule:PRU00076};
KW Reference proteome {ECO:0000313|Proteomes:UP000024635}.
FT DOMAIN 40..81
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 82..125
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 129..172
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 175..212
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 213..254
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT REPEAT 304..346
FT /note="LDL-receptor class B"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00461"
FT REPEAT 347..389
FT /note="LDL-receptor class B"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00461"
FT REPEAT 390..434
FT /note="LDL-receptor class B"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00461"
FT DISULFID 94..111
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 140..157
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
SQ SEQUENCE 559 AA; 61111 MW; 1EC536532E932DA0 CRC64;
MQCVPFTGFR VGTRAHSTAE CPQSSCLSSA PSTHGDSDSA HHQCRDSTDC HQNGHCVVAA
TSGYVCECLP GYRGDGVRQC MTADQCNPVD HSSCHQNAEC VYGETERAYV CKCIRGFTGD
GVRCVPHARP QTCREEPRLC HANAQCVYKH DENTFVCICK PGSVGDGYHK CETQEASRCS
NCSSHAHCSQ TPSGGWQCRC NAGYHGNGHV CAAMTSCLDD RSICDSHAEC VPGEGGHYVC
NCHYGYQGNG RTCIPDFQSK DDTLLISRGM AIFHRGVNPE TPGKQLIVIP HHIAVGLDYD
CKEGRIIWSD ISGHSIRSAS LNGTDHKSFY ANELSSPEGI AVDWSSRNVY YADSLNDEIG
VASLDGKYQK ALVTEGLVNP RALALDMHNR HLYYTDWHRE NPVIGRVDMD GQNNRIFLND
DIHLPNGITI LPNRRELCWV DAGNHRLSCI GLDGNNRRVV FAPLQYPFGL THSNEARFYW
TDWKDTRVHS VGIYGNGYTS FPISLGGSGK VYGILSVPKH CSAPHTGCSV ENGGCSYLCL
PGQKGVRCEC PSNVAVKGC
//