GenomeNet

Database: UniProt
Entry: W6UFV4_ECHGR
LinkDB: W6UFV4_ECHGR
Original site: W6UFV4_ECHGR 
ID   W6UFV4_ECHGR            Unreviewed;      1540 AA.
AC   W6UFV4;
DT   16-APR-2014, integrated into UniProtKB/TrEMBL.
DT   16-APR-2014, sequence version 1.
DT   27-MAR-2024, entry version 44.
DE   SubName: Full=Papilin {ECO:0000313|EMBL:EUB59841.1};
GN   ORFNames=EGR_05317 {ECO:0000313|EMBL:EUB59841.1};
OS   Echinococcus granulosus (Hydatid tapeworm).
OC   Eukaryota; Metazoa; Spiralia; Lophotrochozoa; Platyhelminthes; Cestoda;
OC   Eucestoda; Cyclophyllidea; Taeniidae; Echinococcus;
OC   Echinococcus granulosus group.
OX   NCBI_TaxID=6210 {ECO:0000313|EMBL:EUB59841.1, ECO:0000313|Proteomes:UP000019149};
RN   [1] {ECO:0000313|EMBL:EUB59841.1, ECO:0000313|Proteomes:UP000019149}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX   PubMed=24013640; DOI=10.1038/ng.2757;
RA   Zheng H., Zhang W., Zhang L., Zhang Z., Li J., Lu G., Zhu Y., Wang Y.,
RA   Huang Y., Liu J., Kang H., Chen J., Wang L., Chen A., Yu S., Gao Z.,
RA   Jin L., Gu W., Wang Z., Zhao L., Shi B., Wen H., Lin R., Jones M.K.,
RA   Brejova B., Vinar T., Zhao G., McManus D.P., Chen Z., Zhou Y., Wang S.;
RT   "The genome of the hydatid tapeworm Echinococcus granulosus.";
RL   Nat. Genet. 45:1168-1175(2013).
CC   -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC       feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00076}.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:EUB59841.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; APAU02000038; EUB59841.1; -; Genomic_DNA.
DR   SMR; W6UFV4; -.
DR   STRING; 6210.W6UFV4; -.
DR   EnsemblMetazoa; XM_024494566.1; XP_024351037.1; GeneID_36341032.
DR   OrthoDB; 25347at2759; -.
DR   Proteomes; UP000019149; Unassembled WGS sequence.
DR   GO; GO:0004867; F:serine-type endopeptidase inhibitor activity; IEA:InterPro.
DR   CDD; cd00096; Ig; 1.
DR   CDD; cd00109; Kunitz-type; 4.
DR   Gene3D; 2.60.40.10; Immunoglobulins; 1.
DR   Gene3D; 2.10.25.10; Laminin; 2.
DR   Gene3D; 4.10.410.10; Pancreatic trypsin inhibitor Kunitz domain; 4.
DR   Gene3D; 2.90.20.10; Plasmodium vivax P25 domain; 1.
DR   Gene3D; 2.120.10.30; TolB, C-terminal domain; 1.
DR   InterPro; IPR011042; 6-blade_b-propeller_TolB-like.
DR   InterPro; IPR000742; EGF-like_dom.
DR   InterPro; IPR003645; Fol_N.
DR   InterPro; IPR009030; Growth_fac_rcpt_cys_sf.
DR   InterPro; IPR007110; Ig-like_dom.
DR   InterPro; IPR036179; Ig-like_dom_sf.
DR   InterPro; IPR013783; Ig-like_fold.
DR   InterPro; IPR003599; Ig_sub.
DR   InterPro; IPR002223; Kunitz_BPTI.
DR   InterPro; IPR036880; Kunitz_BPTI_sf.
DR   InterPro; IPR020901; Prtase_inh_Kunz-CS.
DR   PANTHER; PTHR22963:SF39; EGF-LIKE DOMAIN-CONTAINING PROTEIN; 1.
DR   PANTHER; PTHR22963; ENDOGLIN-RELATED; 1.
DR   Pfam; PF00014; Kunitz_BPTI; 4.
DR   PRINTS; PR00759; BASICPTASE.
DR   SMART; SM00181; EGF; 20.
DR   SMART; SM00274; FOLN; 6.
DR   SMART; SM00409; IG; 1.
DR   SMART; SM00131; KU; 4.
DR   SUPFAM; SSF57362; BPTI-like; 4.
DR   SUPFAM; SSF57184; Growth factor receptor domain; 1.
DR   SUPFAM; SSF48726; Immunoglobulin; 1.
DR   SUPFAM; SSF63825; YWTD domain; 1.
DR   PROSITE; PS00280; BPTI_KUNITZ_1; 2.
DR   PROSITE; PS50279; BPTI_KUNITZ_2; 4.
DR   PROSITE; PS01186; EGF_2; 1.
DR   PROSITE; PS50026; EGF_3; 1.
DR   PROSITE; PS50835; IG_LIKE; 1.
PE   4: Predicted;
KW   Disulfide bond {ECO:0000256|PROSITE-ProRule:PRU00076};
KW   EGF-like domain {ECO:0000256|PROSITE-ProRule:PRU00076};
KW   Reference proteome {ECO:0000313|Proteomes:UP000019149}.
FT   DOMAIN          42..92
FT                   /note="BPTI/Kunitz inhibitor"
FT                   /evidence="ECO:0000259|PROSITE:PS50279"
FT   DOMAIN          185..235
FT                   /note="BPTI/Kunitz inhibitor"
FT                   /evidence="ECO:0000259|PROSITE:PS50279"
FT   DOMAIN          252..302
FT                   /note="BPTI/Kunitz inhibitor"
FT                   /evidence="ECO:0000259|PROSITE:PS50279"
FT   DOMAIN          311..361
FT                   /note="BPTI/Kunitz inhibitor"
FT                   /evidence="ECO:0000259|PROSITE:PS50279"
FT   DOMAIN          1015..1051
FT                   /note="EGF-like"
FT                   /evidence="ECO:0000259|PROSITE:PS50026"
FT   DOMAIN          1086..1177
FT                   /note="Ig-like"
FT                   /evidence="ECO:0000259|PROSITE:PS50835"
FT   DISULFID        1019..1029
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
SQ   SEQUENCE   1540 AA;  168626 MW;  1DD86BC22C682940 CRC64;
     MKIGAEVFNV ATMRIAATVD ASATRATAAT PTANVSQSED VCSLPTEVGP CLAAIQRYAY
     NRSSGRCEVF YYGGCQGNAN NFDNLVECQQ RCESLSQRDP CANVRCDRNA YCVNGYCYCN
     PGYEGDGNTY CRPASSGGDL CESVRCGENA ECLGGQCKCM EGYYGDPFHS CQSIGGDKSP
     SNPICRFPID AGQCYDRMEK YGYDSRTGRC EKFFYTGCLG NENRFDTFEE CERECGAAQT
     RVPFSNQVPD KCFLPLITGT CGNAQTRFGF YPPTQRCEEY SYSGCGGNEN NFETRLDCET
     VCNALKRSGI CYLPLRTGPC HNRSTRYGYF PPLKRCVRYT YGGCRGTQNN FLTEAECEQL
     CKDPCDGVRC GYNARCVEGQ CVCEPGFGGD PNYECKAEKM DKCFGVRCSV NARCQDGYCV
     CEAGHRGDPY RECRPADACR GVRCGYYARC VEGRCECEPG YTGDAYSECR PEAKLDLCEV
     LGCDENAVCL NGECKCRQGF TGNPYEGCVA IQYLKPATGT LPDPCHRVAC GKGAFCDLGT
     CRCPEGFQGN PYIKCQFVSI FLGKLHAPEV PAPRASPPVE ESCRGHRCGP NAYCRDEVCH
     CETGYQGNPY TGCVQISDRC DGVQCASNAY CANGYCRCNP GYAGDGFIEC HYQGSCANVQ
     CGVNAHCVDG RCVCWPGYTG DPDSRCDPAA AHKPERCGNT YCHERARCHS DTCYCEYGYA
     GDGVSVCEKV EEDLCKRVHC AENAECEAGL CQCKPGFKGD GFSECNPVEV DPSSCNGRYC
     GANAECRDNV CVCVSGHTGD PYDICTRERP LSDSCHGIEC GSNAYCQNGG CVCYEGFEGD
     PSLACKPIYD PSCMGIRCGA NAYCRGGRCI CPQGYMGDAN QVCYPVWGSS TVDVCNNLAC
     HPNASCSEGQ CHCNYGFEGD GFIDCWSKDP ANLCDCRGVP ASMAGCSNGK CRCMPGFRMT
     RDNYCEECRG NGGCAANAQC IYDRQIQHYR CACDPDYLGD GAIACIPGVV ANRTEAAQCR
     APCHRFATCD EYDGRCKCRP GFIGNGYTYC NFDCNQCLSE AQCVPESSQC VCPPGYIGDG
     VRVCRPATSQ GLFTLRIVKE SETIRVREDS GALTLRCVLS GDVRNVQARW LTPGDVGRTD
     EQYTHEGREL WLTISQPSPK DSGLYVCQAS RVSDTINVVV EPQQKIQTKQ LFLTSDNGIL
     TVQTQSESTT AAQIWHIAEN NKHRPVALAL DCKTDRLVYT SDAGRALRFG NASAARLNQP
     PELIFQDNSA KFTWIAVDPA SGNIFAIDED NSRIVVVNSD RPNQVHTYKK LTDRRSDRDF
     VAGGIAVHPG LSLVYWAQVA NSDSEKRESV IKVASMADPE RVSEITRVTG ALISLSLAVT
     DDVTGGGNTA GRLCWLQRRQ LMPYSRTEIH CAQLETSGRT IHSKRLHKSF DTNEEPSCGL
     IQDDDTILWT SLYRKIYRSL NPSTSIYVKG VCCSNGFQSM AIHNICKRSM TNACSYENGR
     CRYFCLPGGR EMAHICRCPD DQPNCIAEHA KSRFLGYAYS
//
DBGET integrated database retrieval system