ID W6UFV4_ECHGR Unreviewed; 1540 AA.
AC W6UFV4;
DT 16-APR-2014, integrated into UniProtKB/TrEMBL.
DT 16-APR-2014, sequence version 1.
DT 27-MAR-2024, entry version 44.
DE SubName: Full=Papilin {ECO:0000313|EMBL:EUB59841.1};
GN ORFNames=EGR_05317 {ECO:0000313|EMBL:EUB59841.1};
OS Echinococcus granulosus (Hydatid tapeworm).
OC Eukaryota; Metazoa; Spiralia; Lophotrochozoa; Platyhelminthes; Cestoda;
OC Eucestoda; Cyclophyllidea; Taeniidae; Echinococcus;
OC Echinococcus granulosus group.
OX NCBI_TaxID=6210 {ECO:0000313|EMBL:EUB59841.1, ECO:0000313|Proteomes:UP000019149};
RN [1] {ECO:0000313|EMBL:EUB59841.1, ECO:0000313|Proteomes:UP000019149}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=24013640; DOI=10.1038/ng.2757;
RA Zheng H., Zhang W., Zhang L., Zhang Z., Li J., Lu G., Zhu Y., Wang Y.,
RA Huang Y., Liu J., Kang H., Chen J., Wang L., Chen A., Yu S., Gao Z.,
RA Jin L., Gu W., Wang Z., Zhao L., Shi B., Wen H., Lin R., Jones M.K.,
RA Brejova B., Vinar T., Zhao G., McManus D.P., Chen Z., Zhou Y., Wang S.;
RT "The genome of the hydatid tapeworm Echinococcus granulosus.";
RL Nat. Genet. 45:1168-1175(2013).
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00076}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EUB59841.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; APAU02000038; EUB59841.1; -; Genomic_DNA.
DR SMR; W6UFV4; -.
DR STRING; 6210.W6UFV4; -.
DR EnsemblMetazoa; XM_024494566.1; XP_024351037.1; GeneID_36341032.
DR OrthoDB; 25347at2759; -.
DR Proteomes; UP000019149; Unassembled WGS sequence.
DR GO; GO:0004867; F:serine-type endopeptidase inhibitor activity; IEA:InterPro.
DR CDD; cd00096; Ig; 1.
DR CDD; cd00109; Kunitz-type; 4.
DR Gene3D; 2.60.40.10; Immunoglobulins; 1.
DR Gene3D; 2.10.25.10; Laminin; 2.
DR Gene3D; 4.10.410.10; Pancreatic trypsin inhibitor Kunitz domain; 4.
DR Gene3D; 2.90.20.10; Plasmodium vivax P25 domain; 1.
DR Gene3D; 2.120.10.30; TolB, C-terminal domain; 1.
DR InterPro; IPR011042; 6-blade_b-propeller_TolB-like.
DR InterPro; IPR000742; EGF-like_dom.
DR InterPro; IPR003645; Fol_N.
DR InterPro; IPR009030; Growth_fac_rcpt_cys_sf.
DR InterPro; IPR007110; Ig-like_dom.
DR InterPro; IPR036179; Ig-like_dom_sf.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR003599; Ig_sub.
DR InterPro; IPR002223; Kunitz_BPTI.
DR InterPro; IPR036880; Kunitz_BPTI_sf.
DR InterPro; IPR020901; Prtase_inh_Kunz-CS.
DR PANTHER; PTHR22963:SF39; EGF-LIKE DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR22963; ENDOGLIN-RELATED; 1.
DR Pfam; PF00014; Kunitz_BPTI; 4.
DR PRINTS; PR00759; BASICPTASE.
DR SMART; SM00181; EGF; 20.
DR SMART; SM00274; FOLN; 6.
DR SMART; SM00409; IG; 1.
DR SMART; SM00131; KU; 4.
DR SUPFAM; SSF57362; BPTI-like; 4.
DR SUPFAM; SSF57184; Growth factor receptor domain; 1.
DR SUPFAM; SSF48726; Immunoglobulin; 1.
DR SUPFAM; SSF63825; YWTD domain; 1.
DR PROSITE; PS00280; BPTI_KUNITZ_1; 2.
DR PROSITE; PS50279; BPTI_KUNITZ_2; 4.
DR PROSITE; PS01186; EGF_2; 1.
DR PROSITE; PS50026; EGF_3; 1.
DR PROSITE; PS50835; IG_LIKE; 1.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|PROSITE-ProRule:PRU00076};
KW EGF-like domain {ECO:0000256|PROSITE-ProRule:PRU00076};
KW Reference proteome {ECO:0000313|Proteomes:UP000019149}.
FT DOMAIN 42..92
FT /note="BPTI/Kunitz inhibitor"
FT /evidence="ECO:0000259|PROSITE:PS50279"
FT DOMAIN 185..235
FT /note="BPTI/Kunitz inhibitor"
FT /evidence="ECO:0000259|PROSITE:PS50279"
FT DOMAIN 252..302
FT /note="BPTI/Kunitz inhibitor"
FT /evidence="ECO:0000259|PROSITE:PS50279"
FT DOMAIN 311..361
FT /note="BPTI/Kunitz inhibitor"
FT /evidence="ECO:0000259|PROSITE:PS50279"
FT DOMAIN 1015..1051
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1086..1177
FT /note="Ig-like"
FT /evidence="ECO:0000259|PROSITE:PS50835"
FT DISULFID 1019..1029
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
SQ SEQUENCE 1540 AA; 168626 MW; 1DD86BC22C682940 CRC64;
MKIGAEVFNV ATMRIAATVD ASATRATAAT PTANVSQSED VCSLPTEVGP CLAAIQRYAY
NRSSGRCEVF YYGGCQGNAN NFDNLVECQQ RCESLSQRDP CANVRCDRNA YCVNGYCYCN
PGYEGDGNTY CRPASSGGDL CESVRCGENA ECLGGQCKCM EGYYGDPFHS CQSIGGDKSP
SNPICRFPID AGQCYDRMEK YGYDSRTGRC EKFFYTGCLG NENRFDTFEE CERECGAAQT
RVPFSNQVPD KCFLPLITGT CGNAQTRFGF YPPTQRCEEY SYSGCGGNEN NFETRLDCET
VCNALKRSGI CYLPLRTGPC HNRSTRYGYF PPLKRCVRYT YGGCRGTQNN FLTEAECEQL
CKDPCDGVRC GYNARCVEGQ CVCEPGFGGD PNYECKAEKM DKCFGVRCSV NARCQDGYCV
CEAGHRGDPY RECRPADACR GVRCGYYARC VEGRCECEPG YTGDAYSECR PEAKLDLCEV
LGCDENAVCL NGECKCRQGF TGNPYEGCVA IQYLKPATGT LPDPCHRVAC GKGAFCDLGT
CRCPEGFQGN PYIKCQFVSI FLGKLHAPEV PAPRASPPVE ESCRGHRCGP NAYCRDEVCH
CETGYQGNPY TGCVQISDRC DGVQCASNAY CANGYCRCNP GYAGDGFIEC HYQGSCANVQ
CGVNAHCVDG RCVCWPGYTG DPDSRCDPAA AHKPERCGNT YCHERARCHS DTCYCEYGYA
GDGVSVCEKV EEDLCKRVHC AENAECEAGL CQCKPGFKGD GFSECNPVEV DPSSCNGRYC
GANAECRDNV CVCVSGHTGD PYDICTRERP LSDSCHGIEC GSNAYCQNGG CVCYEGFEGD
PSLACKPIYD PSCMGIRCGA NAYCRGGRCI CPQGYMGDAN QVCYPVWGSS TVDVCNNLAC
HPNASCSEGQ CHCNYGFEGD GFIDCWSKDP ANLCDCRGVP ASMAGCSNGK CRCMPGFRMT
RDNYCEECRG NGGCAANAQC IYDRQIQHYR CACDPDYLGD GAIACIPGVV ANRTEAAQCR
APCHRFATCD EYDGRCKCRP GFIGNGYTYC NFDCNQCLSE AQCVPESSQC VCPPGYIGDG
VRVCRPATSQ GLFTLRIVKE SETIRVREDS GALTLRCVLS GDVRNVQARW LTPGDVGRTD
EQYTHEGREL WLTISQPSPK DSGLYVCQAS RVSDTINVVV EPQQKIQTKQ LFLTSDNGIL
TVQTQSESTT AAQIWHIAEN NKHRPVALAL DCKTDRLVYT SDAGRALRFG NASAARLNQP
PELIFQDNSA KFTWIAVDPA SGNIFAIDED NSRIVVVNSD RPNQVHTYKK LTDRRSDRDF
VAGGIAVHPG LSLVYWAQVA NSDSEKRESV IKVASMADPE RVSEITRVTG ALISLSLAVT
DDVTGGGNTA GRLCWLQRRQ LMPYSRTEIH CAQLETSGRT IHSKRLHKSF DTNEEPSCGL
IQDDDTILWT SLYRKIYRSL NPSTSIYVKG VCCSNGFQSM AIHNICKRSM TNACSYENGR
CRYFCLPGGR EMAHICRCPD DQPNCIAEHA KSRFLGYAYS
//