ID U6JH05_ECHGR Unreviewed; 457 AA.
AC U6JH05;
DT 22-JAN-2014, integrated into UniProtKB/TrEMBL.
DT 22-JAN-2014, sequence version 1.
DT 27-MAR-2024, entry version 40.
DE SubName: Full=Enteropeptidase {ECO:0000313|EMBL:EUB58856.1, ECO:0000313|WBParaSite:EgrG_000085400};
DE SubName: Full=Mastin {ECO:0000313|EMBL:CDS23343.1};
GN Name=EGR_06280 {ECO:0000313|WBParaSite:EgrG_000085400};
GN ORFNames=EGR_06280 {ECO:0000313|EMBL:EUB58856.1}, EgrG_000085400
GN {ECO:0000313|EMBL:CDS23343.1};
OS Echinococcus granulosus (Hydatid tapeworm).
OC Eukaryota; Metazoa; Spiralia; Lophotrochozoa; Platyhelminthes; Cestoda;
OC Eucestoda; Cyclophyllidea; Taeniidae; Echinococcus;
OC Echinococcus granulosus group.
OX NCBI_TaxID=6210 {ECO:0000313|EMBL:EUB58856.1, ECO:0000313|Proteomes:UP000019149};
RN [1] {ECO:0000313|EMBL:EUB58856.1, ECO:0000313|Proteomes:UP000019149}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=24013640; DOI=10.1038/ng.2757;
RA Zheng H., Zhang W., Zhang L., Zhang Z., Li J., Lu G., Zhu Y., Wang Y.,
RA Huang Y., Liu J., Kang H., Chen J., Wang L., Chen A., Yu S., Gao Z.,
RA Jin L., Gu W., Wang Z., Zhao L., Shi B., Wen H., Lin R., Jones M.K.,
RA Brejova B., Vinar T., Zhao G., McManus D.P., Chen Z., Zhou Y., Wang S.;
RT "The genome of the hydatid tapeworm Echinococcus granulosus.";
RL Nat. Genet. 45:1168-1175(2013).
RN [2] {ECO:0000313|EMBL:CDS23343.1, ECO:0000313|Proteomes:UP000492820}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=23485966; DOI=10.1038/nature12031;
RA Tsai I.J., Zarowiecki M., Holroyd N., Garciarrubio A., Sanchez-Flores A.,
RA Brooks K.L., Tracey A., Bobes R.J., Fragoso G., Sciutto E., Aslett M.,
RA Beasley H., Bennett H.M., Cai J., Camicia F., Clark R., Cucher M.,
RA De Silva N., Day T.A., Deplazes P., Estrada K., Fernandez C., Holland P.W.,
RA Hou J., Hu S., Huckvale T., Hung S.S., Kamenetzky L., Keane J.A., Kiss F.,
RA Koziol U., Lambert O., Liu K., Luo X., Luo Y., Macchiaroli N., Nichol S.,
RA Paps J., Parkinson J., Pouchkina-Stantcheva N., Riddiford N., Rosenzvit M.,
RA Salinas G., Wasmuth J.D., Zamanian M., Zheng Y., Cai X., Soberon X.,
RA Olson P.D., Laclette J.P., Brehm K., Berriman M., Garciarrubio A.,
RA Bobes R.J., Fragoso G., Sanchez-Flores A., Estrada K., Cevallos M.A.,
RA Morett E., Gonzalez V., Portillo T., Ochoa-Leyva A., Jose M.V., Sciutto E.,
RA Landa A., Jimenez L., Valdes V., Carrero J.C., Larralde C.,
RA Morales-Montor J., Limon-Lason J., Soberon X., Laclette J.P.;
RT "The genomes of four tapeworm species reveal adaptations to parasitism.";
RL Nature 496:57-63(2013).
RN [3] {ECO:0000313|EMBL:CDS23343.1}
RP NUCLEOTIDE SEQUENCE.
RA Aslett M.;
RL Submitted (JUN-2014) to the EMBL/GenBank/DDBJ databases.
RN [4] {ECO:0000313|WBParaSite:EgrG_000085400}
RP IDENTIFICATION.
RG WormBaseParasite;
RL Submitted (OCT-2020) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; LK028590; CDS23343.1; -; Genomic_DNA.
DR EMBL; APAU02000054; EUB58856.1; -; Genomic_DNA.
DR AlphaFoldDB; U6JH05; -.
DR SMR; U6JH05; -.
DR STRING; 6210.U6JH05; -.
DR EnsemblMetazoa; XM_024495529.1; XP_024350052.1; GeneID_36341995.
DR WBParaSite; EgrG_000085400; EgrG_000085400; EgrG_000085400.
DR OMA; DWIHENV; -.
DR OrthoDB; 5404167at2759; -.
DR Proteomes; UP000019149; Unassembled WGS sequence.
DR Proteomes; UP000492820; Unassembled WGS sequence.
DR GO; GO:0004252; F:serine-type endopeptidase activity; IEA:InterPro.
DR GO; GO:0006508; P:proteolysis; IEA:InterPro.
DR CDD; cd00190; Tryp_SPc; 1.
DR Gene3D; 2.40.10.10; Trypsin-like serine proteases; 1.
DR InterPro; IPR009003; Peptidase_S1_PA.
DR InterPro; IPR043504; Peptidase_S1_PA_chymotrypsin.
DR InterPro; IPR001314; Peptidase_S1A.
DR InterPro; IPR001254; Trypsin_dom.
DR InterPro; IPR018114; TRYPSIN_HIS.
DR PANTHER; PTHR24256; TRYPTASE-RELATED; 1.
DR PANTHER; PTHR24256:SF565; ZGC:92313-RELATED; 1.
DR Pfam; PF00089; Trypsin; 2.
DR PRINTS; PR00722; CHYMOTRYPSIN.
DR SMART; SM00020; Tryp_SPc; 1.
DR SUPFAM; SSF50494; Trypsin-like serine proteases; 1.
DR PROSITE; PS50240; TRYPSIN_DOM; 1.
DR PROSITE; PS00134; TRYPSIN_HIS; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000019149};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..19
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 20..457
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5008431673"
FT DOMAIN 156..457
FT /note="Peptidase S1"
FT /evidence="ECO:0000259|PROSITE:PS50240"
SQ SEQUENCE 457 AA; 51472 MW; 863DB4ACD8E35E44 CRC64;
MQWVVRLCFL SLLLHLALTQ RSQSSASPPL PPPTPTPSQS KFFAHYVMDR STLLLRRKIF
IYKNELYTPW SEWSNCSTRD CTELRYRQCL NDSYESWIPN LFQTNNCPFQ FIAETRRCTN
DAGCRKEAPS KLLKELSSTC GKRPNFKGKR GVSPKILGGR EAKPHSWPWQ VALYVRPLVV
EGRSLRSPAI ESPFCGATLI APSWLITAAH CLSELVPDKV LTVGHFFSVE EELEQTIRAR
IGDHVRGKRD GSHEVTRQIE LAIIHPDYRR GFSEQGFDVA LLKLDQPVEF GDKVSSICIP
NRSLHLPEGQ ICYAAGWGAT APDTAVLPEP LGFIDFFSGG LFPRPCGLGQ TLASNRRCRG
PRQPLRLLEV DLPLVSLQRC RRTFRNLREW VHICAGEKGK DTCRGDSGGG LFCQNPEDGR
WYIYGVTSFG SVLGCGEHYG VYACTRGISD WIHENVK
//