GenomeNet

Database: UniProt
Entry: U6JH05_ECHGR
LinkDB: U6JH05_ECHGR
Original site: U6JH05_ECHGR 
ID   U6JH05_ECHGR            Unreviewed;       457 AA.
AC   U6JH05;
DT   22-JAN-2014, integrated into UniProtKB/TrEMBL.
DT   22-JAN-2014, sequence version 1.
DT   27-MAR-2024, entry version 40.
DE   SubName: Full=Enteropeptidase {ECO:0000313|EMBL:EUB58856.1, ECO:0000313|WBParaSite:EgrG_000085400};
DE   SubName: Full=Mastin {ECO:0000313|EMBL:CDS23343.1};
GN   Name=EGR_06280 {ECO:0000313|WBParaSite:EgrG_000085400};
GN   ORFNames=EGR_06280 {ECO:0000313|EMBL:EUB58856.1}, EgrG_000085400
GN   {ECO:0000313|EMBL:CDS23343.1};
OS   Echinococcus granulosus (Hydatid tapeworm).
OC   Eukaryota; Metazoa; Spiralia; Lophotrochozoa; Platyhelminthes; Cestoda;
OC   Eucestoda; Cyclophyllidea; Taeniidae; Echinococcus;
OC   Echinococcus granulosus group.
OX   NCBI_TaxID=6210 {ECO:0000313|EMBL:EUB58856.1, ECO:0000313|Proteomes:UP000019149};
RN   [1] {ECO:0000313|EMBL:EUB58856.1, ECO:0000313|Proteomes:UP000019149}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX   PubMed=24013640; DOI=10.1038/ng.2757;
RA   Zheng H., Zhang W., Zhang L., Zhang Z., Li J., Lu G., Zhu Y., Wang Y.,
RA   Huang Y., Liu J., Kang H., Chen J., Wang L., Chen A., Yu S., Gao Z.,
RA   Jin L., Gu W., Wang Z., Zhao L., Shi B., Wen H., Lin R., Jones M.K.,
RA   Brejova B., Vinar T., Zhao G., McManus D.P., Chen Z., Zhou Y., Wang S.;
RT   "The genome of the hydatid tapeworm Echinococcus granulosus.";
RL   Nat. Genet. 45:1168-1175(2013).
RN   [2] {ECO:0000313|EMBL:CDS23343.1, ECO:0000313|Proteomes:UP000492820}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX   PubMed=23485966; DOI=10.1038/nature12031;
RA   Tsai I.J., Zarowiecki M., Holroyd N., Garciarrubio A., Sanchez-Flores A.,
RA   Brooks K.L., Tracey A., Bobes R.J., Fragoso G., Sciutto E., Aslett M.,
RA   Beasley H., Bennett H.M., Cai J., Camicia F., Clark R., Cucher M.,
RA   De Silva N., Day T.A., Deplazes P., Estrada K., Fernandez C., Holland P.W.,
RA   Hou J., Hu S., Huckvale T., Hung S.S., Kamenetzky L., Keane J.A., Kiss F.,
RA   Koziol U., Lambert O., Liu K., Luo X., Luo Y., Macchiaroli N., Nichol S.,
RA   Paps J., Parkinson J., Pouchkina-Stantcheva N., Riddiford N., Rosenzvit M.,
RA   Salinas G., Wasmuth J.D., Zamanian M., Zheng Y., Cai X., Soberon X.,
RA   Olson P.D., Laclette J.P., Brehm K., Berriman M., Garciarrubio A.,
RA   Bobes R.J., Fragoso G., Sanchez-Flores A., Estrada K., Cevallos M.A.,
RA   Morett E., Gonzalez V., Portillo T., Ochoa-Leyva A., Jose M.V., Sciutto E.,
RA   Landa A., Jimenez L., Valdes V., Carrero J.C., Larralde C.,
RA   Morales-Montor J., Limon-Lason J., Soberon X., Laclette J.P.;
RT   "The genomes of four tapeworm species reveal adaptations to parasitism.";
RL   Nature 496:57-63(2013).
RN   [3] {ECO:0000313|EMBL:CDS23343.1}
RP   NUCLEOTIDE SEQUENCE.
RA   Aslett M.;
RL   Submitted (JUN-2014) to the EMBL/GenBank/DDBJ databases.
RN   [4] {ECO:0000313|WBParaSite:EgrG_000085400}
RP   IDENTIFICATION.
RG   WormBaseParasite;
RL   Submitted (OCT-2020) to UniProtKB.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; LK028590; CDS23343.1; -; Genomic_DNA.
DR   EMBL; APAU02000054; EUB58856.1; -; Genomic_DNA.
DR   AlphaFoldDB; U6JH05; -.
DR   SMR; U6JH05; -.
DR   STRING; 6210.U6JH05; -.
DR   EnsemblMetazoa; XM_024495529.1; XP_024350052.1; GeneID_36341995.
DR   WBParaSite; EgrG_000085400; EgrG_000085400; EgrG_000085400.
DR   OMA; DWIHENV; -.
DR   OrthoDB; 5404167at2759; -.
DR   Proteomes; UP000019149; Unassembled WGS sequence.
DR   Proteomes; UP000492820; Unassembled WGS sequence.
DR   GO; GO:0004252; F:serine-type endopeptidase activity; IEA:InterPro.
DR   GO; GO:0006508; P:proteolysis; IEA:InterPro.
DR   CDD; cd00190; Tryp_SPc; 1.
DR   Gene3D; 2.40.10.10; Trypsin-like serine proteases; 1.
DR   InterPro; IPR009003; Peptidase_S1_PA.
DR   InterPro; IPR043504; Peptidase_S1_PA_chymotrypsin.
DR   InterPro; IPR001314; Peptidase_S1A.
DR   InterPro; IPR001254; Trypsin_dom.
DR   InterPro; IPR018114; TRYPSIN_HIS.
DR   PANTHER; PTHR24256; TRYPTASE-RELATED; 1.
DR   PANTHER; PTHR24256:SF565; ZGC:92313-RELATED; 1.
DR   Pfam; PF00089; Trypsin; 2.
DR   PRINTS; PR00722; CHYMOTRYPSIN.
DR   SMART; SM00020; Tryp_SPc; 1.
DR   SUPFAM; SSF50494; Trypsin-like serine proteases; 1.
DR   PROSITE; PS50240; TRYPSIN_DOM; 1.
DR   PROSITE; PS00134; TRYPSIN_HIS; 1.
PE   4: Predicted;
KW   Reference proteome {ECO:0000313|Proteomes:UP000019149};
KW   Signal {ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..19
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           20..457
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5008431673"
FT   DOMAIN          156..457
FT                   /note="Peptidase S1"
FT                   /evidence="ECO:0000259|PROSITE:PS50240"
SQ   SEQUENCE   457 AA;  51472 MW;  863DB4ACD8E35E44 CRC64;
     MQWVVRLCFL SLLLHLALTQ RSQSSASPPL PPPTPTPSQS KFFAHYVMDR STLLLRRKIF
     IYKNELYTPW SEWSNCSTRD CTELRYRQCL NDSYESWIPN LFQTNNCPFQ FIAETRRCTN
     DAGCRKEAPS KLLKELSSTC GKRPNFKGKR GVSPKILGGR EAKPHSWPWQ VALYVRPLVV
     EGRSLRSPAI ESPFCGATLI APSWLITAAH CLSELVPDKV LTVGHFFSVE EELEQTIRAR
     IGDHVRGKRD GSHEVTRQIE LAIIHPDYRR GFSEQGFDVA LLKLDQPVEF GDKVSSICIP
     NRSLHLPEGQ ICYAAGWGAT APDTAVLPEP LGFIDFFSGG LFPRPCGLGQ TLASNRRCRG
     PRQPLRLLEV DLPLVSLQRC RRTFRNLREW VHICAGEKGK DTCRGDSGGG LFCQNPEDGR
     WYIYGVTSFG SVLGCGEHYG VYACTRGISD WIHENVK
//
DBGET integrated database retrieval system