ID A0A077ZA46_TRITR Unreviewed; 2063 AA.
AC A0A077ZA46;
DT 29-OCT-2014, integrated into UniProtKB/TrEMBL.
DT 29-OCT-2014, sequence version 1.
DT 27-MAR-2024, entry version 31.
DE SubName: Full=Trypsin domain containing protein {ECO:0000313|EMBL:CDW55590.1};
GN ORFNames=TTRE_0000386301 {ECO:0000313|EMBL:CDW55590.1};
OS Trichuris trichiura (Whipworm) (Trichocephalus trichiurus).
OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Enoplea; Dorylaimia;
OC Trichinellida; Trichuridae; Trichuris.
OX NCBI_TaxID=36087 {ECO:0000313|EMBL:CDW55590.1};
RN [1] {ECO:0000313|EMBL:CDW55590.1}
RP NUCLEOTIDE SEQUENCE.
RA Aslett M.;
RL Submitted (JAN-2014) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EMBL:CDW55590.1}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA Foth B.J., Tsai I.J., Reid A.J., Bancroft A.J., Nichol S., Tracey A.,
RA Holroyd N., Cotton J.A., Stanley E.J., Zarowiecki M., Liu J.Z.,
RA Huckvale T., Cooper P.J., Grencis R.K., Berriman M.;
RT "The whipworm genome and dual-species transcriptomics of an intimate host-
RT pathogen interaction.";
RL Submitted (MAR-2014) to the EMBL/GenBank/DDBJ databases.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; HG805966; CDW55590.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A077ZA46; -.
DR STRING; 36087.A0A077ZA46; -.
DR GO; GO:0004252; F:serine-type endopeptidase activity; IEA:InterPro.
DR GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR CDD; cd00190; Tryp_SPc; 7.
DR Gene3D; 2.40.10.10; Trypsin-like serine proteases; 10.
DR InterPro; IPR009003; Peptidase_S1_PA.
DR InterPro; IPR043504; Peptidase_S1_PA_chymotrypsin.
DR InterPro; IPR001254; Trypsin_dom.
DR InterPro; IPR018114; TRYPSIN_HIS.
DR InterPro; IPR033116; TRYPSIN_SER.
DR PANTHER; PTHR24264:SF65; PEPTIDASE S1 DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR24264; TRYPSIN-RELATED; 1.
DR Pfam; PF00089; Trypsin; 11.
DR SMART; SM00020; Tryp_SPc; 8.
DR SUPFAM; SSF50494; Trypsin-like serine proteases; 8.
DR PROSITE; PS50240; TRYPSIN_DOM; 8.
DR PROSITE; PS00134; TRYPSIN_HIS; 5.
DR PROSITE; PS00135; TRYPSIN_SER; 5.
PE 4: Predicted;
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801, ECO:0000256|RuleBase:RU363034};
KW Protease {ECO:0000256|ARBA:ARBA00022670, ECO:0000256|RuleBase:RU363034};
KW Serine protease {ECO:0000256|ARBA:ARBA00022825,
KW ECO:0000256|RuleBase:RU363034}.
FT DOMAIN 26..172
FT /note="Peptidase S1"
FT /evidence="ECO:0000259|PROSITE:PS50240"
FT DOMAIN 227..505
FT /note="Peptidase S1"
FT /evidence="ECO:0000259|PROSITE:PS50240"
FT DOMAIN 522..790
FT /note="Peptidase S1"
FT /evidence="ECO:0000259|PROSITE:PS50240"
FT DOMAIN 815..1051
FT /note="Peptidase S1"
FT /evidence="ECO:0000259|PROSITE:PS50240"
FT DOMAIN 1057..1335
FT /note="Peptidase S1"
FT /evidence="ECO:0000259|PROSITE:PS50240"
FT DOMAIN 1360..1574
FT /note="Peptidase S1"
FT /evidence="ECO:0000259|PROSITE:PS50240"
FT DOMAIN 1571..1864
FT /note="Peptidase S1"
FT /evidence="ECO:0000259|PROSITE:PS50240"
FT DOMAIN 1863..2057
FT /note="Peptidase S1"
FT /evidence="ECO:0000259|PROSITE:PS50240"
SQ SEQUENCE 2063 AA; 232609 MW; E34A077E9F26F507 CRC64;
MRIYSLAICG KPYFDPASEY KRTNRIVRGF NAVPHSHPWQ ALIYVAINGT KYKCGGSLID
WNDHNSSDLV LTAAHCVIDM DAKAGVPLVH PEDVHVYFGI HNLKRLKETR QIYAVTHIIP
ATFHEFNKAN DIAILKLDRN VYYNKYVQGI CLPSINEELA PQGGRCFVTG WGSIGKLKSF
VDAFCLHEME EFTVYTKVSR YLDWMRKAIC GKPYFDPSPE YKWTNRIIRG FNAVPHSHPW
QALIYITING LVSKCGGSLI DWDDRNSSDL VLTAAHCVID TDQFGKSTSR WEELAYHVNR
LIKKNPKVAV PLANPEDVDV YFGVHHLKGP EDTRQKRAVT HIIPGIFHEF NKKDDIAILK
LDRKVNYNKY IQGICLPSID EKFTPYKGRC FVTGWGSIGE LQSFLFYAGN GQNPKELQQF
EVSMYDGKVH YSGYKEKNML CHRMNLGGSA EGDSGGPLAC LKDDKYVLYG IISFSVDVSC
LRHMEEFAVF TKVSRYLDWM KKAICGKPYF DPVPEYKRTN RIVRGFNAVP HSHPWQALIY
VTINGRKYKC GGSLIDWNDH NSSDLVLTAA HCVIDTDQFR NSTSLFEELM FHVNRLIKWD
AKAGVPLANP KDVKVYFGAH NIKHPEDTRQ KYAVTHIIPG TFNEFNEEDD IAILKLNRKV
NYNKYIQGIC LPSIDEELPP QGGRCFVTGW GSIGNGQNPD ELQQFEVSMY DGNVHHSGYN
KKNMLCSRKN EDGIDEGDSG GPLACLKDSK YVLYGIISFS VGVCGLYRMQ EFAVFTKVTK
YLNWMRKVIP TTVNHAICGK PYFDPVPKPK RNNRIVDGFE AVPHSHPWQA VIYVNMNGSL
SLCGGSLIDW NGHNSSDLIL TAAHCVIDID QFRKSTSRFD ELKFHVNRLI KRDTKAGVPL
VKPEDVHVYV GAHNIKKVER TRQEYPVTLI IPGTFHEFNE KEDIAILKLG RKSSISRQIL
CSVSLFYAGN QENPDKLQQF EVSLYDGKVR YSGYNKNNML CHRMNKGGSA EGDSGGPLAC
MKDDKYVIYG IISFSVEAIC GKPYFDPVPE YKWNNRIVRG FKAVPHSHPW QALIYVTTNG
TKYKCGGSLI DWNNHNSSDL VLTAAHCIID IDQFGKSTSG WEELVYHVNR LIKNDPKVAV
PLANPEDVDV YFGAHNIKHP EDTRQKYAVT HIIPGIFHEF DKKDDIAILK LDRKVNYNKY
IQGICLPSVD EELAPQGGRC FVTGWGTIGK LQFFLFYSGN GQYPNELQQF EVSMHDGKVR
YSGYNKKNML CSRKNGGGID QGDSGGPLAC LKDNRYVLYG IISFSVDVFC LYKSQEFAVF
TKVSRYLNWM RKVIPTIVNH AICGKPYFDP IPLHKRNNRI VHGFEAVPHS HPWQAAIYVD
MIDSVSICGG SLIDWNGHNS SDLVLTAAHC VIDTKAKVEV PLANPENVTV YLGAHNLGRL
ENTRQEYGVT HIIAGAFNEF NEKDDIAILK LGRKVYYNKY IQGICLPSIN EEAVPKGSRC
FVTGWGTIGN EQNPDELQQF EVSLYDGKVR YSAYNKKKML CHPMNEGGCA EAVCGKPHFD
PLPEHKRNNR ILHGFEAVPH SHPWQAVVYV NINGPLSFCG GSLIDWNDHN SSDLVLTAAH
CVIDTKGKVE VPLANPENVT VYLGAHNLKR VETTREKYAV THIIPGAFHE FNEKDDIAIL
KLGRKVYYNK YIQGICLPSI NEEIASKGSR CFVTGWGRTG NGRNPGELQQ LEVSLYGGIV
QYSGYSKKKM LCHRMNEGGS AQGDSGGPLA CLKDNKFVLY GIISFSVDVF CLHEKEEFGV
FTKAVCGKPY FDPVPVHGRN NRIVHGFEAV PHSHPWQALI YVNINGSLSK CGGSLIDWNG
HNSSDLVLTA AHCVIYIDRF RNSISLFKEL MFHLEKSIKR KAKVEVPLAN PENVTVYLGA
HNLKRVETTR EKYAVTHIIP GTYHKFNEKD DIAILNLFYT GNGRNPDELQ QLEVSLYGGK
VLYSGFNKKN TLHHRKDEGG SAEGDSGGPL ACLKDNKFVL YGIISFSIDV VGFDKTEEFS
VFTEVSKYLN WMREVIPTTV NYV
//