ID A0A085N5U3_9BILA Unreviewed; 1581 AA.
AC A0A085N5U3;
DT 29-OCT-2014, integrated into UniProtKB/TrEMBL.
DT 29-OCT-2014, sequence version 1.
DT 24-JAN-2024, entry version 24.
DE RecName: Full=Peptidase S1 domain-containing protein {ECO:0000259|PROSITE:PS50240};
GN ORFNames=M514_01651 {ECO:0000313|EMBL:KFD64839.1};
OS Trichuris suis (pig whipworm).
OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Enoplea; Dorylaimia;
OC Trichinellida; Trichuridae; Trichuris.
OX NCBI_TaxID=68888 {ECO:0000313|EMBL:KFD64839.1};
RN [1] {ECO:0000313|EMBL:KFD64839.1}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=DCEP-RM93F {ECO:0000313|EMBL:KFD64839.1};
RX PubMed=24929829; DOI=10.1038/ng.3012;
RA Jex A.R., Nejsum P., Schwarz E.M., Hu L., Young N.D., Hall R.S.,
RA Korhonen P.K., Liao S., Thamsborg S., Xia J., Xu P., Wang S.,
RA Scheerlinck J.P., Hofmann A., Sternberg P.W., Wang J., Gasser R.B.;
RT "Genome and transcriptome of the porcine whipworm Trichuris suis.";
RL Nat. Genet. 46:701-706(2014).
CC -!- SIMILARITY: Belongs to the peptidase S1 family. CLIP subfamily.
CC {ECO:0000256|ARBA:ARBA00024195}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KL367549; KFD64839.1; -; Genomic_DNA.
DR Proteomes; UP000030758; Unassembled WGS sequence.
DR GO; GO:0004252; F:serine-type endopeptidase activity; IEA:InterPro.
DR GO; GO:0006508; P:proteolysis; IEA:InterPro.
DR CDD; cd00190; Tryp_SPc; 4.
DR Gene3D; 2.40.10.10; Trypsin-like serine proteases; 6.
DR InterPro; IPR009003; Peptidase_S1_PA.
DR InterPro; IPR043504; Peptidase_S1_PA_chymotrypsin.
DR InterPro; IPR001254; Trypsin_dom.
DR InterPro; IPR018114; TRYPSIN_HIS.
DR PANTHER; PTHR24256:SF470; IP10114P-RELATED; 1.
DR PANTHER; PTHR24256; TRYPTASE-RELATED; 1.
DR Pfam; PF00089; Trypsin; 7.
DR SMART; SM00020; Tryp_SPc; 6.
DR SUPFAM; SSF50494; Trypsin-like serine proteases; 6.
DR PROSITE; PS50240; TRYPSIN_DOM; 5.
DR PROSITE; PS00134; TRYPSIN_HIS; 3.
PE 3: Inferred from homology;
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..27
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 28..1581
FT /note="Peptidase S1 domain-containing protein"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5001795661"
FT DOMAIN 45..285
FT /note="Peptidase S1"
FT /evidence="ECO:0000259|PROSITE:PS50240"
FT DOMAIN 281..629
FT /note="Peptidase S1"
FT /evidence="ECO:0000259|PROSITE:PS50240"
FT DOMAIN 632..871
FT /note="Peptidase S1"
FT /evidence="ECO:0000259|PROSITE:PS50240"
FT DOMAIN 862..1081
FT /note="Peptidase S1"
FT /evidence="ECO:0000259|PROSITE:PS50240"
FT DOMAIN 1078..1417
FT /note="Peptidase S1"
FT /evidence="ECO:0000259|PROSITE:PS50240"
SQ SEQUENCE 1581 AA; 178459 MW; BD567C374DEFA6BE CRC64;
MDLRQGTLLA IAYTLCLLIN YSEETCGRPF YKPLLTPKRG DGNRITNGIE AREHSHPWQA
LVFTTMQGYR KRCGGSLIHW KDGNTSDLVL TAAHCVVDTS DFDTLQLWEK VRYSFSRLVN
REMYNGPLVN ASDVHVYLGA HDVRLLPYSS EHISVKEIAI GEFNKVRDVE DIALLKLQKV
VTYGKFIQGI CLPNEDEKEL PIHSHCLVAG WGLAADGKPA ARLQQVDAFI YEGRVNCSFF
YKDRMICARG RTNSCGPEEE RCGRPFYKPV LPSVSRAGNR ISNGIEARKH SHPWQALVIT
HERGAVKRCG GSLIHWKEAN SSDLILTAAH CVIDVSDFNS TSMWEEMTYF FRRLFNSHQK
NGPLAKPSDV HVYLGAHDVD QLDYSTERIG VKEIFAGEFN KVSSIEDLTI LRLEKEVAYN
KFIQGICLPA ENEKEPPKES SCVVTGWGLL ADGKSATKLQ QIDAYIFDGE ISSDFFHKDQ
MICALGGGKD TGPDSVRGDI LSFIYQPPII NIIPVTPANF RQYLLSYIHY LNSSRKMLFQ
AEMFYENRYS LLIVISLEFL LFQSSEEICG QPYYKPFIPS SRLSNRITNG VEVRKHSHPW
QALVHKKFDH GHLGICGGSL IHWKESNASD LVLTAAHCVF NGDTSEQATP WKMITNVFSS
VFGGHESKLI TNVTDVLVYL GVHDTELFDS KVKRFRVAAM AIGDFDAKSK AHDIALLKLE
NKATYNEFIQ GVCLPTQDEN LPDAPRYCMV AGWGLLDDGT LGTKLQQVKA YIYEDVVYQS
TFHKSRMICT DSSKRKAGAR EFQYLKSAPK LLFRVEMFYE NTCALLIVIS LECLLFHGTE
EICGQPHYKP ILPTNDTVGN RIANGIEARR HSHPWQALVY KLVGEEQRIC GGSLIHWQEQ
NASDLVLTAA HCVLHGYDSN GTTPWQIVTY AFTKIFNGQE SKLITNVTDV EVYLGMHSIT
RVSSYAKRYR VAAMMVGEFN KINKPHDIAV LKLDSKATYN RFIQGICLPT EDENLPALSS
QCLVVGWGLL DRNAYSDMRK VLLLIICANL LCRDSEEFCG QPVVKPILST KDQAANRISN
GIEVRMHSHP WQALILIKDL AADMLCGGSL MHWNHENASE FILTAAHCLV NWVRYRKERT
AQEKTEGPSL SKRYESYPLV NVSDVIVYLG LHDIKQPTSA MKKLRITAMA VAQFDVYNRT
DDIALLKLEK EITYSAEIQG ICLPTKDEVL PPESLCLVAG WGRLRDGERG TKLQQFQTHI
IIGKVNNSHY QEDLMICTGS KTRNTGARKS QYLYYLNTVL KLDLKIAFAG RKMYFDMRKV
LLLIICANLL LRVSEAFCGR PFFKPIFSTE GPAGNRISNG IEARTHSHPW QALILIKDPQ
TIGLCGGSLI HWKHENASDV ILTAAHCLID WVRYHKEKTT WQKMKQFFNR VFKRRQRPPF
TNLDYLIVYL GLHDINRPTS VLQRLRASAM AAAEFDVPKK TDDIALIKLE KEVTYNVAIQ
GVCLPAENEG LPPPESPCLV AGWGRLRDGK HGTKLQQFQT YIIDGKVNNP FFQQDLMICT
GSKVKDTGAR KARKQILLIV T
//