ID A0A6A5DDU4_SCHHA Unreviewed; 745 AA.
AC A0A6A5DDU4;
DT 17-JUN-2020, integrated into UniProtKB/TrEMBL.
DT 17-JUN-2020, sequence version 1.
DT 27-MAR-2024, entry version 8.
DE SubName: Full=Cathepsin F (C01 family) {ECO:0000313|EMBL:KAF1320129.1};
GN ORFNames=MS3_0012103 {ECO:0000313|EMBL:KAF1320129.1};
OS Schistosoma haematobium (Blood fluke).
OC Eukaryota; Metazoa; Spiralia; Lophotrochozoa; Platyhelminthes; Trematoda;
OC Digenea; Strigeidida; Schistosomatoidea; Schistosomatidae; Schistosoma.
OX NCBI_TaxID=6185 {ECO:0000313|EMBL:KAF1320129.1};
RN [1] {ECO:0000313|EMBL:KAF1320129.1}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=22246508; DOI=10.1038/ng.1065;
RA Young N.D., Jex A.R., Li B., Liu S., Yang L., Xiong Z., Li Y.,
RA Cantacessi C., Hall R.S., Xu X., Chen F., Wu X., Zerlotini A., Oliveira G.,
RA Hofmann A., Zhang G., Fang X., Kang Y., Campbell B.E., Loukas A.,
RA Ranganathan S., Rollinson D., Rinaldi G., Brindley P.J., Yang H., Wang J.,
RA Wang J., Gasser R.B.;
RT "Whole-genome sequence of Schistosoma haematobium.";
RL Nat. Genet. 44:221-225(2012).
RN [2] {ECO:0000313|EMBL:KAF1320129.1}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=31494670;
RA Stroehlein A.J., Korhonen P.K., Chong T.M., Lim Y.L., Chan K.G.,
RA Webster B., Rollinson D., Brindley P.J., Gasser R.B., Young N.D.;
RT "High-quality Schistosoma haematobium genome achieved by single-molecule
RT and long-range sequencing.";
RL Gigascience 8:0-0(2019).
CC -!- SIMILARITY: Belongs to the peptidase C1 family.
CC {ECO:0000256|ARBA:ARBA00008455}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KAF1320129.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AMPZ02000445; KAF1320129.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A6A5DDU4; -.
DR EnsemblMetazoa; XM_035729638.1; XP_035586447.1; MS3_0012103.
DR OrthoDB; 1085298at2759; -.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-KW.
DR GO; GO:0008234; F:cysteine-type peptidase activity; IEA:UniProtKB-KW.
DR GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR CDD; cd02248; Peptidase_C1A; 1.
DR Gene3D; 3.90.70.10; Cysteine proteinases; 1.
DR InterPro; IPR038765; Papain-like_cys_pep_sf.
DR InterPro; IPR025661; Pept_asp_AS.
DR InterPro; IPR000169; Pept_cys_AS.
DR InterPro; IPR025660; Pept_his_AS.
DR InterPro; IPR013128; Peptidase_C1A.
DR InterPro; IPR000668; Peptidase_C1A_C.
DR InterPro; IPR039417; Peptidase_C1A_papain-like.
DR InterPro; IPR013201; Prot_inhib_I29.
DR PANTHER; PTHR12411:SF959; CATHEPSIN F; 1.
DR PANTHER; PTHR12411; CYSTEINE PROTEASE FAMILY C1-RELATED; 1.
DR Pfam; PF08246; Inhibitor_I29; 1.
DR Pfam; PF00112; Peptidase_C1; 1.
DR PRINTS; PR00705; PAPAIN.
DR SMART; SM00848; Inhibitor_I29; 1.
DR SMART; SM00645; Pept_C1; 1.
DR SUPFAM; SSF54001; Cysteine proteinases; 1.
DR PROSITE; PS00640; THIOL_PROTEASE_ASN; 1.
DR PROSITE; PS00139; THIOL_PROTEASE_CYS; 1.
DR PROSITE; PS00639; THIOL_PROTEASE_HIS; 1.
PE 3: Inferred from homology;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW Membrane {ECO:0000256|SAM:Phobius};
KW Protease {ECO:0000256|ARBA:ARBA00022670};
KW Thiol protease {ECO:0000256|ARBA:ARBA00022807};
KW Transmembrane {ECO:0000256|SAM:Phobius};
KW Transmembrane helix {ECO:0000256|SAM:Phobius};
KW Zymogen {ECO:0000256|ARBA:ARBA00023145}.
FT TRANSMEM 127..148
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT DOMAIN 446..502
FT /note="Cathepsin propeptide inhibitor"
FT /evidence="ECO:0000259|SMART:SM00848"
FT DOMAIN 531..743
FT /note="Peptidase C1A papain C-terminal"
FT /evidence="ECO:0000259|SMART:SM00645"
SQ SEQUENCE 745 AA; 84833 MW; 2EF953C26439E6D4 CRC64;
MHDSWSGCAG TRVNNLTNLS CKDRPRGLNL FPSFCLRYGR PIPELRGLSY KEQLEKLDLF
TLSYRRLRGD LILMYRILRN DFGPNLFSLF LPTRSGHLKG HSRRVEKPRT NKIPVAYRFS
HRVINTWNLL PVAVSGMISL SVLLALLIGG KDDLLYYNEH FEGELSDAPL GRVIPTLVTF
YNQYSSEPYR YVAFDANQTL QTVSSNLNIS FFSQNTRGYR YRFSVLLAPT ICPKYQTEYI
SIAHLGPCSL KVYGYQQCNF VLSFRKGLLY DDSTLYLEGC RSAVPRMPVN LETACMQLDD
LDFTDDLALL SHTHKQMQIK TTGVASASAS VGLNIHNGKS KVLKYNTENT NQFTLDGEAL
EDVESSTYLN SNIDERVQSD ADVKARIGKF FCTELKLRKI PNHRHNNASI HKRLSTQDIE
YLLTEYHQQQ PTYLGFKLPA NVEEKYAQFK LTYRKQYHET EDEIRFNIFK SNILKAQLYQ
VFERGSAIYG VTPYSDLTTD EFARTHLTAS WVVSSSTNNT PISLEKEVKN IPKNFDWREK
GAVTEVKNQG MCGSCWAFST TGNVESQWFR KTGKLLSLSE QQLVDCDGLD DGCNGGLPSN
AYESIIKMGG LMLEDNYPYD AKNEKCHLKA DNVAAYINSS VNLTQDETEL AAWLYHNSAI
SVGMNAMLLQ FYRHGISHPW WIFCSKYLLD HAVLLVGYGV SEENEPFWIV KNSWGVEWGE
KGYFRVYRGD GTCGINTVAT SALIY
//