GenomeNet

Database: UniProt
Entry: A0A6A5DDU4_SCHHA
LinkDB: A0A6A5DDU4_SCHHA
Original site: A0A6A5DDU4_SCHHA 
ID   A0A6A5DDU4_SCHHA        Unreviewed;       745 AA.
AC   A0A6A5DDU4;
DT   17-JUN-2020, integrated into UniProtKB/TrEMBL.
DT   17-JUN-2020, sequence version 1.
DT   27-MAR-2024, entry version 8.
DE   SubName: Full=Cathepsin F (C01 family) {ECO:0000313|EMBL:KAF1320129.1};
GN   ORFNames=MS3_0012103 {ECO:0000313|EMBL:KAF1320129.1};
OS   Schistosoma haematobium (Blood fluke).
OC   Eukaryota; Metazoa; Spiralia; Lophotrochozoa; Platyhelminthes; Trematoda;
OC   Digenea; Strigeidida; Schistosomatoidea; Schistosomatidae; Schistosoma.
OX   NCBI_TaxID=6185 {ECO:0000313|EMBL:KAF1320129.1};
RN   [1] {ECO:0000313|EMBL:KAF1320129.1}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX   PubMed=22246508; DOI=10.1038/ng.1065;
RA   Young N.D., Jex A.R., Li B., Liu S., Yang L., Xiong Z., Li Y.,
RA   Cantacessi C., Hall R.S., Xu X., Chen F., Wu X., Zerlotini A., Oliveira G.,
RA   Hofmann A., Zhang G., Fang X., Kang Y., Campbell B.E., Loukas A.,
RA   Ranganathan S., Rollinson D., Rinaldi G., Brindley P.J., Yang H., Wang J.,
RA   Wang J., Gasser R.B.;
RT   "Whole-genome sequence of Schistosoma haematobium.";
RL   Nat. Genet. 44:221-225(2012).
RN   [2] {ECO:0000313|EMBL:KAF1320129.1}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX   PubMed=31494670;
RA   Stroehlein A.J., Korhonen P.K., Chong T.M., Lim Y.L., Chan K.G.,
RA   Webster B., Rollinson D., Brindley P.J., Gasser R.B., Young N.D.;
RT   "High-quality Schistosoma haematobium genome achieved by single-molecule
RT   and long-range sequencing.";
RL   Gigascience 8:0-0(2019).
CC   -!- SIMILARITY: Belongs to the peptidase C1 family.
CC       {ECO:0000256|ARBA:ARBA00008455}.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:KAF1320129.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; AMPZ02000445; KAF1320129.1; -; Genomic_DNA.
DR   AlphaFoldDB; A0A6A5DDU4; -.
DR   EnsemblMetazoa; XM_035729638.1; XP_035586447.1; MS3_0012103.
DR   OrthoDB; 1085298at2759; -.
DR   GO; GO:0016020; C:membrane; IEA:UniProtKB-KW.
DR   GO; GO:0008234; F:cysteine-type peptidase activity; IEA:UniProtKB-KW.
DR   GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR   CDD; cd02248; Peptidase_C1A; 1.
DR   Gene3D; 3.90.70.10; Cysteine proteinases; 1.
DR   InterPro; IPR038765; Papain-like_cys_pep_sf.
DR   InterPro; IPR025661; Pept_asp_AS.
DR   InterPro; IPR000169; Pept_cys_AS.
DR   InterPro; IPR025660; Pept_his_AS.
DR   InterPro; IPR013128; Peptidase_C1A.
DR   InterPro; IPR000668; Peptidase_C1A_C.
DR   InterPro; IPR039417; Peptidase_C1A_papain-like.
DR   InterPro; IPR013201; Prot_inhib_I29.
DR   PANTHER; PTHR12411:SF959; CATHEPSIN F; 1.
DR   PANTHER; PTHR12411; CYSTEINE PROTEASE FAMILY C1-RELATED; 1.
DR   Pfam; PF08246; Inhibitor_I29; 1.
DR   Pfam; PF00112; Peptidase_C1; 1.
DR   PRINTS; PR00705; PAPAIN.
DR   SMART; SM00848; Inhibitor_I29; 1.
DR   SMART; SM00645; Pept_C1; 1.
DR   SUPFAM; SSF54001; Cysteine proteinases; 1.
DR   PROSITE; PS00640; THIOL_PROTEASE_ASN; 1.
DR   PROSITE; PS00139; THIOL_PROTEASE_CYS; 1.
DR   PROSITE; PS00639; THIOL_PROTEASE_HIS; 1.
PE   3: Inferred from homology;
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW   Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW   Membrane {ECO:0000256|SAM:Phobius};
KW   Protease {ECO:0000256|ARBA:ARBA00022670};
KW   Thiol protease {ECO:0000256|ARBA:ARBA00022807};
KW   Transmembrane {ECO:0000256|SAM:Phobius};
KW   Transmembrane helix {ECO:0000256|SAM:Phobius};
KW   Zymogen {ECO:0000256|ARBA:ARBA00023145}.
FT   TRANSMEM        127..148
FT                   /note="Helical"
FT                   /evidence="ECO:0000256|SAM:Phobius"
FT   DOMAIN          446..502
FT                   /note="Cathepsin propeptide inhibitor"
FT                   /evidence="ECO:0000259|SMART:SM00848"
FT   DOMAIN          531..743
FT                   /note="Peptidase C1A papain C-terminal"
FT                   /evidence="ECO:0000259|SMART:SM00645"
SQ   SEQUENCE   745 AA;  84833 MW;  2EF953C26439E6D4 CRC64;
     MHDSWSGCAG TRVNNLTNLS CKDRPRGLNL FPSFCLRYGR PIPELRGLSY KEQLEKLDLF
     TLSYRRLRGD LILMYRILRN DFGPNLFSLF LPTRSGHLKG HSRRVEKPRT NKIPVAYRFS
     HRVINTWNLL PVAVSGMISL SVLLALLIGG KDDLLYYNEH FEGELSDAPL GRVIPTLVTF
     YNQYSSEPYR YVAFDANQTL QTVSSNLNIS FFSQNTRGYR YRFSVLLAPT ICPKYQTEYI
     SIAHLGPCSL KVYGYQQCNF VLSFRKGLLY DDSTLYLEGC RSAVPRMPVN LETACMQLDD
     LDFTDDLALL SHTHKQMQIK TTGVASASAS VGLNIHNGKS KVLKYNTENT NQFTLDGEAL
     EDVESSTYLN SNIDERVQSD ADVKARIGKF FCTELKLRKI PNHRHNNASI HKRLSTQDIE
     YLLTEYHQQQ PTYLGFKLPA NVEEKYAQFK LTYRKQYHET EDEIRFNIFK SNILKAQLYQ
     VFERGSAIYG VTPYSDLTTD EFARTHLTAS WVVSSSTNNT PISLEKEVKN IPKNFDWREK
     GAVTEVKNQG MCGSCWAFST TGNVESQWFR KTGKLLSLSE QQLVDCDGLD DGCNGGLPSN
     AYESIIKMGG LMLEDNYPYD AKNEKCHLKA DNVAAYINSS VNLTQDETEL AAWLYHNSAI
     SVGMNAMLLQ FYRHGISHPW WIFCSKYLLD HAVLLVGYGV SEENEPFWIV KNSWGVEWGE
     KGYFRVYRGD GTCGINTVAT SALIY
//
DBGET integrated database retrieval system