ID A0A5A7RHR3_STRAF Unreviewed; 484 AA.
AC A0A5A7RHR3;
DT 13-NOV-2019, integrated into UniProtKB/TrEMBL.
DT 13-NOV-2019, sequence version 1.
DT 27-MAR-2024, entry version 12.
DE SubName: Full=Cysteine proteinases superfamily protein {ECO:0000313|EMBL:GER56745.1};
GN ORFNames=STAS_34485 {ECO:0000313|EMBL:GER56745.1};
OS Striga asiatica (Asiatic witchweed) (Buchnera asiatica).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC asterids; lamiids; Lamiales; Orobanchaceae; Buchnereae; Striga.
OX NCBI_TaxID=4170 {ECO:0000313|EMBL:GER56745.1, ECO:0000313|Proteomes:UP000325081};
RN [1] {ECO:0000313|Proteomes:UP000325081}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. UVA1 {ECO:0000313|Proteomes:UP000325081};
RX PubMed=31522940; DOI=10.1016/j.cub.2019.07.086;
RA Yoshida S., Kim S., Wafula E.K., Tanskanen J., Kim Y.M., Honaas L.,
RA Yang Z., Spallek T., Conn C.E., Ichihashi Y., Cheong K., Cui S., Der J.P.,
RA Gundlach H., Jiao Y., Hori C., Ishida J.K., Kasahara H., Kiba T., Kim M.S.,
RA Koo N., Laohavisit A., Lee Y.H., Lumba S., McCourt P., Mortimer J.C.,
RA Mutuku J.M., Nomura T., Sasaki-Sekimoto Y., Seto Y., Wang Y., Wakatake T.,
RA Sakakibara H., Demura T., Yamaguchi S., Yoneyama K., Manabe R.I.,
RA Nelson D.C., Schulman A.H., Timko M.P., dePamphilis C.W., Choi D.,
RA Shirasu K.;
RT "Genome Sequence of Striga asiatica Provides Insight into the Evolution of
RT Plant Parasitism.";
RL Curr. Biol. 29:3041-3052.e4(2019).
CC -!- SIMILARITY: Belongs to the peptidase C1 family.
CC {ECO:0000256|ARBA:ARBA00008455}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:GER56745.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; BKCP01012737; GER56745.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A5A7RHR3; -.
DR Proteomes; UP000325081; Unassembled WGS sequence.
DR GO; GO:0008234; F:cysteine-type peptidase activity; IEA:InterPro.
DR GO; GO:0006508; P:proteolysis; IEA:InterPro.
DR CDD; cd02248; Peptidase_C1A; 1.
DR Gene3D; 3.90.70.10; Cysteine proteinases; 2.
DR InterPro; IPR038765; Papain-like_cys_pep_sf.
DR InterPro; IPR025661; Pept_asp_AS.
DR InterPro; IPR025660; Pept_his_AS.
DR InterPro; IPR013128; Peptidase_C1A.
DR InterPro; IPR000668; Peptidase_C1A_C.
DR InterPro; IPR039417; Peptidase_C1A_papain-like.
DR InterPro; IPR013201; Prot_inhib_I29.
DR PANTHER; PTHR12411:SF741; CATHEPSIN K; 1.
DR PANTHER; PTHR12411; CYSTEINE PROTEASE FAMILY C1-RELATED; 1.
DR Pfam; PF08246; Inhibitor_I29; 1.
DR Pfam; PF00112; Peptidase_C1; 2.
DR PRINTS; PR00705; PAPAIN.
DR SMART; SM00848; Inhibitor_I29; 1.
DR SMART; SM00645; Pept_C1; 2.
DR SUPFAM; SSF54001; Cysteine proteinases; 2.
DR PROSITE; PS00640; THIOL_PROTEASE_ASN; 2.
DR PROSITE; PS00639; THIOL_PROTEASE_HIS; 1.
PE 3: Inferred from homology;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW Reference proteome {ECO:0000313|Proteomes:UP000325081};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..21
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 22..484
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5022668499"
FT DOMAIN 36..89
FT /note="Cathepsin propeptide inhibitor"
FT /evidence="ECO:0000259|SMART:SM00848"
FT DOMAIN 120..338
FT /note="Peptidase C1A papain C-terminal"
FT /evidence="ECO:0000259|SMART:SM00645"
FT DOMAIN 341..474
FT /note="Peptidase C1A papain C-terminal"
FT /evidence="ECO:0000259|SMART:SM00645"
SQ SEQUENCE 484 AA; 54223 MW; A606340D0F6DDF29 CRC64;
MKISFIVLLL LLCTLLLSGL GFEYSADELE DMGSLYERWR AYYNKLDEPT NDRFPAFREN
VLRIHHFNQL DEQYKLGLNQ FSDLTAHEFA ARHGCQPGPH QPLDLLEEEF SQLEYHTSDV
PTSVDWRAKG AVTYVKNQLN CKSCWAFSAV GAVEGINFVR TGRLVSLSAQ ELVDCDKRNN
GCRKGNVVRA FDFIRDRGIT SERVYPYVGK QRICDSTKVK SPVVKIAGYK RIPPNNEKAL
MQAVAQQPVS AAIAADNDFV YYKTGIYNGN CSTASSSKDL NHAVTVVGYG MTKQGIKFWT
VKNSWGGSWG ENGYIRMARD VKYKSGMCGI AIEASYPISF NVLLLFLCTL LLNGLGFGYS
ADELEDLGNL YEYARSSAEA LMRAVAHQPV SAFVSIDGEF KSYKKGVFTG WCDVNLTHIV
TVVGYGATEQ GTKFWIVKNS WGSWWGDNGY IRLARDVTDE RGQCGIAMRA SYPMINEFNA
DGFF
//