ID A0A2P5AGF0_TREOI Unreviewed; 345 AA.
AC A0A2P5AGF0;
DT 23-MAY-2018, integrated into UniProtKB/TrEMBL.
DT 23-MAY-2018, sequence version 1.
DT 27-MAR-2024, entry version 17.
DE SubName: Full=Cyseine protease {ECO:0000313|EMBL:PON35618.1};
GN Name=TorCP2 {ECO:0000313|EMBL:PON35618.1};
GN ORFNames=TorRG33x02_351050 {ECO:0000313|EMBL:PON35618.1};
OS Trema orientale (Charcoal tree) (Celtis orientalis).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; fabids; Rosales; Cannabaceae; Trema.
OX NCBI_TaxID=63057 {ECO:0000313|EMBL:PON35618.1, ECO:0000313|Proteomes:UP000237000};
RN [1] {ECO:0000313|Proteomes:UP000237000}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. RG33-2 {ECO:0000313|Proteomes:UP000237000};
RA Van Velzen R., Holmer R., Bu F., Rutten L., Van Zeijl A., Liu W.,
RA Santuari L., Cao Q., Sharma T., Shen D., Roswanjaya Y., Wardhani T.,
RA Kalhor M.S., Jansen J., Van den Hoogen J., Gungor B., Hartog M.,
RA Hontelez J., Verver J., Yang W.-C., Schijlen E., Repin R., Schilthuizen M.,
RA Schranz E., Heidstra R., Miyata K., Fedorova E., Kohlen W., Bisseling T.,
RA Smit S., Geurts R.;
RT "Parallel loss of symbiosis genes in relatives of nitrogen-fixing non-
RT legume Parasponia.";
RL Submitted (JUN-2016) to the EMBL/GenBank/DDBJ databases.
CC -!- SIMILARITY: Belongs to the peptidase C1 family.
CC {ECO:0000256|ARBA:ARBA00008455}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:PON35618.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; JXTC01000875; PON35618.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A2P5AGF0; -.
DR STRING; 63057.A0A2P5AGF0; -.
DR InParanoid; A0A2P5AGF0; -.
DR OrthoDB; 5472443at2759; -.
DR Proteomes; UP000237000; Unassembled WGS sequence.
DR GO; GO:0008234; F:cysteine-type peptidase activity; IEA:UniProtKB-KW.
DR GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR CDD; cd02248; Peptidase_C1A; 1.
DR Gene3D; 3.90.70.10; Cysteine proteinases; 1.
DR InterPro; IPR038765; Papain-like_cys_pep_sf.
DR InterPro; IPR025661; Pept_asp_AS.
DR InterPro; IPR000169; Pept_cys_AS.
DR InterPro; IPR025660; Pept_his_AS.
DR InterPro; IPR013128; Peptidase_C1A.
DR InterPro; IPR000668; Peptidase_C1A_C.
DR InterPro; IPR039417; Peptidase_C1A_papain-like.
DR InterPro; IPR013201; Prot_inhib_I29.
DR PANTHER; PTHR12411; CYSTEINE PROTEASE FAMILY C1-RELATED; 1.
DR PANTHER; PTHR12411:SF1032; SENESCENCE-SPECIFIC CYSTEINE PROTEASE SAG12; 1.
DR Pfam; PF08246; Inhibitor_I29; 1.
DR Pfam; PF00112; Peptidase_C1; 1.
DR PRINTS; PR00705; PAPAIN.
DR SMART; SM00848; Inhibitor_I29; 1.
DR SMART; SM00645; Pept_C1; 1.
DR SUPFAM; SSF54001; Cysteine proteinases; 1.
DR PROSITE; PS00640; THIOL_PROTEASE_ASN; 1.
DR PROSITE; PS00139; THIOL_PROTEASE_CYS; 1.
DR PROSITE; PS00639; THIOL_PROTEASE_HIS; 1.
PE 3: Inferred from homology;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW Protease {ECO:0000256|ARBA:ARBA00022670, ECO:0000313|EMBL:PON35618.1};
KW Reference proteome {ECO:0000313|Proteomes:UP000237000};
KW Signal {ECO:0000256|SAM:SignalP};
KW Thiol protease {ECO:0000256|ARBA:ARBA00022807}.
FT SIGNAL 1..28
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 29..345
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5018523337"
FT DOMAIN 41..98
FT /note="Cathepsin propeptide inhibitor"
FT /evidence="ECO:0000259|SMART:SM00848"
FT DOMAIN 127..344
FT /note="Peptidase C1A papain C-terminal"
FT /evidence="ECO:0000259|SMART:SM00645"
SQ SEQUENCE 345 AA; 37884 MW; E2B17923F940C16C CRC64;
MALTLQTTSI FVTTLFLVLV MWASQAQCRT LNSEISMSDR HEQWMARYGR TYKDDEEKEL
RFQIFKKNVE FIESFNKAKN KAYKLGLNEF TDLTNEEFRT SHNGYKRSSN LLRSSHTSFK
YENMTAVPSS LDWRKKGAVT PVKDQGQCGC CWAFSAVAAT EGITKLSTGK LISLSEQELV
DCDTSGTDQG CEGGLMDDAF EFIINNGGLN SEAKYPYMGV DGTCNKKKSS SDAAKISGYE
DVPANSEKAL LKAVANQPVS VAIDAGGPEF QSYSSGVFTG ECGTQLDHGV TAVGYGTAED
GTKFWLVKNS WGTSWGENGY IRMQRDVDAN EGLCGIAMEA SYPTA
//