ID B8C725_THAPS Unreviewed; 244 AA.
AC B8C725;
DT 03-MAR-2009, integrated into UniProtKB/TrEMBL.
DT 03-MAR-2009, sequence version 1.
DT 27-MAR-2024, entry version 64.
DE SubName: Full=Probable papain cysteine protease {ECO:0000313|EMBL:EED90907.1};
DE Flags: Fragment;
GN ORFNames=THAPSDRAFT_263404 {ECO:0000313|EMBL:EED90907.1};
OS Thalassiosira pseudonana (Marine diatom) (Cyclotella nana).
OC Eukaryota; Sar; Stramenopiles; Ochrophyta; Bacillariophyta;
OC Coscinodiscophyceae; Thalassiosirophycidae; Thalassiosirales;
OC Thalassiosiraceae; Thalassiosira.
OX NCBI_TaxID=35128 {ECO:0000313|EMBL:EED90907.1, ECO:0000313|Proteomes:UP000001449};
RN [1] {ECO:0000313|EMBL:EED90907.1, ECO:0000313|Proteomes:UP000001449}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=CCMP1335 {ECO:0000313|EMBL:EED90907.1};
RX PubMed=15459382; DOI=10.1126/science.1101156;
RA Armbrust E.V., Berges J.A., Bowler C., Green B.R., Martinez D.,
RA Putnam N.H., Zhou S., Allen A.E., Apt K.E., Bechner M., Brzezinski M.A.,
RA Chaal B.K., Chiovitti A., Davis A.K., Demarest M.S., Detter J.C.,
RA Glavina T., Goodstein D., Hadi M.Z., Hellsten U., Hildebrand M.,
RA Jenkins B.D., Jurka J., Kapitonov V.V., Kroger N., Lau W.W., Lane T.W.,
RA Larimer F.W., Lippmeier J.C., Lucas S., Medina M., Montsant A., Obornik M.,
RA Parker M.S., Palenik B., Pazour G.J., Richardson P.M., Rynearson T.A.,
RA Saito M.A., Schwartz D.C., Thamatrakoln K., Valentin K., Vardi A.,
RA Wilkerson F.P., Rokhsar D.S.;
RT "The genome of the diatom Thalassiosira pseudonana: ecology, evolution, and
RT metabolism.";
RL Science 306:79-86(2004).
RN [2] {ECO:0000313|EMBL:EED90907.1, ECO:0000313|Proteomes:UP000001449}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=CCMP1335 {ECO:0000313|EMBL:EED90907.1};
RX PubMed=18923393; DOI=10.1038/nature07410;
RA Bowler C., Allen A.E., Badger J.H., Grimwood J., Jabbari K., Kuo A.,
RA Maheswari U., Martens C., Maumus F., Otillar R.P., Rayko E., Salamov A.,
RA Vandepoele K., Beszteri B., Gruber A., Heijde M., Katinka M., Mock T.,
RA Valentin K., Verret F., Berges J.A., Brownlee C., Cadoret J.P.,
RA Chiovitti A., Choi C.J., Coesel S., De Martino A., Detter J.C., Durkin C.,
RA Falciatore A., Fournet J., Haruta M., Huysman M.J., Jenkins B.D.,
RA Jiroutova K., Jorgensen R.E., Joubert Y., Kaplan A., Kroger N., Kroth P.G.,
RA La Roche J., Lindquist E., Lommer M., Martin-Jezequel V., Lopez P.J.,
RA Lucas S., Mangogna M., McGinnis K., Medlin L.K., Montsant A.,
RA Oudot-Le Secq M.P., Napoli C., Obornik M., Parker M.S., Petit J.L.,
RA Porcel B.M., Poulsen N., Robison M., Rychlewski L., Rynearson T.A.,
RA Schmutz J., Shapiro H., Siaut M., Stanley M., Sussman M.R., Taylor A.R.,
RA Vardi A., von Dassow P., Vyverman W., Willis A., Wyrwicz L.S.,
RA Rokhsar D.S., Weissenbach J., Armbrust E.V., Green B.R., Van de Peer Y.,
RA Grigoriev I.V.;
RT "The Phaeodactylum genome reveals the evolutionary history of diatom
RT genomes.";
RL Nature 456:239-244(2008).
CC -!- SIMILARITY: Belongs to the peptidase C1 family.
CC {ECO:0000256|ARBA:ARBA00008455}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CM000644; EED90907.1; -; Genomic_DNA.
DR RefSeq; XP_002292056.1; XM_002292020.1.
DR AlphaFoldDB; B8C725; -.
DR STRING; 35128.B8C725; -.
DR PaxDb; 35128-Thaps263404; -.
DR EnsemblProtists; EED90907; EED90907; THAPSDRAFT_263404.
DR GeneID; 7449808; -.
DR KEGG; tps:THAPSDRAFT_263404; -.
DR eggNOG; KOG1543; Eukaryota.
DR HOGENOM; CLU_012184_2_1_1; -.
DR InParanoid; B8C725; -.
DR OMA; ETHGAWA; -.
DR Proteomes; UP000001449; Chromosome 8.
DR GO; GO:0005615; C:extracellular space; IBA:GO_Central.
DR GO; GO:0004197; F:cysteine-type endopeptidase activity; IBA:GO_Central.
DR GO; GO:0051603; P:proteolysis involved in protein catabolic process; IBA:GO_Central.
DR Gene3D; 3.90.70.10; Cysteine proteinases; 1.
DR InterPro; IPR038765; Papain-like_cys_pep_sf.
DR InterPro; IPR025661; Pept_asp_AS.
DR InterPro; IPR013128; Peptidase_C1A.
DR InterPro; IPR000668; Peptidase_C1A_C.
DR PANTHER; PTHR12411:SF929; CATHEPSIN Z; 1.
DR PANTHER; PTHR12411; CYSTEINE PROTEASE FAMILY C1-RELATED; 1.
DR Pfam; PF00112; Peptidase_C1; 1.
DR PRINTS; PR00705; PAPAIN.
DR SMART; SM00645; Pept_C1; 1.
DR SUPFAM; SSF54001; Cysteine proteinases; 1.
DR PROSITE; PS00640; THIOL_PROTEASE_ASN; 1.
PE 3: Inferred from homology;
KW Hydrolase {ECO:0000313|EMBL:EED90907.1};
KW Protease {ECO:0000313|EMBL:EED90907.1};
KW Reference proteome {ECO:0000313|Proteomes:UP000001449};
KW Zymogen {ECO:0000256|ARBA:ARBA00023145}.
FT DOMAIN 19..244
FT /note="Peptidase C1A papain C-terminal"
FT /evidence="ECO:0000259|SMART:SM00645"
FT NON_TER 1
FT /evidence="ECO:0000313|EMBL:EED90907.1"
FT NON_TER 244
FT /evidence="ECO:0000313|EMBL:EED90907.1"
SQ SEQUENCE 244 AA; 27636 MW; FE27D26AED49DC52 CRC64;
PVQHIIHPLP HHYLTAEDLP QNFTWQNVNA HSYLTRMRNQ HIPQYCGSCW AHSALSSLAD
RVKIMRSYTG PDIDLSVQYL LNCGIANETE THPHKLSCHG GNSLYAYDYI HSTLGFIPED
SCLNYIACSS ESDEGWCPEV RSLTTCAAWN VCRTCDNIEP HNIHAIQSEI YARGPIKAAI
NANPLRNYTG GILGSDDDPA MLDTHHNHGV SIVGWGYDEE RKTQHWIVRN SWGVYWGEMG
FFRI
//