ID R7Q2H1_CHOCR Unreviewed; 655 AA.
AC R7Q2H1;
DT 24-JUL-2013, integrated into UniProtKB/TrEMBL.
DT 24-JUL-2013, sequence version 1.
DT 27-MAR-2024, entry version 34.
DE RecName: Full=Tail specific protease domain-containing protein {ECO:0000259|Pfam:PF03572};
GN ORFNames=CHC_T00001688001 {ECO:0000313|EMBL:CDF32787.1};
OS Chondrus crispus (Carrageen Irish moss) (Polymorpha crispa).
OC Eukaryota; Rhodophyta; Florideophyceae; Rhodymeniophycidae; Gigartinales;
OC Gigartinaceae; Chondrus.
OX NCBI_TaxID=2769 {ECO:0000313|EMBL:CDF32787.1, ECO:0000313|Proteomes:UP000012073};
RN [1] {ECO:0000313|Proteomes:UP000012073}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Stackhouse {ECO:0000313|Proteomes:UP000012073};
RX PubMed=23503846; DOI=10.1073/pnas.1221259110;
RA Collen J., Porcel B., Carre W., Ball S.G., Chaparro C., Tonon T.,
RA Barbeyron T., Michel G., Noel B., Valentin K., Elias M., Artiguenave F.,
RA Arun A., Aury J.M., Barbosa-Neto J.F., Bothwell J.H., Bouget F.Y.,
RA Brillet L., Cabello-Hurtado F., Capella-Gutierrez S., Charrier B.,
RA Cladiere L., Cock J.M., Coelho S.M., Colleoni C., Czjzek M., Da Silva C.,
RA Delage L., Denoeud F., Deschamps P., Dittami S.M., Gabaldon T.,
RA Gachon C.M., Groisillier A., Herve C., Jabbari K., Katinka M., Kloareg B.,
RA Kowalczyk N., Labadie K., Leblanc C., Lopez P.J., McLachlan D.H.,
RA Meslet-Cladiere L., Moustafa A., Nehr Z., Nyvall Collen P., Panaud O.,
RA Partensky F., Poulain J., Rensing S.A., Rousvoal S., Samson G.,
RA Symeonidi A., Weissenbach J., Zambounis A., Wincker P., Boyen C.;
RT "Genome structure and metabolic features in the red seaweed Chondrus
RT crispus shed light on evolution of the Archaeplastida.";
RL Proc. Natl. Acad. Sci. U.S.A. 110:5247-5252(2013).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; HG001587; CDF32787.1; -; Genomic_DNA.
DR RefSeq; XP_005712588.1; XM_005712531.1.
DR AlphaFoldDB; R7Q2H1; -.
DR STRING; 2769.R7Q2H1; -.
DR EnsemblPlants; CDF32787; CDF32787; CHC_T00001688001.
DR GeneID; 17320305; -.
DR Gramene; CDF32787; CDF32787; CHC_T00001688001.
DR KEGG; ccp:CHC_T00001688001; -.
DR OrthoDB; 4157989at2759; -.
DR Proteomes; UP000012073; Unassembled WGS sequence.
DR GO; GO:0008236; F:serine-type peptidase activity; IEA:InterPro.
DR GO; GO:0006508; P:proteolysis; IEA:InterPro.
DR InterPro; IPR029045; ClpP/crotonase-like_dom_sf.
DR InterPro; IPR005151; Tail-specific_protease.
DR PANTHER; PTHR32060:SF22; CARBOXYL-TERMINAL-PROCESSING PEPTIDASE 2, CHLOROPLASTIC-RELATED; 1.
DR PANTHER; PTHR32060; TAIL-SPECIFIC PROTEASE; 1.
DR Pfam; PF03572; Peptidase_S41; 1.
DR SUPFAM; SSF52096; ClpP/crotonase; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000012073};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..22
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 23..655
FT /note="Tail specific protease domain-containing protein"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5004442636"
FT DOMAIN 367..591
FT /note="Tail specific protease"
FT /evidence="ECO:0000259|Pfam:PF03572"
FT REGION 620..655
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 633..655
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 655 AA; 72970 MW; 2877DE550197DDC0 CRC64;
MVARCATFPI LVTLLLFRVA HSIRCQHSKH CTTDDPKLLG LCAPRHRVCF TIPHPARHIT
DDDDHDDEDA PLFVSLSELI EEANADKLSV AAKRDVIDAV KTIYMDVNPH RFLHENLLKI
DFPAALDSID ITEEMTNVEF HENMMNAFQL MDDYHSVFLA PEPLRTSVAT LGFAVAKFFE
EGSRQRQYIV NDLLAELIPS NSSFGIGSEL LLIDGVPIDK YVLTLGKNSS ASNLAAQIDS
GIFLVSFRTL AFDPIPFSST VDIVYLTTEG IRKSITLPWF FTKLYTQEAA DTMSHAVHPA
AYQPLHGRPN RVVFSEADKR KLYEEVTAEP LDITTRVIEN GRVPIEVSSE FTERFTAEVI
LTKSGPIGRF VIPDFGASVS LELALEIARI LRLMPLNGLI MDLRSNAGGD GDYVKLLAES
LVSETVPPNP NTLRVSQFLQ DLLLSADTKN VTAEELILLP AVIRALNTSL AIGEQFTGPT
VDIYSSEFRE RFAPRAYFGP VLTLVDGICY SAGDLYTSLQ KDYGFSRVVG VSDNVGAGGA
SVYRYSQLAE LFPQKIKKVE SEFTMAYVRF FRSGTSKGAI IENFGIKPDV RYYPTRNDAF
SDDCDLYEFL GSMLKDMREE KDGTKIEREE FDATSEEEPM PEEGLTLEEG PTPEE
//