ID H2ZGI0_CIOSA Unreviewed; 446 AA.
AC H2ZGI0;
DT 18-APR-2012, integrated into UniProtKB/TrEMBL.
DT 18-APR-2012, sequence version 1.
DT 27-MAR-2024, entry version 64.
DE RecName: Full=Peptidase S1 domain-containing protein {ECO:0000259|PROSITE:PS50240};
OS Ciona savignyi (Pacific transparent sea squirt).
OC Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Phlebobranchia;
OC Cionidae; Ciona.
OX NCBI_TaxID=51511 {ECO:0000313|Ensembl:ENSCSAVP00000016696.1, ECO:0000313|Proteomes:UP000007875};
RN [1] {ECO:0000313|Proteomes:UP000007875}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA Birren B., Nusbaum C., Abebe A., Abouelleil A., Adekoya E., Ait-zahra M.,
RA Allen N., Allen T., An P., Anderson M., Anderson S., Arachchi H.,
RA Armbruster J., Bachantsang P., Baldwin J., Barry A., Bayul T.,
RA Blitshsteyn B., Bloom T., Blye J., Boguslavskiy L., Borowsky M.,
RA Boukhgalter B., Brunache A., Butler J., Calixte N., Calvo S., Camarata J.,
RA Campo K., Chang J., Cheshatsang Y., Citroen M., Collymore A., Considine T.,
RA Cook A., Cooke P., Corum B., Cuomo C., David R., Dawoe T., Degray S.,
RA Dodge S., Dooley K., Dorje P., Dorjee K., Dorris L., Duffey N., Dupes A.,
RA Elkins T., Engels R., Erickson J., Farina A., Faro S., Ferreira P.,
RA Fischer H., Fitzgerald M., Foley K., Gage D., Galagan J., Gearin G.,
RA Gnerre S., Gnirke A., Goyette A., Graham J., Grandbois E., Gyaltsen K.,
RA Hafez N., Hagopian D., Hagos B., Hall J., Hatcher B., Heller A.,
RA Higgins H., Honan T., Horn A., Houde N., Hughes L., Hulme W., Husby E.,
RA Iliev I., Jaffe D., Jones C., Kamal M., Kamat A., Kamvysselis M.,
RA Karlsson E., Kells C., Kieu A., Kisner P., Kodira C., Kulbokas E.,
RA Labutti K., Lama D., Landers T., Leger J., Levine S., Lewis D., Lewis T.,
RA Lindblad-toh K., Liu X., Lokyitsang T., Lokyitsang Y., Lucien O., Lui A.,
RA Ma L.J., Mabbitt R., Macdonald J., Maclean C., Major J., Manning J.,
RA Marabella R., Maru K., Matthews C., Mauceli E., Mccarthy M., Mcdonough S.,
RA Mcghee T., Meldrim J., Meneus L., Mesirov J., Mihalev A., Mihova T.,
RA Mikkelsen T., Mlenga V., Moru K., Mozes J., Mulrain L., Munson G.,
RA Naylor J., Newes C., Nguyen C., Nguyen N., Nguyen T., Nicol R., Nielsen C.,
RA Nizzari M., Norbu C., Norbu N., O'donnell P., Okoawo O., O'leary S.,
RA Omotosho B., O'neill K., Osman S., Parker S., Perrin D., Phunkhang P.,
RA Piqani B., Purcell S., Rachupka T., Ramasamy U., Rameau R., Ray V.,
RA Raymond C., Retta R., Richardson S., Rise C., Rodriguez J., Rogers J.,
RA Rogov P., Rutman M., Schupbach R., Seaman C., Settipalli S., Sharpe T.,
RA Sheridan J., Sherpa N., Shi J., Smirnov S., Smith C., Sougnez C.,
RA Spencer B., Stalker J., Stange-thomann N., Stavropoulos S., Stetson K.,
RA Stone C., Stone S., Stubbs M., Talamas J., Tchuinga P., Tenzing P.,
RA Tesfaye S., Theodore J., Thoulutsang Y., Topham K., Towey S., Tsamla T.,
RA Tsomo N., Vallee D., Vassiliev H., Venkataraman V., Vinson J., Vo A.,
RA Wade C., Wang S., Wangchuk T., Wangdi T., Whittaker C., Wilkinson J.,
RA Wu Y., Wyman D., Yadav S., Yang S., Yang X., Yeager S., Yee E., Young G.,
RA Zainoun J., Zembeck L., Zimmer A., Zody M., Lander E.;
RL Submitted (AUG-2003) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSCSAVP00000016696.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00124}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; H2ZGI0; -.
DR Ensembl; ENSCSAVT00000016878.1; ENSCSAVP00000016696.1; ENSCSAVG00000009815.1.
DR GeneTree; ENSGT00940000157103; -.
DR HOGENOM; CLU_006842_19_6_1; -.
DR OMA; GINCWIS; -.
DR Proteomes; UP000007875; Unassembled WGS sequence.
DR GO; GO:0004252; F:serine-type endopeptidase activity; IEA:InterPro.
DR GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR CDD; cd00112; LDLa; 4.
DR CDD; cd00190; Tryp_SPc; 1.
DR Gene3D; 4.10.400.10; Low-density Lipoprotein Receptor; 4.
DR Gene3D; 2.40.10.10; Trypsin-like serine proteases; 1.
DR InterPro; IPR036055; LDL_receptor-like_sf.
DR InterPro; IPR023415; LDLR_class-A_CS.
DR InterPro; IPR002172; LDrepeatLR_classA_rpt.
DR InterPro; IPR009003; Peptidase_S1_PA.
DR InterPro; IPR043504; Peptidase_S1_PA_chymotrypsin.
DR InterPro; IPR001254; Trypsin_dom.
DR InterPro; IPR018114; TRYPSIN_HIS.
DR InterPro; IPR033116; TRYPSIN_SER.
DR PANTHER; PTHR24252; ACROSIN-RELATED; 1.
DR PANTHER; PTHR24252:SF7; HYALIN; 1.
DR Pfam; PF00057; Ldl_recept_a; 4.
DR Pfam; PF00089; Trypsin; 1.
DR PRINTS; PR00261; LDLRECEPTOR.
DR SMART; SM00192; LDLa; 4.
DR SMART; SM00020; Tryp_SPc; 1.
DR SUPFAM; SSF57424; LDL receptor-like module; 4.
DR SUPFAM; SSF50494; Trypsin-like serine proteases; 1.
DR PROSITE; PS01209; LDLRA_1; 2.
DR PROSITE; PS50068; LDLRA_2; 4.
DR PROSITE; PS50240; TRYPSIN_DOM; 1.
DR PROSITE; PS00134; TRYPSIN_HIS; 1.
DR PROSITE; PS00135; TRYPSIN_SER; 1.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW ProRule:PRU00124}; Hydrolase {ECO:0000256|RuleBase:RU363034};
KW Protease {ECO:0000256|RuleBase:RU363034};
KW Reference proteome {ECO:0000313|Proteomes:UP000007875};
KW Serine protease {ECO:0000256|RuleBase:RU363034}.
FT DOMAIN 223..446
FT /note="Peptidase S1"
FT /evidence="ECO:0000259|PROSITE:PS50240"
FT DISULFID 14..26
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 21..39
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 33..48
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 73..85
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 80..98
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 92..107
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 135..150
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 179..194
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
SQ SEQUENCE 446 AA; 49388 MW; C02DFC0F318DF453 CRC64;
TETLPASTDV EPKCGVGMFS CKDEQCIDVR WRCDGYNDCG EWEDEQDCAK LIPQFHNGST
LTPLLHSNIS TVCTYDQFRC SNRQCVEGSR ACDGYYDCLD RSDELQCEEC SYGQFKCRNS
TDGPGHQCLP MTSRCDGVTQ CNDGSDEVDC NCNDLNRCVE TGTFRCRSGG KCLTMSHVCD
GHNDCPENSD EDSCSTHVQI TCKPRSCGKR NDVTDGRRIY TRILGGRTSR DRTWPWVTSL
TLPGMKHVCG ATLVRPEWLI SAAHCFQEEG LGVWKARLGT GNGGVMEFNV TDVVVHPRYN
AEAVDFDIAL VKLGGRVRSF NAEPLCLPTF DIDPGSYCVI AGWGVTEGLQ GGTRKLQEGS
VHVIERSTCQ RYYPNHVISD RMMCAGRGGT VDACSGDSGG PMMCWHPQRR QWQLSGVTSW
GSSCTPHSAP GVYTDVKYFS KWAERQ
//