ID E4YZV1_OIKDI Unreviewed; 677 AA.
AC E4YZV1;
DT 08-FEB-2011, integrated into UniProtKB/TrEMBL.
DT 08-FEB-2011, sequence version 1.
DT 27-MAR-2024, entry version 37.
DE RecName: Full=Peptidase S1 domain-containing protein {ECO:0000259|PROSITE:PS50240};
DE Flags: Fragment;
GN ORFNames=GSOID_T00023024001 {ECO:0000313|EMBL:CBY40979.1};
OS Oikopleura dioica (Tunicate).
OC Eukaryota; Metazoa; Chordata; Tunicata; Appendicularia; Copelata;
OC Oikopleuridae; Oikopleura.
OX NCBI_TaxID=34765 {ECO:0000313|EMBL:CBY40979.1};
RN [1] {ECO:0000313|EMBL:CBY40979.1}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=21097902; DOI=10.1126/science.1194167;
RA Denoeud F., Henriet S., Mungpakdee S., Aury J.M., Da Silva C.,
RA Brinkmann H., Mikhaleva J., Olsen L.C., Jubin C., Canestro C.,
RA Bouquet J.M., Danks G., Poulain J., Campsteijn C., Adamski M., Cross I.,
RA Yadetie F., Muffato M., Louis A., Butcher S., Tsagkogeorga G., Konrad A.,
RA Singh S., Jensen M.F., Cong E.H., Eikeseth-Otteraa H., Noel B.,
RA Anthouard V., Porcel B.M., Kachouri-Lafond R., Nishino A., Ugolini M.,
RA Chourrout P., Nishida H., Aasland R., Huzurbazar S., Westhof E., Delsuc F.,
RA Lehrach H., Reinhardt R., Weissenbach J., Roy S.W., Artiguenave F.,
RA Postlethwait J.H., Manak J.R., Thompson E.M., Jaillon O., Du Pasquier L.,
RA Boudinot P., Liberles D.A., Volff J.N., Philippe H., Lenhard B.,
RA Roest Crollius H., Wincker P., Chourrout D.;
RT "Plasticity of animal genome architecture unmasked by rapid evolution of a
RT pelagic tunicate.";
RL Science 330:1381-1385(2010).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; FN656205; CBY40979.1; -; Genomic_DNA.
DR AlphaFoldDB; E4YZV1; -.
DR Proteomes; UP000011014; Unassembled WGS sequence.
DR GO; GO:0004252; F:serine-type endopeptidase activity; IEA:InterPro.
DR GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR CDD; cd00190; Tryp_SPc; 2.
DR Gene3D; 2.40.10.10; Trypsin-like serine proteases; 3.
DR InterPro; IPR009003; Peptidase_S1_PA.
DR InterPro; IPR043504; Peptidase_S1_PA_chymotrypsin.
DR InterPro; IPR001314; Peptidase_S1A.
DR InterPro; IPR001254; Trypsin_dom.
DR InterPro; IPR018114; TRYPSIN_HIS.
DR InterPro; IPR033116; TRYPSIN_SER.
DR PANTHER; PTHR24264:SF65; PEPTIDASE S1 DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR24264; TRYPSIN-RELATED; 1.
DR Pfam; PF00089; Trypsin; 3.
DR PRINTS; PR00722; CHYMOTRYPSIN.
DR SMART; SM00020; Tryp_SPc; 2.
DR SUPFAM; SSF50494; Trypsin-like serine proteases; 3.
DR PROSITE; PS50240; TRYPSIN_DOM; 3.
DR PROSITE; PS00134; TRYPSIN_HIS; 2.
DR PROSITE; PS00135; TRYPSIN_SER; 2.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801, ECO:0000256|RuleBase:RU363034};
KW Protease {ECO:0000256|ARBA:ARBA00022670, ECO:0000256|RuleBase:RU363034};
KW Serine protease {ECO:0000256|ARBA:ARBA00022825,
KW ECO:0000256|RuleBase:RU363034}.
FT DOMAIN 1..58
FT /note="Peptidase S1"
FT /evidence="ECO:0000259|PROSITE:PS50240"
FT DOMAIN 103..350
FT /note="Peptidase S1"
FT /evidence="ECO:0000259|PROSITE:PS50240"
FT DOMAIN 425..673
FT /note="Peptidase S1"
FT /evidence="ECO:0000259|PROSITE:PS50240"
FT REGION 356..377
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 356..374
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT NON_TER 1
FT /evidence="ECO:0000313|EMBL:CBY40979.1"
SQ SEQUENCE 677 AA; 71984 MW; DACF1FCF68074963 CRC64;
GGQDACQGDS GGPLVCLENG VATLVGVTSW GHGCAFQGYP GIWASTVDNF EWIDGIISGT
ITLPPTTTTT RNTGPPPDGW NTYNATVFDY SNTVYLPWDD NKIVGGQEVE EGQWPFIVGL
GGDSNGNTHW CGGTIISKND YGDDWILTAA HCCDGTGMTM NNIRIGDWQQ STLQANEFST
SGIQYVHPNY DDWFLSNDIC LIRVPNLSAV NADAFEKTCL PSERPAHGSK CYIAGWGTLH
QDDWDGPDIL HDVGIWVMDN EYCENTGNAN ELDQSMICAG SPDMNGNGWP DGGIDACYGD
SGGPLVCLDD NNEPIVVGLT SWGFGCAQEN FPGVWASIAD NLDWIYGTMD GTYTTTTGSS
TSTVSTTTTT GGGGGPVNPP DGWEEAPETM GTVSKCPAPG TPYSPAARSA HKFSPWDDSR
RNSRIVGGQE VLEGEWPFQV GLFKDSWGGF FCGGSVITKN EIGFDYILSA AHCCEAVTGT
IDIHVGDWKH GNHQTSGGEF VVTGTKLSHP NYNDVNLAND LCIIEVPNLL AAAPSADSFQ
AACLAESRPE HGRRVYTAGW GAIAEGAWGS ETLKDAGLWY MSNEYCEGTS NGAYGIQQNM
ICAGVPDFDG DGITDGGQDA CQGDSGGPLV VLENGVPLLI GATSWGIGCA RPNYPGVWAS
VPDNMEWILE NISTNKI
//