GenomeNet

Database: UniProt
Entry: E4XC22_OIKDI
LinkDB: E4XC22_OIKDI
Original site: E4XC22_OIKDI 
ID   E4XC22_OIKDI            Unreviewed;       633 AA.
AC   E4XC22;
DT   08-FEB-2011, integrated into UniProtKB/TrEMBL.
DT   08-FEB-2011, sequence version 1.
DT   27-MAR-2024, entry version 37.
DE   RecName: Full=Peptidase S1 domain-containing protein {ECO:0000259|PROSITE:PS50240};
GN   ORFNames=GSOID_T00007674001 {ECO:0000313|EMBL:CBY09147.1};
OS   Oikopleura dioica (Tunicate).
OC   Eukaryota; Metazoa; Chordata; Tunicata; Appendicularia; Copelata;
OC   Oikopleuridae; Oikopleura.
OX   NCBI_TaxID=34765 {ECO:0000313|EMBL:CBY09147.1};
RN   [1] {ECO:0000313|EMBL:CBY09147.1}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX   PubMed=21097902; DOI=10.1126/science.1194167;
RA   Denoeud F., Henriet S., Mungpakdee S., Aury J.M., Da Silva C.,
RA   Brinkmann H., Mikhaleva J., Olsen L.C., Jubin C., Canestro C.,
RA   Bouquet J.M., Danks G., Poulain J., Campsteijn C., Adamski M., Cross I.,
RA   Yadetie F., Muffato M., Louis A., Butcher S., Tsagkogeorga G., Konrad A.,
RA   Singh S., Jensen M.F., Cong E.H., Eikeseth-Otteraa H., Noel B.,
RA   Anthouard V., Porcel B.M., Kachouri-Lafond R., Nishino A., Ugolini M.,
RA   Chourrout P., Nishida H., Aasland R., Huzurbazar S., Westhof E., Delsuc F.,
RA   Lehrach H., Reinhardt R., Weissenbach J., Roy S.W., Artiguenave F.,
RA   Postlethwait J.H., Manak J.R., Thompson E.M., Jaillon O., Du Pasquier L.,
RA   Boudinot P., Liberles D.A., Volff J.N., Philippe H., Lenhard B.,
RA   Roest Crollius H., Wincker P., Chourrout D.;
RT   "Plasticity of animal genome architecture unmasked by rapid evolution of a
RT   pelagic tunicate.";
RL   Science 330:1381-1385(2010).
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; FN653035; CBY09147.1; -; Genomic_DNA.
DR   AlphaFoldDB; E4XC22; -.
DR   InParanoid; E4XC22; -.
DR   Proteomes; UP000001307; Unassembled WGS sequence.
DR   GO; GO:0004252; F:serine-type endopeptidase activity; IEA:InterPro.
DR   GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR   CDD; cd00190; Tryp_SPc; 1.
DR   Gene3D; 2.40.10.10; Trypsin-like serine proteases; 1.
DR   InterPro; IPR009003; Peptidase_S1_PA.
DR   InterPro; IPR043504; Peptidase_S1_PA_chymotrypsin.
DR   InterPro; IPR001314; Peptidase_S1A.
DR   InterPro; IPR001254; Trypsin_dom.
DR   InterPro; IPR018114; TRYPSIN_HIS.
DR   InterPro; IPR033116; TRYPSIN_SER.
DR   PANTHER; PTHR24264:SF15; RIKEN CDNA 2210010C04 GENE; 1.
DR   PANTHER; PTHR24264; TRYPSIN-RELATED; 1.
DR   Pfam; PF00089; Trypsin; 1.
DR   PRINTS; PR00722; CHYMOTRYPSIN.
DR   SMART; SM00020; Tryp_SPc; 1.
DR   SUPFAM; SSF50494; Trypsin-like serine proteases; 1.
DR   PROSITE; PS50240; TRYPSIN_DOM; 1.
DR   PROSITE; PS00134; TRYPSIN_HIS; 1.
DR   PROSITE; PS00135; TRYPSIN_SER; 1.
PE   4: Predicted;
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW   Hydrolase {ECO:0000256|ARBA:ARBA00022801, ECO:0000256|RuleBase:RU363034};
KW   Protease {ECO:0000256|ARBA:ARBA00022670, ECO:0000256|RuleBase:RU363034};
KW   Reference proteome {ECO:0000313|Proteomes:UP000001307};
KW   Serine protease {ECO:0000256|ARBA:ARBA00022825,
KW   ECO:0000256|RuleBase:RU363034}.
FT   DOMAIN          383..631
FT                   /note="Peptidase S1"
FT                   /evidence="ECO:0000259|PROSITE:PS50240"
SQ   SEQUENCE   633 AA;  69116 MW;  A88CD512E9C49D98 CRC64;
     MKLSQLVLLS SSAEAAWRDK WVNKKLMFSA RSSGDMQNCL ETAKNTTIKH PIDLGKWSCS
     DASSGTVSCK GSCLEEGKKG QWKKMEFSVF CKPTRPQPVR KKIKGPASCK KQRSCFENVQ
     DSVVKHALKS ELDNQNIAHG YWGECRGSSS KETCRALCKE GYEPKWRSNA PVSLTLDCRK
     GKKKLTPKTS AIHGELACKP SREQLCKQPE IETPQGSLSF YGRKGDHDER HVYDLVCNDG
     SVVGQGECEN GFWGSSLGDW PNSCARFDCS QDDVNELYPL FNGQWKCTDD DGNKNCAPDC
     NNPNSRVRWL VTCSAYGDKV GWHISSGQAF VSVDECLADG EEAPVSEFKQ TTKKLPPGLE
     TCSQSINFPD GVDGARLLGG DRIVGGVTAD AHSMPWMALL GVKTSSGWVG QCAGSIIADR
     WVVTAAHCCR NIVSITAKFG EHDRWGSSAD EFALTTTRMF IHPKYYDASD DGTKMNYDIC
     LLRFDDNILD KAPNKEVVKT ACLPTEDVQH GDACWVGGWG AMRYGSGAAR VLQSVGVNIM
     DHAYCENFSN SIIPRPDDIC ATTPDEDGDG KTDGGMDACQ GDSGGALICE RNGFANVVGV
     VSRGNGCAWA GEPGLYTSTF VTRDWIRDIV SKN
//
DBGET integrated database retrieval system