ID E4XKM6_OIKDI Unreviewed; 672 AA.
AC E4XKM6;
DT 08-FEB-2011, integrated into UniProtKB/TrEMBL.
DT 08-FEB-2011, sequence version 1.
DT 27-MAR-2024, entry version 45.
DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:CBY10729.1};
GN ORFNames=GSOID_T00014238001 {ECO:0000313|EMBL:CBY10729.1};
OS Oikopleura dioica (Tunicate).
OC Eukaryota; Metazoa; Chordata; Tunicata; Appendicularia; Copelata;
OC Oikopleuridae; Oikopleura.
OX NCBI_TaxID=34765 {ECO:0000313|EMBL:CBY10729.1};
RN [1] {ECO:0000313|EMBL:CBY10729.1}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=21097902; DOI=10.1126/science.1194167;
RA Denoeud F., Henriet S., Mungpakdee S., Aury J.M., Da Silva C.,
RA Brinkmann H., Mikhaleva J., Olsen L.C., Jubin C., Canestro C.,
RA Bouquet J.M., Danks G., Poulain J., Campsteijn C., Adamski M., Cross I.,
RA Yadetie F., Muffato M., Louis A., Butcher S., Tsagkogeorga G., Konrad A.,
RA Singh S., Jensen M.F., Cong E.H., Eikeseth-Otteraa H., Noel B.,
RA Anthouard V., Porcel B.M., Kachouri-Lafond R., Nishino A., Ugolini M.,
RA Chourrout P., Nishida H., Aasland R., Huzurbazar S., Westhof E., Delsuc F.,
RA Lehrach H., Reinhardt R., Weissenbach J., Roy S.W., Artiguenave F.,
RA Postlethwait J.H., Manak J.R., Thompson E.M., Jaillon O., Du Pasquier L.,
RA Boudinot P., Liberles D.A., Volff J.N., Philippe H., Lenhard B.,
RA Roest Crollius H., Wincker P., Chourrout D.;
RT "Plasticity of animal genome architecture unmasked by rapid evolution of a
RT pelagic tunicate.";
RL Science 330:1381-1385(2010).
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00059}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; FN653065; CBY10729.1; -; Genomic_DNA.
DR AlphaFoldDB; E4XKM6; -.
DR InParanoid; E4XKM6; -.
DR Proteomes; UP000001307; Unassembled WGS sequence.
DR GO; GO:0004252; F:serine-type endopeptidase activity; IEA:InterPro.
DR GO; GO:0006508; P:proteolysis; IEA:InterPro.
DR CDD; cd00041; CUB; 1.
DR CDD; cd00190; Tryp_SPc; 2.
DR Gene3D; 2.60.120.290; Spermadhesin, CUB domain; 1.
DR Gene3D; 2.40.10.10; Trypsin-like serine proteases; 3.
DR InterPro; IPR000859; CUB_dom.
DR InterPro; IPR009003; Peptidase_S1_PA.
DR InterPro; IPR043504; Peptidase_S1_PA_chymotrypsin.
DR InterPro; IPR001314; Peptidase_S1A.
DR InterPro; IPR035914; Sperma_CUB_dom_sf.
DR InterPro; IPR001254; Trypsin_dom.
DR InterPro; IPR033116; TRYPSIN_SER.
DR PANTHER; PTHR24252; ACROSIN-RELATED; 1.
DR PANTHER; PTHR24252:SF7; HYALIN; 1.
DR Pfam; PF00431; CUB; 1.
DR Pfam; PF00089; Trypsin; 3.
DR PRINTS; PR00722; CHYMOTRYPSIN.
DR SMART; SM00042; CUB; 1.
DR SMART; SM00020; Tryp_SPc; 2.
DR SUPFAM; SSF49854; Spermadhesin, CUB domain; 1.
DR SUPFAM; SSF50494; Trypsin-like serine proteases; 2.
DR PROSITE; PS01180; CUB; 1.
DR PROSITE; PS50240; TRYPSIN_DOM; 2.
DR PROSITE; PS00135; TRYPSIN_SER; 1.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW ProRule:PRU00059}; Reference proteome {ECO:0000313|Proteomes:UP000001307}.
FT DOMAIN 1..229
FT /note="Peptidase S1"
FT /evidence="ECO:0000259|PROSITE:PS50240"
FT DOMAIN 270..386
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT DOMAIN 431..666
FT /note="Peptidase S1"
FT /evidence="ECO:0000259|PROSITE:PS50240"
FT DISULFID 331..348
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00059"
SQ SEQUENCE 672 AA; 75119 MW; 0BE821CA01E624F7 CRC64;
MILAGVHNLD HLTADGVDDV PDEMYDNIQV RNARSYYIHP EYNSISDEHD IAVLELDEPL
TLTDYVQPAC LPDGEPRVNE YCEVAGWGSS TPDNAHQDRE HQSDNWLLQM FELGTSYPHL
PSWGFELEES KQTGNELKSG FLQIISQLEC QKKYENDDIT GNMFCAGSDR GVDTCLGDSG
GPLVCHNPAS QRWEITGVTS WGRGCGVEEF PGVYTKVVQY LGWFAQIEAG KGTPSRVDHL
QFDNDDDNND NDIFDYEASG DDLSADFYKC GFTIEPEIPH NGIPASGVIT SPNFPKKYSS
DEKCRWLITA PAGFHVELKF TNFKVEYDSS CRKDRVEIHH DGQGFFLCGN GIPEDTFTTT
EKQMEIYFSS NKLINFSGFS AIYTIKQNKQ NIGDISASFE RADNGVIVSE ADGINGICGK
PLIQPDRMVR MLGGQPIKPT QWPWLGMLLE EDGEYIHMKC GVALICRQWA ITSGDCAREL
KYGEKYVHKV KFGNMRWDQQ SEHQEELFID QIIEHPLFNG GYDYDIALVK FARKVSFNNY
IKPICMKDYV HKNKGGFNCF AAGWGMNSQT MSTRQAHSAK INIVNDGYCS NIYRKTYNTQ
QMVCTGGDNR PCQGDGGAPL ICNADNGEWF LHGISIYGPG CNKPGGGPSV FVKPSVFLTF
IEDATGGCVK SY
//