GenomeNet

Database: UniProt
Entry: K0SP19_THAOC
LinkDB: K0SP19_THAOC
Original site: K0SP19_THAOC 
ID   K0SP19_THAOC            Unreviewed;       413 AA.
AC   K0SP19;
DT   28-NOV-2012, integrated into UniProtKB/TrEMBL.
DT   28-NOV-2012, sequence version 1.
DT   27-MAR-2024, entry version 26.
DE   RecName: Full=OTU domain-containing protein {ECO:0000259|PROSITE:PS50802};
GN   ORFNames=THAOC_10778 {ECO:0000313|EMBL:EJK68083.1};
OS   Thalassiosira oceanica (Marine diatom).
OC   Eukaryota; Sar; Stramenopiles; Ochrophyta; Bacillariophyta;
OC   Coscinodiscophyceae; Thalassiosirophycidae; Thalassiosirales;
OC   Thalassiosiraceae; Thalassiosira.
OX   NCBI_TaxID=159749 {ECO:0000313|EMBL:EJK68083.1, ECO:0000313|Proteomes:UP000266841};
RN   [1] {ECO:0000313|EMBL:EJK68083.1, ECO:0000313|Proteomes:UP000266841}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=CCMP1005 {ECO:0000313|EMBL:EJK68083.1,
RC   ECO:0000313|Proteomes:UP000266841};
RX   PubMed=22835381; DOI=10.1186/gb-2012-13-7-r66;
RA   Lommer M., Specht M., Roy A.S., Kraemer L., Andreson R., Gutowska M.A.,
RA   Wolf J., Bergner S.V., Schilhabel M.B., Klostermeier U.C., Beiko R.G.,
RA   Rosenstiel P., Hippler M., Laroche J.;
RT   "Genome and low-iron response of an oceanic diatom adapted to chronic iron
RT   limitation.";
RL   Genome Biol. 13:R66-R66(2012).
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:EJK68083.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; AGNL01012103; EJK68083.1; -; Genomic_DNA.
DR   AlphaFoldDB; K0SP19; -.
DR   EnsemblProtists; EJK68083; EJK68083; THAOC_10778.
DR   eggNOG; ENOG502S253; Eukaryota.
DR   OMA; KESYWGG; -.
DR   Proteomes; UP000266841; Unassembled WGS sequence.
DR   GO; GO:0019538; P:protein metabolic process; IEA:UniProt.
DR   CDD; cd22744; OTU; 1.
DR   Gene3D; 3.90.70.80; -; 1.
DR   InterPro; IPR003323; OTU_dom.
DR   InterPro; IPR038765; Papain-like_cys_pep_sf.
DR   PANTHER; PTHR12419; OTU DOMAIN CONTAINING PROTEIN; 1.
DR   PANTHER; PTHR12419:SF7; OTU DOMAIN-CONTAINING PROTEIN 3; 1.
DR   Pfam; PF02338; OTU; 1.
DR   SUPFAM; SSF54001; Cysteine proteinases; 1.
DR   PROSITE; PS50802; OTU; 1.
PE   4: Predicted;
KW   Reference proteome {ECO:0000313|Proteomes:UP000266841};
KW   Signal {ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..41
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           42..413
FT                   /note="OTU domain-containing protein"
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5003841311"
FT   DOMAIN          137..346
FT                   /note="OTU"
FT                   /evidence="ECO:0000259|PROSITE:PS50802"
FT   REGION          42..76
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        45..76
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   413 AA;  46892 MW;  90E709EF8BEBEFFD CRC64;
     MNFLSSANTR RKRRRRPVER SDIPWALCLA ICSLFFRSAE ADPPVTSSIP RSNNRYTHYN
     DGASMGDSSS SLADISSPLN RESWSPPWNP SSRIDTQGFL TESYLRVNGE WESLANIRGK
     HGPRNRRNFE RMKEQPVRIR QVPGDGNCLF HSIAVCLYRA VNGTDIPMDS HECIGRLRDQ
     SLALRNAAVD VLQQQPTKPG VQFIDSIGPK TRRVLFLQGD EYLEAHELLN AAAAQFDLDG
     EEYCNLMRKE SYWGGGPEIV ALCNYLQRPI HIYELIPSDE KASSRASQEK YKKLKVSNQF
     SLRRMACFGS PKFDRKEPLH ILSADSRFPD VEARRMGNHF LALHPVHNQD ALLGFRRHAM
     LRGGSRSEEF IDRERHQEDV GGAVSRTKLG SNFHSKVIGS ALKWLHFLGH FAS
//
DBGET integrated database retrieval system