GenomeNet

Database: UniProt
Entry: K0S8W4_THAOC
LinkDB: K0S8W4_THAOC
Original site: K0S8W4_THAOC 
ID   K0S8W4_THAOC            Unreviewed;      1695 AA.
AC   K0S8W4;
DT   28-NOV-2012, integrated into UniProtKB/TrEMBL.
DT   28-NOV-2012, sequence version 1.
DT   27-MAR-2024, entry version 24.
DE   SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:EJK61394.1};
GN   ORFNames=THAOC_18128 {ECO:0000313|EMBL:EJK61394.1};
OS   Thalassiosira oceanica (Marine diatom).
OC   Eukaryota; Sar; Stramenopiles; Ochrophyta; Bacillariophyta;
OC   Coscinodiscophyceae; Thalassiosirophycidae; Thalassiosirales;
OC   Thalassiosiraceae; Thalassiosira.
OX   NCBI_TaxID=159749 {ECO:0000313|EMBL:EJK61394.1, ECO:0000313|Proteomes:UP000266841};
RN   [1] {ECO:0000313|EMBL:EJK61394.1, ECO:0000313|Proteomes:UP000266841}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=CCMP1005 {ECO:0000313|EMBL:EJK61394.1,
RC   ECO:0000313|Proteomes:UP000266841};
RX   PubMed=22835381; DOI=10.1186/gb-2012-13-7-r66;
RA   Lommer M., Specht M., Roy A.S., Kraemer L., Andreson R., Gutowska M.A.,
RA   Wolf J., Bergner S.V., Schilhabel M.B., Klostermeier U.C., Beiko R.G.,
RA   Rosenstiel P., Hippler M., Laroche J.;
RT   "Genome and low-iron response of an oceanic diatom adapted to chronic iron
RT   limitation.";
RL   Genome Biol. 13:R66-R66(2012).
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:EJK61394.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; AGNL01020062; EJK61394.1; -; Genomic_DNA.
DR   EnsemblProtists; EJK61394; EJK61394; THAOC_18128.
DR   Proteomes; UP000266841; Unassembled WGS sequence.
DR   Gene3D; 3.80.10.10; Ribonuclease Inhibitor; 1.
DR   InterPro; IPR026906; LRR_5.
DR   InterPro; IPR032675; LRR_dom_sf.
DR   PANTHER; PTHR45661:SF3; RICH REPEAT DOMAIN PROTEIN, PUTATIVE-RELATED; 1.
DR   PANTHER; PTHR45661; SURFACE ANTIGEN; 1.
DR   Pfam; PF13306; LRR_5; 1.
DR   SUPFAM; SSF52058; L domain-like; 1.
PE   4: Predicted;
KW   Reference proteome {ECO:0000313|Proteomes:UP000266841}.
FT   REGION          848..869
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1073..1122
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1610..1695
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1081..1118
FT                   /note="Acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1622..1662
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   1695 AA;  192144 MW;  DE1F6E0D16CF2198 CRC64;
     MSIQGRIRVG KLEIAYNYAQ EKFSWNEANE KIVYFDQLPL TKKVWDRVHL DKGVFARKNL
     YFDEIKQYIN KSQLFKISED DFILGYLKEC KRFNSGLRLP PEMMKTSKTG GNPQLTTESG
     AIIRHILHDN PFIGYASFNI LLQYFSVLLC DRVLAEKDFP SISTIRHAIC RLDNFDLYDL
     GQMVLKHVSS RSKFGNRIMI GAGSDDTKHD KKDAKSHFFA IVGSRGQPDP ENPYMIDPFV
     FPLTAGRGIS SNSDGNATLN IESMAPLPPE VIASLSVFSS DQAPDALKEG RLTIEKARAT
     AEERGCGELT FVNGVELRPT SIPDSFHNHQ NSAKHFSEAA MGKTEAGQHD QNHHRQFIQN
     DWDIFIRNPG LYYNTAKRIL GDLWDAEDGF QPSQKREINT RFGSNGRACG TIVQGYYVRN
     EQGVNFWTLL YSELKDYFCD FRSQRCQEQI EMIENESIVT NMHLEYEVGT FFNAHENFHN
     RPGELCQHAT FKTLELPFEL LDHSYPFWTA CCEDWTKHLV LTSRYLDSLE SEAETADDED
     KKRLQNIVKL KKEQIDAGVK AGSDELLKLV DMFYNPGLLV LFFTHPIYGP CAARAMLQRA
     KLGGLDLDEV FEHDADNAGS INVDLTWQFL DEDSKSDWEK ACYAKLEGKE SDCAHFLRQY
     GLMNAKIRDD LKMLVMERGG HARSNESDTR LMDFRDVHEP VFDCIHSAFA LTFSATRIFE
     SAHGFNRLTW DTQRPQERID NQIRYMMITS HEQRFERRGR VYSRMAVDED EFKEGKRGTA
     SHKDSKPSQI LAGEQVRNMV IRKYTSSKVA ARIPEDILAE NTVSAIVKSS KLRKKDIVYE
     AKTVNDALQR QSKKRRNKRT KNKTVDEARQ SARNLEIAHD RDWRGREQKT RKQAMLQICT
     KNWWKGVPRT MIEGEMKRVI PLFWKTFGKH LRKNKGSGTS AIYLKPGKAS QGDNHQNNLG
     MFLQLVKNIA KGKENNTLST KSRAKKLKEM KATEYDLLSE FILVDESQHH EDTLTKTQER
     IEMHRGIARA TGTAISSHAH WNRKLPAFEE RTVKFVSTET REEMAEEVAA ALGMLDDDSD
     TGPTEYMDEE YCSGSEEEEE EDSSDEEDTQ SGEEEDDREN PDMAWKAEEQ QAYAGGLMLN
     EGLQVIGANA FNGCESLRSV TLPLTVTELG WRAFVDCSSL IELQLNEGLK VIASCAFQGC
     KSLQSVTLPS TVTELGDGIF NCCSSLIEVY LNEGLQNIGA SAFAFCSALR SVTIPSTVTE
     LGVMAFSDCN KLSEVIFLEG QRLLNQEFFA CGFRREEQGL LNQEALNEMF YDEDGDFAFD
     GCTELRTIKI SISWAVSERM ARLPHKCMLS VEERIHYLSR FEFVLLQDGE FLACFPVVVS
     RAPGVNADGD YDNETEDERY EVLDTNLETA RSLYQVLQLI AFHELKESSI LLELAIWKSR
     IDGTMSIARE DCRVAIPDLE LERPTKQANR ALGRGFPCES SESKPHEFFT TASNCDLLTN
     CTPPARRQTS DPTKYHSKVA LHNLKCTMEV AEGCNPEQNR PSIPILASVK AVDPAITDDF
     GLKVERCPTP PNYTANGTPP FADRSLFNPC YPKHYYGKAY WVKCDSKVDS GQGAERSRRP
     PLPSMLSDRE RSQAGEKSWR SEDKGVTRFL GVHTDERARN GDVGKNSSSS ERGGGGAGRK
     TSLGKERGTK SEFRA
//
DBGET integrated database retrieval system