ID K0S8W4_THAOC Unreviewed; 1695 AA.
AC K0S8W4;
DT 28-NOV-2012, integrated into UniProtKB/TrEMBL.
DT 28-NOV-2012, sequence version 1.
DT 27-MAR-2024, entry version 24.
DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:EJK61394.1};
GN ORFNames=THAOC_18128 {ECO:0000313|EMBL:EJK61394.1};
OS Thalassiosira oceanica (Marine diatom).
OC Eukaryota; Sar; Stramenopiles; Ochrophyta; Bacillariophyta;
OC Coscinodiscophyceae; Thalassiosirophycidae; Thalassiosirales;
OC Thalassiosiraceae; Thalassiosira.
OX NCBI_TaxID=159749 {ECO:0000313|EMBL:EJK61394.1, ECO:0000313|Proteomes:UP000266841};
RN [1] {ECO:0000313|EMBL:EJK61394.1, ECO:0000313|Proteomes:UP000266841}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=CCMP1005 {ECO:0000313|EMBL:EJK61394.1,
RC ECO:0000313|Proteomes:UP000266841};
RX PubMed=22835381; DOI=10.1186/gb-2012-13-7-r66;
RA Lommer M., Specht M., Roy A.S., Kraemer L., Andreson R., Gutowska M.A.,
RA Wolf J., Bergner S.V., Schilhabel M.B., Klostermeier U.C., Beiko R.G.,
RA Rosenstiel P., Hippler M., Laroche J.;
RT "Genome and low-iron response of an oceanic diatom adapted to chronic iron
RT limitation.";
RL Genome Biol. 13:R66-R66(2012).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EJK61394.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AGNL01020062; EJK61394.1; -; Genomic_DNA.
DR EnsemblProtists; EJK61394; EJK61394; THAOC_18128.
DR Proteomes; UP000266841; Unassembled WGS sequence.
DR Gene3D; 3.80.10.10; Ribonuclease Inhibitor; 1.
DR InterPro; IPR026906; LRR_5.
DR InterPro; IPR032675; LRR_dom_sf.
DR PANTHER; PTHR45661:SF3; RICH REPEAT DOMAIN PROTEIN, PUTATIVE-RELATED; 1.
DR PANTHER; PTHR45661; SURFACE ANTIGEN; 1.
DR Pfam; PF13306; LRR_5; 1.
DR SUPFAM; SSF52058; L domain-like; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000266841}.
FT REGION 848..869
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1073..1122
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1610..1695
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1081..1118
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1622..1662
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1695 AA; 192144 MW; DE1F6E0D16CF2198 CRC64;
MSIQGRIRVG KLEIAYNYAQ EKFSWNEANE KIVYFDQLPL TKKVWDRVHL DKGVFARKNL
YFDEIKQYIN KSQLFKISED DFILGYLKEC KRFNSGLRLP PEMMKTSKTG GNPQLTTESG
AIIRHILHDN PFIGYASFNI LLQYFSVLLC DRVLAEKDFP SISTIRHAIC RLDNFDLYDL
GQMVLKHVSS RSKFGNRIMI GAGSDDTKHD KKDAKSHFFA IVGSRGQPDP ENPYMIDPFV
FPLTAGRGIS SNSDGNATLN IESMAPLPPE VIASLSVFSS DQAPDALKEG RLTIEKARAT
AEERGCGELT FVNGVELRPT SIPDSFHNHQ NSAKHFSEAA MGKTEAGQHD QNHHRQFIQN
DWDIFIRNPG LYYNTAKRIL GDLWDAEDGF QPSQKREINT RFGSNGRACG TIVQGYYVRN
EQGVNFWTLL YSELKDYFCD FRSQRCQEQI EMIENESIVT NMHLEYEVGT FFNAHENFHN
RPGELCQHAT FKTLELPFEL LDHSYPFWTA CCEDWTKHLV LTSRYLDSLE SEAETADDED
KKRLQNIVKL KKEQIDAGVK AGSDELLKLV DMFYNPGLLV LFFTHPIYGP CAARAMLQRA
KLGGLDLDEV FEHDADNAGS INVDLTWQFL DEDSKSDWEK ACYAKLEGKE SDCAHFLRQY
GLMNAKIRDD LKMLVMERGG HARSNESDTR LMDFRDVHEP VFDCIHSAFA LTFSATRIFE
SAHGFNRLTW DTQRPQERID NQIRYMMITS HEQRFERRGR VYSRMAVDED EFKEGKRGTA
SHKDSKPSQI LAGEQVRNMV IRKYTSSKVA ARIPEDILAE NTVSAIVKSS KLRKKDIVYE
AKTVNDALQR QSKKRRNKRT KNKTVDEARQ SARNLEIAHD RDWRGREQKT RKQAMLQICT
KNWWKGVPRT MIEGEMKRVI PLFWKTFGKH LRKNKGSGTS AIYLKPGKAS QGDNHQNNLG
MFLQLVKNIA KGKENNTLST KSRAKKLKEM KATEYDLLSE FILVDESQHH EDTLTKTQER
IEMHRGIARA TGTAISSHAH WNRKLPAFEE RTVKFVSTET REEMAEEVAA ALGMLDDDSD
TGPTEYMDEE YCSGSEEEEE EDSSDEEDTQ SGEEEDDREN PDMAWKAEEQ QAYAGGLMLN
EGLQVIGANA FNGCESLRSV TLPLTVTELG WRAFVDCSSL IELQLNEGLK VIASCAFQGC
KSLQSVTLPS TVTELGDGIF NCCSSLIEVY LNEGLQNIGA SAFAFCSALR SVTIPSTVTE
LGVMAFSDCN KLSEVIFLEG QRLLNQEFFA CGFRREEQGL LNQEALNEMF YDEDGDFAFD
GCTELRTIKI SISWAVSERM ARLPHKCMLS VEERIHYLSR FEFVLLQDGE FLACFPVVVS
RAPGVNADGD YDNETEDERY EVLDTNLETA RSLYQVLQLI AFHELKESSI LLELAIWKSR
IDGTMSIARE DCRVAIPDLE LERPTKQANR ALGRGFPCES SESKPHEFFT TASNCDLLTN
CTPPARRQTS DPTKYHSKVA LHNLKCTMEV AEGCNPEQNR PSIPILASVK AVDPAITDDF
GLKVERCPTP PNYTANGTPP FADRSLFNPC YPKHYYGKAY WVKCDSKVDS GQGAERSRRP
PLPSMLSDRE RSQAGEKSWR SEDKGVTRFL GVHTDERARN GDVGKNSSSS ERGGGGAGRK
TSLGKERGTK SEFRA
//