ID B8C1S8_THAPS Unreviewed; 1599 AA.
AC B8C1S8;
DT 03-MAR-2009, integrated into UniProtKB/TrEMBL.
DT 03-MAR-2009, sequence version 1.
DT 27-MAR-2024, entry version 69.
DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:EED92272.1};
GN ORFNames=THAPSDRAFT_22549 {ECO:0000313|EMBL:EED92272.1};
OS Thalassiosira pseudonana (Marine diatom) (Cyclotella nana).
OC Eukaryota; Sar; Stramenopiles; Ochrophyta; Bacillariophyta;
OC Coscinodiscophyceae; Thalassiosirophycidae; Thalassiosirales;
OC Thalassiosiraceae; Thalassiosira.
OX NCBI_TaxID=35128 {ECO:0000313|EMBL:EED92272.1, ECO:0000313|Proteomes:UP000001449};
RN [1] {ECO:0000313|EMBL:EED92272.1, ECO:0000313|Proteomes:UP000001449}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=CCMP1335 {ECO:0000313|EMBL:EED92272.1};
RX PubMed=15459382; DOI=10.1126/science.1101156;
RA Armbrust E.V., Berges J.A., Bowler C., Green B.R., Martinez D.,
RA Putnam N.H., Zhou S., Allen A.E., Apt K.E., Bechner M., Brzezinski M.A.,
RA Chaal B.K., Chiovitti A., Davis A.K., Demarest M.S., Detter J.C.,
RA Glavina T., Goodstein D., Hadi M.Z., Hellsten U., Hildebrand M.,
RA Jenkins B.D., Jurka J., Kapitonov V.V., Kroger N., Lau W.W., Lane T.W.,
RA Larimer F.W., Lippmeier J.C., Lucas S., Medina M., Montsant A., Obornik M.,
RA Parker M.S., Palenik B., Pazour G.J., Richardson P.M., Rynearson T.A.,
RA Saito M.A., Schwartz D.C., Thamatrakoln K., Valentin K., Vardi A.,
RA Wilkerson F.P., Rokhsar D.S.;
RT "The genome of the diatom Thalassiosira pseudonana: ecology, evolution, and
RT metabolism.";
RL Science 306:79-86(2004).
RN [2] {ECO:0000313|EMBL:EED92272.1, ECO:0000313|Proteomes:UP000001449}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=CCMP1335 {ECO:0000313|EMBL:EED92272.1};
RX PubMed=18923393; DOI=10.1038/nature07410;
RA Bowler C., Allen A.E., Badger J.H., Grimwood J., Jabbari K., Kuo A.,
RA Maheswari U., Martens C., Maumus F., Otillar R.P., Rayko E., Salamov A.,
RA Vandepoele K., Beszteri B., Gruber A., Heijde M., Katinka M., Mock T.,
RA Valentin K., Verret F., Berges J.A., Brownlee C., Cadoret J.P.,
RA Chiovitti A., Choi C.J., Coesel S., De Martino A., Detter J.C., Durkin C.,
RA Falciatore A., Fournet J., Haruta M., Huysman M.J., Jenkins B.D.,
RA Jiroutova K., Jorgensen R.E., Joubert Y., Kaplan A., Kroger N., Kroth P.G.,
RA La Roche J., Lindquist E., Lommer M., Martin-Jezequel V., Lopez P.J.,
RA Lucas S., Mangogna M., McGinnis K., Medlin L.K., Montsant A.,
RA Oudot-Le Secq M.P., Napoli C., Obornik M., Parker M.S., Petit J.L.,
RA Porcel B.M., Poulsen N., Robison M., Rychlewski L., Rynearson T.A.,
RA Schmutz J., Shapiro H., Siaut M., Stanley M., Sussman M.R., Taylor A.R.,
RA Vardi A., von Dassow P., Vyverman W., Willis A., Wyrwicz L.S.,
RA Rokhsar D.S., Weissenbach J., Armbrust E.V., Green B.R., Van de Peer Y.,
RA Grigoriev I.V.;
RT "The Phaeodactylum genome reveals the evolutionary history of diatom
RT genomes.";
RL Nature 456:239-244(2008).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CM000642; EED92272.1; -; Genomic_DNA.
DR RefSeq; XP_002290520.1; XM_002290484.1.
DR STRING; 35128.B8C1S8; -.
DR PaxDb; 35128-Thaps22549; -.
DR EnsemblProtists; EED92272; EED92272; THAPSDRAFT_22549.
DR GeneID; 7448467; -.
DR KEGG; tps:THAPSDRAFT_22549; -.
DR eggNOG; KOG0619; Eukaryota.
DR HOGENOM; CLU_244392_0_0_1; -.
DR InParanoid; B8C1S8; -.
DR OMA; NRCADFA; -.
DR Proteomes; UP000001449; Chromosome 5.
DR Gene3D; 2.40.10.120; -; 1.
DR Gene3D; 3.40.50.300; P-loop containing nucleotide triphosphate hydrolases; 1.
DR Gene3D; 3.80.10.10; Ribonuclease Inhibitor; 2.
DR InterPro; IPR001611; Leu-rich_rpt.
DR InterPro; IPR003591; Leu-rich_rpt_typical-subtyp.
DR InterPro; IPR032675; LRR_dom_sf.
DR InterPro; IPR027417; P-loop_NTPase.
DR InterPro; IPR009003; Peptidase_S1_PA.
DR PANTHER; PTHR16083:SF92; DISEASE RESISTANCE PROTEIN (TIR-NBS-LRR CLASS) FAMILY PROTEIN; 1.
DR PANTHER; PTHR16083; LEUCINE RICH REPEAT CONTAINING PROTEIN; 1.
DR Pfam; PF13855; LRR_8; 1.
DR Pfam; PF13365; Trypsin_2; 1.
DR SMART; SM00369; LRR_TYP; 4.
DR SUPFAM; SSF52058; L domain-like; 1.
DR SUPFAM; SSF52540; P-loop containing nucleoside triphosphate hydrolases; 1.
DR SUPFAM; SSF50494; Trypsin-like serine proteases; 1.
DR PROSITE; PS51450; LRR; 1.
PE 4: Predicted;
KW Leucine-rich repeat {ECO:0000256|ARBA:ARBA00022614};
KW Reference proteome {ECO:0000313|Proteomes:UP000001449};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}.
FT REGION 1..79
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 389..418
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 616..644
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 55..79
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1599 AA; 179211 MW; 8AF069DA7B883A1C CRC64;
MSSDKRHTHS SQSQHSSSES LLTELVHIPP YKDRTLSAGS GYGHAAKKLP DPNETFENEN
GNGDVTPLPP SFSNRDVSNL SNQSTIDSAL SSGSRGVLYT DSGPAYLEKL RRVKNLIEQC
ESTKFPFKKK LVLANMNLKY DDVPVKRICS DKLGATLNKL SLVGNHLISI PPPLLVKLTG
LRALDLSQCN IKTVPETWDI PFLKKLNLSN NRLVEFPTEN VFRGLPELQQ LDIHGNKLCE
IVLPKDLSVL SKLDYLDIGF NILLSLPDEL SSLHTLKTLR CMNNILEIVP STICDMDLRV
LDVSSNPLLQ PPLETCERGI ESMRRYYHAL KLEEQMAGTG PNKEPMPTNN KSIFKIGLKH
ERSRNKMRKK KDSVKKTFQA SLCRTNLFRS ASEPAHSTSS DLPSTIRYPT DQAQQETPPL
RAVSFLLSEE GDLSISELKS SQTDSLVETS EVLTKVEDGG ESVLNKRDSE ASDVSVTSDW
EPESAESSVQ LVGLTEAEEK DVSDVIAVND TLKVIFVGMA ESGKTSIIKR LIEGDGAMIP
KKDERTIGVD IFEWIPSAAK GLGKLNTKIS VDRDLQSRLK GDVHVKFSMW DATHELFFSS
QTLYVLVWDM GANNSSTLPT REEEKGTFKL TYESSDDDDD DLDKEREREH RRLIRALEQD
IDEKLQFWVD CIQSSAPGAA ILPVASFDDH FSDRNGNEDA LFRCNLMRER LQKHEEKRVR
GMKQRMEEYT STFGVNSEPA QRLRKLLCPF NRPKIIFGLD GSENSVVRVS SSEYTGFTTL
AQGIIDVATG NERGGWPYPL FRGHIGVRIP RMRLEVRDVV KQMRERFKVV EWGFFLNEVK
KRGIDNIGDI TDSLHFLMNI GELSYFDEVG ERKNENPNLV RKASSLTFDG RTNVCDDQST
LGSRNITPQY YSDTDTVSPF IFLNPRWLVA CVGCILRHDL SREIYEVRRS LLKPETAYTK
NLSWNDGRYH ERELKTDVNY PVISARDACL LWDAKRYTRK AAERALQYSN DRSVTPFDFL
QRLLVRFGIF IPIDLLGLEK VDLGGRDYTR FSGYPDVSSD VVDSEEEAPK YFFLPSLLGG
GEPPDIWTFK TAESWKTTLC HSILFPDGVP PGLMERITST VLGDLYTNPS SAGDSDAFEM
PYGRGAVGQL RIKETLCWRS AFFLKLSREE VDSATGEVQQ SIVEVFATLV DQESKLCVAA
DSMGVGMRRL IFSAKGQSGD LTSKIWSGGY RHVLKKAVKY IVNEYAGLEM ERQAVCPQCL
ATRPIAQCSV WDSSTLEAFR SGSSNDKMIR CRYGHSIDIR ILCGLTDSMM SRDKVPVIET
QSFHGEADTL VSDLLKSVVI VAIWDETAQR IVHAGSGFIV DKKRGFIVTA GHNLMDNNTW
REIPGKIVIG IIPSNDSPRD HVAVYRYFAR IVAKDPSINQ TGICRLDACI LQVTTRMEND
VHTPGREIGD QPETLLMNNP EAMKREKFHQ LSVSEKFELD EAVRILGFNQ GGEGLIEPGD
ELNRCADFAR GYVVMRFAAN EVTEQTASRR LQPRSEIVVI CPTIGGHSGG PCVNQQGEVI
GILSRADPAD KQRCYLVPSS EFKPMVKEAK RILSSAPSL
//