GenomeNet

Database: UniProt
Entry: K0R6L6_THAOC
LinkDB: K0R6L6_THAOC
Original site: K0R6L6_THAOC 
ID   K0R6L6_THAOC            Unreviewed;      2441 AA.
AC   K0R6L6;
DT   28-NOV-2012, integrated into UniProtKB/TrEMBL.
DT   28-NOV-2012, sequence version 1.
DT   22-FEB-2023, entry version 30.
DE   SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:EJK48510.1};
DE   Flags: Fragment;
GN   ORFNames=THAOC_32683 {ECO:0000313|EMBL:EJK48510.1};
OS   Thalassiosira oceanica (Marine diatom).
OC   Eukaryota; Sar; Stramenopiles; Ochrophyta; Bacillariophyta;
OC   Coscinodiscophyceae; Thalassiosirophycidae; Thalassiosirales;
OC   Thalassiosiraceae; Thalassiosira.
OX   NCBI_TaxID=159749 {ECO:0000313|EMBL:EJK48510.1, ECO:0000313|Proteomes:UP000266841};
RN   [1] {ECO:0000313|EMBL:EJK48510.1, ECO:0000313|Proteomes:UP000266841}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=CCMP1005 {ECO:0000313|EMBL:EJK48510.1,
RC   ECO:0000313|Proteomes:UP000266841};
RX   PubMed=22835381; DOI=10.1186/gb-2012-13-7-r66;
RA   Lommer M., Specht M., Roy A.S., Kraemer L., Andreson R., Gutowska M.A.,
RA   Wolf J., Bergner S.V., Schilhabel M.B., Klostermeier U.C., Beiko R.G.,
RA   Rosenstiel P., Hippler M., Laroche J.;
RT   "Genome and low-iron response of an oceanic diatom adapted to chronic iron
RT   limitation.";
RL   Genome Biol. 13:R66-R66(2012).
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:EJK48510.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; AGNL01045748; EJK48510.1; -; Genomic_DNA.
DR   EnsemblProtists; EJK48510; EJK48510; THAOC_32683.
DR   eggNOG; ENOG502RRCM; Eukaryota.
DR   Proteomes; UP000266841; Unassembled WGS sequence.
DR   CDD; cd00161; RICIN; 1.
DR   Gene3D; 2.80.10.50; -; 1.
DR   Gene3D; 2.60.120.260; Galactose-binding domain-like; 1.
DR   Gene3D; 2.130.10.130; Integrin alpha, N-terminal; 1.
DR   InterPro; IPR005046; DUF285.
DR   InterPro; IPR013517; FG-GAP.
DR   InterPro; IPR008979; Galactose-bd-like_sf.
DR   InterPro; IPR013519; Int_alpha_beta-p.
DR   InterPro; IPR028994; Integrin_alpha_N.
DR   InterPro; IPR035992; Ricin_B-like_lectins.
DR   InterPro; IPR000772; Ricin_B_lectin.
DR   PANTHER; PTHR36220:SF1; -; 1.
DR   PANTHER; PTHR36220; UNNAMED PRODUCT; 1.
DR   Pfam; PF03382; DUF285; 2.
DR   Pfam; PF14312; FG-GAP_2; 6.
DR   SMART; SM00191; Int_alpha; 4.
DR   SUPFAM; SSF49785; Galactose-binding domain-like; 2.
DR   SUPFAM; SSF50370; Ricin B-like lectins; 1.
DR   PROSITE; PS50231; RICIN_B_LECTIN; 1.
PE   4: Predicted;
KW   Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW   Reference proteome {ECO:0000313|Proteomes:UP000266841};
KW   Repeat {ECO:0000256|ARBA:ARBA00022737};
KW   Signal {ECO:0000256|ARBA:ARBA00022729}.
FT   REGION          364..424
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          573..624
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          2184..2205
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        573..623
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   NON_TER         1
FT                   /evidence="ECO:0000313|EMBL:EJK48510.1"
SQ   SEQUENCE   2441 AA;  265410 MW;  D01A160A1448A6A3 CRC64;
     FTSDAELRQA MKEYLDPDTR DAAVSTYGPI ESWGVSAVED FSLLFQKAPY TALPGASTFN
     DDISGWDVSS GTNFNAMFLW TSSFNQDISG WDVSSGIDFE GMFNQASSFN QDISGWDTSS
     GTKFRRMFYS TQSFNQELCS CGSHFSSSKV YTDMFEGSSC PIKSQPTGAS GPWCRQCVGR
     LGFTSDAELR QAMKEYLDPD TRDAAVSTYG PIESWGVSAV EDFSRLFVVG GSYTALPGAS
     TFNDDISGWD VSSGTNFNAM FLWTSSFNQD ISGWDVSSGI DFEGMFNQAS SFNQDISGWD
     TSSGTKFRRM FYSTQSFNQE LCSWGSHYSS SKVYTDMFDD SSCLIKLSPN NALGPWCQLC
     TPTNTPTTSP TSSPSLGPSM APSTSPSLSP SKVPSTSPSL SPSEVLSTSP SLSPSEVPSA
     IPSLSPSAND DYAIDIKGLS FNITTPSMVG VVNVYHRPGS HGGYERESQY WTLVYGNSSV
     LASDGAVSVE LSTSVFMEIS SKHSFFVYTT FGVDETDATR DMSPEGSVYR TTDDLDYHMG
     KTFNAEFGGG CLFTPRAWNG DLLFTREFLS TEPSGYPSTQ PSVEPSAKPS LSPSVLPTSE
     PSDMPSYSFQ PSLSPSISSR PTLAPTKYGR GVDISGDTII VGDGINPSCG GASIYKRING
     VWTVQSQLAP PVCSYEEGVT ERPIGFGFSV AVDGDSAIIG SPMDEENGFN SGAAYVYRKS
     NGTSWDLETK LLASDGAPFD EFGWSVDIKG DFAIAGAFVA DIASYVAVRK IKVKLNNGES
     VLNLGEIQAF DVNGVNVALG KPVTQSSTYG NFVASNVVDG DPNTISHTKL QHGAWLEVDL
     QSEVDLTEVV ITNRLSPSVP EKRAYRYVKL SIIYNGGALR TCVDEFEIFD GTTLHLPISA
     SVDMATESRH GTVRTYDRLV GGQDRWGYPT HMCPAHNGTT SWNLIYDMGA MVGITKYAIT
     PEPDISDHSY SPRDWKMYGS NDRNSWWLLD SVENYTSWEI QERSYFDVAP LNVQKRLSSS
     TLTLWSGNNR QVKQYDIGDT SSTTKLSYSL GDREMTGHAY IYGRSGTEWE QASKIDSPAD
     SDDFFGHSVS IGHKVAVLGT FGYSFIYLQS PDGTWNQRDK LSNANGDFWT DVTVHGNSIL
     KAGGGSALIT DYPSLFETSS ETERDTFLNR EWELISRGTV PWNNSSIGQW AVAGSKITSK
     FSQGDEKHTF SEIDLFDSNW EWYTEYKVTW QPSNSSAASI QVKQVELPGL LGQERPVATI
     SNAHKGKWLA NILQFSNDIS VHNGNAVGGD AFSVAKSGSR IFDGNTKRFK MHLSGDGVPA
     LSFRPNSGRY SIATGIRVYT ANNDPDSDPL FYVVEGRDKP GVKIKNVGSN SCWYITDNST
     IDMGDCDSSE SEYLFYTNNW GEIRSSAPNH IGRCVDPSDH VLGLLKFKTC NSYEHGENHA
     QAQNQYFAFD GDLPNGIFSI VTAKTGLCMN QLVGHNITLE TCNSNITSQQ YYFKSEGRHN
     VDDANGWNEV SSGKLPWVSP EARNGIGQAI SSKFNEGDDN LHFTEAKFYS SATPYYEYKI
     SFPKLRSETA TSLHFAEIEL PGTLIFSHMD DGAAVSHNLT RQECAQACVD QGSLCHGIGY
     SAMNGNVYLA NDCILYSSVV GNGCNNRHYQ LELFSISDSK HPEDDPYVRL PMQSLATTTS
     FNFGRECVSG DKLAVYFNVT RKECRQLCVD HGSNCFGVEY HTSATRNAEC AITNSTNTNG
     CDNTVLDIEI LFQGRAPTIV PVPTVSPSTS PITESPTASF ELHLLDNLEM TDAAPYDRLG
     VDIGIDIVNN FIITGSPYAD TYGFIDSGAA NIYTLDEDGT SWSHLIRLQA NDLDENALFG
     TAVAVSEPYA VVGNHKSGNG AAYVFGRIGE DGPWVQQAKL EDSSGSGSDQ FGVDVDIFNN
     TIIVGSDHYN STGSKCGAVF IYKRKYFSWL KYQTIEPTNC TSESFFGRSV HFEAGTDSRF
     IVGSNGDSSV NGAHSGSAYI YSYNENTTLW ELEAKLFAHD GQPGDSFGIS TAISSGRAIV
     GAYLDDTDFG GFDVGSAYIF QKVNSTSWVQ ETKLVNSDGM SGDGFGSRVA IYRDVVVVGA
     PEDDISDVGQ DTRGSAYIFA ENKETHEWDE LKKVSGAHTN VTLGTSVAIH EKTVFIGAPR
     ETINNVNESG HVLIYEMSGP TIFQPTSAPS KNPTSHPSSS PISVSNSSLV LADGSVYNKC
     PGGTEVSQSN CLEALITVTA DLDYNLFNSD VLSVDDWVGL PCGCFLYNNT FLNFDTNCQN
     AGTNADSQLV CLTDEVLDGV CSRYESTGYH SGDIVMTLLS NVAPNGIQWA WQECAQVCLQ
     YSNCLRDGRK GRSTPAPVDG TAAGASENTR QIEADLLGAG HPRGYPSPDM LRGTHADIQA
     LSTKYSTNRL AKRGNPGGVQ YEPMEGTAMR LRTQDMLQGG S
//
DBGET integrated database retrieval system