ID K0R6L6_THAOC Unreviewed; 2441 AA.
AC K0R6L6;
DT 28-NOV-2012, integrated into UniProtKB/TrEMBL.
DT 28-NOV-2012, sequence version 1.
DT 22-FEB-2023, entry version 30.
DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:EJK48510.1};
DE Flags: Fragment;
GN ORFNames=THAOC_32683 {ECO:0000313|EMBL:EJK48510.1};
OS Thalassiosira oceanica (Marine diatom).
OC Eukaryota; Sar; Stramenopiles; Ochrophyta; Bacillariophyta;
OC Coscinodiscophyceae; Thalassiosirophycidae; Thalassiosirales;
OC Thalassiosiraceae; Thalassiosira.
OX NCBI_TaxID=159749 {ECO:0000313|EMBL:EJK48510.1, ECO:0000313|Proteomes:UP000266841};
RN [1] {ECO:0000313|EMBL:EJK48510.1, ECO:0000313|Proteomes:UP000266841}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=CCMP1005 {ECO:0000313|EMBL:EJK48510.1,
RC ECO:0000313|Proteomes:UP000266841};
RX PubMed=22835381; DOI=10.1186/gb-2012-13-7-r66;
RA Lommer M., Specht M., Roy A.S., Kraemer L., Andreson R., Gutowska M.A.,
RA Wolf J., Bergner S.V., Schilhabel M.B., Klostermeier U.C., Beiko R.G.,
RA Rosenstiel P., Hippler M., Laroche J.;
RT "Genome and low-iron response of an oceanic diatom adapted to chronic iron
RT limitation.";
RL Genome Biol. 13:R66-R66(2012).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EJK48510.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AGNL01045748; EJK48510.1; -; Genomic_DNA.
DR EnsemblProtists; EJK48510; EJK48510; THAOC_32683.
DR eggNOG; ENOG502RRCM; Eukaryota.
DR Proteomes; UP000266841; Unassembled WGS sequence.
DR CDD; cd00161; RICIN; 1.
DR Gene3D; 2.80.10.50; -; 1.
DR Gene3D; 2.60.120.260; Galactose-binding domain-like; 1.
DR Gene3D; 2.130.10.130; Integrin alpha, N-terminal; 1.
DR InterPro; IPR005046; DUF285.
DR InterPro; IPR013517; FG-GAP.
DR InterPro; IPR008979; Galactose-bd-like_sf.
DR InterPro; IPR013519; Int_alpha_beta-p.
DR InterPro; IPR028994; Integrin_alpha_N.
DR InterPro; IPR035992; Ricin_B-like_lectins.
DR InterPro; IPR000772; Ricin_B_lectin.
DR PANTHER; PTHR36220:SF1; -; 1.
DR PANTHER; PTHR36220; UNNAMED PRODUCT; 1.
DR Pfam; PF03382; DUF285; 2.
DR Pfam; PF14312; FG-GAP_2; 6.
DR SMART; SM00191; Int_alpha; 4.
DR SUPFAM; SSF49785; Galactose-binding domain-like; 2.
DR SUPFAM; SSF50370; Ricin B-like lectins; 1.
DR PROSITE; PS50231; RICIN_B_LECTIN; 1.
PE 4: Predicted;
KW Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Reference proteome {ECO:0000313|Proteomes:UP000266841};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Signal {ECO:0000256|ARBA:ARBA00022729}.
FT REGION 364..424
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 573..624
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2184..2205
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 573..623
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT NON_TER 1
FT /evidence="ECO:0000313|EMBL:EJK48510.1"
SQ SEQUENCE 2441 AA; 265410 MW; D01A160A1448A6A3 CRC64;
FTSDAELRQA MKEYLDPDTR DAAVSTYGPI ESWGVSAVED FSLLFQKAPY TALPGASTFN
DDISGWDVSS GTNFNAMFLW TSSFNQDISG WDVSSGIDFE GMFNQASSFN QDISGWDTSS
GTKFRRMFYS TQSFNQELCS CGSHFSSSKV YTDMFEGSSC PIKSQPTGAS GPWCRQCVGR
LGFTSDAELR QAMKEYLDPD TRDAAVSTYG PIESWGVSAV EDFSRLFVVG GSYTALPGAS
TFNDDISGWD VSSGTNFNAM FLWTSSFNQD ISGWDVSSGI DFEGMFNQAS SFNQDISGWD
TSSGTKFRRM FYSTQSFNQE LCSWGSHYSS SKVYTDMFDD SSCLIKLSPN NALGPWCQLC
TPTNTPTTSP TSSPSLGPSM APSTSPSLSP SKVPSTSPSL SPSEVLSTSP SLSPSEVPSA
IPSLSPSAND DYAIDIKGLS FNITTPSMVG VVNVYHRPGS HGGYERESQY WTLVYGNSSV
LASDGAVSVE LSTSVFMEIS SKHSFFVYTT FGVDETDATR DMSPEGSVYR TTDDLDYHMG
KTFNAEFGGG CLFTPRAWNG DLLFTREFLS TEPSGYPSTQ PSVEPSAKPS LSPSVLPTSE
PSDMPSYSFQ PSLSPSISSR PTLAPTKYGR GVDISGDTII VGDGINPSCG GASIYKRING
VWTVQSQLAP PVCSYEEGVT ERPIGFGFSV AVDGDSAIIG SPMDEENGFN SGAAYVYRKS
NGTSWDLETK LLASDGAPFD EFGWSVDIKG DFAIAGAFVA DIASYVAVRK IKVKLNNGES
VLNLGEIQAF DVNGVNVALG KPVTQSSTYG NFVASNVVDG DPNTISHTKL QHGAWLEVDL
QSEVDLTEVV ITNRLSPSVP EKRAYRYVKL SIIYNGGALR TCVDEFEIFD GTTLHLPISA
SVDMATESRH GTVRTYDRLV GGQDRWGYPT HMCPAHNGTT SWNLIYDMGA MVGITKYAIT
PEPDISDHSY SPRDWKMYGS NDRNSWWLLD SVENYTSWEI QERSYFDVAP LNVQKRLSSS
TLTLWSGNNR QVKQYDIGDT SSTTKLSYSL GDREMTGHAY IYGRSGTEWE QASKIDSPAD
SDDFFGHSVS IGHKVAVLGT FGYSFIYLQS PDGTWNQRDK LSNANGDFWT DVTVHGNSIL
KAGGGSALIT DYPSLFETSS ETERDTFLNR EWELISRGTV PWNNSSIGQW AVAGSKITSK
FSQGDEKHTF SEIDLFDSNW EWYTEYKVTW QPSNSSAASI QVKQVELPGL LGQERPVATI
SNAHKGKWLA NILQFSNDIS VHNGNAVGGD AFSVAKSGSR IFDGNTKRFK MHLSGDGVPA
LSFRPNSGRY SIATGIRVYT ANNDPDSDPL FYVVEGRDKP GVKIKNVGSN SCWYITDNST
IDMGDCDSSE SEYLFYTNNW GEIRSSAPNH IGRCVDPSDH VLGLLKFKTC NSYEHGENHA
QAQNQYFAFD GDLPNGIFSI VTAKTGLCMN QLVGHNITLE TCNSNITSQQ YYFKSEGRHN
VDDANGWNEV SSGKLPWVSP EARNGIGQAI SSKFNEGDDN LHFTEAKFYS SATPYYEYKI
SFPKLRSETA TSLHFAEIEL PGTLIFSHMD DGAAVSHNLT RQECAQACVD QGSLCHGIGY
SAMNGNVYLA NDCILYSSVV GNGCNNRHYQ LELFSISDSK HPEDDPYVRL PMQSLATTTS
FNFGRECVSG DKLAVYFNVT RKECRQLCVD HGSNCFGVEY HTSATRNAEC AITNSTNTNG
CDNTVLDIEI LFQGRAPTIV PVPTVSPSTS PITESPTASF ELHLLDNLEM TDAAPYDRLG
VDIGIDIVNN FIITGSPYAD TYGFIDSGAA NIYTLDEDGT SWSHLIRLQA NDLDENALFG
TAVAVSEPYA VVGNHKSGNG AAYVFGRIGE DGPWVQQAKL EDSSGSGSDQ FGVDVDIFNN
TIIVGSDHYN STGSKCGAVF IYKRKYFSWL KYQTIEPTNC TSESFFGRSV HFEAGTDSRF
IVGSNGDSSV NGAHSGSAYI YSYNENTTLW ELEAKLFAHD GQPGDSFGIS TAISSGRAIV
GAYLDDTDFG GFDVGSAYIF QKVNSTSWVQ ETKLVNSDGM SGDGFGSRVA IYRDVVVVGA
PEDDISDVGQ DTRGSAYIFA ENKETHEWDE LKKVSGAHTN VTLGTSVAIH EKTVFIGAPR
ETINNVNESG HVLIYEMSGP TIFQPTSAPS KNPTSHPSSS PISVSNSSLV LADGSVYNKC
PGGTEVSQSN CLEALITVTA DLDYNLFNSD VLSVDDWVGL PCGCFLYNNT FLNFDTNCQN
AGTNADSQLV CLTDEVLDGV CSRYESTGYH SGDIVMTLLS NVAPNGIQWA WQECAQVCLQ
YSNCLRDGRK GRSTPAPVDG TAAGASENTR QIEADLLGAG HPRGYPSPDM LRGTHADIQA
LSTKYSTNRL AKRGNPGGVQ YEPMEGTAMR LRTQDMLQGG S
//