ID K0RYU6_THAOC Unreviewed; 1152 AA.
AC K0RYU6;
DT 28-NOV-2012, integrated into UniProtKB/TrEMBL.
DT 28-NOV-2012, sequence version 1.
DT 27-MAR-2024, entry version 21.
DE RecName: Full=RAP domain-containing protein {ECO:0000259|PROSITE:PS51286};
GN ORFNames=THAOC_20863 {ECO:0000313|EMBL:EJK58973.1};
OS Thalassiosira oceanica (Marine diatom).
OC Eukaryota; Sar; Stramenopiles; Ochrophyta; Bacillariophyta;
OC Coscinodiscophyceae; Thalassiosirophycidae; Thalassiosirales;
OC Thalassiosiraceae; Thalassiosira.
OX NCBI_TaxID=159749 {ECO:0000313|EMBL:EJK58973.1, ECO:0000313|Proteomes:UP000266841};
RN [1] {ECO:0000313|EMBL:EJK58973.1, ECO:0000313|Proteomes:UP000266841}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=CCMP1005 {ECO:0000313|EMBL:EJK58973.1,
RC ECO:0000313|Proteomes:UP000266841};
RX PubMed=22835381; DOI=10.1186/gb-2012-13-7-r66;
RA Lommer M., Specht M., Roy A.S., Kraemer L., Andreson R., Gutowska M.A.,
RA Wolf J., Bergner S.V., Schilhabel M.B., Klostermeier U.C., Beiko R.G.,
RA Rosenstiel P., Hippler M., Laroche J.;
RT "Genome and low-iron response of an oceanic diatom adapted to chronic iron
RT limitation.";
RL Genome Biol. 13:R66-R66(2012).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EJK58973.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AGNL01023896; EJK58973.1; -; Genomic_DNA.
DR AlphaFoldDB; K0RYU6; -.
DR EnsemblProtists; EJK58973; EJK58973; THAOC_20863.
DR eggNOG; ENOG502S18V; Eukaryota.
DR Proteomes; UP000266841; Unassembled WGS sequence.
DR InterPro; IPR013584; RAP.
DR PANTHER; PTHR21228; FAST LEU-RICH DOMAIN-CONTAINING; 1.
DR PANTHER; PTHR21228:SF40; GH07286P-RELATED; 1.
DR Pfam; PF08373; RAP; 1.
DR SMART; SM00952; RAP; 1.
DR PROSITE; PS51286; RAP; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000266841}.
FT DOMAIN 1087..1144
FT /note="RAP"
FT /evidence="ECO:0000259|PROSITE:PS51286"
FT REGION 1..165
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 191..236
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 287..306
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 338..387
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 446..486
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1..16
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 81..99
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 133..165
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 291..306
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 349..366
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 448..474
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1152 AA; 126563 MW; 83339A23E1B0762A CRC64;
MGRTEKPRRR VAMDHQRAGD ALSSAGRSPA VAADGPSPSE DAATEEADAE DGAPSPRPPL
RERSPGTANK QSAETTGGAE EDAAASKRSK RADASPSRGP KRAKRHRKDG QSAKDEPDSA
LDALQESARC TILASRKDGE DASRRVSDTG ERGRSDAAHA RLRDSSVREH GALVRYSPSL
HTRRQWHWER ERLEPNGPGS ANREYDDEDG SRYQDRVPGR RRRPRFQPPE LGTSRPHRER
ECFEPYRLGS AKREYDGEDG RRCNGRYQHQ GPGQQRQCRF QPPSWAPVVH GGSGSASSRT
GRDRLSVSTT VRTAAAASIT RPPASSVSPA CNVPSWTPVV HGRNASRRGT SRARLSAGTT
TRTAAGAGIA GSGGREAGET TTAGTRPRGT RSEFLLARRT GFEVRRQVAV QVAIEIEGPI
EIEGPAVGLE QGALTIPEGS AVVHRSTMPD RNEGRGRGWD RAQGRGHGRG RGGRDRGATS
MQGRGPDFRT CQTVAELVDL AHCSLDSMSN RDIAAFWSIL PRLLRNRGAQ DPNLEEKLRC
VIGTTCSRMH NFQYRDLAQT SLGIAKTISQ VSRGNQQYRA DDPRQIIRGL FVKESQCSPV
FDRIARSAVE MLNEFDARTL SNLIYSFGLV ERNPDIGEET LFNVFGKAAV KILNTFNSQD
ISNMLLAFVK VDAKNSRLFH ETCGVISGMD LDNFKPQALA NILWSFAKSG EADPELFQAL
GNHIAVMGSL DSFKPQDLSN TAWAFATARE SNPKLFKKIG DNIAGLGSLD SFNPQELSNT
AWAFATAGDS NPKLFNKIGH HVAGLDSLNS FNPQNLSNTI WAFATAGVSY PELFNKIGNH
IAGLGSLDSF NSQALSNTVW AFATAGESNP KLFNKIGDHV TRLDSIDSFN SQNLSNTAWA
YATARVFHSR LFEKLTTAVA ARKAHFIETQ HIANLLWACA TVGYIDERLF SALAPVVASK
LDQCNGQDIA NIAWAYSVAN FPKQDLFNEG YVSALASNEK DFSTEELFQL HQWQLWQQEL
KSGIELPRSL QEKCRNVVTY ASYSESKLQN DVVGELRAAG LDLDEEVLLG SGYRIDALVK
FGGGRKVAVE VDGPFHFIDR RPAGRAILKH RQVARLDRIE VVPVPYWEWD ELKNSEMKQH
YLRVKLSNGQ IM
//