ID F0Y2I9_AURAN Unreviewed; 837 AA.
AC F0Y2I9;
DT 03-MAY-2011, integrated into UniProtKB/TrEMBL.
DT 03-MAY-2011, sequence version 1.
DT 27-MAR-2024, entry version 53.
DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:EGB10711.1};
GN ORFNames=AURANDRAFT_70085 {ECO:0000313|EMBL:EGB10711.1};
OS Aureococcus anophagefferens (Harmful bloom alga).
OC Eukaryota; Sar; Stramenopiles; Ochrophyta; Pelagophyceae; Pelagomonadales;
OC Aureococcus.
OX NCBI_TaxID=44056 {ECO:0000313|Proteomes:UP000002729};
RN [1] {ECO:0000313|EMBL:EGB10711.1, ECO:0000313|Proteomes:UP000002729}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=CCMP 1984 {ECO:0000313|Proteomes:UP000002729};
RX PubMed=21368207; DOI=10.1073/pnas.1016106108;
RA Gobler C.J., Berry D.L., Dyhrman S.T., Wilhelm S.W., Salamov A.,
RA Lobanov A.V., Zhang Y., Collier J.L., Wurch L.L., Kustka A.B., Dill B.D.,
RA Shah M., VerBerkmoes N.C., Kuo A., Terry A., Pangilinan J., Lindquist E.A.,
RA Lucas S., Paulsen I.T., Hattenrath-Lehmann T.K., Talmage S.C., Walker E.A.,
RA Koch F., Burson A.M., Marcoval M.A., Tang Y.Z., Lecleir G.R., Coyne K.J.,
RA Berg G.M., Bertrand E.M., Saito M.A., Gladyshev V.N., Grigoriev I.V.;
RT "Niche of harmful alga Aureococcus anophagefferens revealed through
RT ecogenomics.";
RL Proc. Natl. Acad. Sci. U.S.A. 108:4352-4357(2011).
CC -!- SIMILARITY: In the C-terminal section; belongs to the trehalose
CC phosphatase family. {ECO:0000256|ARBA:ARBA00006330}.
CC -!- SIMILARITY: In the N-terminal section; belongs to the
CC glycosyltransferase 20 family. {ECO:0000256|ARBA:ARBA00005409}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; GL833123; EGB10711.1; -; Genomic_DNA.
DR RefSeq; XP_009034302.1; XM_009036054.1.
DR AlphaFoldDB; F0Y2I9; -.
DR EnsemblProtists; EGB10711; EGB10711; AURANDRAFT_70085.
DR GeneID; 20227725; -.
DR KEGG; aaf:AURANDRAFT_70085; -.
DR eggNOG; KOG1050; Eukaryota.
DR InParanoid; F0Y2I9; -.
DR OMA; HTASHWG; -.
DR OrthoDB; 1023at2759; -.
DR Proteomes; UP000002729; Unassembled WGS sequence.
DR GO; GO:0016758; F:hexosyltransferase activity; IEA:UniProt.
DR GO; GO:0005992; P:trehalose biosynthetic process; IEA:InterPro.
DR CDD; cd03788; GT20_TPS; 1.
DR CDD; cd01627; HAD_TPP; 1.
DR Gene3D; 3.40.50.2000; Glycogen Phosphorylase B; 2.
DR Gene3D; 3.40.50.1000; HAD superfamily/HAD-like; 1.
DR InterPro; IPR001830; Glyco_trans_20.
DR InterPro; IPR036412; HAD-like_sf.
DR InterPro; IPR006379; HAD-SF_hydro_IIB.
DR InterPro; IPR023214; HAD_sf.
DR InterPro; IPR003337; Trehalose_PPase.
DR NCBIfam; TIGR01484; HAD-SF-IIB; 1.
DR NCBIfam; TIGR00685; T6PP; 1.
DR PANTHER; PTHR10788:SF106; BCDNA.GH08860; 1.
DR PANTHER; PTHR10788; TREHALOSE-6-PHOSPHATE SYNTHASE; 1.
DR Pfam; PF00982; Glyco_transf_20; 1.
DR Pfam; PF02358; Trehalose_PPase; 1.
DR SUPFAM; SSF56784; HAD-like; 1.
DR SUPFAM; SSF53756; UDP-Glycosyltransferase/glycogen phosphorylase; 1.
PE 3: Inferred from homology;
KW Reference proteome {ECO:0000313|Proteomes:UP000002729}.
FT REGION 1..20
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 837 AA; 94925 MW; A4FAB95DCA0167B1 CRC64;
MAEHVARDPS PHAATPSAAQ LSVEEQQIIA EIQELQRHLR ELKADQDEQE VPEVKEKARP
RRVLIVANRL PIRCSRDART GRWSFDDAPG GLATALRGIS MEVQFVWVGW VGCDVPPQER
ARVAERLMRE HGCFPVFLEP QLVENYYNGF CNDVLWPLFH YVPLPMYQAG AEKKFDAHLW
DAYTEANRRF AEAVMEGADV VWVHDYHLMR LPLELRRLDP RVAVAWFLHT PFPSSEIYRI
LPYRRELLEG LLHADLVGFH TYDYARHFLS ACSRVLDVST TTPKGIEFGG RFCSIGVFPI
GIDPLLIRNT LRSRAVKKRT GELGDTFEGR KIIIGVDRLD YIKGMPHKLL AFELFLSRNP
SWVGKVTLIQ VGVPTRVDVA EYQTLAKHVN ELVGRINGYS PIHYINQSIP QDELIAIYHL
ADACLVTSVR DGMNLVSHEY VAAQEDPFAK DGPGALVLSE FAGSAQSLSG AIRVNPWNTE
ELARAIHEAL TLTRVERELR WAKLHRYVTT NTASYWARSF VSEFRDVCEH PPLLSKLPKL
NAAEFRSLVG TPQTSGRIPR RLVVTDYDGT LTQIQSLPQL ATPAPVVTQL LATLARDTRN
TVYVMSGRER RFMDKWLGKL KVGLAAEFGY CHRAPDDEQG VWKSLGRELD TSWKDVVRPI
MQYFAERTPG TYIESKESSL AWHYRDADPH FGAWQAKDMQ IHMEDVMSNL PLEIIQGNRL
VEVRHVGVNK SLVLEEVLRM GPRGETADVD FDFVLCVGDD RSDEDMYQLL KAWHARRAEH
DADKGGDDAP AQLYLVHIGT GATQAEYYLE SVIELRKLLR GMASISLRDL RGAPRAF
//