ID F0Y7F0_AURAN Unreviewed; 1780 AA.
AC F0Y7F0;
DT 03-MAY-2011, integrated into UniProtKB/TrEMBL.
DT 03-MAY-2011, sequence version 1.
DT 27-MAR-2024, entry version 57.
DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:EGB09190.1};
GN ORFNames=AURANDRAFT_71484 {ECO:0000313|EMBL:EGB09190.1};
OS Aureococcus anophagefferens (Harmful bloom alga).
OC Eukaryota; Sar; Stramenopiles; Ochrophyta; Pelagophyceae; Pelagomonadales;
OC Aureococcus.
OX NCBI_TaxID=44056 {ECO:0000313|Proteomes:UP000002729};
RN [1] {ECO:0000313|EMBL:EGB09190.1, ECO:0000313|Proteomes:UP000002729}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=CCMP 1984 {ECO:0000313|Proteomes:UP000002729};
RX PubMed=21368207; DOI=10.1073/pnas.1016106108;
RA Gobler C.J., Berry D.L., Dyhrman S.T., Wilhelm S.W., Salamov A.,
RA Lobanov A.V., Zhang Y., Collier J.L., Wurch L.L., Kustka A.B., Dill B.D.,
RA Shah M., VerBerkmoes N.C., Kuo A., Terry A., Pangilinan J., Lindquist E.A.,
RA Lucas S., Paulsen I.T., Hattenrath-Lehmann T.K., Talmage S.C., Walker E.A.,
RA Koch F., Burson A.M., Marcoval M.A., Tang Y.Z., Lecleir G.R., Coyne K.J.,
RA Berg G.M., Bertrand E.M., Saito M.A., Gladyshev V.N., Grigoriev I.V.;
RT "Niche of harmful alga Aureococcus anophagefferens revealed through
RT ecogenomics.";
RL Proc. Natl. Acad. Sci. U.S.A. 108:4352-4357(2011).
CC -!- SIMILARITY: Belongs to the aldehyde dehydrogenase family.
CC {ECO:0000256|RuleBase:RU003345}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; GL833126; EGB09190.1; -; Genomic_DNA.
DR RefSeq; XP_009036300.1; XM_009038052.1.
DR EnsemblProtists; EGB09190; EGB09190; AURANDRAFT_71484.
DR GeneID; 20228246; -.
DR KEGG; aaf:AURANDRAFT_71484; -.
DR eggNOG; KOG0540; Eukaryota.
DR eggNOG; KOG2456; Eukaryota.
DR eggNOG; KOG4547; Eukaryota.
DR InParanoid; F0Y7F0; -.
DR OrthoDB; 864at2759; -.
DR Proteomes; UP000002729; Unassembled WGS sequence.
DR GO; GO:0031981; C:nuclear lumen; IEA:UniProt.
DR GO; GO:0016874; F:ligase activity; IEA:InterPro.
DR GO; GO:0016620; F:oxidoreductase activity, acting on the aldehyde or oxo group of donors, NAD or NADP as acceptor; IEA:InterPro.
DR Gene3D; 2.130.10.10; YVTN repeat-like/Quinoprotein amine dehydrogenase; 1.
DR InterPro; IPR034733; AcCoA_carboxyl_beta.
DR InterPro; IPR016161; Ald_DH/histidinol_DH.
DR InterPro; IPR016163; Ald_DH_C.
DR InterPro; IPR029510; Ald_DH_CS_GLU.
DR InterPro; IPR016162; Ald_DH_N.
DR InterPro; IPR015590; Aldehyde_DH_dom.
DR InterPro; IPR016024; ARM-type_fold.
DR InterPro; IPR029045; ClpP/crotonase-like_dom_sf.
DR InterPro; IPR011763; COA_CT_C.
DR InterPro; IPR011762; COA_CT_N.
DR InterPro; IPR010754; OPA3-like.
DR InterPro; IPR007148; SSU_processome_Utp12.
DR InterPro; IPR015943; WD40/YVTN_repeat-like_dom_sf.
DR InterPro; IPR001680; WD40_rpt.
DR PANTHER; PTHR43842:SF2; BIOTIN-DEPENDENT ACETYL-_PROPIONYL-COENZYME A CARBOXYLASE BETA5 SUBUNIT; 1.
DR PANTHER; PTHR43842; PROPIONYL-COA CARBOXYLASE BETA CHAIN; 1.
DR Pfam; PF00171; Aldedh; 1.
DR Pfam; PF01039; Carboxyl_trans; 1.
DR Pfam; PF07047; OPA3; 1.
DR Pfam; PF04003; Utp12; 1.
DR SMART; SM00320; WD40; 3.
DR SUPFAM; SSF53720; ALDH-like; 1.
DR SUPFAM; SSF48371; ARM repeat; 1.
DR SUPFAM; SSF52096; ClpP/crotonase; 2.
DR SUPFAM; SSF101908; Putative isomerase YbhE; 1.
DR PROSITE; PS00687; ALDEHYDE_DEHYDR_GLU; 1.
DR PROSITE; PS50989; COA_CT_CTER; 1.
DR PROSITE; PS50980; COA_CT_NTER; 1.
DR PROSITE; PS50082; WD_REPEATS_2; 1.
PE 3: Inferred from homology;
KW Oxidoreductase {ECO:0000256|RuleBase:RU003345};
KW Reference proteome {ECO:0000313|Proteomes:UP000002729};
KW WD repeat {ECO:0000256|PROSITE-ProRule:PRU00221}.
FT DOMAIN 135..391
FT /note="CoA carboxyltransferase N-terminal"
FT /evidence="ECO:0000259|PROSITE:PS50980"
FT DOMAIN 395..636
FT /note="CoA carboxyltransferase C-terminal"
FT /evidence="ECO:0000259|PROSITE:PS50989"
FT REPEAT 792..822
FT /note="WD"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00221"
FT REGION 1285..1337
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1291..1337
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT ACT_SITE 1561
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU10007"
SQ SEQUENCE 1780 AA; 188608 MW; CE327208048F9F79 CRC64;
MEKSELVVAQ YPLPFFIDIE QTKDGNMVTK DGSQQENDGV ERVGDRLRAF TYYEVGQSLD
EPGGGGVISM LGSFGGASIK WRRKLFDATF TPWDKALEVL ITNEPRRTDS VTMVFERRLE
YAGAPARRLL SSSQKAAVRA EVAALRAQSL VGGGERRIEA QHKKGKLTAR ERLSVLLDEG
SFVELDPFVS HTCVDFGMEH EKPPGDGVVT GHGLVDGRPV CVFAQDFTVF GGSLSSAHAS
KIVKVMERAL RVRCPVVGLS DSGGARIQEG VDSLGGYADV FQANVDASGV VPQLSLIMGP
CAGGAVYSPV MTDFLYMVRR SSYMFVTGPD VVKSVTNEAV TQEELGGAEV HTKLSGVAQG
AFDDDVAALR GARKLLGYLP SSYADAAPAV AAYDDPARED PALRLLVPDD PNVPYDMREV
VRRVVDKQSL LEIAEDYAPN ILTAFARLGG RAVGVVANNP KSKAGCLDID ASVKAARFVR
FCDAFGIPLV TFVDVPGFLP GVKQEHGGII RHGAKLLFAY SEANVPKLTV ITRKAYGGAY
DVMASKHLRG DANYAWPSAE IAVMGAKGAV EILYRDLDAD GQQAKADEYA RRFANPIVAA
SRGFVDDVID PAETRAKLCA DLDQLARKDL PNRPRKHTFT RRLSHATSLE SFRKPYKHVD
LKDSDCLNRG SEIVSEGLVL GVAIAVAAYE YEKSAHKKVV AEQKQEAREA QAAADLEARL
VRIEAGLADV SAALRDRPKA TSCIMSLALS PDALSIAVAS ASRLSVHGAS AGSRERQYVQ
QGQLARRYAC CAWSKDAALV ACGSGDGAVV VWDVKRGLVK ATLAPPGALA NAGVVGVDFS
ADGAELYACY ALPGGAFARH VARWAVAGGE PAAWDGDKRG VSALAAHPSG GAVAVGSTRV
RLLQTTGTGA KRALAGTHAT EVRRLAWTAS GRYVVSLARD ARSVLVHDCT KAKDGVWSLR
LAAPGLEVCA RAAPRAEGEE AVEVAVCLGD GRLQVASTAS PDAAPLLDGR RDALAVAFVG
DDLRAARGAA SAPSAAAVDV ARDAGVCAAV DGGAAPAKAA AAGDAPAARE PRRPAVLGFA
ALGAAPRLRP DGEDGAARKR AKAAAADDDA ALGDRLAALE ALATREEAQT AADLAADAGH
AAAADDGSGA SLAAVLDQAV RCGDDALLET VLRRTEPATV AATCAKLAPS LALPTLDALA
ARLERSPLRA ADLAGWVKAL LLQHASHLAT LPKLADTLAR LQYVVDARVA ALPKFLSLLG
KFDLLLHKKT GAAVSGAASA EVTPLNVVAA DATDDEDEEE EDDDMEEEED DDDDESEEEE
DDDDDDDSSD DDDDDSATKI DTMLASDVEH GGVAKAVLGL RRFFESGGTM SYEFRLGQLK
AFQRMLVNER ATLQAAMKAD LHKNATEGQY VEVNQVEHEC QHAIDHLKQW MTPKAVSTNL
LNVPGLSYVH PDPLGVCLVI GAWNFPILLS LQPMIGALAA GNCVCLKTPS QHYSAACSDA
MAAMLQRYLD PRAVAVVAGD RMATQAVLQE TWDHIFFTGG KYVGTMVAEA AAKHLTPCVL
ELGGKSPCVV DRSASLDVAA RRICWGMFQN AGQTCVRPDY LLVHEDVADA FAGLLLKWAT
ASYSKDPKAT EWFGRLINER AAQRDWALAS RFQYGTTSGS FVVNDVLVQG SNHALPFGGV
GPSGMGAYHG EHSFRAHSHQ KAVLYKTPYL DLDARYPPYS PLRAWTLGTV QAVRSANAIL
AVQLGLFAVA AAGLWAACPA LGEELALLLA YLRSRAEARL
//