ID F0Y584_AURAN Unreviewed; 1440 AA.
AC F0Y584;
DT 03-MAY-2011, integrated into UniProtKB/TrEMBL.
DT 03-MAY-2011, sequence version 1.
DT 13-SEP-2023, entry version 30.
DE RecName: Full=Exonuclease 1 {ECO:0008006|Google:ProtNLM};
GN ORFNames=AURANDRAFT_71308 {ECO:0000313|EMBL:EGB09453.1};
OS Aureococcus anophagefferens (Harmful bloom alga).
OC Eukaryota; Sar; Stramenopiles; Ochrophyta; Pelagophyceae; Pelagomonadales;
OC Aureococcus.
OX NCBI_TaxID=44056 {ECO:0000313|Proteomes:UP000002729};
RN [1] {ECO:0000313|EMBL:EGB09453.1, ECO:0000313|Proteomes:UP000002729}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=CCMP 1984 {ECO:0000313|Proteomes:UP000002729};
RX PubMed=21368207; DOI=10.1073/pnas.1016106108;
RA Gobler C.J., Berry D.L., Dyhrman S.T., Wilhelm S.W., Salamov A.,
RA Lobanov A.V., Zhang Y., Collier J.L., Wurch L.L., Kustka A.B., Dill B.D.,
RA Shah M., VerBerkmoes N.C., Kuo A., Terry A., Pangilinan J., Lindquist E.A.,
RA Lucas S., Paulsen I.T., Hattenrath-Lehmann T.K., Talmage S.C., Walker E.A.,
RA Koch F., Burson A.M., Marcoval M.A., Tang Y.Z., Lecleir G.R., Coyne K.J.,
RA Berg G.M., Bertrand E.M., Saito M.A., Gladyshev V.N., Grigoriev I.V.;
RT "Niche of harmful alga Aureococcus anophagefferens revealed through
RT ecogenomics.";
RL Proc. Natl. Acad. Sci. U.S.A. 108:4352-4357(2011).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; GL833125; EGB09453.1; -; Genomic_DNA.
DR RefSeq; XP_009035520.1; XM_009037272.1.
DR EnsemblProtists; EGB09453; EGB09453; AURANDRAFT_71308.
DR GeneID; 20228159; -.
DR KEGG; aaf:AURANDRAFT_71308; -.
DR eggNOG; ENOG502SAF2; Eukaryota.
DR InParanoid; F0Y584; -.
DR OrthoDB; 68365at2759; -.
DR Proteomes; UP000002729; Unassembled WGS sequence.
DR CDD; cd15482; Sialidase_non-viral; 1.
DR Gene3D; 2.130.10.130; Integrin alpha, N-terminal; 3.
DR InterPro; IPR013517; FG-GAP.
DR InterPro; IPR013519; Int_alpha_beta-p.
DR InterPro; IPR028994; Integrin_alpha_N.
DR PANTHER; PTHR36220:SF1; -; 1.
DR PANTHER; PTHR36220; UNNAMED PRODUCT; 1.
DR Pfam; PF14312; FG-GAP_2; 7.
DR SMART; SM00191; Int_alpha; 7.
DR SUPFAM; SSF110296; Oligoxyloglucan reducing end-specific cellobiohydrolase; 1.
PE 4: Predicted;
KW Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Reference proteome {ECO:0000313|Proteomes:UP000002729};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..22
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 23..1440
FT /note="Exonuclease 1"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5003264386"
SQ SEQUENCE 1440 AA; 150687 MW; 53E816622D852E03 CRC64;
MRAWQSIVIL FLLGSLAPKA SGQQLAKLLA SDAAAEDYFG YSVAISGDLV VVGAYRDDDA
GSKTGSAYVF RTTNDGGSWT QTAKLVASDA AKDNYFGKSV AISGDLVVVG ADGNNEDVGS
AYVFRTRNGG ASWSQTAKLL ASDAAKDDVF GESVAISGDL VVVGAYGNND AGSSSGSAYV
FRTRNSGASW TQTAKLLASD AAAEDYFGYS VAISGDLVVV GAYRDDDAGS KTGSAYVFRT
TNDGGSWTQT AKLVASDAAK DNYFGKSVAI SGDLVVVGAD GNNEDVGSAY VFRTRNGGAS
WSQTAKLLAS DAAKDDVFGE SVAISGDLVV VGAYGNDDAG SSSGSAYVFR TRNSGASWTQ
TAKLVASDAA AKDNFGYSVA ISGDLVVVGA YEDDDAGSSS GSAYVLPADG SAVTPRPTLR
PTRGGTSGGG TDTLVVVAAA VAGALVAVLA CLCIKRRHGA PPERNVPFAP AWMAPGDGLE
THSVEELKAM ANTRGVSLAG RFNATMPLHI LTAIALLSAA HQGIKVAASS SNLGAPSSKT
AATPKKRFLR SSASAQDLVK SKEPVQLDKV AHAQESLVAL SGDSSSSDVE GAASLKDRFL
RVSAQDLLAV KESVQLDDVA HGQESLVALS GDSPSSDVEA AASLKGRFLR SSTQDILASR
HPFMHSNWYN TDEKQAERVV VCDFAAVAYH VLSKVETDAS KLLGGQYAVM ARETRAFVER
FERVGAALVF VVGRTATHAD HLKHGHAAHT LASKAIHEAR AVELVAAGRA DEAAAHSKKA
LAPMKNGAVV DALLDLAAEG RVEVLFGDDE DDPGVAYEAA ARHGWVLSGD TDMVAYRYEA
VGSIQGVIFL KDLDWTTAGL TFLHTTPALV AAALGLVRDG VGRGELMPLV ATLLGNDYVD
DIELATVHDH ALHNALLKQQ ERWHALDPAV HREAMRELPG NVRRPCLLGH RCKNTDCPYD
HSRSLRFFCN ANVNDDNNIG PNYCTRPYTC RWRHRGDAGR VPVWCAQVPG PAVADAAAAA
DAVARPVLLR GEPTGATTRW VADVRAARDV VANLRHVGAE ASVERQFSWA IIRGVADAVA
GHAYHIDDES AVLDAVCGAG TDLADKVRTA VAAYEPRRAD LRVADAPYAV GDGVAARYAY
HQMLSPLRDR HLGDHFDAFA HFPRRLGDLK LGLVHGAGAA GVCWLMQADG SIALVERRAS
SPEALPDLAQ LLASDAARRD GVLALLGASD LGDLPAPLLW PLATLRFWLA AGGEAMPGVE
PPPTLGPRQI RAIVEATTQR CLEARLGCDA REPDRWAPPL VGALAEPAFR ALMAWVRALE
GVRGLLAIGA TGDRPELPAA VFDGELLLAI WNAHAGGGAT PTSAWERYDE LAARAAGAAP
LVDALVARVT EGLAVDAMPQ VAFQARSVSE TVDSWEDEGD DAPTLPVDAN AAWDAVVGGV
//