ID G8YGJ3_PICSO Unreviewed; 993 AA.
AC G8YGJ3;
DT 22-FEB-2012, integrated into UniProtKB/TrEMBL.
DT 22-FEB-2012, sequence version 1.
DT 27-MAR-2024, entry version 60.
DE RecName: Full=5'-3' exoribonuclease {ECO:0000256|PIRNR:PIRNR037239};
DE EC=3.1.13.- {ECO:0000256|PIRNR:PIRNR037239};
GN Name=Piso0_003664 {ECO:0000313|EMBL:CCE81310.1};
GN ORFNames=GNLVRS01_PISO0G17298g {ECO:0000313|EMBL:CCE80545.1},
GN GNLVRS01_PISO0H17299g {ECO:0000313|EMBL:CCE81310.1};
OS Pichia sorbitophila (strain ATCC MYA-4447 / BCRC 22081 / CBS 7064 / NBRC
OS 10061 / NRRL Y-12695) (Hybrid yeast).
OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; Saccharomycetes;
OC Saccharomycetales; Debaryomycetaceae; Millerozyma.
OX NCBI_TaxID=559304 {ECO:0000313|EMBL:CCE81310.1, ECO:0000313|Proteomes:UP000005222};
RN [1] {ECO:0000313|EMBL:CCE81310.1}
RP NUCLEOTIDE SEQUENCE.
RA Genoscope - CEA;
RL Submitted (OCT-2011) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Proteomes:UP000005222}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=ATCC MYA-4447 / BCRC 22081 / CBS 7064 / NBRC 10061 / NRRL
RC Y-12695 {ECO:0000313|Proteomes:UP000005222};
RX DOI=10.1534/g3.111.000745;
RA Leh Louis V., Despons L., Friedrich A., Martin T., Durrens P.,
RA Casaregola S., Neuveglise C., Fairhead C., Marck C., Cruz J.A.,
RA Straub M.L., Kugler V., Sacerdot C., Uzunov Z., Thierry A., Weiss S.,
RA Bleykasten C., De Montigny J., Jacques N., Jung P., Lemaire M., Mallet S.,
RA Morel G., Richard G.F., Sarkar A., Savel G., Schacherer J., Seret M.L.,
RA Talla E., Samson G., Jubin C., Poulain J., Vacherie B., Barbe V.,
RA Pelletier E., Sherman D.J., Westhof E., Weissenbach J., Baret P.V.,
RA Wincker P., Gaillardin C., Dujon B., Souciet J.L.;
RT "Pichia sorbitophila, an interspecies yeast hybrid reveals early steps of
RT genome resolution following polyploidization.";
RL G3 (Bethesda) 2:299-311(2012).
CC -!- FUNCTION: Possesses 5'->3' exoribonuclease activity. May promote
CC termination of transcription by RNA polymerase II.
CC {ECO:0000256|PIRNR:PIRNR037239}.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- SIMILARITY: Belongs to the 5'-3' exonuclease family. XRN2/RAT1
CC subfamily. {ECO:0000256|ARBA:ARBA00006994,
CC ECO:0000256|PIRNR:PIRNR037239}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; FO082053; CCE80545.1; -; Genomic_DNA.
DR EMBL; FO082052; CCE81310.1; -; Genomic_DNA.
DR AlphaFoldDB; G8YGJ3; -.
DR STRING; 559304.G8YGJ3; -.
DR eggNOG; KOG2044; Eukaryota.
DR HOGENOM; CLU_006038_1_1_1; -.
DR InParanoid; G8YGJ3; -.
DR Proteomes; UP000005222; Chromosome G.
DR Proteomes; UP000005222; Chromosome H.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-KW.
DR GO; GO:0004534; F:5'-3' RNA exonuclease activity; IEA:UniProtKB-UniRule.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0006353; P:DNA-templated transcription termination; IEA:UniProtKB-KW.
DR GO; GO:0006397; P:mRNA processing; IEA:UniProtKB-UniRule.
DR CDD; cd18673; PIN_XRN1-2-like; 1.
DR Gene3D; 1.25.40.1050; -; 1.
DR Gene3D; 3.40.50.12390; -; 2.
DR InterPro; IPR027073; 5_3_exoribonuclease.
DR InterPro; IPR041412; Xrn1_helical.
DR InterPro; IPR004859; Xrn1_N.
DR InterPro; IPR017151; Xrn2/3/4.
DR PANTHER; PTHR12341:SF41; 5'-3' EXORIBONUCLEASE 1; 1.
DR PANTHER; PTHR12341; 5'->3' EXORIBONUCLEASE; 1.
DR Pfam; PF17846; XRN_M; 1.
DR Pfam; PF03159; XRN_N; 1.
DR PIRSF; PIRSF037239; Exonuclease_Xrn2; 1.
PE 3: Inferred from homology;
KW Coiled coil {ECO:0000256|SAM:Coils};
KW Exonuclease {ECO:0000256|ARBA:ARBA00022839, ECO:0000256|PIRNR:PIRNR037239};
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801, ECO:0000256|PIRNR:PIRNR037239};
KW mRNA processing {ECO:0000256|ARBA:ARBA00022664,
KW ECO:0000256|PIRNR:PIRNR037239};
KW Nuclease {ECO:0000256|ARBA:ARBA00022722, ECO:0000256|PIRNR:PIRNR037239};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000005222};
KW rRNA processing {ECO:0000256|ARBA:ARBA00022552};
KW Transcription {ECO:0000256|ARBA:ARBA00022472};
KW Transcription regulation {ECO:0000256|ARBA:ARBA00022472};
KW Transcription termination {ECO:0000256|ARBA:ARBA00022472}.
FT DOMAIN 1..251
FT /note="Xrn1 N-terminal"
FT /evidence="ECO:0000259|Pfam:PF03159"
FT DOMAIN 310..817
FT /note="Xrn1 helical"
FT /evidence="ECO:0000259|Pfam:PF17846"
FT REGION 108..129
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 491..537
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 887..929
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 950..993
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 698..725
FT /evidence="ECO:0000256|SAM:Coils"
FT COMPBIAS 113..129
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 491..508
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 509..537
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 961..984
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 993 AA; 114894 MW; 2C23A8CF424C144B CRC64;
MGVPALFRWL SRKYPKIISP VIEDELDEEY GGSRYTDPNP NGEIDNLYLD MNGIVHPCSH
PEHKPPPETE DEMFLDVFKY TDRVLMMARP RKVLVIAVDG VAPRAKMNQQ RSRRFRSAQD
AKIAHEEKER QINERELRGE MIDEAIKGKK SWDSNAITPG TPFMDRLALA LRYWVAYKLS
TEPGWANLQV IISDATVPGE GEHKLMSFIR SQRSDPEYNP NTSHCIYGLD ADLIFLGLAT
HEPHFRVLRE DVFANQSRQL RVSDQLSMSE DQKDALKKKE ERKPFLWLHL NVLREYLEIE
LYNPYISFPF DLERAIDDWV FICFFAGNDF LPHLPSLDVR DNGIDNLVRC WKRILPKLSD
YITCDGNLNL ESVEKLLEGL ASKEDEIFRR KHEQELRQEE NERRRKETLD EEKALKKRYV
SQISKGHDKA PLYADVNMPL LTTSGENVDG FAQLSNKDIV ANRDIITKAN MANANAAEAL
KKLLDSKSNK ANEGQISASS ALNQEIDSKS ETENGDNNRK RGFDEVQSDS DDQRLDNDKI
RMWEPGYRER YYKSKFHITS EEDIDKVRKD MVKHYLEGIS WVLLYYYQGC PSWQWYYPYH
YAPFAADFKN IREIVGEEGI TFKLGQPFRP YEQLMSVLPA ASGHNLPEVF RKLMSDPESE
IIDFYPEEFE IDMNGKKMSW QGIALLPFID EKRLLEAVQK KYELLTEAEK ERNTLKDEVL
IISKQHKMYE TFCKELYEKS QNEVEFSFGK TGLAGGVRKA AFDINGVFKY PLNQGEMGDL
DNRNFLLLFF HVPQKKHGKS MILNGYIPHT KVLNQEDRDA IIYGPQRNNG YRFRKPNDNS
DYINTGPSGK DEYLIYSMRR GGYRAFLEHL NNRTSYYEQQ SNASRMVNSR YGNNGYNQRQ
GGYNGNDYNQ GPYKNNYRNY NNYNNYGPKG AGQYNNSGYS GNYSRSGPNN AYDVGGRGRS
SGPSGYSNYA NQGYNGSYYN GANQGPKNGY RRR
//