ID V5E8V4_KALBG Unreviewed; 1209 AA.
AC V5E8V4;
DT 22-JAN-2014, integrated into UniProtKB/TrEMBL.
DT 22-JAN-2014, sequence version 1.
DT 27-MAR-2024, entry version 40.
DE RecName: Full=CCHC-type domain-containing protein {ECO:0000259|PROSITE:PS50158};
GN ORFNames=PSEUBRA_SCAF25g00968 {ECO:0000313|EMBL:EST06766.1};
OS Kalmanozyma brasiliensis (strain GHG001) (Yeast) (Pseudozyma brasiliensis).
OC Eukaryota; Fungi; Dikarya; Basidiomycota; Ustilaginomycotina;
OC Ustilaginomycetes; Ustilaginales; Ustilaginaceae; Kalmanozyma.
OX NCBI_TaxID=1365824 {ECO:0000313|EMBL:EST06766.1, ECO:0000313|Proteomes:UP000019377};
RN [1] {ECO:0000313|Proteomes:UP000019377}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=GHG001 {ECO:0000313|Proteomes:UP000019377};
RX PubMed=24356824; DOI=10.1128/genomea.00920-13;
RA Oliveira J.V.D.C., dos Santos R.A.C., Borges T.A., Riano-Pachon D.M.,
RA Goldman G.H.;
RT "Draft genome sequence of Pseudozyma brasiliensis sp. nov. strain GHG001, a
RT high producer of endo-1,4-xylanase isolated from an insect pest of
RT sugarcane.";
RL Genome Announc. 1:E0092013-E0092013(2013).
CC -!- FUNCTION: Possesses 5'->3' exoribonuclease activity. Required for the
CC processing of nuclear mRNA and rRNA precursors. May promote termination
CC of transcription by RNA polymerase II. {ECO:0000256|ARBA:ARBA00025537}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KI545868; EST06766.1; -; Genomic_DNA.
DR RefSeq; XP_016291755.1; XM_016436814.1.
DR AlphaFoldDB; V5E8V4; -.
DR STRING; 1365824.V5E8V4; -.
DR GeneID; 27419458; -.
DR eggNOG; KOG2044; Eukaryota.
DR HOGENOM; CLU_006038_2_1_1; -.
DR OMA; CLHYYVH; -.
DR OrthoDB; 167745at2759; -.
DR Proteomes; UP000019377; Unassembled WGS sequence.
DR GO; GO:0004534; F:5'-3' RNA exonuclease activity; IEA:UniProt.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR GO; GO:0006364; P:rRNA processing; IEA:UniProtKB-KW.
DR CDD; cd18673; PIN_XRN1-2-like; 1.
DR Gene3D; 1.25.40.1050; -; 1.
DR Gene3D; 3.40.50.12390; -; 2.
DR InterPro; IPR027073; 5_3_exoribonuclease.
DR InterPro; IPR041412; Xrn1_helical.
DR InterPro; IPR004859; Xrn1_N.
DR InterPro; IPR001878; Znf_CCHC.
DR PANTHER; PTHR12341:SF81; 5'-3' EXORIBONUCLEASE 2; 1.
DR PANTHER; PTHR12341; 5'->3' EXORIBONUCLEASE; 1.
DR Pfam; PF17846; XRN_M; 1.
DR Pfam; PF03159; XRN_N; 1.
DR SMART; SM00343; ZnF_C2HC; 1.
DR PROSITE; PS50158; ZF_CCHC; 1.
PE 4: Predicted;
KW Exonuclease {ECO:0000256|ARBA:ARBA00022839};
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW Metal-binding {ECO:0000256|PROSITE-ProRule:PRU00047};
KW Nuclease {ECO:0000256|ARBA:ARBA00022722};
KW Reference proteome {ECO:0000313|Proteomes:UP000019377};
KW Zinc {ECO:0000256|PROSITE-ProRule:PRU00047};
KW Zinc-finger {ECO:0000256|PROSITE-ProRule:PRU00047}.
FT DOMAIN 272..285
FT /note="CCHC-type"
FT /evidence="ECO:0000259|PROSITE:PS50158"
FT REGION 20..48
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 413..487
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 525..616
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 923..962
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1095..1126
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1163..1209
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 413..442
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 472..486
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 525..545
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 557..596
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 597..616
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 923..950
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1209 AA; 130001 MW; 50FBFA884CDE858F CRC64;
MGVPALFRWL SKKYPRIVSS VQEEDPKTMP GPDGTEVPVP LDTSTPNPNG EEFDCLYLDM
NGIVHPCTHP EGKPAPETEE EMMVEVFAYT ERVVNMVRPR RLLMMAIDGV APRAKMNQQR
SRRFRAAKEA REKHEEKEAA LAEWKAKGLP VTDDALESKK AWDSNAITPG TPFMDLLAAS
LRYWVAQKIN TDPGWKDVQV IISDASVPGE GEHKIMEHIR RQRSHPEHDP NTKHVIYGLD
ADLIMLSLAT HEPYFKVLRE DVFANDNKPK TCNLCGQPGH FAASCTGAAK KKSGEHDEIA
PIPEKKPFIL LDVATLREYL EVELNIPQLP FAFDLERAID DWVFLIFFVG NDFLPHLPSL
EIRDGAIDTL LRIWKKELPA MGGYLTNHGK VELGRAQLIL SGLASEEDEI FRRRKEDEDR
RENNKKRREE MQKRREKELD EGNFGNGSMV EVMKKRPAHE TEKPAFNGVS AQEARDQANS
SNKSYDPRKK AVVLGGDNNQ VVKDRNAARQ ANMDAAEALK AELMGGEAKK EEPEADAEPA
TKKVKTEVGA EAAVSATDKQ EADEDEKPVV AGTKRKADDV DAEVNGADAV VKTEGEDDAE
AADEDDNDDD VDGNEDVDPV VTVKKRTVNA DGTVDYEDTV KMWEPGYRER YYQEKFGVDL
SDTDFRRQVV KSYIEGLSWV LAYYYQGVPS WQWYYPFHFS PFAADFEDLA SLDIHFELGA
PFKPFEQLMG VLPADSRASI PTPFHPLMTQ SDSDIIDFYP SEFDIDMNGK KMAWQGVALL
PFIDEKRLLD ALADKYPLLS DDEVRRNGFG NNTLFVGSES RLYDFLCEEI YSKPPTSEVL
VPLNPALSGG ITGTVKPDPA CVPGSTFNSP LTSQNLGDIH NDRSISVLYD FPEQKTPHRS
VLLKGLKVPK RVLSAAEVDW VKRGSPERGR GRGRGGHGGP GRDFHRNNRE HGNGYGGGGE
GNGNYRNGGG GGGRGGYNGG NGGGYGGQGG QGGYQQDYQQ YPSNGGGGYG GYGGGGGGGY
GGAGGYTGGG YGGYGGGGGY APAPAAAPYG GNGYDAYAGY GAGGGGGGYG GYGGGSSYPP
PAGPARDDRY AAYTQPSAHV GGGRGGRHAP PPSQPNPAYA GLPSLGGPPA GAPAPYGGYG
APAAAAPAAA PYGGYGGGYG GYGAPPAQGY GGPPPRGGGR GGRGGNRGGR GGNNNASYGG
GGGAYGSFY
//