ID A0A0F9ZJT2_TRIHA Unreviewed; 980 AA.
AC A0A0F9ZJT2;
DT 22-JUL-2015, integrated into UniProtKB/TrEMBL.
DT 22-JUL-2015, sequence version 1.
DT 03-MAY-2023, entry version 31.
DE RecName: Full=mRNA 3'-end-processing protein RNA14 {ECO:0000256|RuleBase:RU369035};
GN ORFNames=THAR02_07323 {ECO:0000313|EMBL:KKP00567.1};
OS Trichoderma harzianum (Hypocrea lixii).
OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Sordariomycetes;
OC Hypocreomycetidae; Hypocreales; Hypocreaceae; Trichoderma.
OX NCBI_TaxID=5544 {ECO:0000313|EMBL:KKP00567.1, ECO:0000313|Proteomes:UP000034112};
RN [1] {ECO:0000313|Proteomes:UP000034112}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=T6776 {ECO:0000313|Proteomes:UP000034112};
RX PubMed=26067977; DOI=10.1128/genomeA.00647-15;
RA Baroncelli R., Piaggeschi G., Fiorini L., Bertolini E., Zapparata A.,
RA Pe M.E., Sarrocco S., Vannacci G.;
RT "Draft whole-genome sequence of the biocontrol agent Trichoderma harzianum
RT T6776.";
RL Genome Announc. 3:E0064715-E0064715(2015).
CC -!- FUNCTION: Component of the cleavage factor IA (CFIA) complex, which is
CC involved in the endonucleolytic cleavage during polyadenylation-
CC dependent pre-mRNA 3'-end formation. {ECO:0000256|ARBA:ARBA00002863,
CC ECO:0000256|RuleBase:RU369035}.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|RuleBase:RU369035}.
CC Cytoplasm {ECO:0000256|RuleBase:RU369035}. Note=Nucleus and/or
CC cytoplasm. {ECO:0000256|RuleBase:RU369035}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KKP00567.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; JOKZ01000244; KKP00567.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A0F9ZJT2; -.
DR OMA; VQLWSVY; -.
DR OrthoDB; 23291at2759; -.
DR Proteomes; UP000034112; Unassembled WGS sequence.
DR GO; GO:0005737; C:cytoplasm; IEA:UniProtKB-SubCell.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0006378; P:mRNA polyadenylation; IEA:UniProtKB-UniRule.
DR Gene3D; 1.25.40.1040; -; 1.
DR InterPro; IPR003107; HAT.
DR InterPro; IPR045243; Rna14-like.
DR InterPro; IPR008847; Suf.
DR InterPro; IPR011990; TPR-like_helical_dom_sf.
DR PANTHER; PTHR19980:SF0; CLEAVAGE STIMULATION FACTOR SUBUNIT 3; 1.
DR PANTHER; PTHR19980; RNA CLEAVAGE STIMULATION FACTOR; 1.
DR Pfam; PF05843; Suf; 1.
DR SMART; SM00386; HAT; 6.
DR SUPFAM; SSF48452; TPR-like; 2.
PE 4: Predicted;
KW Cytoplasm {ECO:0000256|RuleBase:RU369035};
KW mRNA processing {ECO:0000256|RuleBase:RU369035};
KW Nucleus {ECO:0000256|RuleBase:RU369035};
KW Reference proteome {ECO:0000313|Proteomes:UP000034112}.
FT DOMAIN 590..885
FT /note="Suppressor of forked"
FT /evidence="ECO:0000259|Pfam:PF05843"
FT REGION 1..122
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 779..846
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 952..980
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 782..799
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 980 AA; 109413 MW; 1AEEF401B6506738 CRC64;
MASEGTSWGN EGFAAAEGDH TQHSDEQVDN FAQDSHVSGE SADNGTEDGG DYDPEYVPVE
SVLPDPPVPE NAAPAVLSQR PTPKPKMSGG FLVEASDDED DDQEDNAGPG EPSASQSQVG
QPTEIKMAIA PAPPLPTNAP PNVTATPVAS LDPVALFEAR IKEDPRGDLD AWENLIADHR
RNSRLDEVRK VYNRFLEVFP QSADVWVEWI KMELGMDNFV DAEQLFGRCL MTVPNVNLWT
VYLNYIRRRN DLNNDSTGQA RRTVTQSYEF VIDNIGVDRD SGNIWQDYVQ FIKSGPGQIG
GTGWQDQQKM DQLRKAYHRA INVPMSTVNN LWKEYDQFEM GLNKVTGRKF IQERSPGYMS
AKSGNIALDN ITRDLKRTNL PRLPPAPGFE GDQEFRNQVA LWKKWIEWEK EDPLVLKTDE
PKVFAQRVLY CYKQALMAMR FWPELWVDAA EWCLQNDIGD VDKELGTEFL VQGIAANPES
VLLALKHADH IEATSPLREG DKSEFAKAVR KPFDDVLNTL YALGDKAKER EKLEISTLKQ
AAALEAPVQQ SIEDDDYEEG TPKKASTEER ISAIQKGYAA ETNLLSRTIS YVWIAMARAI
RRIQGKGNQT DGGLRKVFTD ARQKGRLTSD VYVAVALLES VVYKDPVGAK IFERGARLFP
NDEGFVVEYL KFLHSKDDTT NARVVFETCV NRLVANPETL HKAKPLYAYF HKYESQYGEL
SQIHKIEARM LELFPEDPKL ANFAARYSSD KFDPIAAPVI ISKAAQMRPR LAAPIIEQPA
SARNSFPPQR EQSPRPQYIR ATASPKRPLG PDEEELNPPK RLARGSSPLK GAAGRRLDQQ
RRNQSSALHR DITFLLGILP PAHTYDSQRL SAQGMVSLLR DTPLPDYGTW KAQTGGQYRF
TAPVHGRQAS GDFPRPISPY GRIAPASNVY RQSPLRTETG NAYPTLPYGA TDTSGTPSQW
PLAPAGYSAP PPGQYGGYRF
//