ID A0A0G0AIC7_TRIHA Unreviewed; 563 AA.
AC A0A0G0AIC7;
DT 22-JUL-2015, integrated into UniProtKB/TrEMBL.
DT 22-JUL-2015, sequence version 1.
DT 27-MAR-2024, entry version 39.
DE RecName: Full=Splicing factor U2AF subunit {ECO:0000256|RuleBase:RU364135};
DE AltName: Full=U2 snRNP auxiliary factor large subunit {ECO:0000256|RuleBase:RU364135};
GN ORFNames=THAR02_03232 {ECO:0000313|EMBL:KKP04689.1};
OS Trichoderma harzianum (Hypocrea lixii).
OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Sordariomycetes;
OC Hypocreomycetidae; Hypocreales; Hypocreaceae; Trichoderma.
OX NCBI_TaxID=5544 {ECO:0000313|EMBL:KKP04689.1, ECO:0000313|Proteomes:UP000034112};
RN [1] {ECO:0000313|Proteomes:UP000034112}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=T6776 {ECO:0000313|Proteomes:UP000034112};
RX PubMed=26067977; DOI=10.1128/genomeA.00647-15;
RA Baroncelli R., Piaggeschi G., Fiorini L., Bertolini E., Zapparata A.,
RA Pe M.E., Sarrocco S., Vannacci G.;
RT "Draft whole-genome sequence of the biocontrol agent Trichoderma harzianum
RT T6776.";
RL Genome Announc. 3:E0064715-E0064715(2015).
CC -!- FUNCTION: Necessary for the splicing of pre-mRNA.
CC {ECO:0000256|RuleBase:RU364135}.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC ECO:0000256|RuleBase:RU364135}.
CC -!- SIMILARITY: Belongs to the splicing factor SR family.
CC {ECO:0000256|RuleBase:RU364135}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KKP04689.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; JOKZ01000070; KKP04689.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A0G0AIC7; -.
DR OMA; MTQWDIK; -.
DR OrthoDB; 101932at2759; -.
DR Proteomes; UP000034112; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003723; F:RNA binding; IEA:UniProtKB-UniRule.
DR GO; GO:0006397; P:mRNA processing; IEA:UniProtKB-KW.
DR GO; GO:0008380; P:RNA splicing; IEA:UniProtKB-KW.
DR CDD; cd12232; RRM3_U2AF65; 1.
DR Gene3D; 3.30.70.330; -; 3.
DR InterPro; IPR012677; Nucleotide-bd_a/b_plait_sf.
DR InterPro; IPR035979; RBD_domain_sf.
DR InterPro; IPR000504; RRM_dom.
DR InterPro; IPR003954; RRM_dom_euk.
DR InterPro; IPR006529; U2AF_lg.
DR NCBIfam; TIGR01642; U2AF_lg; 1.
DR PANTHER; PTHR23139; RNA-BINDING PROTEIN; 1.
DR PANTHER; PTHR23139:SF9; SPLICING FACTOR U2AF 65 KDA SUBUNIT; 1.
DR Pfam; PF00076; RRM_1; 2.
DR SMART; SM00360; RRM; 2.
DR SMART; SM00361; RRM_1; 1.
DR SUPFAM; SSF54928; RNA-binding domain, RBD; 2.
DR PROSITE; PS50102; RRM; 3.
PE 3: Inferred from homology;
KW mRNA processing {ECO:0000256|ARBA:ARBA00022664,
KW ECO:0000256|RuleBase:RU364135};
KW mRNA splicing {ECO:0000256|ARBA:ARBA00023187,
KW ECO:0000256|RuleBase:RU364135};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|RuleBase:RU364135};
KW Reference proteome {ECO:0000313|Proteomes:UP000034112};
KW RNA-binding {ECO:0000256|ARBA:ARBA00022884, ECO:0000256|PROSITE-
KW ProRule:PRU00176}.
FT DOMAIN 236..329
FT /note="RRM"
FT /evidence="ECO:0000259|PROSITE:PS50102"
FT DOMAIN 355..434
FT /note="RRM"
FT /evidence="ECO:0000259|PROSITE:PS50102"
FT DOMAIN 464..555
FT /note="RRM"
FT /evidence="ECO:0000259|PROSITE:PS50102"
FT REGION 1..161
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 9..161
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 563 AA; 62973 MW; F918F5BC926263CF CRC64;
MNGDSYSSRD GGRRNRDYPS SRGERDDRRD RHRDRGDRDR RRSRSPDHRG GHRRPEGDVD
AYSSSRSHRD REREDRYSSA RDRRGDREWD RDRGSYRRDA RRDDDERPNR RDRDGYDDRR
RGGRGERERR GGDDGFARQE QPRRSPSPPK KREPTPDLTD VIPVLERKRR LTQWDIKPPG
YDLVTAEQAK LSGMFPLPGA PRQQPMDPTK LQAFMTQPGG QVTSAGLKAS NSRQAKRLLV
SNFASGVTED ALISFFNLQL NGLNVIESSD PCVLCQFSQD KAFAVLEFRN ASDATVALAL
DGITMEADDA QNGTANGGNH GLNIRRPKDY VMPALPDEMP YDPEVISNVV PDTVHKLCIT
NIPPFLSEDQ VIELLAAFGK PKAFVLVKDR STEESRGIAF TEYLEPSTAN EPALNSLNGM
DVGGKKLKVT KASIGPTQVA NFDVGITAIS GLASQTSNDI ERSSVIQLLN MVTPEELLDN
DDYEEICEDV QDECSKFGKV VELKVPRPSG GSRQSTGVGK IYVKFDSEES ATKALTALAG
RKFADRTVVS TYFPEENFEV GAW
//