ID G9NTB5_HYPAI Unreviewed; 2480 AA.
AC G9NTB5;
DT 22-FEB-2012, integrated into UniProtKB/TrEMBL.
DT 22-FEB-2012, sequence version 1.
DT 27-MAR-2024, entry version 34.
DE RecName: Full=THO complex subunit 2 {ECO:0000256|ARBA:ARBA00019596};
GN ORFNames=TRIATDRAFT_241884 {ECO:0000313|EMBL:EHK45961.1};
OS Hypocrea atroviridis (strain ATCC 20476 / IMI 206040) (Trichoderma
OS atroviride).
OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Sordariomycetes;
OC Hypocreomycetidae; Hypocreales; Hypocreaceae; Trichoderma.
OX NCBI_TaxID=452589 {ECO:0000313|EMBL:EHK45961.1, ECO:0000313|Proteomes:UP000005426};
RN [1] {ECO:0000313|EMBL:EHK45961.1, ECO:0000313|Proteomes:UP000005426}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=ATCC 20476 / IMI 206040 {ECO:0000313|Proteomes:UP000005426};
RX PubMed=21501500; DOI=10.1186/gb-2011-12-4-r40;
RA Kubicek C.P., Herrera-Estrella A., Seidl-Seiboth V., Martinez D.A.,
RA Druzhinina I.S., Thon M., Zeilinger S., Casas-Flores S., Horwitz B.A.,
RA Mukherjee P.K., Mukherjee M., Kredics L., Alcaraz L.D., Aerts A., Antal Z.,
RA Atanasova L., Cervantes-Badillo M.G., Challacombe J., Chertkov O.,
RA McCluskey K., Coulpier F., Deshpande N., von Doehren H., Ebbole D.J.,
RA Esquivel-Naranjo E.U., Fekete E., Flipphi M., Glaser F.,
RA Gomez-Rodriguez E.Y., Gruber S., Han C., Henrissat B., Hermosa R.,
RA Hernandez-Onate M., Karaffa L., Kosti I., Le Crom S., Lindquist E.,
RA Lucas S., Luebeck M., Luebeck P.S., Margeot A., Metz B., Misra M.,
RA Nevalainen H., Omann M., Packer N., Perrone G., Uresti-Rivera E.E.,
RA Salamov A., Schmoll M., Seiboth B., Shapiro H., Sukno S.,
RA Tamayo-Ramos J.A., Tisch D., Wiest A., Wilkinson H.H., Zhang M.,
RA Coutinho P.M., Kenerley C.M., Monte E., Baker S.E., Grigoriev I.V.;
RT "Comparative genome sequence analysis underscores mycoparasitism as the
RT ancestral life style of Trichoderma.";
RL Genome Biol. 12:R40.1-R40.15(2011).
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- SIMILARITY: Belongs to the THOC2 family.
CC {ECO:0000256|ARBA:ARBA00007857}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EHK45961.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; ABDG02000023; EHK45961.1; -; Genomic_DNA.
DR RefSeq; XP_013944171.1; XM_014088696.1.
DR STRING; 452589.G9NTB5; -.
DR GeneID; 25778488; -.
DR eggNOG; KOG1874; Eukaryota.
DR HOGENOM; CLU_000511_1_0_1; -.
DR OMA; QERWTCI; -.
DR OrthoDB; 179356at2759; -.
DR Proteomes; UP000005426; Unassembled WGS sequence.
DR GO; GO:0000347; C:THO complex; IEA:InterPro.
DR GO; GO:0006406; P:mRNA export from nucleus; IEA:InterPro.
DR GO; GO:0006397; P:mRNA processing; IEA:InterPro.
DR InterPro; IPR040007; Tho2.
DR InterPro; IPR021418; THO_THOC2_C.
DR InterPro; IPR021726; THO_THOC2_N.
DR InterPro; IPR032302; THOC2_N.
DR PANTHER; PTHR21597:SF0; THO COMPLEX SUBUNIT 2; 1.
DR PANTHER; PTHR21597; THO2 PROTEIN; 1.
DR Pfam; PF11262; Tho2; 1.
DR Pfam; PF11732; Thoc2; 1.
DR Pfam; PF16134; THOC2_N; 1.
PE 3: Inferred from homology;
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000005426}.
FT DOMAIN 144..877
FT /note="THO complex subunit 2 N-terminal"
FT /evidence="ECO:0000259|Pfam:PF16134"
FT DOMAIN 879..954
FT /note="THO complex subunitTHOC2 N-terminal"
FT /evidence="ECO:0000259|Pfam:PF11732"
FT DOMAIN 1236..1568
FT /note="THO complex subunitTHOC2 C-terminal"
FT /evidence="ECO:0000259|Pfam:PF11262"
FT REGION 1..142
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 569..613
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1147..1206
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1268..1295
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1428..1451
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1629..2480
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 24..38
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 72..96
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 97..122
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 123..142
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 587..613
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1149..1165
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1185..1204
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1629..1643
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1740..1820
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1831..1849
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1850..1865
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1885..1906
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1919..1972
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1996..2017
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2112..2126
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2144..2159
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2176..2193
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2205..2224
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2259..2277
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2329..2398
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2411..2480
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 2480 AA; 276786 MW; 6ECED7A8DFC05087 CRC64;
MPPKRKRFER PPGDGGRPSP HNPGDSEIAH HERDDSMNHM NQGQGQGQGR GRGRGRHQNA
GRRDSSRGHN IRAARRPSTS SSQQTPQQSP AAPKASSPPV SKQPPPPPTF AKPPPAPLPP
TNNPAESVSE NGTPTGSNYR YDNLTEEKIR SWAERGREEI VQHGVQSRED VDITELSSLF
QEFIHAVVEG RLDATDAGKC VKEILGDEAT EVNKDSYVAP HTLLLDSLAI VMDNEPEIYR
PSLRDFLVAT EVSPALMRQV LDAPLLQQLG LIRDTFARLG VRQATNLLYR QANYNLLREE
TEGYSKLVTE LFTTSSIPSP APELAEQTFE RVKALIGTFD LDVGRVLDVT LDVAAAVLIK
QFKFFVKFLR ISSWWPRSHL TLGSSVYTGG LPTWAQPDYL YWNTTEEDEE LNAQQRLARD
TAFWARAREV HLAAFFELGG REPSNLASYR PKLTNGNSSE STTDIERQWI EETKTLPPPG
NKVAAQLLGF KLLFYNSELR DKLDVLPANL LYLAALLIKV GFISLTDIYP HLSPPDENME
HVREEQTKVI EQEEKESRGG PMNALLMAGV LPQGDDDNPT PTNTSRREPL KKAEPEQKTS
GNAEAHEDSK SLPEPLEQKV RLLVQLLTIG AIPESLFILG KFPWIPELFP EVLSRIHRVL
HVSLEKVFND SRPKPFNKGA EVDCPTKEIF SADQSGVAKG SVRLTRLPVK KIWRWPYPDK
WDTNESQNYR FYWDEWADNI PVCQTVDDVF TLCNTFLNIS GVSIGKDETL LSKLASIGNK
SLSEDTSESN YSRWHDLLRR LLVPALSHTK ANVAVVNAVW DMLRRYPLTT RYSIYAEWFE
GQISRLPTMR AVFARATAET RGTMKRVSLT NISEMAKQLA KTSYSSPGVV FRVAFEQLES
YPNLIEGFVE CAKYFTDLSY DVLVWSLMNS LGKSRSRTQA DHALTTSKWL QALSRFSGKI
FKRYSAIDPI PVLQYVNDQL QRGNSTDLII LKEFITSMGG IVDSVDFTDA QVLSMAGGER
LRRHTLIRGQ DRRFDSVKSS KRLILALTDS KLAARVLLNL AQYRQSAIYQ VPEDEAHIKY
LSAVIDDSHQ ILIQYLDFIW SNLDPSAFDA LVPSINELIS SYGLDTSIAF LIGRSSLSHR
LYPWGQREAE STKENSQSTQ EATDKEGDVS MSDEPKDKSG LAASANDEQV GNNDASPGSR
STTDAKYKQK LDESLTLAPL QPIVEALKEA VRPEVWQKIN PELYSVFWTL QLGDLFCPED
MYKEEKDRLE SEGQAILRDR SDMSRRGQER KNEKRQELLK QQLSLSMELK EHRQRRTKWM
ECLAEQFQSI CPDSKIKTDS LSDILLEQCF LPRALLSPAD TEYTYKFILA LHELRAPNFK
LMSLYDRLFN ANRLRALIFT SSVREAEYLG RLINLILRDL SRWHKNETTE KGRHNKDQPR
LGAYDKEGKG PSDRPYLGFA VSLKEDGEPD TLMEHAQFKD LLFRWHKNLN TALKSCLAGT
EWMHIRNAIT VLKAVLDYFP AIDFMATQFT TQLQKITKQE AAPKTSPDSE EGHRVDLSVA
AQGAMSELQK RKSRWVMVQA FRPNAGGGSQ SEVDKSTSAA TNLRATATDF QPHTARYVNR
FTSNHQCLIK SGSDSQNHRS PANRPSTAER EDGEVQDGKS QSNSALPVKP SGSKRDVSMT
REDASGIPRS STPMAAGQGG AGSFGPRNDP RSHTLPDRPA HNLPSRPDVP IPSHFTQERY
PQNRGHDRRD MRDSRENHRQ RDGREPRETW DGRDAREGKE HRDAREPREP REPREPRNLE
SDRPDRSREY VDRRGNETMP RDAGPTDLPP RPRQQDREWG SRDSWANRSQ DRPNDTSSQL
PTAPSGPADA SEPAMNPQRA ALFAQDDTDR TRRGPEQDRG PRSRRPGPQD SAEAINPERA
ALIDDRDDGT HGRDGRERGP RIQSPRRSGR YGHEHGPPSG PYEDRHGHNF QQDSRQAGRS
ARGRSPGGGG GENYRSSKGA DRDGDRAHMD KIRDSSSAAF HRSALGQEPD HRPPLYQDQN
YGRLNPVPTT TPDIPLGPRG RGRGASRNNQ PGGPPVMSNR PDNRFSAPDA PRAPSQERHP
PSGPASGRNR RGGYEHNNNG PSAPSGTPAG PQADRMRNFG GSGGAQENPP SSQGSGAGAS
SGVHPDRLAQ MGSTSLPNAP PPPPPPPGPP PFGHGQHGQH GQHGHSHNRQ SMSSSGNAER
SGPRMSTGSL PLEPNVPTGP ASGGDRPSRS GGGRRQLAGI NNMLQQAQAS MPNEGRPTGP
RSNPPRQMLG HSDIQVLAGG DMAPPTPDRP EAQWHDSSAR GGTLNGEEAS GRGEHERARR
DRDGRNDRSK RPSRRSSRER ERGEGKEHSE HRERRSAGGT EGGAREDRDN RRSTRDGSSR
DAAAATPSSG RESRHRNDGG AGSRGGDDRS GNRVNRGNQR DGTQRTEEQR REHRDDRGRK
RRGDEADGAL LSDREKRVRR
//