GenomeNet

Database: UniProt
Entry: G9NTB5_HYPAI
LinkDB: G9NTB5_HYPAI
Original site: G9NTB5_HYPAI 
ID   G9NTB5_HYPAI            Unreviewed;      2480 AA.
AC   G9NTB5;
DT   22-FEB-2012, integrated into UniProtKB/TrEMBL.
DT   22-FEB-2012, sequence version 1.
DT   27-MAR-2024, entry version 34.
DE   RecName: Full=THO complex subunit 2 {ECO:0000256|ARBA:ARBA00019596};
GN   ORFNames=TRIATDRAFT_241884 {ECO:0000313|EMBL:EHK45961.1};
OS   Hypocrea atroviridis (strain ATCC 20476 / IMI 206040) (Trichoderma
OS   atroviride).
OC   Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Sordariomycetes;
OC   Hypocreomycetidae; Hypocreales; Hypocreaceae; Trichoderma.
OX   NCBI_TaxID=452589 {ECO:0000313|EMBL:EHK45961.1, ECO:0000313|Proteomes:UP000005426};
RN   [1] {ECO:0000313|EMBL:EHK45961.1, ECO:0000313|Proteomes:UP000005426}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=ATCC 20476 / IMI 206040 {ECO:0000313|Proteomes:UP000005426};
RX   PubMed=21501500; DOI=10.1186/gb-2011-12-4-r40;
RA   Kubicek C.P., Herrera-Estrella A., Seidl-Seiboth V., Martinez D.A.,
RA   Druzhinina I.S., Thon M., Zeilinger S., Casas-Flores S., Horwitz B.A.,
RA   Mukherjee P.K., Mukherjee M., Kredics L., Alcaraz L.D., Aerts A., Antal Z.,
RA   Atanasova L., Cervantes-Badillo M.G., Challacombe J., Chertkov O.,
RA   McCluskey K., Coulpier F., Deshpande N., von Doehren H., Ebbole D.J.,
RA   Esquivel-Naranjo E.U., Fekete E., Flipphi M., Glaser F.,
RA   Gomez-Rodriguez E.Y., Gruber S., Han C., Henrissat B., Hermosa R.,
RA   Hernandez-Onate M., Karaffa L., Kosti I., Le Crom S., Lindquist E.,
RA   Lucas S., Luebeck M., Luebeck P.S., Margeot A., Metz B., Misra M.,
RA   Nevalainen H., Omann M., Packer N., Perrone G., Uresti-Rivera E.E.,
RA   Salamov A., Schmoll M., Seiboth B., Shapiro H., Sukno S.,
RA   Tamayo-Ramos J.A., Tisch D., Wiest A., Wilkinson H.H., Zhang M.,
RA   Coutinho P.M., Kenerley C.M., Monte E., Baker S.E., Grigoriev I.V.;
RT   "Comparative genome sequence analysis underscores mycoparasitism as the
RT   ancestral life style of Trichoderma.";
RL   Genome Biol. 12:R40.1-R40.15(2011).
CC   -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC   -!- SIMILARITY: Belongs to the THOC2 family.
CC       {ECO:0000256|ARBA:ARBA00007857}.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:EHK45961.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; ABDG02000023; EHK45961.1; -; Genomic_DNA.
DR   RefSeq; XP_013944171.1; XM_014088696.1.
DR   STRING; 452589.G9NTB5; -.
DR   GeneID; 25778488; -.
DR   eggNOG; KOG1874; Eukaryota.
DR   HOGENOM; CLU_000511_1_0_1; -.
DR   OMA; QERWTCI; -.
DR   OrthoDB; 179356at2759; -.
DR   Proteomes; UP000005426; Unassembled WGS sequence.
DR   GO; GO:0000347; C:THO complex; IEA:InterPro.
DR   GO; GO:0006406; P:mRNA export from nucleus; IEA:InterPro.
DR   GO; GO:0006397; P:mRNA processing; IEA:InterPro.
DR   InterPro; IPR040007; Tho2.
DR   InterPro; IPR021418; THO_THOC2_C.
DR   InterPro; IPR021726; THO_THOC2_N.
DR   InterPro; IPR032302; THOC2_N.
DR   PANTHER; PTHR21597:SF0; THO COMPLEX SUBUNIT 2; 1.
DR   PANTHER; PTHR21597; THO2 PROTEIN; 1.
DR   Pfam; PF11262; Tho2; 1.
DR   Pfam; PF11732; Thoc2; 1.
DR   Pfam; PF16134; THOC2_N; 1.
PE   3: Inferred from homology;
KW   Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW   Reference proteome {ECO:0000313|Proteomes:UP000005426}.
FT   DOMAIN          144..877
FT                   /note="THO complex subunit 2 N-terminal"
FT                   /evidence="ECO:0000259|Pfam:PF16134"
FT   DOMAIN          879..954
FT                   /note="THO complex subunitTHOC2 N-terminal"
FT                   /evidence="ECO:0000259|Pfam:PF11732"
FT   DOMAIN          1236..1568
FT                   /note="THO complex subunitTHOC2 C-terminal"
FT                   /evidence="ECO:0000259|Pfam:PF11262"
FT   REGION          1..142
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          569..613
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1147..1206
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1268..1295
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1428..1451
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1629..2480
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        24..38
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        72..96
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        97..122
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        123..142
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        587..613
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1149..1165
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1185..1204
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1629..1643
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1740..1820
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1831..1849
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1850..1865
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1885..1906
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1919..1972
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1996..2017
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        2112..2126
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        2144..2159
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        2176..2193
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        2205..2224
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        2259..2277
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        2329..2398
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        2411..2480
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   2480 AA;  276786 MW;  6ECED7A8DFC05087 CRC64;
     MPPKRKRFER PPGDGGRPSP HNPGDSEIAH HERDDSMNHM NQGQGQGQGR GRGRGRHQNA
     GRRDSSRGHN IRAARRPSTS SSQQTPQQSP AAPKASSPPV SKQPPPPPTF AKPPPAPLPP
     TNNPAESVSE NGTPTGSNYR YDNLTEEKIR SWAERGREEI VQHGVQSRED VDITELSSLF
     QEFIHAVVEG RLDATDAGKC VKEILGDEAT EVNKDSYVAP HTLLLDSLAI VMDNEPEIYR
     PSLRDFLVAT EVSPALMRQV LDAPLLQQLG LIRDTFARLG VRQATNLLYR QANYNLLREE
     TEGYSKLVTE LFTTSSIPSP APELAEQTFE RVKALIGTFD LDVGRVLDVT LDVAAAVLIK
     QFKFFVKFLR ISSWWPRSHL TLGSSVYTGG LPTWAQPDYL YWNTTEEDEE LNAQQRLARD
     TAFWARAREV HLAAFFELGG REPSNLASYR PKLTNGNSSE STTDIERQWI EETKTLPPPG
     NKVAAQLLGF KLLFYNSELR DKLDVLPANL LYLAALLIKV GFISLTDIYP HLSPPDENME
     HVREEQTKVI EQEEKESRGG PMNALLMAGV LPQGDDDNPT PTNTSRREPL KKAEPEQKTS
     GNAEAHEDSK SLPEPLEQKV RLLVQLLTIG AIPESLFILG KFPWIPELFP EVLSRIHRVL
     HVSLEKVFND SRPKPFNKGA EVDCPTKEIF SADQSGVAKG SVRLTRLPVK KIWRWPYPDK
     WDTNESQNYR FYWDEWADNI PVCQTVDDVF TLCNTFLNIS GVSIGKDETL LSKLASIGNK
     SLSEDTSESN YSRWHDLLRR LLVPALSHTK ANVAVVNAVW DMLRRYPLTT RYSIYAEWFE
     GQISRLPTMR AVFARATAET RGTMKRVSLT NISEMAKQLA KTSYSSPGVV FRVAFEQLES
     YPNLIEGFVE CAKYFTDLSY DVLVWSLMNS LGKSRSRTQA DHALTTSKWL QALSRFSGKI
     FKRYSAIDPI PVLQYVNDQL QRGNSTDLII LKEFITSMGG IVDSVDFTDA QVLSMAGGER
     LRRHTLIRGQ DRRFDSVKSS KRLILALTDS KLAARVLLNL AQYRQSAIYQ VPEDEAHIKY
     LSAVIDDSHQ ILIQYLDFIW SNLDPSAFDA LVPSINELIS SYGLDTSIAF LIGRSSLSHR
     LYPWGQREAE STKENSQSTQ EATDKEGDVS MSDEPKDKSG LAASANDEQV GNNDASPGSR
     STTDAKYKQK LDESLTLAPL QPIVEALKEA VRPEVWQKIN PELYSVFWTL QLGDLFCPED
     MYKEEKDRLE SEGQAILRDR SDMSRRGQER KNEKRQELLK QQLSLSMELK EHRQRRTKWM
     ECLAEQFQSI CPDSKIKTDS LSDILLEQCF LPRALLSPAD TEYTYKFILA LHELRAPNFK
     LMSLYDRLFN ANRLRALIFT SSVREAEYLG RLINLILRDL SRWHKNETTE KGRHNKDQPR
     LGAYDKEGKG PSDRPYLGFA VSLKEDGEPD TLMEHAQFKD LLFRWHKNLN TALKSCLAGT
     EWMHIRNAIT VLKAVLDYFP AIDFMATQFT TQLQKITKQE AAPKTSPDSE EGHRVDLSVA
     AQGAMSELQK RKSRWVMVQA FRPNAGGGSQ SEVDKSTSAA TNLRATATDF QPHTARYVNR
     FTSNHQCLIK SGSDSQNHRS PANRPSTAER EDGEVQDGKS QSNSALPVKP SGSKRDVSMT
     REDASGIPRS STPMAAGQGG AGSFGPRNDP RSHTLPDRPA HNLPSRPDVP IPSHFTQERY
     PQNRGHDRRD MRDSRENHRQ RDGREPRETW DGRDAREGKE HRDAREPREP REPREPRNLE
     SDRPDRSREY VDRRGNETMP RDAGPTDLPP RPRQQDREWG SRDSWANRSQ DRPNDTSSQL
     PTAPSGPADA SEPAMNPQRA ALFAQDDTDR TRRGPEQDRG PRSRRPGPQD SAEAINPERA
     ALIDDRDDGT HGRDGRERGP RIQSPRRSGR YGHEHGPPSG PYEDRHGHNF QQDSRQAGRS
     ARGRSPGGGG GENYRSSKGA DRDGDRAHMD KIRDSSSAAF HRSALGQEPD HRPPLYQDQN
     YGRLNPVPTT TPDIPLGPRG RGRGASRNNQ PGGPPVMSNR PDNRFSAPDA PRAPSQERHP
     PSGPASGRNR RGGYEHNNNG PSAPSGTPAG PQADRMRNFG GSGGAQENPP SSQGSGAGAS
     SGVHPDRLAQ MGSTSLPNAP PPPPPPPGPP PFGHGQHGQH GQHGHSHNRQ SMSSSGNAER
     SGPRMSTGSL PLEPNVPTGP ASGGDRPSRS GGGRRQLAGI NNMLQQAQAS MPNEGRPTGP
     RSNPPRQMLG HSDIQVLAGG DMAPPTPDRP EAQWHDSSAR GGTLNGEEAS GRGEHERARR
     DRDGRNDRSK RPSRRSSRER ERGEGKEHSE HRERRSAGGT EGGAREDRDN RRSTRDGSSR
     DAAAATPSSG RESRHRNDGG AGSRGGDDRS GNRVNRGNQR DGTQRTEEQR REHRDDRGRK
     RRGDEADGAL LSDREKRVRR
//
DBGET integrated database retrieval system