ID A0A1Y2B0D4_9TREE Unreviewed; 2073 AA.
AC A0A1Y2B0D4;
DT 30-AUG-2017, integrated into UniProtKB/TrEMBL.
DT 30-AUG-2017, sequence version 1.
DT 24-JAN-2024, entry version 18.
DE RecName: Full=THO complex subunit 2 {ECO:0000256|ARBA:ARBA00019596};
GN ORFNames=BCR39DRAFT_535278 {ECO:0000313|EMBL:ORY28281.1};
OS Naematelia encephala.
OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; Tremellomycetes;
OC Tremellales; Naemateliaceae; Naematelia.
OX NCBI_TaxID=71784 {ECO:0000313|EMBL:ORY28281.1, ECO:0000313|Proteomes:UP000193986};
RN [1] {ECO:0000313|EMBL:ORY28281.1, ECO:0000313|Proteomes:UP000193986}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=68-887.2 {ECO:0000313|EMBL:ORY28281.1,
RC ECO:0000313|Proteomes:UP000193986};
RG DOE Joint Genome Institute;
RA Mondo S.J., Dannebaum R.O., Kuo R.C., Labutti K., Haridas S., Kuo A.,
RA Salamov A., Ahrendt S.R., Lipzen A., Sullivan W., Andreopoulos W.B.,
RA Clum A., Lindquist E., Daum C., Ramamoorthy G.K., Gryganskyi A., Culley D.,
RA Magnuson J.K., James T.Y., O'Malley M.A., Stajich J.E., Spatafora J.W.,
RA Visel A., Grigoriev I.V.;
RT "Pervasive Adenine N6-methylation of Active Genes in Fungi.";
RL Submitted (JUL-2016) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- SIMILARITY: Belongs to the THOC2 family.
CC {ECO:0000256|ARBA:ARBA00007857}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:ORY28281.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; MCFC01000032; ORY28281.1; -; Genomic_DNA.
DR STRING; 71784.A0A1Y2B0D4; -.
DR InParanoid; A0A1Y2B0D4; -.
DR OrthoDB; 179356at2759; -.
DR Proteomes; UP000193986; Unassembled WGS sequence.
DR GO; GO:0000347; C:THO complex; IEA:InterPro.
DR GO; GO:0006406; P:mRNA export from nucleus; IEA:InterPro.
DR GO; GO:0006397; P:mRNA processing; IEA:InterPro.
DR InterPro; IPR040007; Tho2.
DR InterPro; IPR021418; THO_THOC2_C.
DR InterPro; IPR021726; THO_THOC2_N.
DR InterPro; IPR032302; THOC2_N.
DR PANTHER; PTHR21597:SF0; THO COMPLEX SUBUNIT 2; 1.
DR PANTHER; PTHR21597; THO2 PROTEIN; 1.
DR Pfam; PF11262; Tho2; 1.
DR Pfam; PF11732; Thoc2; 1.
DR Pfam; PF16134; THOC2_N; 1.
PE 3: Inferred from homology;
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000193986}.
FT DOMAIN 37..744
FT /note="THO complex subunit 2 N-terminal"
FT /evidence="ECO:0000259|Pfam:PF16134"
FT DOMAIN 762..819
FT /note="THO complex subunitTHOC2 N-terminal"
FT /evidence="ECO:0000259|Pfam:PF11732"
FT DOMAIN 1152..1463
FT /note="THO complex subunitTHOC2 C-terminal"
FT /evidence="ECO:0000259|Pfam:PF11262"
FT REGION 1..20
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 443..464
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1078..1130
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1482..2073
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1078..1095
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1501..1519
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1536..1554
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1583..1617
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1658..1672
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1696..1922
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1931..1970
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1992..2013
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2034..2073
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 2073 AA; 228429 MW; C77D008EDF034447 CRC64;
MPPRRSQRAQ VEETPAASSV SMMSELEGLS QQLASAIPTW DDGGQETITT LLESLLSSTA
SSSSSVPPIA ISHVILTLVT SSLPVEAVYE VFTTAIDSLD GEAKETQVAV FAEALSDLVD
ELHETEEDCK DIEGYEISTE EGGQRQGEKG LEVIKRLYAS GHIPSIIPNL ILTPDYPAPG
GSRNPPRLAS LSKADLYPPN MGPPGVQRAS VKRNTTLFYK QSKYNLLRES SEGFAALIVL
LTGPDALPQC STTSTDDEDE ARQKRAQRVW AKILSLIGYF NLSPPRVLDL VLEIASKNIT
NHWHFYLELL KCSPWGQTAL AAEQEGKKGK GRATGSSWAS DELRELEECL KVDGNRVLAQ
VLGVKFSFAR KTDVPKTEEG VVTGQYMLAA LLVKHRFISL ADLLPFLSPD DEQMEAICRN
WKDAIGAKQG NALANAAMLV DDEPPTANGD KPTTVEENAE PAKPPPEQRI ILLQALLAIG
DVPSAQYLLA KHPWVAQSHP AVADLIVRIV DNALEDLYVS SGAVPPEFDD VGETSFGETV
PPPLNAQSSI APHDTTLTLL APTPPPTQTK QFAFFYPGWT DGLERWHDPL DIHIRGLRWL
SLVRGLAGRS PSTMVKICRI GAAYFQSLRK DKEIDTGLGH RAKTLSEQSA VEPTPEQMRP
WLDIIRRGLL PALSASDVAA SFDLELWALI RHFPYTVRYA LYGEWRDSTC FVGSNDSCPI
AARAARQSKA EVKAQLKRVT APTTTPGQGT TAVIDRRPAR KLAKLSHANP CALWIVALNQ
VKSYSNIGEF IIEAGRYMNQ LSMDVATFTL VDALSNPHDS DPEGRKAPAT YDWRLGDSGS
PSAPLQNVAA FVADFNRRYG TMDLLPVLQF IINRLIARES VDLVILDKLL EVMSGNPPIE
NNAFSTDQLQ AFAGGPELLR EAFTATMIDI ADPAGGQGTT HKAKSTKKSL PRFLNMLKEA
GMAIPLEIAL AQTKLELLDK MEGLPIKAIA ATQDTVHDTF IRYTDLLSEQ LSPAEHIALI
PDLKSLVTDY GLKYSLAFQI LRPRLLAELS AATADDKATL RKRLATAKEA LSLQITSPVK
ETASLPPSPT SPSLDSDGER EGDVSMQDTN GDGSAGGTIP PPKKRNRSRD FPKALRPTME
QARDLLPSEV NNLLSAPFFV IFWQLVMSDI AVSSDAYAKI DSRLKSLIAR VQGFIVPSTK
ESEKQSEIKR LEARRTALGV ERDAQQKTVD VTRRRLQKES ARWFGKGIAA SNQQKALSFQ
LHQYCFLPRA VHHPADAVFV ARFIKMAHQF GTLSFSTLFA YERFFDSSLA PCLFSMTIDE
ARNYGRCLSA IMTDLDAWHK DDAIYKREAL GIPDKVEEGT QVVQLPGMFF RTKTGVEPRA
MTWNQFRQIY AKFHTNLSRA LDSCFAEGEF MHVQNALTVA LEVIKSFPVL KDHRDTLSAT
LQGLDLKHMP DLETQTDSYK KRLEALSRQR ALLSFDEFIG IHKAKPPAPP SGPAKGPPTA
PRSASNANIT ANGTSTPVPT STGTPPPAPA QTRDAQTLRR ELEEKRKEAK GVEPASVASD
KGAVSSPSAP EPKRETAIAV PSGPRATRST SSAPVGANGT TTSIKGSATN ESKPVPTGPA
HPSNALPSTM APPAPVSMEE ERRLARARKF GTIAQALPPT GPASNSDSTS LARPKVEEAI
KADSPAASPL HRATRRSGSV ESRMSERRRD AEDKERGRGS RDRDASGRDE KPKDGRTPTS
VEGETERQRQ ERLLAAREGA LDSHREGARR SSREERRRET DKERDERKAK ERERDRDRDR
DKRSDDPAGP KRKREDDAPR RIDDPRDRDR DRDRDRDRIA SHHRDGRDVR RHERRDDPPD
ERRNRRDDRR DAPPRDDPRD DRDRHRRERE TRELLPHPSR KDRSPDKAPS RDLDRRETRS
TRGSTADVQN GDRTPRREEI SNARPETRGE ANGRPDTRGP ADRIVEAPRT RPSEGARPLP
ARPERVEPPR GSKPPEPTNG VPARGPYALP PRPGASGEPS SRSASTTHLV ARMGPARERS
DDIRAPEGST AVDDRAGSRK RPMDGELITI HIR
//