ID A0A1Y1Z9I0_9FUNG Unreviewed; 2365 AA.
AC A0A1Y1Z9I0;
DT 30-AUG-2017, integrated into UniProtKB/TrEMBL.
DT 30-AUG-2017, sequence version 1.
DT 24-JAN-2024, entry version 20.
DE RecName: Full=THO complex subunit 2 {ECO:0000256|ARBA:ARBA00019596};
GN ORFNames=LY90DRAFT_709169 {ECO:0000313|EMBL:ORY06930.1};
OS Neocallimastix californiae.
OC Eukaryota; Fungi; Fungi incertae sedis; Chytridiomycota;
OC Chytridiomycota incertae sedis; Neocallimastigomycetes; Neocallimastigales;
OC Neocallimastigaceae; Neocallimastix.
OX NCBI_TaxID=1754190 {ECO:0000313|EMBL:ORY06930.1, ECO:0000313|Proteomes:UP000193920};
RN [1] {ECO:0000313|EMBL:ORY06930.1, ECO:0000313|Proteomes:UP000193920}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=G1 {ECO:0000313|EMBL:ORY06930.1,
RC ECO:0000313|Proteomes:UP000193920};
RG DOE Joint Genome Institute;
RA Haitjema C.H., Gilmore S.P., Henske J.K., Solomon K.V., De Groot R.,
RA Kuo A., Mondo S.J., Salamov A.A., Labutti K., Zhao Z., Chiniquy J.,
RA Barry K., Brewer H.M., Purvine S.O., Wright A.T., Boxma B., Van Alen T.,
RA Hackstein J.H., Baker S.E., Grigoriev I.V., O'Malley M.A.;
RT "A Parts List for Fungal Cellulosomes Revealed by Comparative Genomics.";
RL Submitted (AUG-2016) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- SIMILARITY: Belongs to the THOC2 family.
CC {ECO:0000256|ARBA:ARBA00007857}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:ORY06930.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; MCOG01000435; ORY06930.1; -; Genomic_DNA.
DR STRING; 1754190.A0A1Y1Z9I0; -.
DR OrthoDB; 179356at2759; -.
DR Proteomes; UP000193920; Unassembled WGS sequence.
DR GO; GO:0000347; C:THO complex; IEA:InterPro.
DR GO; GO:0006406; P:mRNA export from nucleus; IEA:InterPro.
DR GO; GO:0006397; P:mRNA processing; IEA:InterPro.
DR InterPro; IPR040007; Tho2.
DR InterPro; IPR021418; THO_THOC2_C.
DR InterPro; IPR021726; THO_THOC2_N.
DR InterPro; IPR032302; THOC2_N.
DR PANTHER; PTHR21597:SF0; THO COMPLEX SUBUNIT 2; 1.
DR PANTHER; PTHR21597; THO2 PROTEIN; 1.
DR Pfam; PF11262; Tho2; 1.
DR Pfam; PF11732; Thoc2; 1.
DR Pfam; PF16134; THOC2_N; 1.
PE 3: Inferred from homology;
KW Coiled coil {ECO:0000256|SAM:Coils};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000193920}.
FT DOMAIN 154..711
FT /note="THO complex subunit 2 N-terminal"
FT /evidence="ECO:0000259|Pfam:PF16134"
FT DOMAIN 713..788
FT /note="THO complex subunitTHOC2 N-terminal"
FT /evidence="ECO:0000259|Pfam:PF11732"
FT DOMAIN 1047..1355
FT /note="THO complex subunitTHOC2 C-terminal"
FT /evidence="ECO:0000259|Pfam:PF11262"
FT REGION 409..459
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1368..1396
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1478..1960
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1975..2365
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 1070..1135
FT /evidence="ECO:0000256|SAM:Coils"
FT COMPBIAS 415..430
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 431..459
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1380..1396
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1478..1702
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1704..1722
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1723..1880
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1881..1929
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1930..1954
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1983..2009
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2010..2024
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2025..2159
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2160..2209
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2211..2358
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 2365 AA; 280197 MW; C0239D0123BD2944 CRC64;
MNTNNDNSSL NVDFQKIFKN WKTEQNAYKK LLKNYIIECN GETQSTSKNL IPVKVCIRNI
IKAYMNGFIS ASEVSVFFHT LYIDLVKALS IELYNIKTSG YAFMNPTNSK DPLFLPLLIV
DWIWLIDQEI EGINSNFNDQ VENAKFKSEQ KAKIAALAKE LLSKNFIPAE LMKERFEVEM
LEAVGLINSS KLFTKKSARI NTSLLYKQTK FNLLREESEG YAKLITEIST NMYTFHPFGL
SKEEIEKNEA ILNERVHLVL TNIKSLIGYF ALDPNRVLDI IFDIFIANVV QYYKFFINLL
KASPWRSIDC SNEINNKDKM DIDDEDNNQS VKLEDLRGRS SNGQILGYKF AVYQDPESPD
NASVTLYIVA ALLIKNNLVN LNDLYPHLTP ENDSMDEEYS LYLKEIKKST ERETEPNPLL
SAGSLTDDSA PSDHKISKES NKEKEEREKE KEKKEEKKKI IKTNQKAGLL AALLSIGDLK
HSKIILEISP KLGIMYHEIV HLIGRIIHVS IQPLYKKIMK IPDIEVPLEP KPYLTHHYEL
CNPLYEANLV IPKKKGSRVT RYMFFYQQWK DEIPLCNTYE ELLQNIPTFI KYVGIQLHQE
ITVLTKLVRL AKVHIKEKNI SEEYNNMWLS IISSYFLPSV SLLYSNPGFF MELGDLIKLY
PFQIRYGLYS NWKNKSYKIN KELGIIKNSA LQEIRRLMRR LSKDNTKQIG RMFGKLVHSN
PVIAFDKMLE MIEGYENLIN PLVDGCKYLT SLEFDILSFV LIERLSTTLR NRIKDDGTSL
AQWLSNLATF VGTIYKKYAN MELPGILRYI IYQLKCNNVT DLVILKELIQ KMSGIETTEN
LSDSQLDALA GGETLKKEAY NAEHLKNTKK SSLRLIDALI DNNLITPLFI SLCQQRSICV
YNTNLTHLKL IGNMFDQTND TMMQYAEFLM ANIDKHSYSK YIPTIKELCK NYYIDPETAF
YIIRPKLNYL MKKAEQKESS NSIVNKTENE KTEKTEKSEI NKSAMDIDEK ISKPMEAESS
LEVLNIWHKG LLSTIEEVST ILPVNVWEGI SPQFYVTFWQ LSLYDIYIPH SRYQDQIKKE
KQAIHALEND RDESSNANRR RRERERCLDN ISHLEKELNI QNENNKKVIA RLKKEKDYWF
ANCTNIYNII SYLIQYCLYP RCLFSNIDSI YCSKFIKLMH NLGTENFSSL TLFDRIMNVT
DITNIISSCT ENEAHHYGKF LKEILKIIYY WYNDQKLYDK EAKGDNLPGF RKKWSTGSSK
SNQAKISENE ILQFEDYRKV VYKWHIKCHK AFKTCLESTE YMNIRNSIII LNIIIDYFPI
INKVGVSIEK NVNNVVKNEE REDLKLVAKQ YAGKLSLKKS SWIPNEMFNI DPHSSKPSRT
ASTHSHDRDR ERDRDRDRDR DRERERDRER ERERDRERDR DRERDRDRDR DRERDRERER
DRERERDRER DRERDRDRDR EREKERDRDR DRDRDRERDR DREREKERDR DREKERERER
DRERERDKDR ERDKDRERDR DRDRERDRDR DRERDRDRDR EKSREKDKDK DKSIKERSDT
PSSHYKLDNL DSKSYSKERE REREKDRERD RDRERNRDRE REHDKDRERE KDREKEKEKE
RERDRNHERE KEHEKEKDHD RERERDKERE RDRERERDRE RERDRERDNK DIRENESRLS
NNDIKETSNR DVRSDRRSSS RNSQRHQRLA SDTHSLNNQK SSSKDRINRD KDLKMADRNN
NRDNHDQRNE YKDKERIERD RNERNREYRN SRDSRDWEHS NSQRNSEKDK EIEKEKEKEK
EKEREKDRDR ERERERERER ERERERERER DKERDKERDK DRERERERDK SKDTKDRNNI
KDKNNNSDKI DKDNKESSSK RLVHNSSVAS INSNNDVHKS RSSSLSSNKP NSDIHHPRSN
SFSSNKPNND IHRSRSDSTS SNKTDNEMHR SHSDSLSTSK LSNEIQRMHN DSYQHNKNNE
AHHTHNSSIQ FNKTNTNDNN SNSSKQYSKE KTTNHSDQHS NYKLNSKDSY DNQSLSRTSS
DNNINSDRNN DNRKTISNNS SNNNSQNTHI SNNNSNRNSN ANTNNINNNN SQFKNTNKQN
SNIPTNIDNN NNNIINLNKN SSKINKSNNN INNATVITSN SNTNNNNNNN NNNNKTTKIL
SHHDNNENYK KDMKTNSSSI DKLPIKDERK RERNDKEDNT REAKRRKAEG VSIQGSSSSS
KKNNNIDDNN NNNNNNTKMI NTHGSNNSNN ISIGGSSNNK INYNKLSSNS SNGSISISNS
NYNKKSQSEI KSNNEIYGNN SNKRQRFDRP SMNNSSNSTI RSNNSNNDNQ NRIYSHSNNN
NNNNNNNSSS SNNKNSMNKI YNRRE
//