ID J4TU57_SACK1 Unreviewed; 1594 AA.
AC J4TU57;
DT 31-OCT-2012, integrated into UniProtKB/TrEMBL.
DT 31-OCT-2012, sequence version 1.
DT 24-JAN-2024, entry version 37.
DE RecName: Full=THO complex subunit 2 {ECO:0000256|ARBA:ARBA00019596};
GN Name=YNL139C {ECO:0000313|EMBL:EJT41870.1};
GN Synonyms=SKDI14G1850 {ECO:0000313|EMBL:CAI4049840.1};
GN ORFNames=SKDI_14G1850 {ECO:0000313|EMBL:CAI4049840.1}, SKUD_185505
GN {ECO:0000313|EMBL:EJT41870.1};
OS Saccharomyces kudriavzevii (strain ATCC MYA-4449 / AS 2.2408 / CBS 8840 /
OS NBRC 1802 / NCYC 2889) (Yeast).
OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; Saccharomycetes;
OC Saccharomycetales; Saccharomycetaceae; Saccharomyces.
OX NCBI_TaxID=226230 {ECO:0000313|EMBL:EJT41870.1, ECO:0000313|Proteomes:UP000002753};
RN [1] {ECO:0000313|EMBL:EJT41870.1, ECO:0000313|Proteomes:UP000002753}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=ATCC MYA-4449 / AS 2.2408 / CBS 8840 / NBRC 1802 / NCYC 2889
RC {ECO:0000313|Proteomes:UP000002753}, and IFO 1802
RC {ECO:0000313|EMBL:EJT41870.1};
RX PubMed=12775844; DOI=10.1126/science.1084337;
RA Cliften P.F., Sudarsanam P., Desikan A., Fulton L., Fulton B., Majors J.,
RA Waterston R., Cohen B.A., Johnston M.;
RT "Finding functional features in Saccharomyces genomes by phylogenetic
RT footprinting.";
RL Science 301:71-76(2003).
RN [2] {ECO:0000313|EMBL:EJT41870.1}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=IFO 1802 {ECO:0000313|EMBL:EJT41870.1};
RA Cliften P.F., Johnston M.;
RL Submitted (APR-2003) to the EMBL/GenBank/DDBJ databases.
RN [3] {ECO:0000313|Proteomes:UP000002753}
RP GENOME REANNOTATION.
RC STRAIN=ATCC MYA-4449 / AS 2.2408 / CBS 8840 / NBRC 1802 / NCYC 2889
RC {ECO:0000313|Proteomes:UP000002753};
RX PubMed=22384314; DOI=10.1534/g3.111.000273;
RA Scannell D.R., Zill O.A., Rokas A., Payen C., Dunham M.J., Eisen M.B.,
RA Rine J., Johnston M., Hittinger C.T.;
RT "The awesome power of yeast evolutionary genetics: New genome sequences and
RT strain resources for the Saccharomyces sensu stricto genus.";
RL G3 (Bethesda) 1:11-25(2011).
RN [4] {ECO:0000313|EMBL:EJT41870.1}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=IFO 1802 {ECO:0000313|EMBL:EJT41870.1};
RA Cliften P., Hittinger C.T., Wang B., Sudarsanam P., Desikan A., Fulton L.,
RA Fulton B., Majors J., Waterston R., Cohen B.A., Johnston M.;
RL Submitted (AUG-2012) to the EMBL/GenBank/DDBJ databases.
RN [5] {ECO:0000313|EMBL:CAI4049840.1}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=IFO1802 {ECO:0000313|EMBL:CAI4049840.1};
RA Byrne P K.;
RL Submitted (OCT-2022) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- SIMILARITY: Belongs to the THOC2 family.
CC {ECO:0000256|ARBA:ARBA00007857}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; OX365909; CAI4049840.1; -; Genomic_DNA.
DR EMBL; AACI03001913; EJT41870.1; -; Genomic_DNA.
DR STRING; 226230.J4TU57; -.
DR HOGENOM; CLU_003123_0_0_1; -.
DR Proteomes; UP000002753; Unassembled WGS sequence.
DR Proteomes; UP001162087; Chromosome 14.
DR GO; GO:0000347; C:THO complex; IEA:InterPro.
DR GO; GO:0006406; P:mRNA export from nucleus; IEA:InterPro.
DR GO; GO:0006397; P:mRNA processing; IEA:InterPro.
DR InterPro; IPR040007; Tho2.
DR InterPro; IPR021418; THO_THOC2_C.
DR InterPro; IPR021726; THO_THOC2_N.
DR InterPro; IPR032302; THOC2_N.
DR PANTHER; PTHR21597:SF0; THO COMPLEX SUBUNIT 2; 1.
DR PANTHER; PTHR21597; THO2 PROTEIN; 1.
DR Pfam; PF11262; Tho2; 2.
DR Pfam; PF11732; Thoc2; 1.
DR Pfam; PF16134; THOC2_N; 1.
PE 3: Inferred from homology;
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000002753}.
FT DOMAIN 29..656
FT /note="THO complex subunit 2 N-terminal"
FT /evidence="ECO:0000259|Pfam:PF16134"
FT DOMAIN 658..732
FT /note="THO complex subunitTHOC2 N-terminal"
FT /evidence="ECO:0000259|Pfam:PF11732"
FT DOMAIN 938..1106
FT /note="THO complex subunitTHOC2 C-terminal"
FT /evidence="ECO:0000259|Pfam:PF11262"
FT DOMAIN 1115..1192
FT /note="THO complex subunitTHOC2 C-terminal"
FT /evidence="ECO:0000259|Pfam:PF11262"
FT REGION 1248..1274
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1385..1594
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1387..1403
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1417..1439
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1444..1490
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1491..1521
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1522..1561
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1562..1583
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1594 AA; 183955 MW; 7EB7F31BCA2FB8A3 CRC64;
MAEQALLSKL NALSQEVISP ASLGQPTILT EEAVQNWPQR SRTLCSEFFA LESNDEKEDW
LRIVFIELFE FINKGEENFI LKLSDIALFI EELVNNDRQA PQASLVGKMF IAVSSTVPNI
DDTNTISLCR LIPSLHEELF KFSWVSSKLL NKEQTTLLRH LLKKSKYELK KYNLLAENSV
GYAQLVTLLI LAYHDPDNSS KVSAYLKEIY HIMGKYSLDS IRALDVILDV SSQFITEDYQ
FLTSFLQESD FWPSNHVADS SEYSALNEGG SMIAANIISF NLSQCKEQTD KENYKRYMDM
CCILIKIGFI NFYSIWDNVK PEMESLQKYT QDLETELEAE STKGIENPLA MAAALSAENE
NDEDAVVVNV DDNDKDVIPV TTNGEANLKD KQKIFQEDTL SYGKIKLLER LLVHGCVVPV
MHVLKEYPKL LYVSQSIPKY FGRVFEHLLD PLYTSMTSCC ESNGMTSALM ITRIDNGILA
HKPRLIHQYK THDPFESLEL NTKYFFYYSE WDCAFTPFTS VDDLFENSHA YLSIIGPYLA
KVPMLLSKIS RIGVADIQKK QSSESQQDTV DKWIDYVRKF IFPATPLLQN NPIATSEVYE
LMKLFPFEKR YFIYNEMMTK LSLENLPIKV SFNKAEREAK SILKALSIDT IAKESRRFAK
LISTNPLASL IPAVKQIENY DKVSELVVYT TKYFNDFAYD VLQFVLLLRL TYNRPAVQFD
GVNQAMWVQR LSIFIAGLAK NCPDMDISNI ITYILKTLHN GNIIAVSILK ELITTVGGIR
DLNEVNMKQL LMLNSGKPLK QYARHLIYDF RDDNSDIAAR LTSFFTNQNA ISEIILLLYA
LNLKANTQDS HYKILSTRCD EMNTLLWSYI ELIKHCLKTK AFEENVLSFV ELTNRFHLST
PWAFHIWRDY LDNQLNTNEN ISIEQLIEGV EFNDVDLTKI SKDLFTTFWR LSLYDIHFDK
SLYDERKKTL SGENTDQLSN RKKHLIQNQI KDILVTGISH QRAFKKTAEF VSEKSTIWDK
DCGENQIKIF LQNCVVPRVL FSPSDALFSS YFIFMAFSTE NLMSILNTFI TSNILKTLLF
CCTSSEAGNL GLFFTNVLKR LEEMRLSGEF NDQASRKLYE WHLVITEQII DLLSEKNYMS
IRNGIEFMKH VTSVFPIVKT HIQLVYTTLE ENLVNEERED IKLPSSALIG HLKARLKNAL
ELDEFCTLTE KEAEQKKILE MEMEEIKSYE TAYQNELKQM ALRKKLELNK SQRLQNDSSK
TATSDTTEPH TKEKYTYSRD EPVIPIKPSS SQWSYSKVTR HLDDINHYLA SNHLQKAISL
VENESETWNL KNLSKQNMPI FDFRNLTLEI FERYFRSLIQ NPQNPDFAEK IDALKRHIKN
LSREPYAGTL NPHSESSAPE YTKRSSRYGG VDAYGSSNYR ASGNDRSGLK NSKPSNSYPH
KRSELPARPS KGKAYNDRSR PVRSTGADRG DNFEQQRENR SREDYKKTNS QRSQLRFPEK
PSQEIKGSNK NTAYQPSSYK RDLPSDNDEK PNKRFKRDDE NRSKFQVQDY RNTRDSGANR
RPNESQRYNT NRKGNTQVLP QGPKGGNHVS RYQR
//