ID A0A1D8PH99_CANAL Unreviewed; 1608 AA.
AC A0A1D8PH99;
DT 18-JAN-2017, integrated into UniProtKB/TrEMBL.
DT 18-JAN-2017, sequence version 1.
DT 24-JAN-2024, entry version 31.
DE RecName: Full=THO complex subunit 2 {ECO:0000256|ARBA:ARBA00019596};
GN OrderedLocusNames=CAALFM_C204870CA {ECO:0000313|EMBL:AOW27510.1},
GN orf19.4123 {ECO:0000313|CGD:CAL0000181452};
OS Candida albicans (strain SC5314 / ATCC MYA-2876) (Yeast).
OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; Saccharomycetes;
OC Saccharomycetales; Debaryomycetaceae; Candida/Lodderomyces clade; Candida.
OX NCBI_TaxID=237561 {ECO:0000313|EMBL:AOW27510.1, ECO:0000313|Proteomes:UP000000559};
RN [1] {ECO:0000313|EMBL:AOW27510.1, ECO:0000313|Proteomes:UP000000559}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=SC5314 / ATCC MYA-2876 {ECO:0000313|Proteomes:UP000000559};
RX PubMed=15123810; DOI=10.1073/pnas.0401648101;
RA Jones T., Federspiel N.A., Chibana H., Dungan J., Kalman S., Magee B.B.,
RA Newport G., Thorstenson Y.R., Agabian N., Magee P.T., Davis R.W.,
RA Scherer S.;
RT "The diploid genome sequence of Candida albicans.";
RL Proc. Natl. Acad. Sci. U.S.A. 101:7329-7334(2004).
RN [2] {ECO:0000313|EMBL:AOW27510.1, ECO:0000313|Proteomes:UP000000559}
RP GENOME REANNOTATION.
RC STRAIN=SC5314 / ATCC MYA-2876 {ECO:0000313|Proteomes:UP000000559};
RX PubMed=17419877; DOI=.1186/gb-2007-8-4-r52;
RA van het Hoog M., Rast T.J., Martchenko M., Grindle S., Dignard D.,
RA Hogues H., Cuomo C., Berriman M., Scherer S., Magee B.B., Whiteway M.,
RA Chibana H., Nantel A., Magee P.T.;
RT "Assembly of the Candida albicans genome into sixteen supercontigs aligned
RT on the eight chromosomes.";
RL Genome Biol. 8:RESEARCH52.1-RESEARCH52.12(2007).
RN [3] {ECO:0000313|EMBL:AOW27510.1, ECO:0000313|Proteomes:UP000000559}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=SC5314 / ATCC MYA-2876 {ECO:0000313|Proteomes:UP000000559};
RX PubMed=24025428; DOI=.1186/gb-2013-14-9-r97;
RA Muzzey D., Schwartz K., Weissman J.S., Sherlock G.;
RT "Assembly of a phased diploid Candida albicans genome facilitates allele-
RT specific measurements and provides a simple model for repeat and indel
RT structure.";
RL Genome Biol. 14:RESEARCH97.1-RESEARCH97.14(2013).
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- SIMILARITY: Belongs to the THOC2 family.
CC {ECO:0000256|ARBA:ARBA00007857}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CP017624; AOW27510.1; -; Genomic_DNA.
DR RefSeq; XP_710282.2; XM_705190.2.
DR AlphaFoldDB; A0A1D8PH99; -.
DR SMR; A0A1D8PH99; -.
DR STRING; 237561.A0A1D8PH99; -.
DR EnsemblFungi; C2_04870C_A-T; C2_04870C_A-T-p1; C2_04870C_A.
DR GeneID; 3648111; -.
DR KEGG; cal:CAALFM_C204870CA; -.
DR CGD; CAL0000181452; orf19.4123.
DR VEuPathDB; FungiDB:C2_04870C_A; -.
DR eggNOG; KOG1874; Eukaryota.
DR InParanoid; A0A1D8PH99; -.
DR OrthoDB; 179356at2759; -.
DR Proteomes; UP000000559; Chromosome 2.
DR GO; GO:0000445; C:THO complex part of transcription export complex; IBA:GO_Central.
DR GO; GO:0003729; F:mRNA binding; IBA:GO_Central.
DR GO; GO:0006406; P:mRNA export from nucleus; IBA:GO_Central.
DR GO; GO:0006397; P:mRNA processing; IEA:InterPro.
DR InterPro; IPR040007; Tho2.
DR InterPro; IPR021418; THO_THOC2_C.
DR InterPro; IPR021726; THO_THOC2_N.
DR InterPro; IPR032302; THOC2_N.
DR PANTHER; PTHR21597:SF0; THO COMPLEX SUBUNIT 2; 1.
DR PANTHER; PTHR21597; THO2 PROTEIN; 1.
DR Pfam; PF11262; Tho2; 1.
DR Pfam; PF11732; Thoc2; 1.
DR Pfam; PF16134; THOC2_N; 1.
PE 3: Inferred from homology;
KW Coiled coil {ECO:0000256|SAM:Coils};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000000559}.
FT DOMAIN 6..612
FT /note="THO complex subunit 2 N-terminal"
FT /evidence="ECO:0000259|Pfam:PF16134"
FT DOMAIN 614..687
FT /note="THO complex subunitTHOC2 N-terminal"
FT /evidence="ECO:0000259|Pfam:PF11732"
FT DOMAIN 877..1160
FT /note="THO complex subunitTHOC2 C-terminal"
FT /evidence="ECO:0000259|Pfam:PF11262"
FT REGION 326..349
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1238..1608
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 940..971
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 1175..1232
FT /evidence="ECO:0000256|SAM:Coils"
FT COMPBIAS 1238..1262
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1272..1302
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1303..1323
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1337..1367
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1381..1415
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1442..1460
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1466..1514
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1515..1530
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1539..1576
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1577..1608
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1608 AA; 183694 MW; 370CDEE6C0F227E3 CRC64;
MSLLYITEEV VENFSGSGMD SLFEVLESYQ ENDAESEEIL AQIFTELLIT FEENKLDVND
IKKFFTQAIK SDDQARIFFQ VLNSFAVSKN IHDLLHLLFR DNKIKIETLA LHLSSDFLKS
VEIVPKDYFQ RTLSHKIRDE YFTQKKFNLL HEEVEGYSKF TSEMHSILDS SDAEFQLDYA
IQVMEKLIGH YDLDPNRCLS LLFQVFTGTI VPHYRFILNV LKKSRWWPNV ESDNSSLLSL
GNGGSETAAK VIGLELVGEC GTRDLPETYK CLVAILIKEG LISFGSLYKF MGPDESEMDE
LEAEYKKKLD RDVLVAGATA LALAAPLADE EDEEGEKGES KTKNKTSQAS TEKDLSSLLK
SNMKFQFLKV FLGVGLYWPA IFILTKYPYL AQIDEEIPIL INRLFATMID PLYNKIRIFS
DEEISNLQKP KGITFSRPYN TVFVEHSPVA HLFSFNPLMR GYGNRKYTYF YREFSTDLPK
IQDIDSLIAA SNELLKFNGP NLAKDTDIFI KTCEVTRYLL SQEEDKSKVF FYFKNFIFPA
MPLIEENSIA IDKAFEILLF FPTEDRFSLY GELYNILAKN NPLIKIAYSK AEKSTKDVLK
RLSKENVRPM MRRLAKICFS NPLPCLLTIL QQIESYDNLN TLVVETARYF NAYGWDNLTA
AILIRLASSR SSTYNGMSER QWVQSLASFI GKICQRYPHA IDIKTILAFI LKSFYSGDTI
GLLVLKEMFI SMGGIQHITN LTINQIDMIN CGSSLQKIVY NTIDDLRFER RATGKYLIKC
MNEIDAVNEL LVLLCRISND VTFTGNESYL KVLVSKSDDV NAVIRLFVTL INLYDEDLNL
MPIQQLSDLG VPWSWAYEVW RFRGKTEKDN LSLESTSSIF DNFWKLSLHD INYTNELYDN
ETIKLESNIK SLKDSIAINL KNKELPRTVI DKQRKDLETC EEYQKTLVDE RAKHKDENEK
IEEQLKQISF NWFVDLSFEE FIQKCIFPRA ITSSFDAVYS ARFLFKLNSM KIDNYSLVNV
LDLLFKSKSL FGTLCSSTPT EAENIGLFFA DILRTLQGWR DESKYGEIGL QDQDGDAITF
DDFKKLLYDY HSLLLEEIRI GLQAPEYISR NNTIIFLKNL LAVYPTVDDH GAQIVNLIEK
LSTTEKRNDL KLASNALLVH VKSKSKNWVP IWDFISMSEE EKEEIIKAKE EEKQRIIKEE
AEAKRQKELE LEKEKQMKLA KEEEEKKKLL AAASLNYDSS TAGGSGGGAR TQTRTTQIGR
TYEKYAIETK SEAHSRQTTP IPTQPRTLSS VPTSPSSLSK ENKLQERVNK MKQAYKESRF
SSDTENKVLN SESLDGHEAT SNEKEAKQEE SNDQEKLDSE AKEKANPGQN NEETQLSTEK
EDSAGDNSTN TSNLAAQKES EQKRSPLPPQ NEIKKSISSE DFKGPTEPKR TPLPPQTMVA
RHKDDSYGRS EGRRAPLPPQ HEIKKQSLGG TDSSNKVGSS RPPLGQSNAN GFRNDMRTNK
GNSSSNQQNS RQQPPASIPP SPPPPLPPIQ HHRGRNDYGN SRGYQSNRDN YGRHSNSRND
SRPASSRTPT YDNRARNQGN RDNRNTGRNE KRNADSFGGR GYDKRPRH
//