GenomeNet

Database: UniProt
Entry: A0A1D8PH99_CANAL
LinkDB: A0A1D8PH99_CANAL
Original site: A0A1D8PH99_CANAL 
ID   A0A1D8PH99_CANAL        Unreviewed;      1608 AA.
AC   A0A1D8PH99;
DT   18-JAN-2017, integrated into UniProtKB/TrEMBL.
DT   18-JAN-2017, sequence version 1.
DT   24-JAN-2024, entry version 31.
DE   RecName: Full=THO complex subunit 2 {ECO:0000256|ARBA:ARBA00019596};
GN   OrderedLocusNames=CAALFM_C204870CA {ECO:0000313|EMBL:AOW27510.1},
GN   orf19.4123 {ECO:0000313|CGD:CAL0000181452};
OS   Candida albicans (strain SC5314 / ATCC MYA-2876) (Yeast).
OC   Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; Saccharomycetes;
OC   Saccharomycetales; Debaryomycetaceae; Candida/Lodderomyces clade; Candida.
OX   NCBI_TaxID=237561 {ECO:0000313|EMBL:AOW27510.1, ECO:0000313|Proteomes:UP000000559};
RN   [1] {ECO:0000313|EMBL:AOW27510.1, ECO:0000313|Proteomes:UP000000559}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=SC5314 / ATCC MYA-2876 {ECO:0000313|Proteomes:UP000000559};
RX   PubMed=15123810; DOI=10.1073/pnas.0401648101;
RA   Jones T., Federspiel N.A., Chibana H., Dungan J., Kalman S., Magee B.B.,
RA   Newport G., Thorstenson Y.R., Agabian N., Magee P.T., Davis R.W.,
RA   Scherer S.;
RT   "The diploid genome sequence of Candida albicans.";
RL   Proc. Natl. Acad. Sci. U.S.A. 101:7329-7334(2004).
RN   [2] {ECO:0000313|EMBL:AOW27510.1, ECO:0000313|Proteomes:UP000000559}
RP   GENOME REANNOTATION.
RC   STRAIN=SC5314 / ATCC MYA-2876 {ECO:0000313|Proteomes:UP000000559};
RX   PubMed=17419877; DOI=.1186/gb-2007-8-4-r52;
RA   van het Hoog M., Rast T.J., Martchenko M., Grindle S., Dignard D.,
RA   Hogues H., Cuomo C., Berriman M., Scherer S., Magee B.B., Whiteway M.,
RA   Chibana H., Nantel A., Magee P.T.;
RT   "Assembly of the Candida albicans genome into sixteen supercontigs aligned
RT   on the eight chromosomes.";
RL   Genome Biol. 8:RESEARCH52.1-RESEARCH52.12(2007).
RN   [3] {ECO:0000313|EMBL:AOW27510.1, ECO:0000313|Proteomes:UP000000559}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=SC5314 / ATCC MYA-2876 {ECO:0000313|Proteomes:UP000000559};
RX   PubMed=24025428; DOI=.1186/gb-2013-14-9-r97;
RA   Muzzey D., Schwartz K., Weissman J.S., Sherlock G.;
RT   "Assembly of a phased diploid Candida albicans genome facilitates allele-
RT   specific measurements and provides a simple model for repeat and indel
RT   structure.";
RL   Genome Biol. 14:RESEARCH97.1-RESEARCH97.14(2013).
CC   -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC   -!- SIMILARITY: Belongs to the THOC2 family.
CC       {ECO:0000256|ARBA:ARBA00007857}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; CP017624; AOW27510.1; -; Genomic_DNA.
DR   RefSeq; XP_710282.2; XM_705190.2.
DR   AlphaFoldDB; A0A1D8PH99; -.
DR   SMR; A0A1D8PH99; -.
DR   STRING; 237561.A0A1D8PH99; -.
DR   EnsemblFungi; C2_04870C_A-T; C2_04870C_A-T-p1; C2_04870C_A.
DR   GeneID; 3648111; -.
DR   KEGG; cal:CAALFM_C204870CA; -.
DR   CGD; CAL0000181452; orf19.4123.
DR   VEuPathDB; FungiDB:C2_04870C_A; -.
DR   eggNOG; KOG1874; Eukaryota.
DR   InParanoid; A0A1D8PH99; -.
DR   OrthoDB; 179356at2759; -.
DR   Proteomes; UP000000559; Chromosome 2.
DR   GO; GO:0000445; C:THO complex part of transcription export complex; IBA:GO_Central.
DR   GO; GO:0003729; F:mRNA binding; IBA:GO_Central.
DR   GO; GO:0006406; P:mRNA export from nucleus; IBA:GO_Central.
DR   GO; GO:0006397; P:mRNA processing; IEA:InterPro.
DR   InterPro; IPR040007; Tho2.
DR   InterPro; IPR021418; THO_THOC2_C.
DR   InterPro; IPR021726; THO_THOC2_N.
DR   InterPro; IPR032302; THOC2_N.
DR   PANTHER; PTHR21597:SF0; THO COMPLEX SUBUNIT 2; 1.
DR   PANTHER; PTHR21597; THO2 PROTEIN; 1.
DR   Pfam; PF11262; Tho2; 1.
DR   Pfam; PF11732; Thoc2; 1.
DR   Pfam; PF16134; THOC2_N; 1.
PE   3: Inferred from homology;
KW   Coiled coil {ECO:0000256|SAM:Coils};
KW   Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW   Reference proteome {ECO:0000313|Proteomes:UP000000559}.
FT   DOMAIN          6..612
FT                   /note="THO complex subunit 2 N-terminal"
FT                   /evidence="ECO:0000259|Pfam:PF16134"
FT   DOMAIN          614..687
FT                   /note="THO complex subunitTHOC2 N-terminal"
FT                   /evidence="ECO:0000259|Pfam:PF11732"
FT   DOMAIN          877..1160
FT                   /note="THO complex subunitTHOC2 C-terminal"
FT                   /evidence="ECO:0000259|Pfam:PF11262"
FT   REGION          326..349
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1238..1608
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COILED          940..971
FT                   /evidence="ECO:0000256|SAM:Coils"
FT   COILED          1175..1232
FT                   /evidence="ECO:0000256|SAM:Coils"
FT   COMPBIAS        1238..1262
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1272..1302
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1303..1323
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1337..1367
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1381..1415
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1442..1460
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1466..1514
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1515..1530
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1539..1576
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1577..1608
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   1608 AA;  183694 MW;  370CDEE6C0F227E3 CRC64;
     MSLLYITEEV VENFSGSGMD SLFEVLESYQ ENDAESEEIL AQIFTELLIT FEENKLDVND
     IKKFFTQAIK SDDQARIFFQ VLNSFAVSKN IHDLLHLLFR DNKIKIETLA LHLSSDFLKS
     VEIVPKDYFQ RTLSHKIRDE YFTQKKFNLL HEEVEGYSKF TSEMHSILDS SDAEFQLDYA
     IQVMEKLIGH YDLDPNRCLS LLFQVFTGTI VPHYRFILNV LKKSRWWPNV ESDNSSLLSL
     GNGGSETAAK VIGLELVGEC GTRDLPETYK CLVAILIKEG LISFGSLYKF MGPDESEMDE
     LEAEYKKKLD RDVLVAGATA LALAAPLADE EDEEGEKGES KTKNKTSQAS TEKDLSSLLK
     SNMKFQFLKV FLGVGLYWPA IFILTKYPYL AQIDEEIPIL INRLFATMID PLYNKIRIFS
     DEEISNLQKP KGITFSRPYN TVFVEHSPVA HLFSFNPLMR GYGNRKYTYF YREFSTDLPK
     IQDIDSLIAA SNELLKFNGP NLAKDTDIFI KTCEVTRYLL SQEEDKSKVF FYFKNFIFPA
     MPLIEENSIA IDKAFEILLF FPTEDRFSLY GELYNILAKN NPLIKIAYSK AEKSTKDVLK
     RLSKENVRPM MRRLAKICFS NPLPCLLTIL QQIESYDNLN TLVVETARYF NAYGWDNLTA
     AILIRLASSR SSTYNGMSER QWVQSLASFI GKICQRYPHA IDIKTILAFI LKSFYSGDTI
     GLLVLKEMFI SMGGIQHITN LTINQIDMIN CGSSLQKIVY NTIDDLRFER RATGKYLIKC
     MNEIDAVNEL LVLLCRISND VTFTGNESYL KVLVSKSDDV NAVIRLFVTL INLYDEDLNL
     MPIQQLSDLG VPWSWAYEVW RFRGKTEKDN LSLESTSSIF DNFWKLSLHD INYTNELYDN
     ETIKLESNIK SLKDSIAINL KNKELPRTVI DKQRKDLETC EEYQKTLVDE RAKHKDENEK
     IEEQLKQISF NWFVDLSFEE FIQKCIFPRA ITSSFDAVYS ARFLFKLNSM KIDNYSLVNV
     LDLLFKSKSL FGTLCSSTPT EAENIGLFFA DILRTLQGWR DESKYGEIGL QDQDGDAITF
     DDFKKLLYDY HSLLLEEIRI GLQAPEYISR NNTIIFLKNL LAVYPTVDDH GAQIVNLIEK
     LSTTEKRNDL KLASNALLVH VKSKSKNWVP IWDFISMSEE EKEEIIKAKE EEKQRIIKEE
     AEAKRQKELE LEKEKQMKLA KEEEEKKKLL AAASLNYDSS TAGGSGGGAR TQTRTTQIGR
     TYEKYAIETK SEAHSRQTTP IPTQPRTLSS VPTSPSSLSK ENKLQERVNK MKQAYKESRF
     SSDTENKVLN SESLDGHEAT SNEKEAKQEE SNDQEKLDSE AKEKANPGQN NEETQLSTEK
     EDSAGDNSTN TSNLAAQKES EQKRSPLPPQ NEIKKSISSE DFKGPTEPKR TPLPPQTMVA
     RHKDDSYGRS EGRRAPLPPQ HEIKKQSLGG TDSSNKVGSS RPPLGQSNAN GFRNDMRTNK
     GNSSSNQQNS RQQPPASIPP SPPPPLPPIQ HHRGRNDYGN SRGYQSNRDN YGRHSNSRND
     SRPASSRTPT YDNRARNQGN RDNRNTGRNE KRNADSFGGR GYDKRPRH
//
DBGET integrated database retrieval system