ID C1G654_PARBD Unreviewed; 2572 AA.
AC C1G654;
DT 26-MAY-2009, integrated into UniProtKB/TrEMBL.
DT 26-MAY-2009, sequence version 1.
DT 27-MAR-2024, entry version 46.
DE RecName: Full=THO complex subunit 2 {ECO:0000256|ARBA:ARBA00019596};
GN ORFNames=PADG_02659 {ECO:0000313|EMBL:EEH46561.1};
OS Paracoccidioides brasiliensis (strain Pb18).
OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Eurotiomycetes;
OC Eurotiomycetidae; Onygenales; Onygenales incertae sedis; Paracoccidioides.
OX NCBI_TaxID=502780 {ECO:0000313|EMBL:EEH46561.1, ECO:0000313|Proteomes:UP000001628};
RN [1] {ECO:0000313|EMBL:EEH46561.1, ECO:0000313|Proteomes:UP000001628}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Pb18 {ECO:0000313|EMBL:EEH46561.1,
RC ECO:0000313|Proteomes:UP000001628};
RX PubMed=22046142; DOI=10.1371/journal.pgen.1002345;
RA Desjardins C.A., Champion M.D., Holder J.W., Muszewska A., Goldberg J.,
RA Bailao A.M., Brigido M.M., Ferreira M.E., Garcia A.M., Grynberg M.,
RA Gujja S., Heiman D.I., Henn M.R., Kodira C.D., Leon-Narvaez H.,
RA Longo L.V.G., Ma L.-J., Malavazi I., Matsuo A.L., Morais F.V., Pereira M.,
RA Rodriguez-Brito S., Sakthikumar S., Salem-Izacc S.M., Sykes S.M.,
RA Teixeira M.M., Vallejo M.C., Walter M.E., Yandava C., Young S., Zeng Q.,
RA Zucker J., Felipe M.S., Goldman G.H., Haas B.J., McEwen J.G., Nino-Vega G.,
RA Puccia R., San-Blas G., Soares C.M., Birren B.W., Cuomo C.A.;
RT "Comparative genomic analysis of human fungal pathogens causing
RT paracoccidioidomycosis.";
RL PLoS Genet. 7:E1002345-E1002345(2011).
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- SIMILARITY: Belongs to the THOC2 family.
CC {ECO:0000256|ARBA:ARBA00007857}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KN275959; EEH46561.1; -; Genomic_DNA.
DR RefSeq; XP_010758555.1; XM_010760253.1.
DR STRING; 502780.C1G654; -.
DR GeneID; 22582102; -.
DR KEGG; pbn:PADG_02659; -.
DR VEuPathDB; FungiDB:PADG_02659; -.
DR eggNOG; KOG1874; Eukaryota.
DR HOGENOM; CLU_000511_1_0_1; -.
DR InParanoid; C1G654; -.
DR OMA; QERWTCI; -.
DR OrthoDB; 179356at2759; -.
DR Proteomes; UP000001628; Unassembled WGS sequence.
DR GO; GO:0000347; C:THO complex; IEA:InterPro.
DR GO; GO:0006406; P:mRNA export from nucleus; IEA:InterPro.
DR GO; GO:0006397; P:mRNA processing; IEA:InterPro.
DR InterPro; IPR040007; Tho2.
DR InterPro; IPR021418; THO_THOC2_C.
DR InterPro; IPR021726; THO_THOC2_N.
DR InterPro; IPR032302; THOC2_N.
DR PANTHER; PTHR21597:SF0; THO COMPLEX SUBUNIT 2; 1.
DR PANTHER; PTHR21597; THO2 PROTEIN; 1.
DR Pfam; PF11262; Tho2; 1.
DR Pfam; PF11732; Thoc2; 1.
DR Pfam; PF16134; THOC2_N; 1.
PE 3: Inferred from homology;
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000001628}.
FT DOMAIN 134..876
FT /note="THO complex subunit 2 N-terminal"
FT /evidence="ECO:0000259|Pfam:PF16134"
FT DOMAIN 878..953
FT /note="THO complex subunitTHOC2 N-terminal"
FT /evidence="ECO:0000259|Pfam:PF11732"
FT DOMAIN 1247..1552
FT /note="THO complex subunitTHOC2 C-terminal"
FT /evidence="ECO:0000259|Pfam:PF11262"
FT REGION 1..123
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 586..615
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 681..703
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1143..1226
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1583..1642
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1663..2572
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 33..54
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 65..123
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 588..615
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1190..1222
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1592..1627
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1666..1698
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1711..1760
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1795..1905
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1935..2006
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2030..2060
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2074..2089
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2091..2107
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2108..2130
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2155..2172
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2173..2193
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2234..2270
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2298..2412
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2413..2428
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2429..2447
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2454..2469
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2477..2514
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 2572 AA; 287107 MW; 63774E6DD030F74C CRC64;
MPSAAGGKRK RGDRSWSGDA GNDGWRPSPH RPGNLNLAQQ HQQNQFNGQS RDSTDNRNRG
GRRTSRGGNR NGTPARRASE GHNSHQKENS PSLRSAMISQ RDPPELSQPS RMPSTPIPTS
SSQPQVMFYS YEYLTRECVD AWTLTGRKAV IDIGVGARKA EDALVISSVF QEIIQSVIDG
RISAVDGGTS VKEIIGEDVA DDMPSVDGST GTSQTFDVRS LFLDTLSVTA DANTSHSSLR
TLVFSTGISP TLMRQQLETP LLQSLGLIRD TFARMGIRKQ TNLLYRQSNY NLLREESEGY
SKLLTELFTT SSNEPPSGEV VEDTFERVKA MIGAFDMDVG RVLDVTLDVF AAVLVKKNRF
FVKFLRVSSW WPKEESFLRR YGGISEPGLP KWALPGSAGW LTTEEDREEA LRANEKRDRL
FWDRAREVGI RAYFEIGRNQ SLEPKQLKSI SELKINAPEE DNDTLKWIEQ SGTLPPKSNG
VAAQLLGFKL RFYSSDARNP ADILPDNLIY LAALLIKVGF ISLRDLYPHL WRPDEFMDEL
KEEKMKEKEQ RELAARPGAG AMNALMMAGA LSDDTIPVPM SRLRETDTRA ATPAKDQDVD
KTAQTKVDEK KESLPEPADQ KVLLLKSLLA IGALPESLYI LGRFPWLLDA YPELPEYIHR
IIHHSLSKLC DSSRPLSSRS DIKTEKKITS SDQAGLPRGN IRLTDPPPRR VLRWALLDKE
DTNDGTDYRF YWDDWTDNIP VCQTVDDVFT LCGSFLNLSG VKIGQDPGLL TKLARLGNES
LQSDPSESNQ MRWRDLCKRL LVPALSLTRA NPGVVNEVFE LLSHFSRDVR YSIYAEWYSG
QTSRLPDIKS AFDQARAETK DALKRLSKTN IKPMARNLAK IAFANPGIVI SVAINQIEAY
ENLIEVVVEC ARYFTYLGYD ILTWSLINSL GQKGRSRVQD GGLLTSRWLN ALASFVGRVF
KRYSTIMDPV PVLQYVGEQL RHNNSTDLVI LEQLISSMAG IVTDTNFNDS QIQAMAGGAL
LQSQTMLQLL DKRHESKITS RRLIKSLANS KLAGQLLIAV AQERATCIFK ESEADGELKL
LGNIFDEIHR VLTQYIDLLR SNFTVEEFDS FVPDVASLIG EFGLQPEVAF WITRPSVAHQ
IAEVDTKKRE QATKKQETET IPASKSPDGD LEMADDGEAV EKEDSSGDVT ISTEESTSST
NDNQGEAKTN ASITSSTSKA DPESVPWHPV LEGLMDKIKT AMPRSSWEVV GLPFFVTFWQ
LSLYDIHVPQ RAYEEELERL KKKIQAISQD RSDVSIAGTL KKDKEKKLIN DLHDRILAEN
KAHVRCYGLN RARLQKEKDR WFVGLRGKHE ALNIAVMEQC LLPRLLLSPI DAFYSFKMLK
YLHSSGTPNF RTVGFLDQLF REQRLTAIIF QSTSKEADNF GRFLNEVLRD LTRWHADKAV
YEKEAYGMKR DLPGFAMAVD QEGKPKSFLD YEDFRRIFYK WHRIFGACLK TCLSGGEYMH
IRNAISVLKA VVQHFPAVNW IGRDMHTCVN NLKTMDPRDD VKIPAASLIG DLNRREKKWL
LPQAFMIVSL PICNVEEYGA NPPKNESLPS DKAKARTPQP QSTTPKSLNA SAPDFKPSGS
STTVNDARPH GPGKLEVEDG EIEDAKMVGK LNNAAMKAKA ARGDAQSLQV SDTSSAVETA
STSKLDKVEQ NNKTPVLQET EPAKVAEPAT GHSQTPAQPS SQQPSTTQAT LGDSQTPVSG
HRSSSPAPSR VASRPPSESS HVPSIPKRPD VDCYPPQHSA SVRPQANLPN RPEPPRPFRH
PDERMSMRPP NMPDERRDAR DNRHPDRSGR FGGQDRERPF EHALPIDSRS HGRPNERPGD
RDRLDGHRID REFPSRPLED NFGRPGYRDA RPSPRDQEWS DRMGRGRISQ ADAFQVRQDS
DRSFREGDVH QYRAANVPDL HDRDHPSRPH TETGIHQRLE LPRPERDDRR NRSSRPSTPP
KADDSRLQNR SDRREEREER KPTNSQPPAR SEDLPTGPKG DRGNYSSAAD NRHALDSRST
YQSTTDSSYG RLNQDSKFPL RPQESFDRPQ DIPSGPRKTS TQRGGRNPSL SQPLPIPPQP
TDRQPPTGPS NRPLARNPAQ HDQQLATAPS SAPAPVEKID TSGIHPDRLK AIQSPRDDGT
QTTASPTSLP PSGPRSGGHP PSGPSPTTRG PPGVPFPGER SRGDKRFAGL NNMLQQFGAP
ADKSGMGTSI RGRGANRTGG NSVNAPSPQS TRPQTPVGSK GDNYSGTAPP SGSEKPDLFP
SRSDASAPPV VPSQEEPRAH GRAGRRSEII EEAAAESRRS PRHSSGTRTP DRERDREKDR
ERERDRERDR EHRDRERERD TSRRGEEEAS RASTKRDEYR ERHREGDRDR DRDRDRGRAA
DISSREQARG SRESSSRGPP QNSSANREPV STRRRDKRER DVGTYESQSR GKSDVSPPPP
PPPPPPLLAS NETENRRWGS GGREEDRNRD RERGRDRERD RDRDRDRERR DLGNGRGGNT
GAAGGASGGG NTGGGSWSRK RGRLAGSNVD DGASAAGSSR IGGENKRPRR GH
//