ID C1H1Y4_PARBA Unreviewed; 2551 AA.
AC C1H1Y4;
DT 26-MAY-2009, integrated into UniProtKB/TrEMBL.
DT 26-MAY-2009, sequence version 1.
DT 27-MAR-2024, entry version 51.
DE RecName: Full=THO complex subunit 2 {ECO:0000256|ARBA:ARBA00019596};
GN ORFNames=PAAG_04920 {ECO:0000313|EMBL:EEH33871.1};
OS Paracoccidioides lutzii (strain ATCC MYA-826 / Pb01) (Paracoccidioides
OS brasiliensis).
OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Eurotiomycetes;
OC Eurotiomycetidae; Onygenales; Onygenales incertae sedis; Paracoccidioides.
OX NCBI_TaxID=502779 {ECO:0000313|EMBL:EEH33871.1, ECO:0000313|Proteomes:UP000002059};
RN [1] {ECO:0000313|EMBL:EEH33871.1, ECO:0000313|Proteomes:UP000002059}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=ATCC MYA-826 / Pb01 {ECO:0000313|Proteomes:UP000002059};
RX PubMed=22046142; DOI=10.1371/journal.pgen.1002345;
RA Desjardins C.A., Champion M.D., Holder J.W., Muszewska A., Goldberg J.,
RA Bailao A.M., Brigido M.M., Ferreira M.E., Garcia A.M., Grynberg M.,
RA Gujja S., Heiman D.I., Henn M.R., Kodira C.D., Leon-Narvaez H.,
RA Longo L.V.G., Ma L.-J., Malavazi I., Matsuo A.L., Morais F.V., Pereira M.,
RA Rodriguez-Brito S., Sakthikumar S., Salem-Izacc S.M., Sykes S.M.,
RA Teixeira M.M., Vallejo M.C., Walter M.E., Yandava C., Young S., Zeng Q.,
RA Zucker J., Felipe M.S., Goldman G.H., Haas B.J., McEwen J.G., Nino-Vega G.,
RA Puccia R., San-Blas G., Soares C.M., Birren B.W., Cuomo C.A.;
RT "Comparative genomic analysis of human fungal pathogens causing
RT paracoccidioidomycosis.";
RL PLoS Genet. 7:E1002345-E1002345(2011).
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- SIMILARITY: Belongs to the THOC2 family.
CC {ECO:0000256|ARBA:ARBA00007857}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KN294003; EEH33871.1; -; Genomic_DNA.
DR RefSeq; XP_002793391.1; XM_002793345.2.
DR AlphaFoldDB; C1H1Y4; -.
DR STRING; 502779.C1H1Y4; -.
DR GeneID; 9096501; -.
DR KEGG; pbl:PAAG_04920; -.
DR VEuPathDB; FungiDB:PAAG_04920; -.
DR eggNOG; KOG1874; Eukaryota.
DR HOGENOM; CLU_000511_1_0_1; -.
DR OMA; QERWTCI; -.
DR OrthoDB; 179356at2759; -.
DR Proteomes; UP000002059; Partially assembled WGS sequence.
DR GO; GO:0000347; C:THO complex; IEA:InterPro.
DR GO; GO:0006406; P:mRNA export from nucleus; IEA:InterPro.
DR GO; GO:0006397; P:mRNA processing; IEA:InterPro.
DR InterPro; IPR040007; Tho2.
DR InterPro; IPR021418; THO_THOC2_C.
DR InterPro; IPR021726; THO_THOC2_N.
DR InterPro; IPR032302; THOC2_N.
DR PANTHER; PTHR21597:SF0; THO COMPLEX SUBUNIT 2; 1.
DR PANTHER; PTHR21597; THO2 PROTEIN; 1.
DR Pfam; PF11262; Tho2; 1.
DR Pfam; PF11732; Thoc2; 1.
DR Pfam; PF16134; THOC2_N; 1.
PE 3: Inferred from homology;
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000002059}.
FT DOMAIN 134..877
FT /note="THO complex subunit 2 N-terminal"
FT /evidence="ECO:0000259|Pfam:PF16134"
FT DOMAIN 879..954
FT /note="THO complex subunitTHOC2 N-terminal"
FT /evidence="ECO:0000259|Pfam:PF11732"
FT DOMAIN 1248..1553
FT /note="THO complex subunitTHOC2 C-terminal"
FT /evidence="ECO:0000259|Pfam:PF11262"
FT REGION 1..124
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 587..616
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 672..703
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1145..1228
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1572..1620
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1644..2551
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 33..54
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 65..124
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 589..616
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1191..1223
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1576..1612
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1651..1668
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1694..1727
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1779..1889
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1919..1990
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1999..2016
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2017..2044
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2058..2073
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2089..2114
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2139..2156
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2157..2175
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2218..2254
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2282..2395
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2396..2410
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2411..2429
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2456..2493
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 2551 AA; 284810 MW; 7D7EC108F094C2FF CRC64;
MPSAAGGKRK RGDRSWSGDA GNDGWRPSPH RPGNLNLAQQ HQQNQFNGQG RDSTDIRNRG
GRRTSRGGNR NGTPARRASE GQNSHQKENS PSSRSAMTSQ RDPPELSQPS RTPSTPTPTS
SSQPQAMFYS YEYLTREYVD AWTLMGRKAV IDIGVSARKA EDALVISSVF QEIIQSVIDS
RISAVDGGTS VKEIIGEDVA ADDIPSVDGS TSTAQTFDVR SLFLDTLSVI ADANTSHSSL
RTLVFSTGIS PTLMRQQLET PLLQSLGLIR DTFARMGIRK QTNLLYRQSN YNLLREESEG
YSKLLTELFT TSSNEPPSAE VVEDTFERVK AMIGAFDMDV GRVLDVTLDV FAAVLVKKNR
FFVKFLRVSS WWPKEESFLR RYGGISEPGL PKWALPGSAG WSTTEEDREE ALRANEKRDR
LFWDRAREVG IRAFFEIGRN QSLEQKQLKS LSEFKSNSPE EDNDTLKWIE QTGTLPPKGN
GVAAQLLGFK LRFYSSDARN PADILPDNLI YLAALLIKVG FISLRDLYPH LWRPDESMDE
LKEEKMKEKE QRELAARPGA GAMNALMMAG ALSDDTIPVP MSRLREADTR AATPAKDQDV
DKTAQTKVDE KKESLPEPAD QKVLLLKSLL AIGALPESLY ILGRFPWLLD AYPELPEFIH
RIIHHSLSKL CDSSRPLSSR SDIKSEKKIT SSDQASLPKG NIRLTDQPPR RVLRWALLDK
EDTNDGTDYR FYWDDWTDNI PVCQTVDDVF TLCGSFLNLS GVKIGQDPGL LTKLARLGNK
SLQSDPSESN QMRWRDLCKR LLVPALSLTR ANPGVVNEVF ELLSHFARDV RYSIYAEWYS
GQTSRLPDIK SAFDQARAET KDALKRLSKT NIKPMARNLA KIAFANPGIV ISVAINQIEA
YENLIEVVVE CARYFTYLGY DILTWSLINS LGQKGRSRVQ DGGLLTSRWL NALASFVGRV
FKRYSTIMDP VPVLQYVGEQ LRHNNSTDLV ILEQLISSMA GIVTDTNFND SQIQAMAGGA
LLQSQTMLQL LDKRHESKIT SRRLIKSLAN SKLAGQLLIA VAQERATCIF KESEADGELK
LLGNIFDEIH RVLTQYIDLL RSNFTVEEFD SFVPDVASLI GEFGLQPEVA FWITRPSVAH
QIAEVDTKKR EQAAKKPETE TIPASKSPDG DLEMADDGEA VEKEDSSGDV TISTEESTSN
TNDNQGEAKT NASNTSSTSK ADPESVPWHP VLEGLMDKIK TAMPRSSWEV VGLPFFVTFW
QLSLYDIHVP QRAYEEELER LKKKIQAISQ DRSDVSIAGT LKKDKEKKLI NDLHDRILAE
NKAHVRCYGL NRARLQKEKD RWFVGLRGKH EALNIAVMEQ CLLPRLLLSP IDAFYSFKML
KYLHSSGTPN FRTVGFLDQL FREQRLTAII FQSTSKEADN FGRFLNEVLR DLTRWHADKA
VYEKEAYGIK RDLPGFAMAV DQEGKPKSFL DYEDFRRIFY KWHRIFGACL KTCLSGGEYM
HIRNAISVLK AVVQHFPAVN WIGRDMHTCV NNLKTMDPRD DVKIPAASLI GDLNRREKKW
LLPQAFMINE SLPSDKAKAR TPQPQSTTPK SLNASAPDFK PSGSSTTVND AQPHGPGKLE
VEDGEIEDAK MAEKLNNAAI KAKAARGDAQ SLQVSDASRA METASTSKLD KVEQNDKTPV
IQETEPAKVA EPATGHSQTP SQPSSQQPST TPATLGDSQT PVSGHRSSSP APLRVASRPP
SESSHPPSIP KRPDVDRYPP QHNTSVRPQA NLPNRPEPPR SFRHPDERMT MRPPNVPDER
RDARDNRHPD RSGRFGGQDR ERPFEHALPI DPRSHGRPNE RPGDRDRLDG HRIDREFPSR
PLEDNFGRPG YRDARPSPRD QEWSDRMGRG RISQADAFQV RQDSDRSFRE GDVHQYRAAN
VPDLHDRDHP SRLHTETGIH QRLELPRAER DGRRNRSSRP STPPKADDSR LQNRSDRREE
REERKPTSSQ PPARSDDLPT GPKGDRGNHS SAADNRHAPD SRSTYQSTTD SSYGRLNQDS
KFPLRPQESF DRPQDIPSGP RKTSTQRGGR NPSLSQPLPI PPQSTDRQPP TGPSNRPLAR
NPPQHDQQLA TAPSSAPAPV EKIDTSGIHP DRLKAIQSPR DDGTQTTASP TSLPPSGPRS
GGHPPSGPSP TTRGPPGVPF SGERSRGDKR FAGLNNMLQQ FGGPADKSVM GTSIRGRGAN
RTGGNSLNAP SPQSTRPQTP IGSKGDTYSG TAPPSGSEKP DLFPSRSDAS APQVAPSQEE
PRGHGRAGRR SEIIEEAAAE SRRSPRHSSG TRTPDRERDR EKDRERERER ERDREHRDRE
RERDTSRRGE EEASRASTKR DEYRERHREG DRDRDRDRGR AADISSREQA RGSRESSSRR
PPQNSSANRE PVSTRRRDKR ERDVGTYESQ SRGKSDVSPP PPPPPPLASN ETENRRWGSG
GREEDRNRDR ERGRDRERDR DRDRDRERRD LGNGRGGNSG AAGGASGGGN TGGGSWSRKR
GRLAGSNVDD GASAAGSSRI GGENKRPRRG H
//