GenomeNet

Database: UniProt
Entry: C1H1Y4_PARBA
LinkDB: C1H1Y4_PARBA
Original site: C1H1Y4_PARBA 
ID   C1H1Y4_PARBA            Unreviewed;      2551 AA.
AC   C1H1Y4;
DT   26-MAY-2009, integrated into UniProtKB/TrEMBL.
DT   26-MAY-2009, sequence version 1.
DT   27-MAR-2024, entry version 51.
DE   RecName: Full=THO complex subunit 2 {ECO:0000256|ARBA:ARBA00019596};
GN   ORFNames=PAAG_04920 {ECO:0000313|EMBL:EEH33871.1};
OS   Paracoccidioides lutzii (strain ATCC MYA-826 / Pb01) (Paracoccidioides
OS   brasiliensis).
OC   Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Eurotiomycetes;
OC   Eurotiomycetidae; Onygenales; Onygenales incertae sedis; Paracoccidioides.
OX   NCBI_TaxID=502779 {ECO:0000313|EMBL:EEH33871.1, ECO:0000313|Proteomes:UP000002059};
RN   [1] {ECO:0000313|EMBL:EEH33871.1, ECO:0000313|Proteomes:UP000002059}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=ATCC MYA-826 / Pb01 {ECO:0000313|Proteomes:UP000002059};
RX   PubMed=22046142; DOI=10.1371/journal.pgen.1002345;
RA   Desjardins C.A., Champion M.D., Holder J.W., Muszewska A., Goldberg J.,
RA   Bailao A.M., Brigido M.M., Ferreira M.E., Garcia A.M., Grynberg M.,
RA   Gujja S., Heiman D.I., Henn M.R., Kodira C.D., Leon-Narvaez H.,
RA   Longo L.V.G., Ma L.-J., Malavazi I., Matsuo A.L., Morais F.V., Pereira M.,
RA   Rodriguez-Brito S., Sakthikumar S., Salem-Izacc S.M., Sykes S.M.,
RA   Teixeira M.M., Vallejo M.C., Walter M.E., Yandava C., Young S., Zeng Q.,
RA   Zucker J., Felipe M.S., Goldman G.H., Haas B.J., McEwen J.G., Nino-Vega G.,
RA   Puccia R., San-Blas G., Soares C.M., Birren B.W., Cuomo C.A.;
RT   "Comparative genomic analysis of human fungal pathogens causing
RT   paracoccidioidomycosis.";
RL   PLoS Genet. 7:E1002345-E1002345(2011).
CC   -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC   -!- SIMILARITY: Belongs to the THOC2 family.
CC       {ECO:0000256|ARBA:ARBA00007857}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; KN294003; EEH33871.1; -; Genomic_DNA.
DR   RefSeq; XP_002793391.1; XM_002793345.2.
DR   AlphaFoldDB; C1H1Y4; -.
DR   STRING; 502779.C1H1Y4; -.
DR   GeneID; 9096501; -.
DR   KEGG; pbl:PAAG_04920; -.
DR   VEuPathDB; FungiDB:PAAG_04920; -.
DR   eggNOG; KOG1874; Eukaryota.
DR   HOGENOM; CLU_000511_1_0_1; -.
DR   OMA; QERWTCI; -.
DR   OrthoDB; 179356at2759; -.
DR   Proteomes; UP000002059; Partially assembled WGS sequence.
DR   GO; GO:0000347; C:THO complex; IEA:InterPro.
DR   GO; GO:0006406; P:mRNA export from nucleus; IEA:InterPro.
DR   GO; GO:0006397; P:mRNA processing; IEA:InterPro.
DR   InterPro; IPR040007; Tho2.
DR   InterPro; IPR021418; THO_THOC2_C.
DR   InterPro; IPR021726; THO_THOC2_N.
DR   InterPro; IPR032302; THOC2_N.
DR   PANTHER; PTHR21597:SF0; THO COMPLEX SUBUNIT 2; 1.
DR   PANTHER; PTHR21597; THO2 PROTEIN; 1.
DR   Pfam; PF11262; Tho2; 1.
DR   Pfam; PF11732; Thoc2; 1.
DR   Pfam; PF16134; THOC2_N; 1.
PE   3: Inferred from homology;
KW   Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW   Reference proteome {ECO:0000313|Proteomes:UP000002059}.
FT   DOMAIN          134..877
FT                   /note="THO complex subunit 2 N-terminal"
FT                   /evidence="ECO:0000259|Pfam:PF16134"
FT   DOMAIN          879..954
FT                   /note="THO complex subunitTHOC2 N-terminal"
FT                   /evidence="ECO:0000259|Pfam:PF11732"
FT   DOMAIN          1248..1553
FT                   /note="THO complex subunitTHOC2 C-terminal"
FT                   /evidence="ECO:0000259|Pfam:PF11262"
FT   REGION          1..124
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          587..616
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          672..703
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1145..1228
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1572..1620
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1644..2551
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        33..54
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        65..124
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        589..616
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1191..1223
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1576..1612
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1651..1668
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1694..1727
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1779..1889
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1919..1990
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1999..2016
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        2017..2044
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        2058..2073
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        2089..2114
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        2139..2156
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        2157..2175
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        2218..2254
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        2282..2395
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        2396..2410
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        2411..2429
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        2456..2493
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   2551 AA;  284810 MW;  7D7EC108F094C2FF CRC64;
     MPSAAGGKRK RGDRSWSGDA GNDGWRPSPH RPGNLNLAQQ HQQNQFNGQG RDSTDIRNRG
     GRRTSRGGNR NGTPARRASE GQNSHQKENS PSSRSAMTSQ RDPPELSQPS RTPSTPTPTS
     SSQPQAMFYS YEYLTREYVD AWTLMGRKAV IDIGVSARKA EDALVISSVF QEIIQSVIDS
     RISAVDGGTS VKEIIGEDVA ADDIPSVDGS TSTAQTFDVR SLFLDTLSVI ADANTSHSSL
     RTLVFSTGIS PTLMRQQLET PLLQSLGLIR DTFARMGIRK QTNLLYRQSN YNLLREESEG
     YSKLLTELFT TSSNEPPSAE VVEDTFERVK AMIGAFDMDV GRVLDVTLDV FAAVLVKKNR
     FFVKFLRVSS WWPKEESFLR RYGGISEPGL PKWALPGSAG WSTTEEDREE ALRANEKRDR
     LFWDRAREVG IRAFFEIGRN QSLEQKQLKS LSEFKSNSPE EDNDTLKWIE QTGTLPPKGN
     GVAAQLLGFK LRFYSSDARN PADILPDNLI YLAALLIKVG FISLRDLYPH LWRPDESMDE
     LKEEKMKEKE QRELAARPGA GAMNALMMAG ALSDDTIPVP MSRLREADTR AATPAKDQDV
     DKTAQTKVDE KKESLPEPAD QKVLLLKSLL AIGALPESLY ILGRFPWLLD AYPELPEFIH
     RIIHHSLSKL CDSSRPLSSR SDIKSEKKIT SSDQASLPKG NIRLTDQPPR RVLRWALLDK
     EDTNDGTDYR FYWDDWTDNI PVCQTVDDVF TLCGSFLNLS GVKIGQDPGL LTKLARLGNK
     SLQSDPSESN QMRWRDLCKR LLVPALSLTR ANPGVVNEVF ELLSHFARDV RYSIYAEWYS
     GQTSRLPDIK SAFDQARAET KDALKRLSKT NIKPMARNLA KIAFANPGIV ISVAINQIEA
     YENLIEVVVE CARYFTYLGY DILTWSLINS LGQKGRSRVQ DGGLLTSRWL NALASFVGRV
     FKRYSTIMDP VPVLQYVGEQ LRHNNSTDLV ILEQLISSMA GIVTDTNFND SQIQAMAGGA
     LLQSQTMLQL LDKRHESKIT SRRLIKSLAN SKLAGQLLIA VAQERATCIF KESEADGELK
     LLGNIFDEIH RVLTQYIDLL RSNFTVEEFD SFVPDVASLI GEFGLQPEVA FWITRPSVAH
     QIAEVDTKKR EQAAKKPETE TIPASKSPDG DLEMADDGEA VEKEDSSGDV TISTEESTSN
     TNDNQGEAKT NASNTSSTSK ADPESVPWHP VLEGLMDKIK TAMPRSSWEV VGLPFFVTFW
     QLSLYDIHVP QRAYEEELER LKKKIQAISQ DRSDVSIAGT LKKDKEKKLI NDLHDRILAE
     NKAHVRCYGL NRARLQKEKD RWFVGLRGKH EALNIAVMEQ CLLPRLLLSP IDAFYSFKML
     KYLHSSGTPN FRTVGFLDQL FREQRLTAII FQSTSKEADN FGRFLNEVLR DLTRWHADKA
     VYEKEAYGIK RDLPGFAMAV DQEGKPKSFL DYEDFRRIFY KWHRIFGACL KTCLSGGEYM
     HIRNAISVLK AVVQHFPAVN WIGRDMHTCV NNLKTMDPRD DVKIPAASLI GDLNRREKKW
     LLPQAFMINE SLPSDKAKAR TPQPQSTTPK SLNASAPDFK PSGSSTTVND AQPHGPGKLE
     VEDGEIEDAK MAEKLNNAAI KAKAARGDAQ SLQVSDASRA METASTSKLD KVEQNDKTPV
     IQETEPAKVA EPATGHSQTP SQPSSQQPST TPATLGDSQT PVSGHRSSSP APLRVASRPP
     SESSHPPSIP KRPDVDRYPP QHNTSVRPQA NLPNRPEPPR SFRHPDERMT MRPPNVPDER
     RDARDNRHPD RSGRFGGQDR ERPFEHALPI DPRSHGRPNE RPGDRDRLDG HRIDREFPSR
     PLEDNFGRPG YRDARPSPRD QEWSDRMGRG RISQADAFQV RQDSDRSFRE GDVHQYRAAN
     VPDLHDRDHP SRLHTETGIH QRLELPRAER DGRRNRSSRP STPPKADDSR LQNRSDRREE
     REERKPTSSQ PPARSDDLPT GPKGDRGNHS SAADNRHAPD SRSTYQSTTD SSYGRLNQDS
     KFPLRPQESF DRPQDIPSGP RKTSTQRGGR NPSLSQPLPI PPQSTDRQPP TGPSNRPLAR
     NPPQHDQQLA TAPSSAPAPV EKIDTSGIHP DRLKAIQSPR DDGTQTTASP TSLPPSGPRS
     GGHPPSGPSP TTRGPPGVPF SGERSRGDKR FAGLNNMLQQ FGGPADKSVM GTSIRGRGAN
     RTGGNSLNAP SPQSTRPQTP IGSKGDTYSG TAPPSGSEKP DLFPSRSDAS APQVAPSQEE
     PRGHGRAGRR SEIIEEAAAE SRRSPRHSSG TRTPDRERDR EKDRERERER ERDREHRDRE
     RERDTSRRGE EEASRASTKR DEYRERHREG DRDRDRDRGR AADISSREQA RGSRESSSRR
     PPQNSSANRE PVSTRRRDKR ERDVGTYESQ SRGKSDVSPP PPPPPPLASN ETENRRWGSG
     GREEDRNRDR ERGRDRERDR DRDRDRERRD LGNGRGGNSG AAGGASGGGN TGGGSWSRKR
     GRLAGSNVDD GASAAGSSRI GGENKRPRRG H
//
DBGET integrated database retrieval system