ID K5V975_PHACS Unreviewed; 2126 AA.
AC K5V975;
DT 09-JAN-2013, integrated into UniProtKB/TrEMBL.
DT 09-JAN-2013, sequence version 1.
DT 22-FEB-2023, entry version 31.
DE RecName: Full=THO complex subunit 2 {ECO:0000256|ARBA:ARBA00019596};
GN ORFNames=PHACADRAFT_191704 {ECO:0000313|EMBL:EKM59356.1};
OS Phanerochaete carnosa (strain HHB-10118-sp) (White-rot fungus) (Peniophora
OS carnosa).
OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; Agaricomycetes;
OC Polyporales; Phanerochaetaceae; Phanerochaete.
OX NCBI_TaxID=650164 {ECO:0000313|EMBL:EKM59356.1, ECO:0000313|Proteomes:UP000008370};
RN [1] {ECO:0000313|EMBL:EKM59356.1, ECO:0000313|Proteomes:UP000008370}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=HHB-10118-sp {ECO:0000313|EMBL:EKM59356.1,
RC ECO:0000313|Proteomes:UP000008370};
RX PubMed=22937793; DOI=10.1186/1471-2164-13-444;
RA Suzuki H., MacDonald J., Syed K., Salamov A., Hori C., Aerts A.,
RA Henrissat B., Wiebenga A., vanKuyk P.A., Barry K., Lindquist E.,
RA LaButti K., Lapidus A., Lucas S., Coutinho P., Gong Y., Samejima M.,
RA Mahadevan R., Abou-Zaid M., de Vries R.P., Igarashi K., Yadav J.S.,
RA Grigoriev I.V., Master E.R.;
RT "Comparative genomics of the white-rot fungi, Phanerochaete carnosa and P.
RT chrysosporium, to elucidate the genetic basis of the distinct wood types
RT they colonize.";
RL BMC Genomics 13:444-444(2012).
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- SIMILARITY: Belongs to the THOC2 family.
CC {ECO:0000256|ARBA:ARBA00007857}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; JH930469; EKM59356.1; -; Genomic_DNA.
DR RefSeq; XP_007391920.1; XM_007391858.1.
DR STRING; 650164.K5V975; -.
DR GeneID; 18910779; -.
DR KEGG; pco:PHACADRAFT_191704; -.
DR HOGENOM; CLU_000511_4_0_1; -.
DR InParanoid; K5V975; -.
DR OrthoDB; 179356at2759; -.
DR Proteomes; UP000008370; Unassembled WGS sequence.
DR GO; GO:0000347; C:THO complex; IEA:InterPro.
DR GO; GO:0006406; P:mRNA export from nucleus; IEA:InterPro.
DR GO; GO:0006397; P:mRNA processing; IEA:InterPro.
DR InterPro; IPR040007; Tho2.
DR InterPro; IPR021418; THO_THOC2_C.
DR InterPro; IPR021726; THO_THOC2_N.
DR InterPro; IPR032302; THOC2_N.
DR PANTHER; PTHR21597:SF0; THO COMPLEX SUBUNIT 2; 1.
DR PANTHER; PTHR21597; THO2 PROTEIN; 1.
DR Pfam; PF11262; Tho2; 1.
DR Pfam; PF11732; Thoc2; 1.
DR Pfam; PF16134; THOC2_N; 1.
PE 3: Inferred from homology;
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000008370}.
FT DOMAIN 11..732
FT /note="THO complex subunit 2 N-terminal"
FT /evidence="ECO:0000259|Pfam:PF16134"
FT DOMAIN 737..812
FT /note="THO complex subunitTHOC2 N-terminal"
FT /evidence="ECO:0000259|Pfam:PF11732"
FT DOMAIN 1108..1424
FT /note="THO complex subunitTHOC2 C-terminal"
FT /evidence="ECO:0000259|Pfam:PF11262"
FT REGION 445..465
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1027..1085
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1433..2126
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1033..1070
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1446..1470
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1604..1648
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1655..1672
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1673..1690
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1695..1732
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1739..1845
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1866..1951
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2041..2070
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2112..2126
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 2126 AA; 234959 MW; 685E46047AA895B7 CRC64;
MDVVETVRGF IGCWEESGQA ECRDLLTAPH SNPSDPAASD VLSTAYHTLI TATLKTWSPS
KALTPQALIS FLQSLLESLP SPSSTKSPHA IAFGDILVDI LWAIDVELDD IHQDTKMALA
NAEQGNAPVV AEGVDVTAVL ARVAQAKQNA ESDKEILAGA VKLLVASGIL DADICRERLE
LSMIHHADLI PDDQAFSKKE VRMRTALFYK QNKFNLLREQ SEGYSKLTTE LTSSLGPPHS
SVTGRPIDSW SVIEARARPA WERVVGLIGY FDLDPNRALD IILDVFSVHL ATHYSFFLAL
LSCSPWGASL KPKPEDAMNA EASSDAYKGK DLDEILRIAE LQSGHASSEL LLSATSNGSR
VLAQVLGFKF TYYQLPDVTE SAPKNLYLTT ALLIREGFIT LEDIHPHLSP AEDAMGTEHK
KYLDSVNARI ANAKVSMLAM AAPLESSSSS SGPKIHQATP AEPKKVEKEV SNQKIGLLHA
LLSLGALRPG LALLSKYPWI VDASPDIADL LLRILKHSIA PVYAAVGSGK ESRPAFAVPK
RRFGATGVVS APERKPQLTL WAPTPPSTSA IDFVFFFPCW TERIPLCTTT EDVINVLEPL
MRFVGLHASR DLIFLAQFSR VGRADVVATV SVDPDTKRPL PADLDHPTCR FWYKMARLYL
LPALSLVRGN AVCNVEIWTL IRMFETTQRW RLYGEWKATT YQSHPELRVR YIQADRESKG
ILRRLSHQTV EKLSCSVAKL AHSNPLILFS NTINQVMAYE NLAEAVIRTL TYTTVMGFDV
LLYVIIEAFS NPNKARVKDD GVNTSDWLQN LASFTGLLFR KYTGDITYLL KYLVHQLQNG
QVSEIIILRE LIWKIAGIEP LPNLTDSQVV AMAGGPNLRI EVLGSDKRGA RLDPQDVGAK
GSIRLGKVMI ESGLALPLLI QIAQQRQACV FQAANTHLKS LANLYDITHG VLLQYVEFLI
TPSIVPPEDY AKKVLPSLAD MHLKYGIGVP ICMQILRPTL HLALLSAALE MQKRERVASE
EAEKRLKAAL TAKREPTTTS RTGSPVVAEG TASTDQAAEA KPSGSSEDVT MESADHPEPA
PAPEKPWLPQ LYALFDDVKA IAPGSVNEVI GSGFYMTFWQ LSTYDLSPPA SKYEEETSNL
RKLSLEADRA YNAAEKSDSW VERNNSYKHR ERRNRYNVFI DMLRDELKQQ TAARAFTIKR
LAREKQHWFA HNPKGTTVAA AIVEHCLHPR AMLSPMDADY CAQIIKVVHL QGTPGFSTLN
IYDKLLGDHL RTLIFSCSEV EARNYGRFLL GILGDIWKWH QDEQLFMQDN RVKSGGKVSY
LPGFMLIYSN KANVAIDDII KWQQFRQVCK KWHRKLAKAF IDCIESGEFM HVYNTIVVLK
EILPVFPVAS VSEASGPALE IAMEKFIEKE ERDDLKILGR AYVASFKKRE PHWKLQERGV
RPAAPTPPAK STPAATPAPP ERPRTSGPPP LGPTSNDRSA AAPPTGPRAD IAQVNGSAAL
GAEKPSMPGS TRIAIDSIPR PEVVKRVRPD TRSPAPRVNG EAEAKTNGQL DPMQVDKLHA
SVRPPVTATT PPTAPRRDEQ GIPRGAAGPG ATLRPGSPNI PAGSPVLRSI SQMEGSRPPT
PSPLVRSVSN RDLTQSPRIG TSDSRGRVEA SQVMPPPANP SQTTSAQELR ETAKQSRPID
KVEEKPARPP PAEPRGQQTS ASTPASASRR RSPSPLSRPG TRNPSLESRA SGGRSRGATG
DSERSDDKRD RESRHESRRD VHGRRSTRES DREKESDRER RDRHGDRERP RDRDRERDRE
RDGHRDKDRD RDRDRHRDSE RDRDRHRKDE KDRDRESRKE RDGSGRGIPT GPSSSATPAV
DERGLPVRPD TSRRREDEAL GKRRRPTEDE PDRASKRPSR KESHHEDRSR RASDKDRDDR
GRESDRRRKE REQPENDPRA LTIDIKAAEK RVPEGPASAS ARSLPPTTPS APRAMAGSAE
GPKNVKPDRD WRPRDQPPRG SPTMPSAVQP GPEVQGPGVS LRSRIADKET RPIPSGPRGE
ASSDRKQEGG SSAAKDEREG SRKRASSERE TGDPSAGPSE LIGPAKRPRI NRTRWQGTAT
AAAAPFAKKS LMDAEKNGRG GIGRKD
//