ID A0A0M8P719_9EURO Unreviewed; 2345 AA.
AC A0A0M8P719;
DT 09-DEC-2015, integrated into UniProtKB/TrEMBL.
DT 09-DEC-2015, sequence version 1.
DT 22-FEB-2023, entry version 18.
DE RecName: Full=THO complex subunit 2 {ECO:0000256|ARBA:ARBA00019596};
GN ORFNames=ACN38_g6478 {ECO:0000313|EMBL:KOS42624.1};
OS Penicillium nordicum.
OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Eurotiomycetes;
OC Eurotiomycetidae; Eurotiales; Aspergillaceae; Penicillium.
OX NCBI_TaxID=229535 {ECO:0000313|EMBL:KOS42624.1, ECO:0000313|Proteomes:UP000037696};
RN [1] {ECO:0000313|EMBL:KOS42624.1, ECO:0000313|Proteomes:UP000037696}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=DAOMC 185683 {ECO:0000313|EMBL:KOS42624.1,
RC ECO:0000313|Proteomes:UP000037696};
RA Nguyen H.D., Seifert K.A.;
RT "Genome sequencing of Penicillium nordicum.";
RL Submitted (AUG-2015) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- SIMILARITY: Belongs to the THOC2 family.
CC {ECO:0000256|ARBA:ARBA00007857}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KOS42624.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; LHQQ01000101; KOS42624.1; -; Genomic_DNA.
DR STRING; 229535.A0A0M8P719; -.
DR OrthoDB; 179356at2759; -.
DR Proteomes; UP000037696; Unassembled WGS sequence.
DR GO; GO:0000347; C:THO complex; IEA:InterPro.
DR GO; GO:0006406; P:mRNA export from nucleus; IEA:InterPro.
DR GO; GO:0006397; P:mRNA processing; IEA:InterPro.
DR InterPro; IPR040007; Tho2.
DR InterPro; IPR021418; THO_THOC2_C.
DR InterPro; IPR021726; THO_THOC2_N.
DR InterPro; IPR032302; THOC2_N.
DR PANTHER; PTHR21597:SF0; THO COMPLEX SUBUNIT 2; 1.
DR PANTHER; PTHR21597; THO2 PROTEIN; 1.
DR Pfam; PF11262; Tho2; 1.
DR Pfam; PF11732; Thoc2; 1.
DR Pfam; PF16134; THOC2_N; 1.
PE 3: Inferred from homology;
KW Coiled coil {ECO:0000256|SAM:Coils};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000037696}.
FT DOMAIN 163..897
FT /note="THO complex subunit 2 N-terminal"
FT /evidence="ECO:0000259|Pfam:PF16134"
FT DOMAIN 899..974
FT /note="THO complex subunitTHOC2 N-terminal"
FT /evidence="ECO:0000259|Pfam:PF11732"
FT DOMAIN 1256..1561
FT /note="THO complex subunitTHOC2 C-terminal"
FT /evidence="ECO:0000259|Pfam:PF11262"
FT REGION 1..159
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 475..505
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 601..639
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1161..1208
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1582..2345
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 1279..1346
FT /evidence="ECO:0000256|SAM:Coils"
FT COMPBIAS 77..95
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 123..137
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 478..492
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 606..638
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1599..1623
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1624..1643
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1648..1694
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1715..1805
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1822..1860
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1861..1875
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1898..1918
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1949..1973
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2149..2226
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2250..2269
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2272..2305
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 2345 AA; 259984 MW; D623B488C9BB4B2F CRC64;
MAPGYGHGGK RKRIDRSWSG ESGSNDLRPS PHRPGTLNMA HHPTNPGHQS PTPREQNDTR
DRSRRQSHRS RAGSRRESYD GQNFSNSQHR EANAMSPPAI NQPREPAEPV PHNGDSATAR
PTPSPAPNTP ASQPQPNPPS RAGSETITHP PSLPPPPYDY EYVTDNAVEE WASAGKQKVV
EEGTAARLQQ DLARLASVYQ ELIRSAIYGR LSPSNAGNAV KEIIGEESVS QDVDMEGDSG
KPSTQGIDPC SLFLDTLSIV TDADTSNPAL KPLVFATGID PALMRLQLET PLLQALGLVR
ETFARMGIRK QTNQLYRQSN YNLLREESEG YSKLITELFT TSNNEPPSSE VVEDTFERVK
AMIGAFDMDV GRVLDVTLDV FAAVLVKQYR FFVKLLRVSS WWPKEDTFYS LEQGRRHTGI
PNWALPGSTG WITTEEERSA TMRANEERDS QFWDRVREIG LQAFFEIGRK PISEEERQQS
LSETNGSSFE EDATRSWLEE TGTLPPKGNR VAAQLLGFKL RFYSSSARTK ADVLPDNLIY
LAALLIKVGF ISLRDLYAHL WRSDDTMEIL KTEKMAEKAE RVKAGRPGGG INALMMAGAL
SDDTVPPSRL RDEPRAATPG KDQESNKGTP KTENELPDPS DQKVLLLKSL LAIGAIPESL
FVISKFPWLM EAYPELPEFI HRILHHSLNK VYIQFRPLSS TGDFPGAQYM VTSDQNNAKG
QIGLTPPPAR RVLRWAQLDK EDTNDGTDYR FYWDDWADNI PICQSVDDVF ALCSSFLNLS
GHKIGQDASL LAKLVRIGKG SLLQDGSEEN RTRWRDLCKR LLLPAVSLTK ANPGVVNEVF
DLVSFFPRDV RYNMYAEWYS GQTSRLPDIK EAFDQARAET KDTLKRLSKT NIRPMARALA
KIAYANPGIV INVAMSQIES YENLIEVVVE CARYFTYLGY DILSWSLINS LGQKGRSRVQ
EGGLLTSRWL NALSTFAGRT YKRYSVMDPT PVLQYVVEQL RHNNSTDLIV LEQLISSMAG
IISDNDFNDA QIQAMAGGDV LQSQTILQLL DKRHESKTTS KRLVKSLTHT RLAGQLLVAI
AQERLTCVYN ETSSELKLLG NVFDEIHRIL TQYLDLLRSN LSVDEFDSFV PDFASLITEF
GIQPEIAFWI RRPSIARKID DVEESKQEKE RSASAPKSNG DNRMDTAEDG EAPPKSEESP
ADSAMDVDKG ETEITIPVEG PDAILVPAVN AEPLAANPVM QELIDDVKTA LSAEKWETIG
AHFYATFWQL SLYDVHIPQK SYEDEIDRLK RRVISINSDR SDISVAGTTR KENLKRQVTQ
LQERILDENK NHLKAYGNTR ARLQKEKDKW FAGMRLKHDA LNVALLEQCF IPRLLLSPLD
AFFCFKVLKF LHSSGTPNFR TVGLIDQLFR EQRLTALIFL CTSKEADNLG RFLNEILRDL
TRWHADKAVY EKEAFGAKRD LPGFAKNVDS GGVPAMFLEF EDFRRLLYKW HRWFSNALKS
CLGSGEYMHI RNAISVLKAV VKHFPAVNWI GRDILNSVDH LSKNDERDDV KIPAASLIGD
LNRREKKWML PQAFYFVAAQ PGSHPSADAA GKPGTPQPAA TPLNASAPEF NPTGSSISDA
KQEQPSKAEV EDGEIEDAKM TDVSTGKGGE AKPTSQAGST TPSVVAADTS QDAMSGVSPS
VQSRPHSRAP TSSRQPDIPK RPDIRQQVHP PVRPPPRHSD GRLPPRPEGL DDRRERHPDF
GPRGRHGGPD HNRSFDGSLG DVRGRLNEHL NERDRDFPMR PPADDLMRGP PRDARSTREN
GWPMDRPGRM RGTGPNDSFH GRDPSVRGEL MPDHMDRPVD IPRRGDQSRP EKDDRRAYPP
RPLSPPRPAD LPNRPERFPP PDDRRPAGFP PSSRIDDLPR GPRTDRQADP RDGAPGPDMT
HGRLRQPEPP SEIPAGPRRR GGRNGPGQAP PMLSSSNATL PAPERQTPTG PARQSGRGVP
EQPAPPTQPA GPTTAPGVHP DRLRNLVNEP AAPAAAPSGP RGLGPQQPPR GPSAQSAQSG
PLGQGFGGER GRGDKRFAGL NNMLQQSGGG GGGGDRGGNP PPVRGRGANR PPNTMESPSS
QPANRPPMGP AGPQEDLSRN RPSGGRGGDL IDDAIPEAGR SGPTGGRSRE TEPTREKEPE
RRDGSSSGRN RRDGRRNDRE RSRRSDVGGG SRDEKGTGEP RETLRRVQSS REELRRRDRR
DRQDGPSEVT GPTPSTNETH DTEGRLRPPS SMDAPPPPPP PPPPPMPEGN ERRFNSGDRG
GRSDNRDRER ERRGDRRDRD HQREGGSSSG ASGHRKRGRQ GPDDNTDGGR GMRMGNDNKR
PRRGA
//