ID A0A250WYS1_9CHLO Unreviewed; 1720 AA.
AC A0A250WYS1;
DT 22-NOV-2017, integrated into UniProtKB/TrEMBL.
DT 22-NOV-2017, sequence version 1.
DT 24-JAN-2024, entry version 19.
DE RecName: Full=THO complex subunit 2 {ECO:0000256|ARBA:ARBA00019596};
GN ORFNames=CEUSTIGMA_g3437.t1 {ECO:0000313|EMBL:GAX75994.1};
OS Chlamydomonas eustigma.
OC Eukaryota; Viridiplantae; Chlorophyta; core chlorophytes; Chlorophyceae;
OC CS clade; Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas.
OX NCBI_TaxID=1157962 {ECO:0000313|EMBL:GAX75994.1, ECO:0000313|Proteomes:UP000232323};
RN [1] {ECO:0000313|EMBL:GAX75994.1, ECO:0000313|Proteomes:UP000232323}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=NIES-2499 {ECO:0000313|EMBL:GAX75994.1,
RC ECO:0000313|Proteomes:UP000232323};
RA Hirooka S., Hirose Y., Kanesaki Y., Higuchi S., Fujiwara T., Onuma R.,
RA Era A., Ohbayashi R., Uzuka A., Nozaki H., Yoshikawa H., Miyagishima S.Y.;
RT "Acidophilic green algal genome provides insights into adaptation to an
RT acidic environment.";
RL Submitted (AUG-2017) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- SIMILARITY: Belongs to the THOC2 family.
CC {ECO:0000256|ARBA:ARBA00007857}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:GAX75994.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; BEGY01000015; GAX75994.1; -; Genomic_DNA.
DR STRING; 1157962.A0A250WYS1; -.
DR OrthoDB; 179356at2759; -.
DR Proteomes; UP000232323; Unassembled WGS sequence.
DR GO; GO:0000347; C:THO complex; IEA:InterPro.
DR GO; GO:0006406; P:mRNA export from nucleus; IEA:InterPro.
DR GO; GO:0006397; P:mRNA processing; IEA:InterPro.
DR InterPro; IPR040007; Tho2.
DR InterPro; IPR021418; THO_THOC2_C.
DR InterPro; IPR021726; THO_THOC2_N.
DR InterPro; IPR032302; THOC2_N.
DR PANTHER; PTHR21597:SF0; THO COMPLEX SUBUNIT 2; 1.
DR PANTHER; PTHR21597; THO2 PROTEIN; 1.
DR Pfam; PF11262; Tho2; 1.
DR Pfam; PF11732; Thoc2; 1.
DR Pfam; PF16134; THOC2_N; 2.
PE 3: Inferred from homology;
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000232323}.
FT DOMAIN 54..266
FT /note="THO complex subunit 2 N-terminal"
FT /evidence="ECO:0000259|Pfam:PF16134"
FT DOMAIN 435..567
FT /note="THO complex subunit 2 N-terminal"
FT /evidence="ECO:0000259|Pfam:PF16134"
FT DOMAIN 586..660
FT /note="THO complex subunitTHOC2 N-terminal"
FT /evidence="ECO:0000259|Pfam:PF11732"
FT DOMAIN 916..1237
FT /note="THO complex subunitTHOC2 C-terminal"
FT /evidence="ECO:0000259|Pfam:PF11262"
FT REGION 275..316
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 863..883
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1248..1278
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1308..1720
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 283..308
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1313..1354
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1422..1445
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1452..1532
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1550..1585
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1594..1629
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1637..1652
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1656..1670
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1720 AA; 189495 MW; A450DF3449E539CE CRC64;
MDTKLLDLIS KSIEEKVDID KIYSEIQNIG ADQGLLIEVS WLLWVRLDGP SHNDKRLRLA
EIVKALVKDK KIDRRQAMEV FELDCMQAVG LLSDKEATDR WKRAEVRFNT KTAYTQNKFN
LLREDTEGYA KLVTCLNHVG DGSLTEETVP LLFEEIQALI GYFDLDPNRV LDLVLDAAEH
QPHNDAYIKL VPMFKAEAVV QLLGFKFQGF QEDSPAAPAD LYLLAARLLR DGLITMEGLL
THLSPSDEEI KVSQAAAADA MKNKVEDIGQ VSLKGPETAP DWLRPNSYTS RSQAPYTSGS
SAAAANGGSG QASEVPVRRP LCSETLELDA GPYLTMVLAP GVAKNQKLGL ALALVQAGVW
DVARPLMERL TVLGLPPAAW PPMCKALCDL ASKDIDLLHN TLVPRGLSSV NPLESRKGLH
KDNLQTPEIQ SRTFDLLKRI GVFIHSDITL LTKVIRVLRH YVLAHCGGIL EPGIVPPAAS
SPGSVAAAEA AVSQVVLPAS MMVPSNVALV SEVWCLLSLF PFQTRFRMYG EIKELSVLSP
YLNASSKMAA WEARKVLKRL ARSERDDAGR EKRETREVMR PYARMLAKVA HASPLSVMEA
LVTLVENYSN QIEPVADALK YLTPFAFDVL TYVVLSRLTS GRDRVKEDGV NISDWLQNLA
AFNAAVCRKH SDMEMNAMCQ LLSNALRAGD AFDLLVLKEL LEVLTGIAAH SDVSEKQLEG
LSGGPALRDR AVVRTSEERE RSSKAWTKGG RKLLAALQQG SPQQRMALPL LLLIAQQRHV
IATQTESKHV KLIAEMFDKC QETCKQYISF LQMALPEEEY ASLLPPIEVL RKQYSLDPEV
VYCMYRPMLR NLFLLTRPLP EDGELEEGEE APGSNDGDAG NGVSEKVTSL APKVCGKTWA
DCLSSMQASL SGRKGWELMT MELYGTFWSL SIEDVYVPTQ RYQDEIALIE AELTGIERQL
SQPRGGGIGG GYGMGMGSYG VGGAVNPNLT PEEVDRLKKQ RSGLQDLLTK LQSEYESQVE
HCREIRLRLE RDKGNFCVYT PAMKTGVCTL MEECLLPRVL YSPEDALYCA KMILLLHEIE
TPNLSTLIFY NEMLTLLVPV LKCITMREAS NLGIFLREVL SLLSHWRVAD VYSTECSTKV
GFSFSLVASD SRKTTHDQFV KLMFKWHSNI MRTVKAGLEN EEWTSRRNAL AVLSSVVEFF
PVVQEHLKII RHEVEKLRDS EEREDIKTIA NGYLNLLKRE DTKPRRFFPA VDWGGPPMKP
KTGATRPASS NPPRSAPVNA ITKAEPEMLS AAPSDTAAPL STASLRAEAE AFQPPNSRTS
KPTNPSGPPQ SGATGKTEEN SGRSHGSSKR EQIPSPEFGN KREHATTAGL DTVSKRPRHE
EPSVQGGTVA TRSNTPPASR DKALGGAGGQ VDVGSGAGKE AVVRDSSREK ERSGGRGEAG
EERGGGVGRA GEVGEIKERV SREHRSEKET GKERDHGGES VKEGRRSKSD GRREEGVRGG
GGTAEDTRVS RDEREKGRAQ EREERRPPPS DKHNSSSVHP ASEQLHQSLP ALGEERKDRI
SGERESSSRK MKSKDAKEKD VDNANPVRSG EVGVGGQDKE RTRESKEGGG KVTERDPTEH
RERQDAAASQ KDPDSLMEVS SSKRRRHDQH VADPVIATSS RDQQQLQGLA GDLRSRLGSR
PTGRMDSDRS GEVGGDGAAP EVEASPRHQQ GRERSSNRRK
//