ID A0A453GQI7_AEGTS Unreviewed; 1701 AA.
AC A0A453GQI7;
DT 08-MAY-2019, integrated into UniProtKB/TrEMBL.
DT 08-MAY-2019, sequence version 1.
DT 24-JAN-2024, entry version 17.
DE RecName: Full=THO complex subunit 2 {ECO:0000256|ARBA:ARBA00019596};
OS Aegilops tauschii subsp. strangulata (Goatgrass).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; Liliopsida; Poales; Poaceae; BOP clade;
OC Pooideae; Triticodae; Triticeae; Triticinae; Aegilops.
OX NCBI_TaxID=200361 {ECO:0000313|EnsemblPlants:AET3Gv21158100.9, ECO:0000313|Proteomes:UP000015105};
RN [1] {ECO:0000313|Proteomes:UP000015105}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. AL8/78 {ECO:0000313|Proteomes:UP000015105};
RX PubMed=25035499; DOI=10.1126/science.1250092;
RG International Wheat Genome Sequencing Consortium,;
RA Marcussen T., Sandve S.R., Heier L., Spannagl M., Pfeifer M.,
RA Jakobsen K.S., Wulff B.B., Steuernagel B., Mayer K.F., Olsen O.A.;
RT "Ancient hybridizations among the ancestral genomes of bread wheat.";
RL Science 345:1250092-1250092(2014).
RN [2] {ECO:0000313|Proteomes:UP000015105}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. AL8/78 {ECO:0000313|Proteomes:UP000015105};
RX PubMed=29158546; DOI=10.1038/s41477-017-0067-8;
RA Zhao G., Zou C., Li K., Wang K., Li T., Gao L., Zhang X., Wang H., Yang Z.,
RA Liu X., Jiang W., Mao L., Kong X., Jiao Y., Jia J.;
RT "The Aegilops tauschii genome reveals multiple impacts of transposons.";
RL Nat. Plants 3:946-955(2017).
RN [3] {ECO:0000313|EnsemblPlants:AET3Gv21158100.9}
RP IDENTIFICATION.
RG EnsemblPlants;
RL Submitted (MAR-2019) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- SIMILARITY: Belongs to the THOC2 family.
CC {ECO:0000256|ARBA:ARBA00007857}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EnsemblPlants; AET3Gv21158100.9; AET3Gv21158100.9; AET3Gv21158100.
DR Gramene; AET3Gv21158100.9; AET3Gv21158100.9; AET3Gv21158100.
DR Proteomes; UP000015105; Chromosome 3D.
DR GO; GO:0000347; C:THO complex; IEA:InterPro.
DR GO; GO:0006406; P:mRNA export from nucleus; IEA:InterPro.
DR GO; GO:0006397; P:mRNA processing; IEA:InterPro.
DR InterPro; IPR040007; Tho2.
DR InterPro; IPR021418; THO_THOC2_C.
DR InterPro; IPR021726; THO_THOC2_N.
DR InterPro; IPR032302; THOC2_N.
DR PANTHER; PTHR21597:SF0; THO COMPLEX SUBUNIT 2; 1.
DR PANTHER; PTHR21597; THO2 PROTEIN; 1.
DR Pfam; PF11262; Tho2; 1.
DR Pfam; PF11732; Thoc2; 1.
DR Pfam; PF16134; THOC2_N; 2.
PE 3: Inferred from homology;
KW Coiled coil {ECO:0000256|SAM:Coils};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000015105}.
FT DOMAIN 6..333
FT /note="THO complex subunit 2 N-terminal"
FT /evidence="ECO:0000259|Pfam:PF16134"
FT DOMAIN 343..477
FT /note="THO complex subunit 2 N-terminal"
FT /evidence="ECO:0000259|Pfam:PF16134"
FT DOMAIN 479..554
FT /note="THO complex subunitTHOC2 N-terminal"
FT /evidence="ECO:0000259|Pfam:PF11732"
FT DOMAIN 797..1092
FT /note="THO complex subunitTHOC2 C-terminal"
FT /evidence="ECO:0000259|Pfam:PF11262"
FT REGION 1118..1248
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1261..1701
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 820..875
FT /evidence="ECO:0000256|SAM:Coils"
FT COMPBIAS 1132..1156
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1157..1180
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1221..1238
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1264..1309
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1326..1494
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1518..1615
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1638..1693
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1701 AA; 193897 MW; 12B6E6C1D4176099 CRC64;
RSRLVKMTKS LVESSLIVPR LLQERCEEEF LWEVELSKSK GQDLKAKEVR VNTRLLYQQT
KFNLVREESE GYAKLVTLLC QVGSDLACQN ASSATISIVK SLIGHFDLDP NRVFDIVLEC
FELYPDNSIF YQLIPLFPKS HAAQILGFKF QYYQQLDVNS PVPSGLFRIA ALLVKSGLID
LDNLYAHLLP NDDEAFEHFG SFVSRKIDEA TKIGKINLAA TGKDLMDEEK QEITIDLYTA
LEMENDIIDE RAPEIEKNQK LGLLLGFLSV HDWDHAQLLF ERLAQLNPVE HVEICDALFR
IVEKTISSAY STYCQTHHKI TRNMDTHMMD ASSVSSPSYL VDLPKEFFQM LVACGPYLHR
DTQLFQKVCR VLKVYHASSK ESARTAGVMS PESQVEEALG SCLLPSLQLI PANPAVDMEI
WGVLSLLPYE VRYRLYGEWE KDTEQNPIVL AARQTAKLDT RRLLKRLAKE NLKQLGRMVA
KLAHANPMTV LRTIVQQVEA YRDMINPVVD AFKYLTQLEY DILQYIVIER LAQGGREKVK
DDGLNLSDWL QCLASFWGHL CKKHLSMELK CLFQYIVNQL KKGLGTELVV LEELIQQMAN
VQYTENMTDE QVDAMAGSET LRLQSSLFGS TRNYKVLNKS TNKLRDSLLP KDEPKLAIPL
LLLIAQHRSK IIINADATYI KMVSEQFDRC HGILLQYAEF LSSAVAPSTY VQLIPPLEDL
VYKYHIEPDV AFLIYRPVMR LFKSANGGEA CWPLDDNEEG ESVSYDEMIL HGDSSQKSIM
WSDLLNTIRT ILPAKAWNGL SPELYATFWG LTLYDLNFPK DRYDAEIKKL HENLKQLEDN
SDNSSIAISR RKKDKERIQD LLDKLNNESD KHQQHVISVL QRLTREKDKW LSSSPDALKI
NMEFLQRCIY PRCVLSMQDA VYCATFVQMM HSLGTPFFNT VNHIDVFICK TLQPMICCCT
EYEAGRLGRF LHETLKMAYH WKSDESVYER ECGNKPGFAV YFRFPNSQRV SYPQFVKVHW
KWSGRITKVL NQCMESKEYM EIRNALIVLT KITSIFPVMR KSGINIEKRV AKLKGDERED
LKVLATGVAA ALAARKSSWV SEEEFGMGHL DLKPVPAKPI AGNQYADPST AKDHSVRAKS
VEGRHERSEN AMKPDAHKKN ASTTNGSDIQ MPSSSAQGKG SGLVRGVDEP PKLLSDDGVK
VLKPTAEPET RAPQKRAVQN AAKVSKHDVV KEDGKPGRST SRGLNQQACA IPVDREVLSQ
AADGVLDTNP TSPLVGTNGN VHPAPRKVSA SSQRSTVLAA HSGGTANPTG EGESADLIDS
TVKQQKRSVP VEEQERTGKR RKGEIEGRDG DLTEHHTDKE KKLDPRSVDK FRSVDHERGA
SEEQNLIRTE KLKEKFDDKY DRDHREKADR SERRRGEDVV ERPTDRSLER RERSIEKMQD
RVPEKGREDR NKEERNKVKH EPIDRAHTIK NEPIDRAYTI KHEPIDRAHT SDERFRGQSL
PPPPPLPTSF VPQSVAANRR DEDSDRRGGS TRHTQRSSPR RDEKERWHLE ENAPLSQDDG
KHRREEDLRD RKREDRDVSS SKVDDRDRDK GNTVKEDSDP NSASKRRKIK REQSALEAGE
YAPSAPQPPS VGPGNSQFEI RERERKGAIS QHRPSHADDL PRMHAKDSTS KTSRREADQT
HDREWEEEKR PRTEAKRKHR K
//