ID A0A3B6H670_WHEAT Unreviewed; 1774 AA.
AC A0A3B6H670;
DT 05-DEC-2018, integrated into UniProtKB/TrEMBL.
DT 05-DEC-2018, sequence version 1.
DT 24-JAN-2024, entry version 21.
DE RecName: Full=THO complex subunit 2 {ECO:0000256|ARBA:ARBA00019596};
OS Triticum aestivum (Wheat).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; Liliopsida; Poales; Poaceae; BOP clade;
OC Pooideae; Triticodae; Triticeae; Triticinae; Triticum.
OX NCBI_TaxID=4565 {ECO:0000313|EnsemblPlants:TraesCS3D02G504300.1};
RN [1] {ECO:0000313|EnsemblPlants:TraesCS3D02G504300.1}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Chinese Spring
RC {ECO:0000313|EnsemblPlants:TraesCS3D02G504300.1};
RX PubMed=30115783; DOI=10.1126/science.aar7191;
RG International wheat genome sequencing consortium (IWGSC);
RT "Shifting the limits in wheat research and breeding using a fully annotated
RT reference genome.";
RL Science 361:EAAR7191-EAAR7191(2018).
RN [2] {ECO:0000313|EnsemblPlants:TraesCS3D02G504300.1}
RP IDENTIFICATION.
RG EnsemblPlants;
RL Submitted (OCT-2018) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- SIMILARITY: Belongs to the THOC2 family.
CC {ECO:0000256|ARBA:ARBA00007857}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EnsemblPlants; TraesCS3D02G504300.1; TraesCS3D02G504300.1; TraesCS3D02G504300.
DR Gramene; TraesCAD_scaffold_037431_01G000300.1; TraesCAD_scaffold_037431_01G000300.1; TraesCAD_scaffold_037431_01G000300.
DR Gramene; TraesCLE_scaffold_073970_01G000100.1; TraesCLE_scaffold_073970_01G000100.1; TraesCLE_scaffold_073970_01G000100.
DR Gramene; TraesCS3D02G504300.1; TraesCS3D02G504300.1; TraesCS3D02G504300.
DR Gramene; TraesCS3D03G1112600.1; TraesCS3D03G1112600.1.CDS; TraesCS3D03G1112600.
DR Gramene; TraesPAR_scaffold_034588_01G000100.1; TraesPAR_scaffold_034588_01G000100.1; TraesPAR_scaffold_034588_01G000100.
DR Gramene; TraesROB_scaffold_036210_01G000100.1; TraesROB_scaffold_036210_01G000100.1; TraesROB_scaffold_036210_01G000100.
DR Gramene; TraesWEE_scaffold_033731_01G000100.1; TraesWEE_scaffold_033731_01G000100.1; TraesWEE_scaffold_033731_01G000100.
DR Proteomes; UP000019116; Chromosome 3D.
DR GO; GO:0000347; C:THO complex; IEA:InterPro.
DR GO; GO:0006406; P:mRNA export from nucleus; IEA:InterPro.
DR GO; GO:0006397; P:mRNA processing; IEA:InterPro.
DR InterPro; IPR040007; Tho2.
DR InterPro; IPR021418; THO_THOC2_C.
DR InterPro; IPR021726; THO_THOC2_N.
DR InterPro; IPR032302; THOC2_N.
DR PANTHER; PTHR21597:SF0; THO COMPLEX SUBUNIT 2; 1.
DR PANTHER; PTHR21597; THO2 PROTEIN; 1.
DR Pfam; PF11262; Tho2; 2.
DR Pfam; PF11732; Thoc2; 1.
DR Pfam; PF16134; THOC2_N; 2.
PE 3: Inferred from homology;
KW Coiled coil {ECO:0000256|SAM:Coils};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000019116}.
FT DOMAIN 37..430
FT /note="THO complex subunit 2 N-terminal"
FT /evidence="ECO:0000259|Pfam:PF16134"
FT DOMAIN 440..574
FT /note="THO complex subunit 2 N-terminal"
FT /evidence="ECO:0000259|Pfam:PF16134"
FT DOMAIN 576..651
FT /note="THO complex subunitTHOC2 N-terminal"
FT /evidence="ECO:0000259|Pfam:PF11732"
FT DOMAIN 894..1079
FT /note="THO complex subunitTHOC2 C-terminal"
FT /evidence="ECO:0000259|Pfam:PF11262"
FT DOMAIN 1080..1154
FT /note="THO complex subunitTHOC2 C-terminal"
FT /evidence="ECO:0000259|Pfam:PF11262"
FT REGION 1180..1309
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1331..1774
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 917..972
FT /evidence="ECO:0000256|SAM:Coils"
FT COMPBIAS 1194..1218
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1219..1242
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1283..1300
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1331..1371
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1388..1567
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1591..1688
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1711..1766
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1774 AA; 201989 MW; 3497DDCDE71C0D86 CRC64;
MSPPLQAPDY RYVTEECLRE WKGQSAAAFR LPDPVPMARF LYELCWAMVL GDLPPQKCRA
ALDSVVFVEE ARQEESASVL ADIIAHLGQD ITISGEYRSR LVKMTKSLVE SSLIVPRLLQ
ERCEEEFLWE VELSKSKGQD LKAKEVRVNT RLLYQQTKFN LVREESEGYA KLVTLLCQVG
SDLACQNASS ATISIVKSLI GHFDLDPNRV FDIVLECFEL YPDNSIFYQL IPLFPKSHAA
QILGFKFQYY QQLDVNSPVP SGLFRIAALL VKSGLIDLDN LYAHLLPNDD EAFEHFGSFV
SRKIDEATKI GKINLAATGK DLMDEEKQEI TIDLYTALEM ENDIIDERAP EIEKNQKLGL
LLGFLSVHDW DHAQLLFERL AQLNPVEHVE ICDALFRIVE KTISSAYSTY CQTHHKITRN
MDTHMMDASS VSSPSYLVDL PKEFFQMLVA CGPYLHRDTQ LFQKVCRVLK VYHASSKESA
RTAGVMSPES QVEEALGSCL LPSLQLIPAN PAVDMEIWGV LSLLPYEVRY RLYGEWEKDT
EQNPIVLAAR QTAKLDTRRL LKRLAKENLK QLGRMVAKLA HANPMTVLRT IVQQVEAYRD
MINPVVDAFK YLTQLEYDIL QYIVIERLAQ GGREKVKDDG LNLSDWLQCL ASFWGHLCKK
HLSMELKCLF QYIVNQLKKG LGTELVVLEE LIQQMANVQY TENMTDEQVD AMAGSETLRL
QSSLFGSTRN YKVLNKSTNK LRDSLLPKDE PKLAIPLLLL IAQHRSKIII NADATYIKMV
SEQFDRCHGI LLQYAEFLSS AVAPSTYVQL IPPLEDLVYK YHIEPDVAFL IYRPVMRLFK
SANGGEACWP LDDNEEGESV SYDEMILHGD SSQKSIMWSD LLNTIRTILP AKAWNGLSPE
LYATFWGLTL YDLNFPKDRY DAEIKKLHEN LKQLEDNSDN SSIAISRRKK DKERIQDLLD
KLNNESDKHQ QHVISVLQRL TREKDKWLSS SPDALKINME FLQRCIYPRC VLSMQDAVYC
ATFVQMMHSL GTPFFNTVNH IDVFICKTLQ PMICCCTEYE AGRLGRFLHE TLKMAYHWKV
HWKWSGRITK VLNQCMESKE YMEIRNALIV LTKITSIFPV MRKSGINIEK RVAKLKGDER
EDLKVLATGV AAALAARKSS WVSEEEFGMG HLDLKPVPAK PIAGNQYADP STAKDHSVRA
KSVEGRHERS ENAMKPDAHK KNASTTNGSD IQMPSSSAQG KGSGLVRGVD EPPKLLSDDG
VKVLKPTAEP ETRAPQKRAV QNAAKVSKHD VVKEDGKPGR STSRGLNQQA CAIPVDREVL
YQAADGVLNT NPTSPLVGTN GNVHLAPRKV SASSQRSTVL AAHSGGTANP TGEGESADLI
DSTVKQQKRS VPVEEQERTG KRRKGEIEGR DGDLTEHHTD KEKKLDPRSV DKFRSVDHER
GASEEQNLIR TEKLKEKFDD KYDRDHREKA DRSERRRGED VVERPTDRSL ERRERSIEKM
QDRVPEKGRE DRNKEERNKV KHEPIDRAHT IKNEPIDRAY TIKNEPIDRA HTVKHEPIDR
AHTSDERFRG QSLPPPPPLP TSFVPQSVAA NRRDEDSDRR GGSTRHTQRS SPRRDEKERW
HLEENAPLSQ DDGKHRREED LRDRKREDRD VSSSKVDDRD RDKGNTVKED SDPNSASKRR
KIKREQSALE AGEYAPSAPQ PPSVGPGNSQ FEIRERERKG AISQHRPSHA DDLPRMHAKD
STSKTSRREA DQTHDREWEE EKRPRTEAKR KHRK
//