ID A0A2P5D240_PARAD Unreviewed; 1903 AA.
AC A0A2P5D240;
DT 23-MAY-2018, integrated into UniProtKB/TrEMBL.
DT 23-MAY-2018, sequence version 1.
DT 24-JAN-2024, entry version 15.
DE RecName: Full=THO complex subunit 2 {ECO:0000256|ARBA:ARBA00019596};
GN ORFNames=PanWU01x14_103520 {ECO:0000313|EMBL:PON67371.1};
OS Parasponia andersonii (Sponia andersonii).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; fabids; Rosales; Cannabaceae; Parasponia.
OX NCBI_TaxID=3476 {ECO:0000313|EMBL:PON67371.1, ECO:0000313|Proteomes:UP000237105};
RN [1] {ECO:0000313|Proteomes:UP000237105}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. WU1-14 {ECO:0000313|Proteomes:UP000237105};
RA Van Velzen R., Holmer R., Bu F., Rutten L., Van Zeijl A., Liu W.,
RA Santuari L., Cao Q., Sharma T., Shen D., Roswanjaya Y., Wardhani T.,
RA Kalhor M.S., Jansen J., Van den Hoogen J., Gungor B., Hartog M.,
RA Hontelez J., Verver J., Yang W.-C., Schijlen E., Repin R., Schilthuizen M.,
RA Schranz E., Heidstra R., Miyata K., Fedorova E., Kohlen W., Bisseling T.,
RA Smit S., Geurts R.;
RT "Parallel loss of symbiosis genes in relatives of nitrogen-fixing non-
RT legume Parasponia.";
RL Submitted (JUN-2016) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- SIMILARITY: Belongs to the THOC2 family.
CC {ECO:0000256|ARBA:ARBA00007857}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:PON67371.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; JXTB01000072; PON67371.1; -; Genomic_DNA.
DR STRING; 3476.A0A2P5D240; -.
DR OrthoDB; 179356at2759; -.
DR Proteomes; UP000237105; Unassembled WGS sequence.
DR GO; GO:0000347; C:THO complex; IEA:InterPro.
DR GO; GO:0006406; P:mRNA export from nucleus; IEA:InterPro.
DR GO; GO:0006397; P:mRNA processing; IEA:InterPro.
DR InterPro; IPR040007; Tho2.
DR InterPro; IPR021418; THO_THOC2_C.
DR InterPro; IPR021726; THO_THOC2_N.
DR InterPro; IPR032302; THOC2_N.
DR PANTHER; PTHR21597:SF0; THO COMPLEX SUBUNIT 2; 1.
DR PANTHER; PTHR21597; THO2 PROTEIN; 1.
DR Pfam; PF11262; Tho2; 1.
DR Pfam; PF11732; Thoc2; 1.
DR Pfam; PF16134; THOC2_N; 2.
PE 3: Inferred from homology;
KW Coiled coil {ECO:0000256|SAM:Coils};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000237105}.
FT DOMAIN 37..414
FT /note="THO complex subunit 2 N-terminal"
FT /evidence="ECO:0000259|Pfam:PF16134"
FT DOMAIN 440..592
FT /note="THO complex subunit 2 N-terminal"
FT /evidence="ECO:0000259|Pfam:PF16134"
FT DOMAIN 594..669
FT /note="THO complex subunitTHOC2 N-terminal"
FT /evidence="ECO:0000259|Pfam:PF11732"
FT DOMAIN 923..1218
FT /note="THO complex subunitTHOC2 C-terminal"
FT /evidence="ECO:0000259|Pfam:PF11262"
FT REGION 1276..1466
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1479..1825
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 981..1015
FT /evidence="ECO:0000256|SAM:Coils"
FT COMPBIAS 1290..1304
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1322..1348
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1407..1460
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1504..1560
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1567..1667
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1688..1794
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1903 AA; 215842 MW; 7CEE4180C9A49F01 CRC64;
MSLPQIECLY VTEDCLREWK SGNPSFKLHH PVPMLRFLSE LCSTMARGEL PFQKCRVALD
SVVFTQRVSD EEIGSSFADI LTQMAQDLTI PGEYRARLIK LAKWLVESGL VPLRLLQERC
EEEFLWEAEM IKIKAQELKN KEVRVNTRLL YQQTKFNLLR EESEGYAKLV TLLCGNTESS
SCKVSGATIG IIKSLIGHFD LDPNRVFDIV LECFELQPDN IDLLELIPLF PKSHAAQILG
FKFQYYQRMD VDIPVPFGLY KLTALLVKED FIDLDSIYAH LLPQDDKAFE HYNALSSKRL
DEARKIGKIN LAATGKDLMD DEKLGDVTID LFAAVDMESE AVVERSEELE NNQILGLLSG
FLSVDDWYHA NLLFDRLSHL NPVEHIQICN SLFRLIEKSL SSIYKTVHQA HLQNIGSSSG
VSIDSVDTET SLARRSFIDL PKEFFQMLVS AGPYLYRDTL LLQKVCRVLR GYYLSALELV
GSDDAVLNSE SVNSVNRDSC LHLKGARLRI EEALGTCLLP SLQLIPANPA VGQEIWEVMN
LLPYEVRYRL YGEWEKEDER MPMLLAARQT AKLDTRRILK RLAKENLKQL GRMVAKLAHA
NPMTVLRTIV HQIEAYRDMI TPVVDAFKYL TQLEYDILEY VVIERLAQGG RDKLKDDGLN
LSDWLQSLAS FWGHLCKKYP SMELRGLFQY LVNQLKKGLG IELVLLQELI QQMANVQYTE
NLTEEQLDAM AGSETLRFQA TSFGVTRNNK ALIKSTNRLR DSLLPKDEPK LAIPLLLLIA
QHRSLVVINA DAPYIKMVSE QFDRCHGTLL QYVEFLCSSV TPASTYAQLI PSLDDLVHKY
HLDPEVAFLI YRPVMRLFKI QGTSDIFWPL DNNDTSSVAI ANSDSEAAEH SDDVVLDLGS
SWMPVMWSDL LITVKTMLPP KAWNSLSPDL YTTFWGLTLY DLYVPRNRYE SEISKQHAAL
KALEEFSDNS SSAITKRKKD KERIQESLDR LTGELRKHEE NVASVRRRLF REKDNWLSSC
PDTLKINMEF LQRCIFPRCT FSMPDAVYCA VFVHTLHSLG TPFFNTVNHI DVLICKTLQP
MICCCTEYEA GRLGRFLYET LKLAYYWKSD ESVYERECGN MPGFAVYYRY PNSQRVNYGQ
FIKVHWKWSQ RITRLLIQCL ESTEYMEIRN SLIILTKISG VFPVTRKSGI NLEKRVSKIK
SDEREDLKVL ATGVAAALAA RKPSWVTDEE FGMGYLELKP APSSLSAKSS VGNLVAIQSG
SAINVSQNEY AGVKTNVSDS SNSVKDQMLK ARPADGRTER TEGVSNMKSD PGNVKLKGGS
LINGTDAQSA LPSAGLSSGT SRSLENQKQV DDSINRPDEN LAKVAPKNSS EPELRAQTKR
SMPAGSLSKP PKQDLTKEDG RSGKSVGRVP GSSTTDREIA SQTSERMGGA ANVSSAVTAN
GNTVSASAKA SAPSTRTSDI HGSDSKLENA AAKVSVLKDD AAEALDAPRH TSSRPLHSPR
HESSTASKSN DKLQKRASPV EEADRLTKRR KGETEVRDFD GEVRLSDRER SVDARFGGLD
KSGTDEQSVY RATDKLSDRS KDKASERHEK DYKERSERLD KSRGDDLIEK PRDRSMERYG
RERSVERAQE RGSDRNFDRL SEKAKDDRSK VRYNDTSADK SHIDDRFHGQ NLPPPPPLPP
HVIPQSVNSG RRDEDVDRRF GTTRHSQRLS PRHEEKERRR SEESLVSQDD TKRRREDDFR
DRKREDREGL SMKVEDRERE REREREKANI LKEDIDATAA SKRRKLKRDH LPSGEAGEYS
PVGPPPPLGI NMSQSFDGRD RGDRKGAVIQ RAGYLEEPSL RVHGKEIASK MTRRDTDPED
PYLNFALAEE GSDGFLITKQ KIYDREWDDE KRQRAEQKRR HRK
//