ID A0A151T380_CAJCA Unreviewed; 1820 AA.
AC A0A151T380;
DT 08-JUN-2016, integrated into UniProtKB/TrEMBL.
DT 08-JUN-2016, sequence version 1.
DT 27-MAR-2024, entry version 20.
DE RecName: Full=THO complex subunit 2 {ECO:0000256|ARBA:ARBA00019596};
GN ORFNames=KK1_015986 {ECO:0000313|EMBL:KYP61495.1};
OS Cajanus cajan (Pigeon pea) (Cajanus indicus).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; fabids; Fabales; Fabaceae; Papilionoideae; 50 kb inversion clade;
OC NPAAA clade; indigoferoid/millettioid clade; Phaseoleae; Cajanus.
OX NCBI_TaxID=3821 {ECO:0000313|EMBL:KYP61495.1, ECO:0000313|Proteomes:UP000075243};
RN [1] {ECO:0000313|EMBL:KYP61495.1, ECO:0000313|Proteomes:UP000075243}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Asha {ECO:0000313|Proteomes:UP000075243};
RX PubMed=22057054; DOI=10.1038/nbt.2022;
RA Varshney R.K., Chen W., Li Y., Bharti A.K., Saxena R.K., Schlueter J.A.,
RA Donoghue M.T., Azam S., Fan G., Whaley A.M., Farmer A.D., Sheridan J.,
RA Iwata A., Tuteja R., Penmetsa R.V., Wu W., Upadhyaya H.D., Yang S.P.,
RA Shah T., Saxena K.B., Michael T., McCombie W.R., Yang B., Zhang G.,
RA Yang H., Wang J., Spillane C., Cook D.R., May G.D., Xu X., Jackson S.A.;
RT "Draft genome sequence of pigeonpea (Cajanus cajan), an orphan legume crop
RT of resource-poor farmers.";
RL Nat. Biotechnol. 30:83-89(2012).
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- SIMILARITY: Belongs to the THOC2 family.
CC {ECO:0000256|ARBA:ARBA00007857}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CM003610; KYP61495.1; -; Genomic_DNA.
DR STRING; 3821.A0A151T380; -.
DR OMA; QERWTCI; -.
DR Proteomes; UP000075243; Chromosome 8.
DR GO; GO:0000347; C:THO complex; IEA:InterPro.
DR GO; GO:0006406; P:mRNA export from nucleus; IEA:InterPro.
DR GO; GO:0006397; P:mRNA processing; IEA:InterPro.
DR InterPro; IPR040007; Tho2.
DR InterPro; IPR021418; THO_THOC2_C.
DR InterPro; IPR021726; THO_THOC2_N.
DR InterPro; IPR032302; THOC2_N.
DR PANTHER; PTHR21597:SF0; THO COMPLEX SUBUNIT 2; 1.
DR PANTHER; PTHR21597; THO2 PROTEIN; 1.
DR Pfam; PF11262; Tho2; 1.
DR Pfam; PF11732; Thoc2; 1.
DR Pfam; PF16134; THOC2_N; 3.
PE 3: Inferred from homology;
KW Coiled coil {ECO:0000256|SAM:Coils};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000075243}.
FT DOMAIN 37..215
FT /note="THO complex subunit 2 N-terminal"
FT /evidence="ECO:0000259|Pfam:PF16134"
FT DOMAIN 233..415
FT /note="THO complex subunit 2 N-terminal"
FT /evidence="ECO:0000259|Pfam:PF16134"
FT DOMAIN 440..590
FT /note="THO complex subunit 2 N-terminal"
FT /evidence="ECO:0000259|Pfam:PF16134"
FT DOMAIN 592..667
FT /note="THO complex subunitTHOC2 N-terminal"
FT /evidence="ECO:0000259|Pfam:PF11732"
FT DOMAIN 921..1220
FT /note="THO complex subunitTHOC2 C-terminal"
FT /evidence="ECO:0000259|Pfam:PF11262"
FT REGION 1265..1424
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1447..1672
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1685..1731
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 944..1006
FT /evidence="ECO:0000256|SAM:Coils"
FT COMPBIAS 1288..1304
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1325..1348
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1349..1365
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1397..1424
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1452..1480
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1481..1580
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1603..1672
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1702..1716
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1820 AA; 206554 MW; 94E4A419A9115DFE CRC64;
MSLPPIECVY VTEDYVREWR SGNPALKVSE PVPMLRFLYE LCWTMVRGEL PFQKCKVALD
SVIFSDKAST EKIASNFADI VTQMAQDHTM SGEFRSRLVK LARWLVESEM VPVRLLQERC
EEEFLGEAEL IKIKAQELKG KEVRVNTRLL YQQTKFNLLR EESEGYAKLV TLLCRDSEAP
TQKASAATIG IIKSLIGHFD LDPNRVFDIV LECFELQPDD GVFVELIPIF PKSHASQILG
FKFQYYQRME VNSPVPFGLY RLTALLVKQD FIDLDSIYTH LLPKDDEAFE HYNTFSSKRL
DEANKIGRIN LAATGKDLMD DEKQGDVNID LFAALDMETD AIEERTTELQ NSQTLGLLTG
FLSVDDWYHA RLLFERLSPL NAVEHIQICD SLFRLIEKSM SSAYDVIRQT HLQNPGSSTG
GSTDVMDVDN SSGHDSFIDL PKDLFQMLAC TGPYLYRDTV LLQKVCRVLR GYYLSALELV
SHGEGALDTQ LHFSGNPHLH LKEARLRVED ALGTCLLPSL QLIPANPAVG QGIWELMSLL
PYEVRYRLYG EWEKDDERVP MLLAARQTAK LDTRRILKRL AKENLKQLGR MVAKLAHANP
MTVLRTIVYQ IEAYRDMITP VVDAFKYLTQ LEYDILEYVV IERLALGGRD KLKDDGLNLS
DWLQSLASFW GHLCKKYPSM ELRGLFQYLV NQLKKGQGIE LVLLQELIQQ MANVQYTENL
TEEQLDSMAG SETLRYQATS FGVTRNNKAL IKSTSRLRDA LLPKDEPKLA IPLLLLIAQH
RSLVVINADA PYIKMLSEQF DRCHGTLLQY VEFLCSAVTP ASNYAILIPS LNDLVHLYHL
DPEVAFLIYR PVMRLFKSQR YPDVCWPLDD KNTASDASMN LESDPSDHSS SMVLNLGSAQ
SPISWSYLLD TVKTMLPSKA WNSLSPDLYA TFWGLTLYDL YVPKNRYESE IAKLHANLKS
LEELSDNSSS AITKRKKEKE RIQESLDRLI SELHKHEENV ASVHRRLSHE KDNWLSSCPD
TLKINMEFLQ RCIFPRCTFS MPDAVYCAMF VHTLHSLGTP FFNTVNHIDV LICKTLQPMI
CCCTEYEAGR LGRFLYETLK IAYHWKSDES IYERECGNMP GFAVYYRYPN SQRVTYGQFI
KASSIVHWKW SQRITRLLIQ CLESSEYMEI RNALIMLTKI SSVFPVTRKS GINLEKRVAK
IKSDEREDLK VLATGVAAAL AARKPSWVTD EEFGMGYLEL KPAPSVTKTS AGNSATVQSG
INLNVSQTEP AGGKHADSGN PAKDQVIRTK NADGKSDRTE SITATKSDSG HTKLKGGSMV
NGLDAPSSLP PSVQPGTSKS MENTKQVEES INRASDEHGT RIAESRTSAK RSVPAGSLSK
SSKLDPIKED GRSGKPVARS SGSSSSDKDL QTHASEGRHT VTTNVSSSVS ANDFIGWSWA
SMVKDDGNDI ADFTRGSSSR VVHSPRHENT GVTSKSNDKI QKRAGSAEEP DRLGKRRKGD
VELRDFESDK PLERPKDKGN ERYEREHRER LDRLDKSRGD DFVAEKPRDR SIERYGRERS
VERMQERGNE RKKSHADDRF HGQSLPPPPP LPPNMVPQSV GAGRRDEDAD RRYGATRHSQ
RLSPRHEEKE RRRSEETVVS QDEAKRRKED DFRDRKREEI KVEERERERE KSNILKEDLD
LNAASKRRKL KREHLPTSEP GEYSPVAPPP PPPGIGMSVG YDGRDRGDRK GPIIQHPSYI
DEPSLRIHGK EVASKLNRRD SDPYPKLQNA YIDIHAYYLI HVYVSFNISF GGAPLDCLLL
CLICIYFVYH LFHHSWKESA
//