ID F1M0V4_RAT Unreviewed; 1604 AA.
AC F1M0V4;
DT 03-MAY-2011, integrated into UniProtKB/TrEMBL.
DT 25-MAY-2022, sequence version 3.
DT 27-MAR-2024, entry version 74.
DE RecName: Full=THO complex subunit 2 {ECO:0000256|ARBA:ARBA00019596};
GN Name=Thoc2 {ECO:0000313|Ensembl:ENSRNOP00000009755.7,
GN ECO:0000313|RGD:1561623};
OS Rattus norvegicus (Rat).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae;
OC Murinae; Rattus.
OX NCBI_TaxID=10116 {ECO:0000313|Ensembl:ENSRNOP00000009755.7, ECO:0000313|Proteomes:UP000002494};
RN [1] {ECO:0000313|Ensembl:ENSRNOP00000009755.7, ECO:0000313|Proteomes:UP000002494}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Brown Norway {ECO:0000313|Ensembl:ENSRNOP00000009755.7,
RC ECO:0000313|Proteomes:UP000002494};
RX PubMed=15057822; DOI=10.1038/nature02426;
RG Rat Genome Sequencing Project Consortium;
RA Gibbs R.A., Weinstock G.M., Metzker M.L., Muzny D.M., Sodergren E.J.,
RA Scherer S., Scott G., Steffen D., Worley K.C., Burch P.E., Okwuonu G.,
RA Hines S., Lewis L., Deramo C., Delgado O., Dugan-Rocha S., Miner G.,
RA Morgan M., Hawes A., Gill R., Holt R.A., Adams M.D., Amanatides P.G.,
RA Baden-Tillson H., Barnstead M., Chin S., Evans C.A., Ferriera S.,
RA Fosler C., Glodek A., Gu Z., Jennings D., Kraft C.L., Nguyen T.,
RA Pfannkoch C.M., Sitter C., Sutton G.G., Venter J.C., Woodage T., Smith D.,
RA Lee H.-M., Gustafson E., Cahill P., Kana A., Doucette-Stamm L.,
RA Weinstock K., Fechtel K., Weiss R.B., Dunn D.M., Green E.D.,
RA Blakesley R.W., Bouffard G.G., De Jong P.J., Osoegawa K., Zhu B., Marra M.,
RA Schein J., Bosdet I., Fjell C., Jones S., Krzywinski M., Mathewson C.,
RA Siddiqui A., Wye N., McPherson J., Zhao S., Fraser C.M., Shetty J.,
RA Shatsman S., Geer K., Chen Y., Abramzon S., Nierman W.C., Havlak P.H.,
RA Chen R., Durbin K.J., Egan A., Ren Y., Song X.-Z., Li B., Liu Y., Qin X.,
RA Cawley S., Cooney A.J., D'Souza L.M., Martin K., Wu J.Q.,
RA Gonzalez-Garay M.L., Jackson A.R., Kalafus K.J., McLeod M.P.,
RA Milosavljevic A., Virk D., Volkov A., Wheeler D.A., Zhang Z., Bailey J.A.,
RA Eichler E.E., Tuzun E., Birney E., Mongin E., Ureta-Vidal A., Woodwark C.,
RA Zdobnov E., Bork P., Suyama M., Torrents D., Alexandersson M., Trask B.J.,
RA Young J.M., Huang H., Wang H., Xing H., Daniels S., Gietzen D., Schmidt J.,
RA Stevens K., Vitt U., Wingrove J., Camara F., Mar Alba M., Abril J.F.,
RA Guigo R., Smit A., Dubchak I., Rubin E.M., Couronne O., Poliakov A.,
RA Huebner N., Ganten D., Goesele C., Hummel O., Kreitler T., Lee Y.-A.,
RA Monti J., Schulz H., Zimdahl H., Himmelbauer H., Lehrach H., Jacob H.J.,
RA Bromberg S., Gullings-Handley J., Jensen-Seaman M.I., Kwitek A.E.,
RA Lazar J., Pasko D., Tonellato P.J., Twigger S., Ponting C.P., Duarte J.M.,
RA Rice S., Goodstadt L., Beatson S.A., Emes R.D., Winter E.E., Webber C.,
RA Brandt P., Nyakatura G., Adetobi M., Chiaromonte F., Elnitski L.,
RA Eswara P., Hardison R.C., Hou M., Kolbe D., Makova K., Miller W.,
RA Nekrutenko A., Riemer C., Schwartz S., Taylor J., Yang S., Zhang Y.,
RA Lindpaintner K., Andrews T.D., Caccamo M., Clamp M., Clarke L., Curwen V.,
RA Durbin R.M., Eyras E., Searle S.M., Cooper G.M., Batzoglou S., Brudno M.,
RA Sidow A., Stone E.A., Payseur B.A., Bourque G., Lopez-Otin C., Puente X.S.,
RA Chakrabarti K., Chatterji S., Dewey C., Pachter L., Bray N., Yap V.B.,
RA Caspi A., Tesler G., Pevzner P.A., Haussler D., Roskin K.M., Baertsch R.,
RA Clawson H., Furey T.S., Hinrichs A.S., Karolchik D., Kent W.J.,
RA Rosenbloom K.R., Trumbower H., Weirauch M., Cooper D.N., Stenson P.D.,
RA Ma B., Brent M., Arumugam M., Shteynberg D., Copley R.R., Taylor M.S.,
RA Riethman H., Mudunuri U., Peterson J., Guyer M., Felsenfeld A., Old S.,
RA Mockrin S., Collins F.S.;
RT "Genome sequence of the Brown Norway rat yields insights into mammalian
RT evolution.";
RL Nature 428:493-521(2004).
RN [2] {ECO:0007829|PubMed:22673903}
RP IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS].
RX PubMed=22673903;
RA Lundby A., Secher A., Lage K., Nordsborg N.B., Dmytriyev A., Lundby C.,
RA Olsen J.V.;
RT "Quantitative maps of protein phosphorylation sites across 14 different rat
RT organs and tissues.";
RL Nat. Commun. 3:876-876(2012).
RN [3] {ECO:0000313|Ensembl:ENSRNOP00000009755.7}
RP IDENTIFICATION.
RC STRAIN=Brown Norway {ECO:0000313|Ensembl:ENSRNOP00000009755.7};
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SUBUNIT: Component of the THO complex, which is composed of THOC1,
CC THOC2, THOC3, THOC5, THOC6 and THOC7; together with at least
CC ALYREF/THOC4, DDX39B, SARNP/CIP29 and CHTOP, THO forms the
CC transcription/export (TREX) complex which seems to have a dynamic
CC structure involving ATP-dependent remodeling. Interacts with THOC1,
CC POLDIP3 and ZC3H11A. {ECO:0000256|ARBA:ARBA00025995}.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- SIMILARITY: Belongs to the THOC2 family.
CC {ECO:0000256|ARBA:ARBA00007857}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR PaxDb; 10116-ENSRNOP00000009755; -.
DR Ensembl; ENSRNOT00000009755.8; ENSRNOP00000009755.7; ENSRNOG00000007315.8.
DR AGR; RGD:1561623; -.
DR RGD; 1561623; Thoc2.
DR VEuPathDB; HostDB:ENSRNOG00000007315; -.
DR eggNOG; KOG1874; Eukaryota.
DR GeneTree; ENSGT00710000106792; -.
DR HOGENOM; CLU_000511_5_0_1; -.
DR TreeFam; TF313127; -.
DR Proteomes; UP000002494; Chromosome X.
DR Bgee; ENSRNOG00000007315; Expressed in spleen and 18 other cell types or tissues.
DR ExpressionAtlas; F1M0V4; baseline and differential.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-KW.
DR GO; GO:0005634; C:nucleus; ISO:RGD.
DR GO; GO:0000347; C:THO complex; ISO:RGD.
DR GO; GO:0000445; C:THO complex part of transcription export complex; ISO:RGD.
DR GO; GO:0000346; C:transcription export complex; ISO:RGD.
DR GO; GO:0003729; F:mRNA binding; ISO:RGD.
DR GO; GO:0001824; P:blastocyst development; ISO:RGD.
DR GO; GO:0000902; P:cell morphogenesis; ISO:RGD.
DR GO; GO:0048699; P:generation of neurons; ISO:RGD.
DR GO; GO:0006406; P:mRNA export from nucleus; ISO:RGD.
DR GO; GO:0006397; P:mRNA processing; IEA:InterPro.
DR GO; GO:0010977; P:negative regulation of neuron projection development; IMP:RGD.
DR GO; GO:0048666; P:neuron development; ISO:RGD.
DR GO; GO:0016973; P:poly(A)+ mRNA export from nucleus; ISO:RGD.
DR GO; GO:0010468; P:regulation of gene expression; ISO:RGD.
DR GO; GO:0010793; P:regulation of mRNA export from nucleus; ISO:RGD.
DR GO; GO:0017145; P:stem cell division; ISO:RGD.
DR GO; GO:0046784; P:viral mRNA export from host cell nucleus; ISO:RGD.
DR InterPro; IPR040007; Tho2.
DR InterPro; IPR021418; THO_THOC2_C.
DR InterPro; IPR021726; THO_THOC2_N.
DR InterPro; IPR032302; THOC2_N.
DR PANTHER; PTHR21597:SF1; THO COMPLEX SUBUNIT 2; 1.
DR PANTHER; PTHR21597; THO2 PROTEIN; 1.
DR Pfam; PF11262; Tho2; 1.
DR Pfam; PF11732; Thoc2; 1.
DR Pfam; PF16134; THOC2_N; 2.
PE 1: Evidence at protein level;
KW Coiled coil {ECO:0000256|SAM:Coils}; Membrane {ECO:0000256|SAM:Phobius};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Proteomics identification {ECO:0007829|PeptideAtlas:F1M0V4};
KW Reference proteome {ECO:0000313|Proteomes:UP000002494};
KW Transmembrane {ECO:0000256|SAM:Phobius};
KW Transmembrane helix {ECO:0000256|SAM:Phobius}.
FT TRANSMEM 1580..1598
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT DOMAIN 11..416
FT /note="THO complex subunit 2 N-terminal"
FT /evidence="ECO:0000259|Pfam:PF16134"
FT DOMAIN 420..566
FT /note="THO complex subunit 2 N-terminal"
FT /evidence="ECO:0000259|Pfam:PF16134"
FT DOMAIN 568..643
FT /note="THO complex subunitTHOC2 N-terminal"
FT /evidence="ECO:0000259|Pfam:PF11732"
FT DOMAIN 873..1173
FT /note="THO complex subunitTHOC2 C-terminal"
FT /evidence="ECO:0000259|Pfam:PF11262"
FT REGION 1183..1573
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 896..959
FT /evidence="ECO:0000256|SAM:Coils"
FT COMPBIAS 1201..1220
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1221..1236
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1237..1264
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1265..1381
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1398..1416
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1438..1500
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1509..1573
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1604 AA; 184085 MW; E9AE5E925FBF3EE0 CRC64;
MAAATVVVPA EWIKNWEKSG RGDFLHLCRI LSENKSHDSS TYRDFQQALY ELSYHVIKGN
LKHEQASSVL NDISEFREDM PSILADVFCI LDIETNCLEE KSKRDYFTQL VLACLYLVSD
TVLKERLDPE TLESLGLIKQ SQQFNQKSVK IKTKLFYKQQ KFNLLREENE GYAKLIAELG
QDLSGTITSD LILENIKSLI GCFNLDPNRV LDVILEVFEC RPEHDDFFIS LLESYMSMCE
PQTLCHILGF KFKFYQEPSG ETPSSLYRVA AVLLQFNLID LDDLYVHLLP ADNCIMDEYK
REIVEAKQIV RKLTMVVLSS EKLDERDKEK DKDDEKVEKP PDNQKLGLLE ALLKIGDWQH
AQNIMDQMPP YYAASHKLIA LAICKLIHIT VEPLYRRVGV PKGAKGSPVS ALQNKRAPKQ
VESFEDLRRD VFNMFCYLGP HLSHDPILFA KVVRIGKSFM KEFQSDGSKQ EDKEKTEVIL
SCLLSITDQV LLPSLSLMDC NACMSEELWG MFKTFPYQHR YRLYGQWKNE TYNGHPLLVK
VKAQTIDRAK YIMKRLTKEN VKPSGRQIGK LSHSNPTILF DYILSQIQKY DNLITPVVDS
LKYLTSLNYD VLAYCIIEAL ANPEKERMKH DDTTISSWLQ SLASFCGAVF RKYPIDLAGL
LQYVANQLKA GKSFDLLILK EVVQKMAGIE ITEEMTMEQL EAMTGGEQLK AEGGYFGQIR
NTKKSSQRLK DALLDHDLAL PLCLLMAQQR NGVIFQEGGE KHLKLVGKLY DQCHDTLVQF
GGFLASNLST EDYIKRVPSI DVLCNEFHTP HDAAFFLSRP MYAHHISSKY DELKKSEKGS
KQQHKVHKYI TSCEMVMAPV HEAVVSLHVA KVWDDISPQF YATFWSLTMY DLAVPHTSYE
REVNKLKVQM KAIDDNQEMP PNKKKKEKER CTALQDKLLE EEKKQMEHVQ RVLQRLKLEK
DNWLLAKSTK NETITKFLQL CIFPRCIFSA IDAVYCARFV ELVHQQKTPN FSTLLCYDRV
FSDIIYTVAS CTENEASRYG RFLCCMLETV TRWHSDRATY EKECGNYPGF LTILRATGFD
GGNKADQLDY ENFRHVVHKW HYKLTKASVH CLETGEYTHI RNILIVLTKI LPWYPKVLNL
GQALERRVNK ICQEEKEKRP DLYALAMGYS GQLKSRKSHM IPENEFHHKD PPPRNAVASV
QNGPGGGTSS SSIGSASKSD ESGAEETDKS RERSQCGTKA VNKASSTTPK GNSSNGNSGS
NSNKAVKEND KEKVKEKEKE KKEKTPATTP EARVLGKESK EKPKEERPNK DDKARETKER
TPKSDKEKEK FKKEEKAKDE KFKTTVPSVE SKSTQERERE KEPSRERDVA KEMKSKENVK
GGEKPPVSGS LKSPVPRSDI SEPDREQKRR KIDTHPSPSH SSTVKVSILY INHNPPPLSK
SKEREMDKKD LDKSRERSRE REKKDEKDRK ERKRDHSNND REVPPDITKR RKEENGTMGV
SKHKSESPCE SQYPNEKDKE KNKSKSSGKE KGSSDSFKSE KMDKISSGGK KESRHDKEKI
EKKEKRDSSG GKEEKKQYPF YLYISLTVVE GFLIPFMFRI KPFM
//