ID A0A2Y9SZ29_PHYMC Unreviewed; 1625 AA.
AC A0A2Y9SZ29;
DT 12-SEP-2018, integrated into UniProtKB/TrEMBL.
DT 12-SEP-2018, sequence version 1.
DT 27-MAR-2024, entry version 25.
DE RecName: Full=THO complex subunit 2 {ECO:0000256|ARBA:ARBA00019596};
GN Name=THOC2 {ECO:0000313|RefSeq:XP_023983931.1};
OS Physeter macrocephalus (Sperm whale) (Physeter catodon).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Artiodactyla; Whippomorpha; Cetacea; Odontoceti;
OC Physeteridae; Physeter.
OX NCBI_TaxID=9755 {ECO:0000313|Proteomes:UP000248484, ECO:0000313|RefSeq:XP_023983931.1};
RN [1] {ECO:0000313|RefSeq:XP_023983931.1}
RP IDENTIFICATION.
RC TISSUE=Muscle {ECO:0000313|RefSeq:XP_023983931.1};
RG RefSeq;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SUBUNIT: Component of the THO complex, which is composed of THOC1,
CC THOC2, THOC3, THOC5, THOC6 and THOC7; together with at least
CC ALYREF/THOC4, DDX39B, SARNP/CIP29 and CHTOP, THO forms the
CC transcription/export (TREX) complex which seems to have a dynamic
CC structure involving ATP-dependent remodeling. Interacts with THOC1,
CC POLDIP3 and ZC3H11A. {ECO:0000256|ARBA:ARBA00025995}.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- SIMILARITY: Belongs to the THOC2 family.
CC {ECO:0000256|ARBA:ARBA00007857}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR RefSeq; XP_023983931.1; XM_024128163.2.
DR KEGG; pcad:102973141; -.
DR OrthoDB; 179356at2759; -.
DR Proteomes; UP000248484; Chromosome 21.
DR GO; GO:0000347; C:THO complex; IEA:InterPro.
DR GO; GO:0006406; P:mRNA export from nucleus; IEA:InterPro.
DR GO; GO:0006397; P:mRNA processing; IEA:InterPro.
DR InterPro; IPR040007; Tho2.
DR InterPro; IPR021418; THO_THOC2_C.
DR InterPro; IPR021726; THO_THOC2_N.
DR InterPro; IPR032302; THOC2_N.
DR PANTHER; PTHR21597:SF0; THO COMPLEX SUBUNIT 2; 1.
DR PANTHER; PTHR21597; THO2 PROTEIN; 1.
DR Pfam; PF11262; Tho2; 1.
DR Pfam; PF11732; Thoc2; 1.
DR Pfam; PF16134; THOC2_N; 2.
PE 3: Inferred from homology;
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000248484}.
FT DOMAIN 11..419
FT /note="THO complex subunit 2 N-terminal"
FT /evidence="ECO:0000259|Pfam:PF16134"
FT DOMAIN 420..566
FT /note="THO complex subunit 2 N-terminal"
FT /evidence="ECO:0000259|Pfam:PF16134"
FT DOMAIN 568..643
FT /note="THO complex subunitTHOC2 N-terminal"
FT /evidence="ECO:0000259|Pfam:PF11732"
FT DOMAIN 873..1173
FT /note="THO complex subunitTHOC2 C-terminal"
FT /evidence="ECO:0000259|Pfam:PF11262"
SQ SEQUENCE 1625 AA; 186255 MW; 4200B05CB85C678E CRC64;
MAAASVVVPV EWIKNWEKSG RGEFLHLCRI LSENKSHDSS TYRDFQQALY ELSYHVIKGN
LKHEQASNVL NDISEFREDM PSILADVFCI LDIETNCLEE KSKRDYFTQL VLACLYLVSD
TVLKERLDPE TLESLGLIKQ SQQFNQKSVK IKTKLFYKQQ KFNLLREENE GYAKLIAELG
QDLSGNITSD LILENIKSLI GCFNLDPNRV LDVILEVFEC RPEHDDFFIS LLESYMSMCE
PQTLCHILGF KFKFYQEPNG ETPSSLYRVA AVLLQFNLID LDDLYVHLLP ADNCIMDEHK
REIVEAKQIV RKLTMVVLSS DKIDEREKEK EKEEEKVEKP PDNQKLGLLE ALLKIGDWQH
AQNIMDQMPP YYAASHKLIA LAICKLIHIT IEPLYRRVGV PKGAKGSPVN ALQNKRAPKQ
AESFEDLRRD VFNMFCYLGP HLSHDPILFA KVVRIGKSFM KEFQSDGSKQ EDKEKTEVIL
SCLLSITDQV LLPSLSLMDC NACMSEELWG MFKTFPYQHR YRLYGQWKNE TYNSHPLLVK
VKAQTIDRAK YIMKRLTKEN VKPSGRQIGK LSHSNPTILF DYILSQIQKY DNLITPVVDS
LKYLTSLNYD VLAYCIIEAL ANPEKERMKH DDTTISSWLQ SLASFCGAVF RKYPIDLAGL
LQYVANQLKA GKSFDLLILK EVVQKMAGIE ITEEMTMEQL EAMTGGEQLK AEGGYFGQIR
NTKKSSQRLK DALLDHDLAL PLCLLMAQQR NGVIFQEGGE KHLKLVGKLY DQCHDTLVQF
GGFLASNLST EDYIKRVPSI DVLCNEFHTP HDAAFFLSRP MYAHHISSKY DELKKSEKGS
KQQHKVHKYI TSCEMVMAPV HEAVVSLHVS KVWDDISPQF YATFWSLTMY DLAVPHTSYE
REVNKLKVQM KAIDDNQEMP PNKKKKEKER CTALQDKLLE EEKKQMEHVQ RVLQRLKLEK
DNWLLAKSTK NETITKFLQL CIFPRCIFSA IDAVYCARFV ELVHQQKTPN FSTLLCYDRV
FSDIIYTVAS CTENEASRYG RFLCCMLETV TRWHSDRATY EKECGNYPGF LTILRATGFD
GGNKADQLDY ENFRHVVHKW HYKLTKASVH CLETGEYTHI RNILIVLTKI LPWYPKVLNL
GQALERRVHK ICQEEKEKRP DLYALAMGYS GQLKSRKSYM IPENEFHHKD PPPRNAVASV
QNGPGGGPSS SSVGSASKSD ESSTEETDKS RERSQCGVKA VNKASSATPK GNSSNGNSGS
NSSKAVKEND KEKGKEKEKE KKEKTPATTP EARVLGKDGK EKPKEERPSK DEKARETKER
TPKSDKEKEK FKKEEKAKDE KFKTTVPNVE SKSTQEKERE KEPSRERDIA KEMKSKENVK
GGEKTPVSGS LKSPVPRSDI AEPEREQKRR KIDTYPSPSH SSTVKLVYFQ VTAILPKVPL
GSENYASSPV ISIHFLQDSL IELKESSAKL YINHTPPPLS KSKEREMDKK DLDKSRERSR
EREKKDEKDR KERKRDHSNN DREVPPDLTK RRKEENGTMG VSKHKSESPC ESPYPNEKDK
EKNKSKSSGK EKGGDSFKSE KMDKISSGGK KESRHDKEKI EKKEKRDSSG GKEEKKHHKS
SDKHR
//