#=GF ID THP2
#=GF AC PF09432.11
#=GF DE Tho complex subunit THP2
#=GF AU Mistry J;0000-0003-2479-5322
#=GF AU Wood V;0000-0001-6330-7526
#=GF SE manual
#=GF GA 28.00 28.00;
#=GF TC 30.40 88.60;
#=GF NC 27.90 22.40;
#=GF BM hmmbuild HMM.ann SEED.ann
#=GF SM hmmsearch -Z 47079205 -E 1000 --cpu 4 HMM pfamseq
#=GF TP Family
#=GF RN [1]
#=GF RM 11060033
#=GF RT A protein complex containing Tho2, Hpr1, Mft1 and a novel
#=GF RT protein, Thp2, connects transcription elongation with mitotic
#=GF RT recombination in Saccharomyces cerevisiae.
#=GF RA Chavez S, Beilharz T, Rondon AG, Erdjument-Bromage H, Tempst P,
#=GF RA Svejstrup JQ, Lithgow T, Aguilera A;
#=GF RL EMBO J. 2000;19:5824-5834.
#=GF RN [2]
#=GF RM 12093753
#=GF RT The yeast THO complex and mRNA export factors link RNA
#=GF RT metabolism with transcription and genome instability.
#=GF RA Jimeno S, Rondon AG, Luna R, Aguilera A;
#=GF RL EMBO J. 2002;21:3526-3535.
#=GF DR INTERPRO; IPR018557;
#=GF DR SO; 0100021; polypeptide_conserved_region;
#=GF CC The THO complex plays a role in coupling transcription
#=GF CC elongation to mRNA export. It is composed of subunits THP2,
#=GF CC HPR1, THO2 and MFT1 [1].
#=GF SQ 25
#=GS G0VDT0_NAUCC/118-246 AC G0VDT0.1
#=GS A0A0C7N7V1_9SACH/121-248 AC A0A0C7N7V1.1
#=GS I6NCT5_ERECY/120-250 AC I6NCT5.1
#=GS G0W885_NAUDC/117-245 AC G0W885.1
#=GS I2GZR0_TETBL/114-242 AC I2GZR0.1
#=GS Q6CXV8_KLULA/119-249 AC Q6CXV8.2
#=GS Q75D17_ASHGO/119-247 AC Q75D17.2
#=GS A7TEX7_VANPO/115-244 AC A7TEX7.1
#=GS C5E1Y6_LACTC/120-243 AC C5E1Y6.1
#=GS J4U4S3_SACK1/115-243 AC J4U4S3.1
#=GS G8BNX6_TETPH/133-261 AC G8BNX6.1
#=GS W0T571_KLUMD/122-252 AC W0T571.1
#=GS THP2_YEAST/115-243 AC O13539.1
#=GS A0A1G4JI36_9SACH/121-251 AC A0A1G4JI36.1
#=GS S6E1D2_ZYGB2/116-244 AC S6E1D2.1
#=GS C5DTV5_ZYGRC/116-244 AC C5DTV5.1
#=GS A0A1G4MJ07_LACFM/120-248 AC A0A1G4MJ07.1
#=GS A0A0X8HTE9_9SACH/120-250 AC A0A0X8HTE9.1
#=GS A0A1G4IRN9_9SACH/121-245 AC A0A1G4IRN9.1
#=GS A0A1G4JP12_9SACH/121-247 AC A0A1G4JP12.1
#=GS H2AVH8_KAZAF/115-243 AC H2AVH8.1
#=GS A0A1X7R5N4_9SACH/118-247 AC A0A1X7R5N4.1
#=GS Q6FQZ2_CANGA/122-250 AC Q6FQZ2.1
#=GS J7RYT2_KAZNA/124-252 AC J7RYT2.1
#=GS G8ZYI7_TORDC/112-240 AC G8ZYI7.1
G0VDT0_NAUCC/118-246 .LEYVNLLERLSVDLAKQVEISDPSVSK.FVLNDWNPPKGVQAILDKFAD....PSADAALLKMELVHYLDDIKMSRAKYSLENKYSLQDKVVNLNTELNRWRKELDDIEMMMFGDGATSIKKMLANVESLRSKI..
A0A0C7N7V1_9SACH/121-248 .MEHINLVGRLSSKLCDPALASRS----.-AAGDRDDAFDLKQILEAYNSlgdeDVEDTANLKLQLLRWTDSMKMENARFTLENQHILRDKLKSITAEVTQWKKNYESVENTMFGEDPHSIMQTIQRIQKMKPQL..
I6NCT5_ERECY/120-250 .LKHLKLLNALAVDMCYPLVNQEDTEN-.IAVNKEHYPRELAPVLEEYDAy.gaDIEDIRNLRSKLMQYFENIKSSRAKYLLENKYLLADSLKELTKLVAAWSQKWEHLENILFGDSPASLRKLLQTMETVKASL..
G0W885_NAUDC/117-245 .LEYINLLQRLSVDLVRQIEISDPNVSK.INVDGWNPPKKIQVLLDKFGE....PDADTRELKIQVQRYLDDIKMSRAKYSLENKYSLQEKLSEVTKAVNQWRAEWDNIEMMLFGDGSNSMKNMLANVESIKSKL..
I2GZR0_TETBL/114-242 q-KYINMVNRLSVDLAKQIETADIRKDK.YIVDNWLPPKEIEEILQEFTD....DDSEAVRLRARLEQYLDQLKMERVKYTLENRYTIEDKLILANKEVNRWRIEWDKLETLMFGRGPNSLKNMLQKNEQLAEKL..
Q6CXV8_KLULA/119-249 .MRHLSLVANLSDDLVHKLESSDESNK-.VLVNKNPLPAVLKETVKQYEEi.gdEQQRIENIRAKLFQYLDEIKAGRAKYALENKYILNSTLQQITKEVSEWSQRWTHIENSLFGDSPTSLKKLVQKAENIKEL-l.
Q75D17_ASHGO/119-247 .LQHLQLVNQLSVELAYPLGRRGSEH--.VTVNREGPPPELVAALAAYDAg..pDAPAAAELRAELLRYLDDIKATRARYLLENKYLLADSLRQLTRDVSSWSQKWESLEGTLFGDAPSSLRSLLRSVDTTKATI..
A7TEX7_VANPO/115-244 .LVYINLLERLASDLFIQVEDAQFKDNEvIMVDEVAAPVEVQDVLKKYIT....ESSETSVLRDELDKYLNEIKMERAELTIKNKFSLQPTLNELSKEVNYWRKEWDNMEMLMFGDGPNSMKRMMKNIESLRAK-a.
C5E1Y6_LACTC/120-243 .LEHINLIGRLSSVLTEMLP--------.SELDDSTQNLELAQILEAYNTd.skGSEDTDCLKEKLLDWIDSIKMEKARYSLENQHILRDSLKALTMEVTRWRENYESIEGMMFGENANSISQMLHKVQRLRPQL..
J4U4S3_SACK1/115-243 .LKYINLLKRLSVDLAKQVEVSDPSVTV.YELDNWVPSEKLQGILEEYCA....PETDIRGVDAQIKNYLGQIKMARAKFGLENKYSLKEGLSTLTKELNHWRKEWDDIEMLMFGDDAHSMKKMIQKIDSLKSEI..
G8BNX6_TETPH/133-261 .MIYTNLLGRLSVGLIQQVQVSNTENSE.IMINDYPPPEEIVSILEKFNT....ETTETDDLRGQLDDYLQKIKMDRAKYTLENEYLLKDSLLTLSKEVNYWRKEYDNLEMLMFSDGPNTIMKMMKNVDSLRLKV..
W0T571_KLUMD/122-252 .MRHLSLISNLSDDLVVKLESHDDSNL-.VVTNKDPLPPVLKNTIRKYEEl.gpEQQHIEDIRAALFQYLDDIKAGRAKYALENKYILNTSLQEITKEVSEWSQRWTNIENTLFGDSPNSLKKLIQKADEIKEL-l.
THP2_YEAST/115-243 .LRYINLLKRLSVDLAKQVEVSDPSVTV.YEMDKWVPSEKLQGILEQYCA....PDTDIRGVDAQIKNYLDQIKMARAKFGLENKYSLKERLSTLTKELNHWRKEWDDIEMLMFGDDAHSMKKMIQKIDSLKSEI..
A0A1G4JI36_9SACH/121-251 f-EHQSLLGRLSASLDLSESANERSV-R.PAKDDDATKNVFHQLLKQYSAt.nsSQDELTKLRDQLMELINDQKLEKAQYSLENQHTLKEVFSQLAHQVTEWKEQFQSLEDIMFGNGPRSMLSLFHEVDKMKP--ll
S6E1D2_ZYGB2/116-244 .LDYINLLQRLSVDLAKQIEISDREKSA.FEVNSWEPTDRMQTIVEQLAD....PNVDSALLNSQLVEYMDQIKMERAKYTIENKHSLQETLVELNKEVNYWRRNWNAIENLMFGDNSHSIKRMLHSIEILRSKL..
C5DTV5_ZYGRC/116-244 .LDYVNLLQRLSVDLAKQIEISDPEVSE.FVVDNWSPPDGMQSILEQLAN....PDKDSTHLQSQLDQYLDQIKMERAKYTIENKYSLQETLNEVNKEVNYWRRNWNAIENLMFGDSAHSIKKMLQSIDLLRAKL..
A0A1G4MJ07_LACFM/120-248 .LEHGNLLGRLSSNLGNQMK-QDIDTS-.ISVEV-SKRDTLREIMAKYDSi.dgNDDQPEILRQELLDYIDSMKMEKARYSLVNQYMLSDSYKQLTKEVTQWKHQYESLEGIMFGDNPNSIRSMVYKIESLKEKL..
A0A0X8HTE9_9SACH/120-250 .MKHLKLLNDFSTDMSFPIANQEGTD-H.IVVNREHFPAELLPVLERYDQh.geAVDDTMKLRSEMMQYFENIRSTRAKYHLENKYLIATSLKELTKSVAAWSSKWENLESLLFGDNPNSIRKLLQSVQSIKASI..
A0A1G4IRN9_9SACH/121-245 .LEHTNLIGRLSSNLCDAASAF------.SQVEKED-YFDLESWLQSYKSe.daSVETREILKSRLVSWITSIKMEKARYLVENQHILRDMLKSLTSDVAQWRTNYESIESMLFGDAHNSIAKTLLDITNLRQEL..
A0A1G4JP12_9SACH/121-247 .MEHVNLIGRLSSKFSDSAVSSSKDSS-.----NTGAEFDLEKVLDAYEAe.nsSPQDTEQLKRRLLDWIDSLKMNKARYSLENQYILRDMLKKLTSDVMQWRQNYESVENMMFGGTQNSIVQTLQKIEKLRPS-l.
H2AVH8_KAZAF/115-243 h-EYVNLLERLSVDLGKQVDISDSNVTE.LVVDDWTPPSELISLLEQYNE....SSSDIELQDSKVDRYLDQLKLLRAKYAMENNYLLKSTLNDLNDEVNYWRREYENIESMMFGNGPNSMKKMLHNVEVLKVK-a.
A0A1X7R5N4_9SACH/118-247 .LEYINLLNRLSVELVKQVDISDPDISE.FVFDNWKPPAELQKIIDNYYG...dENKNFTSLNGDLQDYFNSIKLSRAKYTLENRYVLQRHLTELNKEANYWRGELDNIELLLFGEGPHSIRKVLQNVEVLKNKL..
Q6FQZ2_CANGA/122-250 .LTYINLLTKLSVNLAKQIEFADHSVSE.FLLEDWKPPHELQSILEKFVD....MEEDPEVLNDQLNKYMDNIKMERAKYSLENKYSLQEQLKTLESELSRWRDAWVNIESLMFGDSPNSMKGMLQNIESMKKEL..
J7RYT2_KAZNA/124-252 .LEYLNLLGTYAVDLARQIEISDPSVSH.FDIDDWKPPRKLLEILDKFQS....EDCEPIKIRDELQSYLDNIKLSRAKFTLENKHILQDKLGVLSKEVSYWRKEWDNIENMMFGEGSDSMRSMLQTVDSLRSKI..
G8ZYI7_TORDC/112-240 .LRYVNLLERLSVDLVKEIEIADPTVTE.FVVNKWNPPKGIFEILDELAD....PATDVVAVRSRLNGYLDRIKMERAKYTIENKHSLQGTLRDLNKEVSNWRKEWDSIENVMFGDGSHSMKKMLQNIDSLKSKL..
#=GC seq_cons .LcalNLLsRLSVDLscpl-huDssss..hhlschssPpcLpslL-pYss....sss-sppL+ucLhpYlDsIKMpRAKYoLENKYhLp-sLppLoKEVspWRccW-sIEshMFGDussSl++MLpsl-sLKscl..
//