#=GF ID DUF4152
#=GF AC PF13680.10
#=GF DE Protein of unknown function (DUF4152)
#=GF AU Bateman A;0000-0002-6982-4660
#=GF SE [1]
#=GF GA 27.00 27.00;
#=GF TC 28.20 125.80;
#=GF NC 26.00 23.30;
#=GF BM hmmbuild HMM.ann SEED.ann
#=GF SM hmmsearch -Z 75585367 --cpu 4 -E 1000 HMM pfamseq
#=GF TP Domain
#=GF CL CL0219
#=GF RN [1]
#=GF RM 21203960
#=GF RT Crystal structure of a novel non-Pfam protein PF2046 solved
#=GF RT using low resolution B-factor sharpening and multi-crystal
#=GF RT averaging methods.
#=GF RA Su J, Li Y, Shaw N, Zhou W, Zhang M, Xu H, Wang BC, Liu ZJ;
#=GF RL Protein Cell. 2010;1:453-458.
#=GF DR INTERPRO; IPR025206;
#=GF DR SO; 0000417; polypeptide_domain;
#=GF CC This family of proteins is functionally uncharacterised. This
#=GF CC family of proteins is found in archaea. Proteins in this family
#=GF CC are approximately 230 amino acids in length. The structure of
#=GF CC PF2046 from pyrococcus furiosus has been solved. It shows an
#=GF CC RNaseH like fold that conserves critical catalytic residues [1].
#=GF CC This suggests that these proteins may cleave nucleic acid.
#=GF SQ 5
#=GS A0A2U0RZE7_9ARCH/21-246 AC A0A2U0RZE7.1
#=GS Q5JIU0_THEKO/1-224 AC Q5JIU0.1
#=GS C6A5E2_THESM/1-224 AC C6A5E2.1
#=GS Q8TZE9_PYRFU/1-225 AC Q8TZE9.1
#=GS F0LJ97_THEBM/1-224 AC F0LJ97.1
A0A2U0RZE7_9ARCH/21-246 LRVVAADSGAAILDDQYEPVQVVAASAILTEPPYKTADSVL---AEPiFADANSGYQLIVHELELCQQLLKTVKADVVHLDLSLGGISLEEFSAVTISNMRKPGKTRAQILKILPHLRKTATSILLTYGVTVFAFGKQSIPVRIAELTSGAHAIRFAVEKANKEkTKLRLGLPTKCQTKILEDQIELQSLIPTENELFGYAKCdrELLERTKISEMLNPCARGFKMLEI.
Q5JIU0_THEKO/1-224 MRIVAADTGGALLDDAYNPLGLIATVAVLVEKPYRTASLSRVKYADP.FNYDMSGRQAIRDEAYLAVELAREVKPDVVHLDSTIGGIEVRKLDEPTIEALGITDRGKEVWKDLSRDLQPLAKKLWEETGIEIIAIGKWSVPVRIAEIYSGIYTAKWAIEYARKN.GKVLVGLPRYMRVDIKPGEIYGESLDPREGGLFGKIE-..TNTDGINWELYPNPLVRRYMVLEV.
C6A5E2_THESM/1-224 MRIVSADTGGALLDENYNPIGLIATAAVLVERPYKTAKLSIAKYSDP.FKYDLSGRMALKDETFLALKLAKKVKPDVIHLDSTLGGIEIRKLDDPTIDALRISDRGKAIWHELSKDLQPLAKRFWEETGIEILAIGKESVAVRIAEIYAGIYSVKWGIEYAMKE.GFVRIGLPRYMTIEINKEKILGKSLDSREGDLYGEIS-..MEGKEFEWEIYPNPIARTFMVFE-a
Q8TZE9_PYRFU/1-225 MRIVAADTGGAVLDETFEPIGLIATVAVLVEKPYRSAKEVMVKYANP.YDYDLTGRQAIRDEVLLAIELARKVKPDVIHLDSTLGGIELRKLDEPTIDALGISDKGKEVWKELSKDLQPLARKFWEETNIEIVAIGKSSVPVRIAEIYAGIYSAKWGIENVEKE.GHLIIGLPRYMEVNIKDGKIIGRSLDPREGGLYGSAEV..SVPEGVKWEIYPNPVARRFMIFEI.
F0LJ97_THEBM/1-224 MRIVSADTGGAVLDGAYEPIGLIATAAVLIEKPFKTATMSIVKYADP.FNYDLSGREAIKDEVFLAIKLAKKVKPDVIHLDSSLGGIELRKLDDPTIDALRISDRGKAIWHELSKDLQPLAKRFWEDTGIEILAIGKDSVAVRIAEIYAGIYSAKWAIEYAREE.GSIRIGLPRYMKVEIHESRIYGESLDPREGGLYGEIE-..SNNEGIGWELYPNPVARTFMVLEV.
#=GC seq_cons MRIVAADTGGAlLD-sYEPIGLIATuAVLVEKPYKTAchSlVKYADP.FsYDLSGRQAI+DElaLAlcLAKKVKPDVIHLDSTLGGIELRKLD-PTIDALRISDRGKAlW+ELSKDLQPLAK+FWEETGIEIlAIGKpSVPVRIAEIYAGIYSAKWAIEYAcKE.G+lRIGLPRYMcV-I+-u+IhGcSLDPREGGLYGcIE...ossEGlcWElYPNPlARsFMVLEl.
//