GenomeNet

Database: Pfam
Entry: DUF3855
LinkDB: DUF3855
Original site: DUF3855 
#=GF ID   DUF3855
#=GF AC   PF12967.8
#=GF DE   Domain of Unknown Function with PDB structure (DUF3855)
#=GF AU   Ellrott K;0000-0002-6573-5900
#=GF SE   JCSG structure PDB:1O22
#=GF GA   27.00 27.00;
#=GF TC   359.80 359.60;
#=GF NC   22.90 21.10;
#=GF BM   hmmbuild HMM.ann SEED.ann
#=GF SM   hmmsearch -Z 47079205 -E 1000 --cpu 4 HMM pfamseq
#=GF TP   Family
#=GF WK   Domain_of_unknown_function
#=GF RN   [1]
#=GF RM   15229892
#=GF RT   Crystal structure of an orphan protein (TM0875) from  Thermotoga
#=GF RT   maritima at 2.00-A resolution reveals a new fold.
#=GF RA   Bakolitsa C, Schwarzenbacher R, McMullan D, Brinen LS,  Canaves
#=GF RA   JM, Dai X, Deacon AM, Elsliger MA, Eshagi S, Floyd R,     
#=GF RA   Godzik A, Grittini C, Grzechnik SK, Jaroszewski L, Karlak C,   
#=GF RA   Klock HE, Koesema E, Kovarik JS, Kreusch A, Kuhn P, Lesley SA, 
#=GF RA   McPhillips TM, Miller MD, Morse A, Moy K, Ouyang J, Page R,    
#=GF RA   Quijano K, Robb A, Spraggon G, Stevens RC, van den Bedem H, 
#=GF RA   Velasquez J, Vincent J, von Delft F, Wang X, West B, Wolf G, 
#=GF RA   Hodgson KO, Wooley J, Wilson IA.
#=GF RL   Proteins. 2004 Aug 15;56(3):607-10.
#=GF DR   INTERPRO; IPR024482;
#=GF DR   SO; 0100021; polypeptide_conserved_region;
#=GF CC   Family based on orphan protein (TM0875) from Thermotoga maritima
#=GF CC   that has been structurally determined as PDB:1022. The TM0875
#=GF CC   gene of Thermotoga maritima encodes a hypothetical protein
#=GF CC   NP_228683 [1] of unknown function. Analysis of TM0875 genomic
#=GF CC   context reveals the presence of MMT1 (a predicted Co/Zn/Cd
#=GF CC   cation transporter) and an inactive homolog of metal-dependent
#=GF CC   proteases. 1O22 shows weak structural similarity with the
#=GF CC   phosphoribosylformylglycinamidine synthase 1t4a (Dali
#=GF CC   Z-scr=4.6), the yggU protein (PDB structure:1n91; with DALI
#=GF CC   Z-scr=3), and with the thioesterase superfamily member (PDB
#=GF CC   structure 2cy9 - found using FATCAT), even though they have very
#=GF CC   low sequence identity.
#=GF SQ   1
#=GS Q9WZX8_THEMA/1-158  AC Q9WZX8.1
#=GS Q9WZX8_THEMA/1-158  DR PDB; 1O22 A; 6-154;
Q9WZX8_THEMA/1-158             MRLMDILEILYYKKGKEFGILEKKMKEIFNETGVSLEPVNSELIGRIFLKISVLEEGEEVPSFAIKALTPKENAVDLPLGDWTDLKNVFVEEIDYLDSYGDMKILSEKNWYKIYVPYSSVKKKNRNELVEEFMKYFFESKGWNPGEYTFSVQEIDNLF
#=GR Q9WZX8_THEMA/1-158  SS    XXXXX-EEEEEEETT---HHHHHHHHHHHHHHS--------SSEEEEEEEEEEE-TT----SEEEEEE---S--TT--SSS-----S-EEEEEEE-EEETTEEEEEETTEEEEEEEGGGSTT--HHHHHHHHHHHHHHHTT--GGGEEEEEEE-XXXX
#=GC SS_cons                   XXXXX-EEEEEEETT---HHHHHHHHHHHHHHS--------SSEEEEEEEEEEE-TT----SEEEEEE---S--TT--SSS-----S-EEEEEEE-EEETTEEEEEETTEEEEEEEGGGSTT--HHHHHHHHHHHHHHHTT--GGGEEEEEEE-XXXX
#=GC seq_cons                  MRLMDILEILYYKKGKEFGILEKKMKEIFNETGVSLEPVNSELIGRIFLKISVLEEGEEVPSFAIKALTPKENAVDLPLGDWTDLKNVFVEEIDYLDSYGDMKILSEKNWYKIYVPYSSVKKKNRNELVEEFMKYFFESKGWNPGEYTFSVQEIDNLF
//
DBGET integrated database retrieval system