GenomeNet

Database: Pfam
Entry: DUF5046
LinkDB: DUF5046
Original site: DUF5046 
#=GF ID   DUF5046
#=GF AC   PF16465.9
#=GF DE   Domain of unknown function (DUF5046)
#=GF AU   Chang Y;0000-0002-2418-3433
#=GF AU   Bateman A;0000-0002-6982-4660
#=GF SE   Jackhmmer new metaHIT cluster
#=GF GA   25.00 25.00;
#=GF TC   29.60 89.60;
#=GF NC   24.40 20.00;
#=GF BM   hmmbuild HMM.ann SEED.ann
#=GF SM   hmmsearch --cpu 4 -Z 75585367 -E 1000 HMM pfamseq
#=GF TP   Repeat
#=GF WK   Domain_of_unknown_function
#=GF CL   CL0186
#=GF DR   INTERPRO; IPR032491;
#=GF DR   SO; 0001068; polypeptide_repeat;
#=GF CC   This small family consists of C-terminal of several
#=GF CC   uncharacterized proteins around 500 residues in length and is
#=GF CC   mainly found in various Faecalibacterium species. The function
#=GF CC   of this family is unknown. This family has distant similarity to
#=GF CC   WD40 repeats.
#=GF SQ   5
#=GS A0A1Y4TA14_9FIRM/260-508  AC A0A1Y4TA14.1
#=GS C7H388_FAEPA/269-536      AC C7H388.1
#=GS A0A174BFW5_9FIRM/267-536  AC A0A174BFW5.1
#=GS R6QB00_9FIRM/259-528      AC R6QB00.1
#=GS D4K2X1_9FIRM/265-528      AC D4K2X1.1
A0A1Y4TA14_9FIRM/260-508             egyrsfcgdgyvcceagdgryyvrsvddptpladygsscllyqpggltllgydetsnapllvmpdgvthplqdfgsyvieedg-----------------.----------------------------.---.-------------------.------..---------------------------MGFLLADGTLLLYGPDGSQTSFAVKAQPG---STLTLASLDSGYALL.LGHDS.DYNLVEMQIWNEDGLQAG...SyapSGLGYEYS....SVYYLA.TGPDGP.LY.AALRQGIGGSWLYDVLDHTGQPVLKGLASV...V..SGs....vASLPEGCFMARRGFEQGFMDAEGQWLYHESIFDAVGD-e
C7H388_FAEPA/269-536                 ...................................................................................LYNPTTGEELTGYQQY-.---TGAGTVSL--YNDGRYQLVDLVSTEqSAV.LCEYDQPIRYYVPGAAVTEpDASTPEmaGRYLFHDLLTGEEKDLYDANTD--DATLAIYAVDGTVRVFDLQTGVLLTDTTIDPVENQVRTHIYPEGNGWAWV.QQDDNdSYDATAIHICGPDGIHKTl.dP...AKLNETYN....YYSPLL.STEDGI.YF.YGCYNGPGSSWLYDVLDSDGDVVVSGLRTCagyYanSV......NGLPEGVFAAVKGFESGWMDLTGQWLYAESIFASSND-e
A0A174BFW5_9FIRM/267-536             ............................................................................qtsllna---------TTGEERSGfVVYLSAGVASFQSENGTYQLVDLTSTGQ.SEV.LCEFEDPISAYAPGVAVTY.RQDLGD..--YELHDLNTGDV--LEVRDESLGTDTLAVYAKDGTLRVYDQNTGAVLTDTVVTPIEGLQRTDLYNVDGGWVWLrQYDND.DHEVTTSTICGPNGTNKT...L...DLDAIKARygadFHGYLWpVTAAGGeFYfSVSYQGPGRNWLYDLIDSTGNVVLAGLGSCngyY..AA......NPLPDGVFVARKGFEIGWMDLHGQWVYCQNIFYSSGDD.
R6QB00_9FIRM/259-528                 ..............................................................................sstls--NTVTGEELDG----F.DQVCGNGTASFRT-EDGQYQLIDLASTE.QSEvLATFDRSVLYHAPGAVVCW.NA-GND..YRYQFHDLTTGEMKPVYNVDADDK--TLAVYATDGTLRVYDRTTGAILTDVNVDSVENQDSAQVWAVGEGYALVmLYPAG.DHSKPTIRLYDAQGLVRQlaiS...ASTPDGYY....YFAPLT.TTDGHA.YF.RYGYAGPNGKTLYDVLDENGNAVLKGLALC...Y..SYyggnglNDLPAGAFMARKGFYYGWMTPDGQWLYCRSIFASSTD-e
D4K2X1_9FIRM/265-528                 ...................................................................................LYNVTTGELLTGEDDSA.VSACGVGVACLQSRQDSRSVLYDLNGGE.AVE.LGRFDRGVNTYTPGCVVLS.GSDDPD..SPYTLIDLASGEKTAIQRRDTDYRSGNVAVLTTDNILKVYDGTTGALLTDVEAAPVEEAQYISVTALPDGYALL.QYDDE.NYNTIAIQTYGGEGLLWS...S...AGEAQQYT....YASYLT.STASGP.VL.TARRDSRDGSSLYDVLDMEGNLLLRRLGSC...Y..SP......DDLPDDCFIARQGFDYGLMDSTGQWLYRESIFSSPSDD.
#=GC seq_cons                        ..................................................................................u..NssTGE.LTG.pp.h.s.hsGsGsAShpo.pDup..LhDLsosE.us..LscFDcslphYsPGssVs..susssD....YphHDLsTGEhpsl.stDsD.tssTLAVYAsDGTLRVYDtsTGAlLTDVslsPVEstspsslaulssGYALL.QYDDs.DYssssIpIaGs-GLp+o...S...AupuppYs....YauYLs.oTsuGs.aa.susYsGPGGSWLYDVLDpsGNlVL+GLuSC...Y..Ss......NsLP-GsFhARKGFEhGWMDscGQWLYsESIFuSSuD.E
//
DBGET integrated database retrieval system