[ GenomeNet Home Page | Motif Search Home Page | Motif Help Page ]

Search with a protein sequence against PRINTS library


OVERVIEW

The PRINTS [ 1, 2 ] is a compendium of protein motif fingerprints. A fingerprint is a group of conserved motifs used to characterise a protein family. It is derived by the excision of conserved motifs from sequence alignments and refined by iterative dredging of the OWL,a non-redundand composite sequence database. Two types of fingerprint are represented in the database, i.e. they are either simple or composite, depending on their complexity: simple fingerprints are essentially single-motifs; while composite fingerprints encode multiple motifs.

This server uses the BLIMPS [3, 4] program to search a protein sequence against the PRINTS database. BLIMPS (BLocks IMProved Searcher) is a searching tool that scores a sequence against blocks or a block against sequences. Original PRINTS database was convered to BLOCKS format for each release of PRINTS. "Composite" fingerprints were decomposed into motif components under the unique PRINTS code (such as "GLABLOOD" held in gc line which is equivalent of "PR00001" in gx line in the PRINTS database.)


SCORE

During the search each entry in the database, a set of blocks (simple fingerprints or the components of composite ones) is converted to a position-specific score matrix (profile). This matrix is used to score the motif found in the query sequence. The motif blocks in the PRINTS database are calibrated based on the distribution of matches determined in a search as described by the authors. [3] Each raw score is divided by estimated score of 99.5 % of true negative sequence and multiplied by 1000. Thus, a derived score of 1000 means that match would be at the 99.5 % false matches which means scores less than around 1000 are therefore probably not important.


RESULTS

The found motifs are listed in a table. From the gc identifiers of PRINTS you can jump into DBGET to look at the hits as well as related informations precisely. Under Position (Score) column of the table the position (start and end sequence numbers) and the scores of found motifs are listed. Click Detail bottun to see actual positions of the blocks motif along the query sequence . (shown in red)

OUTPUT

Below a sample output table of the search is shown. Click the images to get detailed results discribed above.

prints-res1
prints-res2 prints-res3

References

1. Attwood, T.K., Blythe, M., Flower, D.R., Gaulton, A., Mabey, J.E., Maudling, N., McGregor, L., Mitchell, A., Moulton, G., Paine, K. and Scordis, P.
"PRINTS and PRINTS-S shed light on protein ancestry."
Nucl. Acids Res. 30(1):239-241, 2002.
PubMed: 11752304
2. Attwood T.K., Beck M.E.
PRINTS--a protein motif fingerprint database
Protein Engng, 7: 841-848 (1994)
PubMed: 95062131
3. Wallace J.C., Henikoff S.
PATMAT: A searching and extraction program for sequence, pattern, and block queries and databases.
CABIOS , 8:249-254 (1992)
PubMed: 1633566
4. Henikoff S., Henikoff J.G.
Automated assembly of protein blocks for database searching
Nucleic Acid Res., 19:6565-6572 (1991)
PubMed: 1754394