Abstract
We evaluate various methods to estimate pairwise statistical significance of a pairwise local sequence alignment in terms of statistical significance accuracy and compare it with popular database search programs in terms of retrieval accuracy on a benchmark database. Results indicate that using pairwise statistical significance using standard substitution matrices is significantly better than database statistical significance reported by BLAST and PSI-BLAST, and that it is comparable and at times significantly better than SSEARCH. An application of pairwise statistical significance to empirically determine effective gap opening penalties for protein local sequence alignment using the widely used BLOSUM matrices is also presented.
Original language | English (US) |
---|---|
Pages (from-to) | 347-367 |
Number of pages | 21 |
Journal | International Journal of Computational Biology and Drug Design |
Volume | 1 |
Issue number | 4 |
DOIs | |
State | Published - Jan 1 2008 |
Keywords
- database statistical significance
- gap opening penalty
- homologs
- pairwise local alignment
- pairwise statistical significance
- sequence alignment
ASJC Scopus subject areas
- Drug Discovery
- Computer Science Applications