## Abstract

We give a greedy learning algorithm for reconstructing an evolutionary tree based on a certain harmonic average on triplets of terminal taxa. After the pairwise distances between terminal taxa are estimated from sequence data, the algorithm runs in script O sign(n^{2}) time using script O sign(n) work space, where n is the number of terminal taxa. These time and space complexities are optimal in the sense that the size of an input distance matrix is n^{2} and the size of an output tree is n. Moreover, in the Jukes-Cantor model of evolution, the algorithm recovers the correct tree topology with high probability using sample sequences of length polynomial in (1) n, (2) the logarithm of the error probability, and (3) the inverses of two small parameters.

Original language | English (US) |
---|---|

Pages (from-to) | 306-322 |

Number of pages | 17 |

Journal | SIAM Journal on Computing |

Volume | 31 |

Issue number | 1 |

DOIs | |

State | Published - 2001 |

## Keywords

- Computational learning
- Evolutionary trees
- Harmonic greedy triplets
- The Jukes-Cantor model of evolution

## ASJC Scopus subject areas

- General Computer Science
- General Mathematics