Skip to content

diverse-seq provides alignment-free algorithms to facilitate phylogenetic workflows

diverse-seq implements computationally efficient alignment-free algorithms that enable efficient prototyping for phylogenetic workflows. It can accelerate parameter selection searches for sequence alignment and phylogeny estimation by identifying a subset of sequences that are representative of the diversity in a collection.

You can read more about the methods implemented in diverse-seq in the published article.

Warning

We have rewritten a substantial part of the project in Rust. We now also use the Zarr storage format instead of HDF5. As a result, the output file from dvs prep now has the suffix .dvseqsz instead of .dvseq. Old-format files are not compatible with this version.

Installation

diverse-seq can be installed from PyPI as follows

pip install diverse-seq
Note

If you experience any errors during installation, we recommend using uv pip. This command provides much better error messages than the standard pip command. If you cannot resolve the installation problem, please open an issue.

The extra install option

To get the additional components for working within Jupyter notebooks, or just to reproduce all the examples in the documentation, do the install as follows

pip install "diverse-seq[extra]"

Installation and usage with uv

Speaking of uv, it provides a simplified approach to install dvs as a command-line only tool as

uv tool install diverse-seq

For the examples in this documentation, you can access the dvs command as

uvx --from diverse-seq dvs

without having to activate a virtual environment.