Skip to content

giotto-ai/giotto-ph

Repository files navigation

wheels ci docs

giotto-ph

giotto-ph is a high-performance implementation of Vietoris–Rips (VR) persistence on the CPU, and is distributed under the GNU AGPLv3 license. It consists of an improved reimplementation of Morozov and Nigmetov's "lock-free Ripser" and in addition makes use of a parallel implementation of the apparent pairs optimization used in Ripser v1.2. It also contains an improved reimplementation of GUDHI's Edge Collapse (EC) algorithm and offers support for weighted VR filtrations. See also Morozov's Ripser fork, Nigmetov's Oineus library, and GUDHI's EC implementation.

giotto-ph is part of the Giotto family of open-source projects and designed for tight integration with the giotto-tda and pyflagser libraries.

Project genesis

giotto-ph is the result of a collaborative effort between L2F SA, the Laboratory for Topology and Neuroscience at EPFL, and the Institute of Reconfigurable & Embedded Digital Systems (REDS) of HEIG-VD.

License

giotto-ph is distributed under the AGPLv3 license. If you need a different distribution license, please contact the L2F team.

Parallel persistent homology backend

Computing persistence barcodes of large datasets and in high homology degrees is challenging even on modern hardware. giotto-ph's persistent homology backend is able to distribute the key stages of the computation (namely, search for apparent pairs and coboundary matrix reduction) across an arbitrary number of available CPU threads.

On challenging datasets, the scaling is quite favourable as shown in the following figure (for more details, see our paper linked below):

Basic usage in Python

Basic imports:

import numpy as np
from gph import ripser_parallel

Point clouds

Persistence diagram of a random point cloud of 100 points in 3D Euclidean space, up to homology dimension 2, using all available threads:

pc = np.random.random((100, 3))
dgm = ripser_parallel(pc, maxdim=2, n_threads=-1)

Distance matrices and graphs

You can also work with distance matrices by passing metric="precomputed":

from scipy.spatial.distance import pdist, squareform

# A distance matrix
dm = squareform(pdist(pc))
dgm = ripser_parallel(pc, metric="precomputed", maxdim=2, n_threads=-1)

More generally, you can work with dense or sparse adjacency matrices of weighted graphs. Here is a dense square matrix interpreted as the adjacency matrix of a fully connected weighted graph with 100 vertices:

# Entries can be negative. The only constraint is that, for every i and j, dm[i, j] ≥ max(dm[i, i], dm[j, j])
# With dense input, the lower diagonal is ignored
adj_dense = np.random.random((100, 100))
np.fill_diagonal(adj_dense, 0)
dgm = ripser_parallel(adj_dense, metric="precomputed", maxdim=2, n_threads=-1)

And here is a sparse adjacency matrix:

# See API reference for treatment of entries below the diagonal
from scipy.sparse import random
adj_sparse = random(100, 100, density=0.1)
dgm = ripser_parallel(adj_sparse, metric="precomputed", maxdim=2, n_threads=-1)

Edge Collapser

Push the computation to higher homology dimensions and larger point clouds/distance matrices/adjacency matrices using edge collapses:

dgm_higher = ripser_parallel(pc, maxdim=5, collapse_edges=True, n_threads=-1)

(Note: not all datasets and configurations will benefit from edge collapses. For more details, see our paper below.)

Weighted Rips Filtrations

Use the weights and weight_params parameters to constructed a weighted Rips filtration as defined in this paper. weights can either be a custom 1D array of vertex weights, or the string "DTM" for distance-to-measure reweighting:

dgm_dtm = ripser_parallel(pc, weights="DTM", n_threads=-1)

Documentation and Tutorials

Jupyter notebook tutorials can be found in the examples folder. The API reference can be found at https://giotto-ai.github.io/giotto-ph.

Installation

Dependencies

The latest stable version of giotto-ph requires:

  • Python (>= 3.7)
  • NumPy (>= 1.19.1)
  • SciPy (>= 1.5.0)
  • scikit-learn (>= 0.23.1)

User installation

The simplest way to install giotto-ph is using pip

python -m pip install -U giotto-ph

If necessary, this will also automatically install all the above dependencies. Note: we recommend upgrading pip to a recent version as the above may fail on very old versions.

Developer installation

Please consult the dedicated page for detailed instructions on how to build giotto-ph from sources across different platforms.

Contributing

We welcome new contributors of all experience levels. The Giotto community goals are to be helpful, welcoming, and effective. To learn more about making a contribution to giotto-ph, please consult the relevant page.

Testing

After installation, you can launch the test suite from inside the source directory

pytest gph

Important links

Citing giotto-ph

If you use giotto-ph in a scientific publication, we would appreciate citations to the following paper:

giotto-ph: A Python Library for High-Performance Computation of Persistent Homology of Vietoris–Rips Filtrations, Burella Pérez et al, arXiv:2107.05412, 2021.

You can use the following BibTeX entry:

@misc{burella2021giottoph,
      title={giotto-ph: A Python Library for High-Performance Computation of Persistent Homology of Vietoris--Rips Filtrations},
      author={Julián Burella Pérez and Sydney Hauke and Umberto Lupo and Matteo Caorsi and Alberto Dassatti},
      year={2021},
      eprint={2107.05412},
      archivePrefix={arXiv},
      primaryClass={cs.CG}
}

Community

giotto-ai Slack workspace: https://slack.giotto.ai/

Contacts

[email protected]