giotto-ph
is a high-performance implementation of Vietoris–Rips (VR) persistence on the CPU, and is distributed under the GNU AGPLv3 license.
It consists of an improved reimplementation of Morozov and Nigmetov's "lock-free Ripser"
and in addition makes use of a parallel implementation of the apparent pairs optimization used in Ripser v1.2.
It also contains an improved reimplementation of GUDHI's Edge Collapse (EC) algorithm and offers support
for weighted VR filtrations. See also Morozov's Ripser fork, Nigmetov's
Oineus library, and GUDHI's EC implementation.
giotto-ph
is part of the Giotto family of open-source projects and designed for tight integration with
the giotto-tda and pyflagser libraries.
giotto-ph
is the result of a collaborative effort between L2F SA,
the Laboratory for Topology and Neuroscience at EPFL,
and the Institute of Reconfigurable & Embedded Digital Systems (REDS) of HEIG-VD.
giotto-ph
is distributed under the AGPLv3 license.
If you need a different distribution license, please contact the L2F team.
Computing persistence barcodes of large datasets and in high homology degrees is challenging even on modern hardware. giotto-ph
's persistent homology backend
is able to distribute the key stages of the computation (namely, search for apparent pairs and coboundary matrix reduction) across an arbitrary number of available CPU threads.
On challenging datasets, the scaling is quite favourable as shown in the following figure (for more details, see our paper linked below):
Basic imports:
import numpy as np
from gph import ripser_parallel
Persistence diagram of a random point cloud of 100 points in 3D Euclidean space, up to homology dimension 2, using all available threads:
pc = np.random.random((100, 3))
dgm = ripser_parallel(pc, maxdim=2, n_threads=-1)
You can also work with distance matrices by passing metric="precomputed"
:
from scipy.spatial.distance import pdist, squareform
# A distance matrix
dm = squareform(pdist(pc))
dgm = ripser_parallel(pc, metric="precomputed", maxdim=2, n_threads=-1)
More generally, you can work with dense or sparse adjacency matrices of weighted graphs. Here is a dense square matrix interpreted as the adjacency matrix of a fully connected weighted graph with 100 vertices:
# Entries can be negative. The only constraint is that, for every i and j, dm[i, j] ≥ max(dm[i, i], dm[j, j])
# With dense input, the lower diagonal is ignored
adj_dense = np.random.random((100, 100))
np.fill_diagonal(adj_dense, 0)
dgm = ripser_parallel(adj_dense, metric="precomputed", maxdim=2, n_threads=-1)
And here is a sparse adjacency matrix:
# See API reference for treatment of entries below the diagonal
from scipy.sparse import random
adj_sparse = random(100, 100, density=0.1)
dgm = ripser_parallel(adj_sparse, metric="precomputed", maxdim=2, n_threads=-1)
Push the computation to higher homology dimensions and larger point clouds/distance matrices/adjacency matrices using edge collapses:
dgm_higher = ripser_parallel(pc, maxdim=5, collapse_edges=True, n_threads=-1)
(Note: not all datasets and configurations will benefit from edge collapses. For more details, see our paper below.)
Use the weights
and weight_params
parameters to constructed a weighted Rips filtration as defined in this paper. weights
can either be a custom 1D array of vertex weights, or the string "DTM"
for distance-to-measure reweighting:
dgm_dtm = ripser_parallel(pc, weights="DTM", n_threads=-1)
Jupyter notebook tutorials can be found in the examples folder. The API reference can be found at https://giotto-ai.github.io/giotto-ph.
The latest stable version of giotto-ph
requires:
- Python (>= 3.7)
- NumPy (>= 1.19.1)
- SciPy (>= 1.5.0)
- scikit-learn (>= 0.23.1)
The simplest way to install giotto-ph
is using pip
python -m pip install -U giotto-ph
If necessary, this will also automatically install all the above dependencies. Note: we recommend
upgrading pip
to a recent version as the above may fail on very old versions.
Please consult the dedicated page
for detailed instructions on how to build giotto-ph
from sources across different platforms.
We welcome new contributors of all experience levels. The Giotto community goals are to be helpful, welcoming,
and effective. To learn more about making a contribution to giotto-ph
, please consult the relevant page.
After installation, you can launch the test suite from inside the source directory
pytest gph
- Issue tracker: https://github.com/giotto-ai/giotto-ph/issues
If you use giotto-ph
in a scientific publication, we would appreciate citations to the following paper:
giotto-ph: A Python Library for High-Performance Computation of Persistent Homology of Vietoris–Rips Filtrations, Burella Pérez et al, arXiv:2107.05412, 2021.
You can use the following BibTeX entry:
@misc{burella2021giottoph,
title={giotto-ph: A Python Library for High-Performance Computation of Persistent Homology of Vietoris--Rips Filtrations},
author={Julián Burella Pérez and Sydney Hauke and Umberto Lupo and Matteo Caorsi and Alberto Dassatti},
year={2021},
eprint={2107.05412},
archivePrefix={arXiv},
primaryClass={cs.CG}
}
giotto-ai Slack workspace: https://slack.giotto.ai/