Avro reader for Dask.
- Free software: MIT license
- Documentation: https://dask-avro.readthedocs.org.
- Python versions: 2.7, 3.5+
This projects provides an Avro format reader for Dask. Provides a convenient function to read one or more Avro files and partition them arbitrarily.
Usage:
import dask.bag import dask_avro delayeds = dask_avro.read_avro("data-*.avro", blocksize=2**26) data = dask.bag.from_delayed(delayeds)
This package was created with Cookiecutter and the rmax/cookiecutter-pypackage project template.