Papers

This repository stores paper that motivates the project.

An Empirical Evaluation of Columnar Storage Formats gives a nice illustration of how apache parquet is structured and the argument for implementing additional auxillary data structures (disk speed is catching up and the bottleneck is shifting to compute, so it might be worth reading more structrues from disk in exchange for faster compute)

Column Sketch presents the core data structure and algorithm we want to reproduce and extend in the project. In one sentance is is an order-preserving compression to allow scanning only on compressed values. The performance argument is that scanning is bottlenecked by memory bandwith, and by accessing smaller memory per value we can perform scan faster.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
README.md		README.md
columnar.pdf		columnar.pdf
sketches.pdf		sketches.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Papers

About

Uh oh!

Releases

Packages

SP24-CS511-Final-Project/papers

Folders and files

Latest commit

History

Repository files navigation

Papers

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Packages