chore: add compression #2345

NathanFlurry · 2025-04-11T01:22:14Z

Changes

NathanFlurry · 2025-04-11T01:22:29Z

Warning

This pull request is not mergeable via GitHub because a downstack PR is open. Once all requirements are satisfied, merge this PR as a stack on Graphite.
Learn more

chore: add compression #2345 👈 (View in Graphite)
feat: sqlite vfs fdb implementation #2282 : 1 other dependent PR (#2348 )
main

How to use the Graphite Merge Queue

Add the label merge-queue to this PR to add it to the merge queue.

You must have a Graphite account in order to use the merge queue. Sign up using this link.

_{An organization admin has enabled the Graphite Merge Queue in this repository.} _{Please do not merge from GitHub as this will restart CI on PRs being processed by the merge queue.}

This stack of pull requests is managed by Graphite. Learn more about stacking.

greptile-apps

PR Summary

This PR adds compression support to the SQLite VFS FDB implementation, enabling data to be stored more efficiently in FoundationDB with multiple compression algorithms (None, LZ4, Snappy, Zstd).

Added CompressionType enum and Compression trait with implementations for all supported algorithms in compression.rs
Modified FdbFileMetadata to include compression type with backward compatibility for older formats
Updated file operations in file.rs to compress/decompress page data during reads and writes
Added comprehensive compression metrics tracking including operation counts, bytes processed, ratios, and latencies
Implemented VFS registration with compression-specific suffixes allowing multiple VFS variants to coexist

Greptile AI

_{7 file(s) reviewed, 8 comment(s)}
_{Edit PR Review Bot Settings | Greptile}

greptile-apps · 2025-04-11T01:22:52Z

packages/common/sqlite-vfs-fdb/src/impls/pages/utils/mod.rs

+            // Convert from FdbBindingError to FdbError
+            Err(FdbError::from_code(1))
+        }


style: Converting any FdbBindingError to a generic error code 1 loses specific error information. Consider preserving the original error code from the FdbBindingError if available, or at least logging the specific error code before converting.

greptile-apps · 2025-04-11T01:22:52Z

packages/common/sqlite-vfs-fdb/src/impls/pages/utils/mod.rs

+pub fn create_safe_bytes(size: usize) -> bytes::BytesMut {
+    // Check if the size is reasonable and chunk if needed
+    let safe_size = std::cmp::min(size, MAX_SAFE_PAGE_SIZE);
+    let buffer = vec![0u8; safe_size];
+    bytes::BytesMut::from(&buffer[..])
+}


style: This implementation creates a new buffer with zeros and then copies it to a BytesMut. Consider using BytesMut::with_capacity followed by resize_with for better performance.

Suggested change

pub fn create_safe_bytes(size: usize) -> bytes::BytesMut {

// Check if the size is reasonable and chunk if needed

let safe_size = std::cmp::min(size, MAX_SAFE_PAGE_SIZE);

let buffer = vec![0u8; safe_size];

bytes::BytesMut::from(&buffer[..])

}

pub fn create_safe_bytes(size: usize) -> bytes::BytesMut {

// Check if the size is reasonable and chunk if needed

let safe_size = std::cmp::min(size, MAX_SAFE_PAGE_SIZE);

let mut bytes = bytes::BytesMut::with_capacity(safe_size);

bytes.resize(safe_size, 0);

bytes

}

greptile-apps · 2025-04-11T01:22:57Z

packages/common/sqlite-vfs-fdb/src/impls/pages/utils/compression.rs

+impl From<u8> for CompressionType {
+    fn from(value: u8) -> Self {
+        match value {
+            0 => CompressionType::None,
+            1 => CompressionType::Lz4,
+            2 => CompressionType::Snappy,
+            3 => CompressionType::Zstd,
+            _ => CompressionType::None,
+        }
+    }


style: This implementation silently defaults to CompressionType::None for invalid values, but there's an UnknownType error defined that's never used. Consider either throwing the error or logging a warning when an unknown compression type is encountered.

greptile-apps · 2025-04-11T01:22:57Z

packages/common/sqlite-vfs-fdb/src/impls/pages/utils/compression.rs

+
+impl fmt::Display for CompressionType {
+    fn fmt(&self, f: &mut fmt::Formatter<'_>) -> fmt::Result {
+        match self {
+            CompressionType::None => write!(f, "none"),
+            CompressionType::Lz4 => write!(f, "lz4"),
+            CompressionType::Snappy => write!(f, "snappy"),
+            CompressionType::Zstd => write!(f, "zstd"),
+        }
+    }
+}


style: This manual Display implementation is redundant since you're already deriving Display from strum at line 25. You can remove this implementation to avoid potential inconsistencies.

Suggested change

impl fmt::Display for CompressionType {

fn fmt(&self, f: &mut fmt::Formatter<'_>) -> fmt::Result {

match self {

CompressionType::None => write!(f, "none"),

CompressionType::Lz4 => write!(f, "lz4"),

CompressionType::Snappy => write!(f, "snappy"),

CompressionType::Zstd => write!(f, "zstd"),

}

}

}

greptile-apps · 2025-04-11T01:22:58Z

packages/common/sqlite-vfs-fdb/src/impls/pages/utils/compression.rs

+        // Store the original size at the beginning for decompression
+        let mut result = BytesMut::with_capacity(4 + compressed_size);
+        result.extend_from_slice(&(data.len() as u32).to_le_bytes());
+        result.extend_from_slice(&compressed[0..compressed_size]);


style: LZ4 implementation stores the original size in the compressed data, but Snappy doesn't. This creates an inconsistency in how the compressed data is formatted between different compression types.

greptile-apps · 2025-04-11T01:22:59Z

packages/common/sqlite-vfs-fdb/src/impls/pages/utils/compression.rs

+    fn decompress(&self, compressed_data: &[u8], expected_size: usize) -> Result<Bytes, CompressionError> {
+        let timer = metrics::start_compression_operation();
+
+        // Decompress the data
+        let decompressed = snap::raw::Decoder::new()
+            .decompress_vec(compressed_data)
+            .map_err(|e| CompressionError::SnappyError(e.to_string()))?;
+
+        // Record metrics
+        metrics::complete_compression_operation(
+            &timer, 
+            CompressionType::Snappy, 
+            decompressed.len(), 
+            compressed_data.len(),
+            false
+        );
+
+        Ok(Bytes::from(decompressed))
+    }


style: Unlike LZ4, the Snappy implementation doesn't use the expected_size parameter and doesn't store the original size in the compressed data. This could cause issues if the caller expects consistent behavior across compression types.

greptile-apps · 2025-04-11T01:23:28Z

packages/common/sqlite-vfs-fdb/src/metrics.rs

+    // Record ratio (only if compression and sizes are valid)
+    if is_compression && original_size > 0 && compressed_size > 0 {
+        let ratio = original_size as f64 / compressed_size as f64;
+        COMPRESSION_RATIO
+            .with_label_values(&[&comp_type, operation])
+            .observe(ratio);
+    } else if !is_compression && original_size > 0 && compressed_size > 0 {
+        let ratio = compressed_size as f64 / original_size as f64;
+        COMPRESSION_RATIO
+            .with_label_values(&[&comp_type, operation])
+            .observe(ratio);
+    }


logic: The ratio calculation is inverted between compression and decompression. For compression, you calculate original/compressed (which is correct), but for decompression you calculate compressed/original. This is inconsistent and could be confusing when analyzing metrics. Consider using the same ratio definition (original/compressed) for both operations and adding a note in the metric description that values > 1.0 indicate effective compression.

Suggested change

// Record ratio (only if compression and sizes are valid)

if is_compression && original_size > 0 && compressed_size > 0 {

let ratio = original_size as f64 / compressed_size as f64;

COMPRESSION_RATIO

.with_label_values(&[&comp_type, operation])

.observe(ratio);

} else if !is_compression && original_size > 0 && compressed_size > 0 {

let ratio = compressed_size as f64 / original_size as f64;

COMPRESSION_RATIO

.with_label_values(&[&comp_type, operation])

.observe(ratio);

}

// Record ratio (only if compression and sizes are valid)

if is_compression && original_size > 0 && compressed_size > 0 {

let ratio = original_size as f64 / compressed_size as f64;

COMPRESSION_RATIO

.with_label_values(&[&comp_type, operation])

.observe(ratio);

} else if !is_compression && original_size > 0 && compressed_size > 0 {

let ratio = original_size as f64 / compressed_size as f64;

COMPRESSION_RATIO

.with_label_values(&[&comp_type, operation])

.observe(ratio);

}

greptile-apps · 2025-04-11T01:23:28Z

packages/common/sqlite-vfs-fdb/src/metrics.rs

+/// Start measuring a compression operation
+pub fn start_compression_operation() -> MetricsTimer {
+    MetricsTimer::start()
+}


style: Unlike other metric start functions (e.g., start_file_open, start_fdb_transaction), this function doesn't increment any counter before starting the timer. This is fine if the counter is incremented in complete_compression_operation, but it's inconsistent with the pattern used elsewhere in the file.

cloudflare-workers-and-pages · 2025-04-11T01:25:46Z

Deploying rivet with Cloudflare Pages

Latest commit:	`45181cb`
Status:	✅ Deploy successful!
Preview URL:	https://a687fccd.rivet.pages.dev
Branch Preview URL:	https://04-10-chore-add-compression.rivet.pages.dev

View logs

NathanFlurry mentioned this pull request Apr 11, 2025

feat: sqlite vfs fdb implementation #2282

Open

greptile-apps bot reviewed Apr 11, 2025

View reviewed changes

chore: add compression

45181cb

NathanFlurry force-pushed the 04-10-chore_add_compression branch from e3c5f03 to 45181cb Compare April 11, 2025 08:19

MasterPtato changed the base branch from 03-29-feat_sqlite_vfs_fdb_implementation to graphite-base/2345 April 15, 2025 18:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

chore: add compression #2345

chore: add compression #2345

Uh oh!

NathanFlurry commented Apr 11, 2025

Uh oh!

NathanFlurry commented Apr 11, 2025 •

edited

Loading

Uh oh!

greptile-apps bot left a comment

Uh oh!

greptile-apps bot Apr 11, 2025

Uh oh!

greptile-apps bot Apr 11, 2025

Uh oh!

greptile-apps bot Apr 11, 2025

Uh oh!

greptile-apps bot Apr 11, 2025

Uh oh!

greptile-apps bot Apr 11, 2025

Uh oh!

greptile-apps bot Apr 11, 2025

Uh oh!

greptile-apps bot Apr 11, 2025

Uh oh!

greptile-apps bot Apr 11, 2025

Uh oh!

cloudflare-workers-and-pages bot commented Apr 11, 2025 •

edited

Loading

Uh oh!

Uh oh!

chore: add compression #2345

Are you sure you want to change the base?

chore: add compression #2345

Uh oh!

Conversation

NathanFlurry commented Apr 11, 2025

Changes

Uh oh!

NathanFlurry commented Apr 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

How to use the Graphite Merge Queue

Uh oh!

greptile-apps bot left a comment

Choose a reason for hiding this comment

PR Summary

Uh oh!

greptile-apps bot Apr 11, 2025

Choose a reason for hiding this comment

Uh oh!

greptile-apps bot Apr 11, 2025

Choose a reason for hiding this comment

Uh oh!

greptile-apps bot Apr 11, 2025

Choose a reason for hiding this comment

Uh oh!

greptile-apps bot Apr 11, 2025

Choose a reason for hiding this comment

Uh oh!

greptile-apps bot Apr 11, 2025

Choose a reason for hiding this comment

Uh oh!

greptile-apps bot Apr 11, 2025

Choose a reason for hiding this comment

Uh oh!

greptile-apps bot Apr 11, 2025

Choose a reason for hiding this comment

Uh oh!

greptile-apps bot Apr 11, 2025

Choose a reason for hiding this comment

Uh oh!

cloudflare-workers-and-pages bot commented Apr 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Deploying rivet with Cloudflare Pages

Uh oh!

Uh oh!

NathanFlurry commented Apr 11, 2025 •

edited

Loading

cloudflare-workers-and-pages bot commented Apr 11, 2025 •

edited

Loading