feat(rust/sedona-geoparquet): Ensure metadata cache is used in GeoParquet wrapper code by paleolimbot · Pull Request #646 · apache/sedona-db

paleolimbot · 2026-02-19T23:20:04Z

In #251 we tried to use the file metadata cache and found that it actually slowed down queries. @yutannihilation kindly benchmarked the effect of the cache against DuckDB to demonstrate that the file cache there is effective for queries against large tables. @b4l kindly showed how to do this in #604.

This PR pipes through the requisite options to ensure the cache is used for GeoParquet reads. This is especially important because we need to pull two extra copies of the metadata after DataFusion has already pulled it: if we don't use the cached version, we issue three requests where we could have issued one. For most queries this is done in parallel/async in a non-blocking way and is hard to notice; however, remote tables with large numbers of Parquet files do very badly.

A secondary issue is that the default size of the cache is not well-equiped to deal with Overture buildings, which we were using to benchmark this. The buildings data requires almost 900 megabytes of cache space and because it is a least-recently used cache being queried roughly in order three times, if the cache size is even a little bit smaller than the full size of the dataset then it is 0% useful. The increase we see in time is probably because of contention on the mutex guarding the in-memory cache.

import re
import os
os.environ["AWS_SKIP_SIGNATURE"] = "true"
os.environ["AWS_DEFAULT_REGION"] = "us-west-2"
import sedona.db

sd = sedona.db.connect()

sd.sql("SET datafusion.runtime.metadata_cache_limit = '900M'").execute()

# 16s on main, 10s on this PR with a big enough cache
sd.read_parquet(
    "s3://overturemaps-us-west-2/release/2026-02-18.0/theme=buildings/type=building/"
).to_view("buildings", overwrite=True)

# Second time: 16s on main, 0s with this PR
sd.read_parquet(
    "s3://overturemaps-us-west-2/release/2026-02-18.0/theme=buildings/type=building/"
).to_view("buildings", overwrite=True)

I took the opportunity to redo the Overture buildings documentation page to include this and a few other improvements we added in the last few months.

Closes #250.

Copilot

Pull request overview

This PR implements metadata caching for GeoParquet reads to reduce redundant metadata fetches from remote object stores. The PR addresses a performance issue where each GeoParquet query would fetch the same metadata multiple times without using DataFusion's built-in metadata cache.

Changes:

Pipes through file metadata cache to DFParquetMetadata operations in both schema inference and file opening
Adds metadata_cache field to GeoParquetFileSource and GeoParquetFileOpener structs with proper propagation through all transformation methods
Updates Overture Buildings documentation to demonstrate cache configuration and modern SedonaDB API patterns

Reviewed changes

Copilot reviewed 4 out of 4 changed files in this pull request and generated 2 comments.

File	Description
rust/sedona-geoparquet/src/format.rs	Adds metadata cache support to GeoParquetFormat, including cache retrieval in infer_schema and create_physical_plan, and proper propagation through GeoParquetFileSource methods
rust/sedona-geoparquet/src/file_opener.rs	Adds metadata_cache field to GeoParquetFileOpener and uses it when fetching parquet metadata
docs/overture-examples.md	Comprehensive rewrite demonstrating cache configuration, parameterized queries, and modern SedonaDB patterns for Overture data
docs/overture-examples.ipynb	Corresponding Jupyter notebook with consistent examples and output

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

docs/overture-examples.md

docs/overture-examples.ipynb

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

yutannihilation · 2026-02-20T23:43:30Z

Thanks for this!

Sorry for a noob question. Even if I include sd.sql("SET datafusion.runtime.metadata_cache_limit = '900M'").execute(), the benchmark still doesn't improve much. Is this expected at the moment (until the new version of DataFusion...?)? Maybe the difference is that DuckDB somehow holds a persistent cache among sessions while SedonaDB doesn't?

Benchmark results (seconds):

Engine	Median	Mean	Min	Max	Runs	row_count	max_confidence
DuckDB	0.591740	2.654925	0.575550	6.797485	3	7471	0.999544084072
SedonaDB	9.257312	8.987796	8.034394	9.671681	3	7471	0.999544084072

paleolimbot · 2026-02-21T03:04:13Z

If the benchmark is running a fresh process each time, then sd.sql("SET datafusion.runtime.metadata_cache_limit = '900M'").execute() won't help (the cache is in-memory only). I'm not sure exactly how DuckDB does it but having a persistent cache would be great.

We can do that if we want...it roughly involves reimplementing the default cache:

https://github.com/apache/datafusion/blob/1736fd2a40b64c6e39fb12090a2dbe8be07ac5ac/datafusion/execution/src/cache/file_metadata_cache.rs#L143-L205

...backing it with a SQLite database or files in a temporary directory. It can be overridden when we set up the runtime environment here:

https://github.com/apache/datafusion/blob/1736fd2a40b64c6e39fb12090a2dbe8be07ac5ac/datafusion/execution/src/runtime_env.rs#L379-L383

sedona-db/rust/sedona/src/context_builder.rs

Lines 194 to 223 in bed9151

    
               /// Build a [`RuntimeEnv`] from the current configuration. 
        
               /// 
        
               /// This constructs the memory pool and disk manager based on the 
        
               /// builder settings and returns the resulting runtime environment. 
        
               pub fn build_runtime_env(&self) -> Result<Arc<RuntimeEnv>> { 
        
                   let mut rt_builder = RuntimeEnvBuilder::new(); 
        
                   if let Some(memory_limit) = self.memory_limit { 
        
                       let track_capacity = NonZeroUsize::new(10).expect("track capacity must be non-zero"); 
        
                       let pool: Arc<dyn MemoryPool> = match self.pool_type { 
        
                           PoolType::Fair => Arc::new(TrackConsumersPool::new( 
        
                               SedonaFairSpillPool::new(memory_limit, self.unspillable_reserve_ratio), 
        
                               track_capacity, 
        
                           )), 
        
                           PoolType::Greedy => Arc::new(TrackConsumersPool::new( 
        
                               GreedyMemoryPool::new(memory_limit), 
        
                               track_capacity, 
        
                           )), 
        
                       }; 
        
                       rt_builder = rt_builder.with_memory_pool(pool); 
        
                   } 
        
                   if let Some(ref temp_dir) = self.temp_dir { 
        
                       let dm_builder = DiskManagerBuilder::default() 
        
                           .with_mode(DiskManagerMode::Directories(vec![PathBuf::from(temp_dir)])); 
        
                       rt_builder = rt_builder.with_disk_manager_builder(dm_builder); 
        
                   } 
        
                   rt_builder.build_arc() 
        
               }

zhangfengcdt

Looks good to me! I would suggest we add some document to clarify:

What is recommended cache size for very large datasets like Overture
Do we support cache invalidation and if not, we should clarify the limitation and the risk of inconsistence (though it might not be the main concerns for slow updating dataset).

paleolimbot · 2026-02-23T22:13:28Z

python/sedonadb/tests/test_udf.py

-def test_udf_sedonadb_registry_function_to_datafusion(con):
-    datafusion = pytest.importorskip("datafusion")
-    udf_impl = udf.arrow_udf(pa.binary(), [udf.STRING, udf.NUMERIC])(some_udf)
-
-    # Register with our session
-    con.register_udf(udf_impl)
-
-    # Create a datafusion session, fetch our udf and register with the other session
-    datafusion_ctx = datafusion.SessionContext()
-    datafusion_ctx.register_udf(
-        datafusion.ScalarUDF.from_pycapsule(con._impl.scalar_udf("some_udf"))
-    )


I added #655 to track this...you have to try pretty hard to trigger this failing functionality so I just removed the tests for now.

paleolimbot added 5 commits February 19, 2026 14:36

one metadata

aa56e42

pipe through cache to file opener

e3a2b21

update docs on overture

6a4d7ac

format

68b5682

remove commented out code

c2097ef

paleolimbot requested a review from Copilot February 19, 2026 23:37

Copilot started reviewing on behalf of paleolimbot February 19, 2026 23:37 View session

Copilot AI reviewed Feb 19, 2026

View reviewed changes

docs/overture-examples.md Outdated Show resolved Hide resolved

docs/overture-examples.ipynb Outdated Show resolved Hide resolved

paleolimbot and others added 2 commits February 19, 2026 17:44

Update docs/overture-examples.ipynb

6354ce9

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Update docs/overture-examples.md

0f1be81

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

paleolimbot marked this pull request as ready for review February 19, 2026 23:44

zhangfengcdt approved these changes Feb 23, 2026

View reviewed changes

paleolimbot added 6 commits February 23, 2026 14:30

better cache docs

e1c8ea9

see if the test works

59201bb

wrong pr :(

2d46609

remove datafusion tests for now

94b5df6

grr wrong pr again

06602be

remove the tests

a126634

paleolimbot commented Feb 23, 2026

View reviewed changes

paleolimbot merged commit a788960 into apache:main Feb 23, 2026
17 checks passed

paleolimbot deleted the all-the-metadata-caches branch February 23, 2026 22:14

petern48 mentioned this pull request Feb 24, 2026

perf: Use file_metadata_cache in geoparquet #294

Closed

yutannihilation mentioned this pull request Feb 24, 2026

SedonaDB doesn't use the column statistics of bbox column? #617

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(rust/sedona-geoparquet): Ensure metadata cache is used in GeoParquet wrapper code#646

feat(rust/sedona-geoparquet): Ensure metadata cache is used in GeoParquet wrapper code#646
paleolimbot merged 13 commits intoapache:mainfrom
paleolimbot:all-the-metadata-caches

paleolimbot commented Feb 19, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

yutannihilation commented Feb 20, 2026

Uh oh!

paleolimbot commented Feb 21, 2026

Uh oh!

zhangfengcdt left a comment

Uh oh!

paleolimbot Feb 23, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

paleolimbot commented Feb 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

yutannihilation commented Feb 20, 2026

Uh oh!

paleolimbot commented Feb 21, 2026

Uh oh!

zhangfengcdt left a comment

Choose a reason for hiding this comment

Uh oh!

paleolimbot Feb 23, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

paleolimbot commented Feb 19, 2026 •

edited

Loading