You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
GH-45339: [Parquet][C++] Fix statistics load logic for no row group and multiple row groups (#45350)
### Rationale for this change
Loading `arrow::ArrayStatistics` logic depends on `parquet::ColumnChunkMetaData`.
We can't get `parquet::ColumnChunkMetaData` when requested row groups are empty because no associated row group and column chunk exist.
We can't use multiple `parquet::ColumnChunkMetaData`s for now because we don't have statistics merge logic. So we can't load statistics when we use multiple row groups.
### What changes are included in this PR?
* Don't load statistics when no row groups are used
* Don't load statistics when multiple row groups are used
* Add `parquet::ArrowReaderProperties::{set_,}should_load_statistics()` to enforce loading statistics by loading row group one by one
### Are these changes tested?
Yes.
### Are there any user-facing changes?
Yes.
* GitHub Issue: #45339
Authored-by: Sutou Kouhei <kou@clear-code.com>
Signed-off-by: Sutou Kouhei <kou@clear-code.com>
0 commit comments