Skip to content

[C++][Parquet] Add configure to limit the row group size #48467

@wecharyu

Description

@wecharyu

Describe the enhancement requested

Arrow has one config max_row_group_length_ to limit the row number of a row group, but does not limit the size, which is even more important, see parquet doc.

And other implementations like Java has such config:
https://github.com/apache/parquet-java/blob/79977453b8cd65e6244f16316fac3a510aa87aa8/parquet-hadoop/src/main/java/parquet/hadoop/ParquetWriter.java#L35

Component(s)

C++

Metadata

Metadata

Assignees

Type

No type

Projects

No projects

Milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions