Skip to content

Conversation

@friendlymatthew
Copy link
Contributor

Which issue does this PR close?

Rationale for this change

This PR adds a fast path that detects single row group reads at construction time and stores the constant index value directly, avoiding Hashmap allocation and iterator overhead

For multiple row groups, the existing iterator-based approach is used unchanged

@github-actions github-actions bot added the parquet Changes to the parquet crate label Jan 20, 2026
@Dandandan
Copy link
Contributor

@friendlymatthew do you have some performance results?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

parquet Changes to the parquet crate

Projects

None yet

Development

Successfully merging this pull request may close these issues.

perf: optimize RowGroupIndexReader for single row group reads

2 participants