Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

MQE: initial query planner implementation #11004

Open
wants to merge 59 commits into
base: main
Choose a base branch
from

Conversation

charleskorn
Copy link
Contributor

@charleskorn charleskorn commented Mar 26, 2025

What this PR does

This PR introduces a query planner for MQE. At present, it takes the original PromQL expression, translates it to a query plan, then converts that to operators for evaluation. No significant optimisations (eg. common subexpression elimination) are applied to the query plan as part of this PR.

The main parts of this PR are:

  • the QueryPlanner type in pkg/streamingpromql/planning.go: the main entrypoint to the query planning functionality, responsible for creating query plans from PromQL expressions
  • the QueryPlan type in pkg/streamingpromql/planning/plan.go: represents a query plan
  • the Node type in pkg/streamingpromql/planning/plan.go: represents a node in a query plan, and provides methods to examine, manipulate and convert the node to an operator
  • the EncodedQueryPlan and EncodedNode types in pkg/streamingpromql/planning/plan.proto: represent a query plan transformed into a form suitable for serialization / deserialization
  • two simple AST optimisation passes in pkg/streamingpromql/optimize/ast

I've run our existing benchmarks with query planning enabled and disabled. For non-trivial cases, the cost of running planning is noise compared to the cost of evaluating the query itself.

There are a number of improvements I'd like to introduce as follow-up PRs:

  • metrics for planning latency
  • investigate using code generation for methods like EquivalentTo(), Details(), Children(), SetChildren() and ChildrenLabels()
  • moving the responsibility for converting a node to an operator elsewhere: the current design will make passing configuration (eg. cache connection parameters) difficult

I'll also introduce the first optimisation that builds on this as a follow-up PR.

I'll add a changelog entry when this PR is nearly ready to merge.

Which issue(s) this PR fixes or relates to

(none)

Checklist

  • Tests updated.
  • [n/a] Documentation added.
  • CHANGELOG.md updated - the order of entries should be [CHANGE], [FEATURE], [ENHANCEMENT], [BUGFIX].
  • [n/a] about-versioning.md updated with experimental features.

Copy link
Contributor

github-actions bot commented Mar 26, 2025

@charleskorn charleskorn force-pushed the charleskorn/mqe-query-planner branch from 00a4742 to cdcf0f7 Compare March 31, 2025 02:57
@charleskorn charleskorn marked this pull request as ready for review March 31, 2025 04:18
@charleskorn charleskorn requested review from tacole02 and a team as code owners March 31, 2025 04:18
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant