feat: use output config instead of relying on leaf node type to save … #139

ChenZiHong-Gavin · 2025-12-24T08:24:40Z

This pull request enhances the flexibility of output management within the graph processing engine. By introducing an explicit save_output configuration option for each node, the system no longer relies on the topological position (i.e., being a leaf node) to decide which intermediate results are preserved. This change provides developers with more precise control over data persistence, allowing them to specify exactly which node outputs are relevant for downstream use or inspection, thereby streamlining data flow and potentially optimizing resource usage.

…output

gemini-code-assist · 2025-12-24T08:24:58Z

Summary of Changes

Hello @ChenZiHong-Gavin, I'm Gemini Code Assist¹! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request enhances the flexibility of output management within the graph processing engine. By introducing an explicit save_output configuration option for each node, the system no longer relies on the topological position (i.e., being a leaf node) to decide which intermediate results are preserved. This change provides developers with more precise control over data persistence, allowing them to specify exactly which node outputs are relevant for downstream use or inspection, thereby streamlining data flow and potentially optimizing resource usage.

Highlights

Explicit Output Saving: Introduced a save_output boolean field in the Node class to explicitly control whether a node's output should be saved, replacing the previous implicit mechanism based on leaf nodes.
Configuration Updates: Updated several example YAML configuration files to set save_output: true for their respective nodes, demonstrating the new configuration approach.
Engine Logic Refinement: Modified the Engine's execution logic to leverage the new save_output flag for determining which datasets to return, and removed the _find_leaf_nodes utility method.

Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point by creating a comment using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands on the current page.

Feature	Command	Description
Code Review	`/gemini review`	Performs a code review for the current pull request in its current state.
Pull Request Summary	`/gemini summary`	Provides a summary of the current pull request in its current state.
Comment	@gemini-code-assist	Responds in comments when explicitly tagged, both in pull request comments and review comments.
Help	`/gemini help`	Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for Github and other Google products, sign up here.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution. ↩

gemini-code-assist

Code Review

This pull request introduces a save_output configuration flag for nodes in the computation graph. This provides an explicit way to specify which node outputs should be saved, replacing the previous implicit behavior of saving outputs only for leaf nodes. The changes include updating the Node data model, modifying the execution engine to use this new flag, and updating all example configurations to be compatible with the new approach. The changes are well-implemented and make the system's behavior more predictable and configurable. I have one minor suggestion regarding import ordering for better code style.

gemini-code-assist · 2025-12-24T08:26:11Z

graphgen/engine.py

 import ray
 import ray.data
+from dotenv import load_dotenv
 from ray.data import DataContext


According to PEP 8, imports within a group should be sorted alphabetically. This improves readability and makes it easier to find imports.

Suggested change

import ray

import ray.data

from dotenv import load_dotenv

from ray.data import DataContext

from dotenv import load_dotenv

import ray

import ray.data

from ray.data import DataContext

feat: use output config instead of relying on leaf node type to save …

c2cba57

…output

gemini-code-assist bot reviewed Dec 24, 2025

View reviewed changes

test: update e2e tests

084a9c9

ChenZiHong-Gavin merged commit 60f4d1b into main Dec 24, 2025
4 checks passed

ChenZiHong-Gavin deleted the save-output-config branch December 24, 2025 10:51

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: use output config instead of relying on leaf node type to save … #139

feat: use output config instead of relying on leaf node type to save … #139

Uh oh!

ChenZiHong-Gavin commented Dec 24, 2025 •

edited

Loading

Uh oh!

gemini-code-assist bot commented Dec 24, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Dec 24, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

feat: use output config instead of relying on leaf node type to save … #139

feat: use output config instead of relying on leaf node type to save … #139

Uh oh!

Conversation

ChenZiHong-Gavin commented Dec 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gemini-code-assist bot commented Dec 24, 2025

Summary of Changes

Highlights

Footnotes

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Dec 24, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

ChenZiHong-Gavin commented Dec 24, 2025 •

edited

Loading