Skip to content

Conversation

@songbell
Copy link

@songbell songbell commented Sep 17, 2025

Copilot AI review requested due to automatic review settings September 17, 2025 07:28
@github-actions github-actions bot added category: continuous batching Continuous batching category: LLM LLM pipeline (stateful, static) category: sampling Sampling / Decoding algorithms category: speculative decoding Speculative decoding category: LoRA Low rank adapters category: cmake / build Cmake scripts category: LLM samples GenAI LLM samples category: CPP API Changes in GenAI C++ public headers no-match-files labels Sep 17, 2025
@songbell songbell changed the title eagle impl with top-1 proposal eagle3 cb impl with top-1 proposal Sep 17, 2025
Copy link
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull Request Overview

This PR implements Eagle speculative decoding functionality for top-1 proposal generation. The implementation adds support for Eagle3 mode, which enables accelerated text generation through speculative decoding with hidden state sharing between main and draft models.

Key changes include:

  • Added Eagle decoding implementation with model transformation pipelines for hidden state extraction
  • Integrated safetensor parsing for Eagle3 configuration data (d2t mappings)
  • Extended continuous batching pipeline to support Eagle mode with hidden state management

Reviewed Changes

Copilot reviewed 24 out of 25 changed files in this pull request and generated 8 comments.

Show a summary per file
File Description
tools/continuous_batching/accuracy/continuous_batching_eagle_decoding.cpp New Eagle decoding accuracy test tool
src/cpp/src/speculative_decoding/speculative_decoding_impl.hpp Added Eagle decoding class definitions and model transformation passes
src/cpp/src/speculative_decoding/speculative_decoding_impl.cpp Core Eagle decoding implementation with model transformations
src/cpp/src/continuous_batching/pipeline.cpp Integration of Eagle mode into pipeline construction
src/cpp/src/continuous_batching/model_runner.hpp Added hidden state management functionality
samples/cpp/text_generation/eagle_speculative_lm.cpp New Eagle speculative decoding sample
src/cpp/src/safe_tensor_wrapper.hpp New safetensor parsing utilities
Comments suppressed due to low confidence (1)

src/cpp/src/continuous_batching/model_runner.hpp:1

  • This appears to be modifying the token index without bounds checking on the d2t array. Add bounds checking to prevent potential buffer overflow.
// Copyright (C) 2023-2025 Intel Corporation

Tip: Customize your code reviews with copilot-instructions.md. Create the file or learn how to get started.

@github-actions github-actions bot added the category: llm_bench Label for tool/llm_bench folder label Sep 18, 2025
Copy link
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull Request Overview

Copilot reviewed 28 out of 29 changed files in this pull request and generated 10 comments.


Tip: Customize your code reviews with copilot-instructions.md. Create the file or learn how to get started.

Copy link
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull Request Overview

Copilot reviewed 21 out of 21 changed files in this pull request and generated 3 comments.


💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot AI review requested due to automatic review settings November 12, 2025 04:17
Copy link
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull Request Overview

Copilot reviewed 21 out of 21 changed files in this pull request and generated 6 comments.


💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

@rkazants
Copy link
Collaborator

Please do NOT merge until we will have more-less clear vision about solution in optimum-intel side. The model preparation can affect IRs and their structure so that the proposed solution in this PR will be incompatible.

Now we have no the full architectural proposal for eagle3 and we need to pass it for model preparation in arch review.

@peterchen-intel, did you have internal code-review here? Is the code clean enough?

Best regards,
Roman

@peterchen-intel
Copy link
Collaborator

peterchen-intel commented Nov 12, 2025

Please do NOT merge until we will have more-less clear vision about solution in optimum-intel side. The model preparation can affect IRs and their structure so that the proposed solution in this PR will be incompatible.

Now we have no the full architectural proposal for eagle3 and we need to pass it for model preparation in arch review.

@peterchen-intel, did you have internal code-review here? Is the code clean enough?

Best regards, Roman

@rkazants 50 comments from all reviewers (including 12 from copilot) have been resolved, the code is clean now.

Copilot AI review requested due to automatic review settings November 12, 2025 08:36
Copy link
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull Request Overview

Copilot reviewed 21 out of 21 changed files in this pull request and generated 6 comments.


💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

@peterchen-intel peterchen-intel removed this from the 2025.4 milestone Nov 12, 2025
Copy link
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull Request Overview

Copilot reviewed 21 out of 21 changed files in this pull request and generated 9 comments.


💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

@MaximProshin
Copy link
Collaborator

Please do NOT merge until we will have more-less clear vision about solution in optimum-intel side. The model preparation can affect IRs and their structure so that the proposed solution in this PR will be incompatible.

Now we have no the full architectural proposal for eagle3 and we need to pass it for model preparation in arch review.

@peterchen-intel, did you have internal code-review here? Is the code clean enough?

Best regards, Roman

This is not a blocking comment and will be addressed separately.

Copilot AI review requested due to automatic review settings November 19, 2025 03:26
Copy link
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull Request Overview

Copilot reviewed 21 out of 21 changed files in this pull request and generated 4 comments.


💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot AI review requested due to automatic review settings November 19, 2025 03:30
Copy link
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull Request Overview

Copilot reviewed 21 out of 21 changed files in this pull request and generated 4 comments.


💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

@songbell
Copy link
Author

replace with #3055

@songbell songbell closed this Nov 21, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

category: continuous batching Continuous batching category: CPP API Changes in GenAI C++ public headers category: GGUF GGUF file reader category: GHA CI based on Github actions category: LLM samples GenAI LLM samples category: LLM LLM pipeline (stateful, static) category: sampling Sampling / Decoding algorithms category: speculative decoding Speculative decoding do_not_merge no-match-files

Projects

None yet

Development

Successfully merging this pull request may close these issues.