LLM Streaming Functionality with E2E Integration Tests #1088

igordayen · 2025-11-29T04:39:07Z

LLM Streaming Integration with Tooling Support

Overview

This PR introduces comprehensive LLM streaming capabilities with full tooling integration, providing reactive streaming APIs that work seamlessly with @Tool annotated methods.

User Code Usage

Basic Streaming with Tools

// Extension function approach (recommended)
runner = ai.withLlmByRole("fastest")

    val results = runner.asStreaming()
          .withPrompt("Test integration streaming")
          .createObjectListWithThinking(SimpleItem::class.java)

      results
          .doOnNext { event ->
              when {
                  event.isThinking() -> {
                      val content = event.getThinking()!!
                      receivedEvents.add("THINKING: $content")
                      logger.info("Integration test received thinking: {}", content)
                  }
                  event.isObject() -> {
                      val obj = event.getObject()!!
                      receivedEvents.add("OBJECT: ${obj.name}")
                      logger.info("Integration test received object: {}", obj.name)
                  }
              }
          }
          .doOnError { error ->
              errorOccurred = error
              logger.error("Integration test stream error: {}", error.message)
          }
          .doOnComplete {
              completionCalled = true
              logger.info("Integration test stream completed successfully")
          }

Extension Functions API

// Pure casting (fast)
runner.asStreaming()

// Safe conversion with validation
runner.asStreamingWithValidation()

Code Organization

New Components

StreamingPromptRunnerOperations - Core streaming operations interface
StreamingPromptRunnerOperationsImpl - Implementation bridging API to SPI
StreamingChatClientOperations - Spring AI integration layer
Extension Functions - Clean alternatives to manual casting

Architecture Flow

  LLM Raw Chunks → Line Buffering → Content Classification → Event Generation → User Stream
      Flux<String>     LineBuffer       Thinking vs JSON      StreamingEvent<T>    Subscription

OperationContextPromptRunner serves as the bridge between existing code and streaming features, enabling seamless transition from traditional blocking operations to reactive streaming
without requiring changes to existing business logic or tool definitions.

Challenge: Minimal Changes to Existing Artifacts

StreamingJacksonOutputConverter

JSONL Processing - Converts streaming LLM responses to typed objects
Real-time Parsing - Objects emitted as they arrive, not after completion
Thinking Support - Preserves LLM reasoning alongside structured output
See PR Streaming Converter Implementation for LLM JSONL Processing embabel-common#89

Workflow

LLM streams JSONL → {"name": "item1"}\n{"name": "item2"}
Converter processes → Real-time object creation
Objects emitted → Flux for reactive consumption. possible enhancement - conceal Flux
4.Tools invoked

Testing Coverage

Unit Tests - Streaming capability detection and operations
Integration Tests - Tool registration and streaming workflow
API Tests - Both traditional and extension function approaches

Remaining Major Tasks:

Implement combination of WithExampleConverter format with StreamingJacksonOutputConverter for few-shot examples in streaming
Create common PromptFormatBuilder utility to reduce code duplication between JSON and JSONL format instructions
Add tests for mixed content, error cases, and edge conditions
Incorporate StreamingUtility into Streaming converter
Explore Streaming Metadata (from model DB?)
Discuss classification as {Object | Thinking |String}. Currently non-classified AS {Object |Thinking} got swallowed
Implement backpressure strategies for high-volume streaming
Add retry / reconnect logic for transient OpenAI failures
clean-up APIs

igordayen · 2025-11-29T21:35:59Z

additional documentation.

igordayen · 2025-12-01T22:35:04Z

Please see write-up on introduction of Streaming Capability interface.
streaming-capability.md

poutsma

Looks good, I only have a few minor suggestions and one major change that I would like to see: the method name change in StreamPromptRunnerOperations.

That said, I would like to revisit StreamingCapabilities later, once this PR has been merged, as discussed here: embabel/embabel-common#89 (comment)

...pi/src/main/kotlin/com/embabel/agent/api/common/streaming/StreamingPromptRunnerOperations.kt

...ent-api/src/main/kotlin/com/embabel/agent/api/common/support/OperationContextPromptRunner.kt

...bel-agent-api/src/main/kotlin/com/embabel/agent/spi/streaming/StreamingCapabilityDetector.kt

embabel-agent-api/src/main/kotlin/com/embabel/agent/spi/streaming/StreamingLlmOperations.kt

rebased one more time; generalized LLM Thinking detection; added more tests on reactive pattern: {onNext, onError, onComplete...} enhanced logic on multi-chuncked JSONL-based objects

rebased to agent-api 0.3.1 synchronized with test-domain major update to Streaming Chat Client Operations; addition of LLM IT test, to be executed manually

move from spi package to api common/support marked as 'internal'

sonarqubecloud · 2025-12-04T02:08:52Z

Quality Gate passed

Issues
0 New issues
0 Accepted issues

Measures
0 Security Hotspots
81.0% Coverage on New Code
0.0% Duplication on New Code

See analysis details on SonarQube Cloud

poutsma

This can be merged from my perspective.

igordayen marked this pull request as draft November 29, 2025 04:40

igordayen requested a review from poutsma November 29, 2025 04:58

igordayen marked this pull request as ready for review November 30, 2025 05:59

igordayen requested review from alexheifetz and johnsonr November 30, 2025 06:00

igordayen mentioned this pull request Nov 30, 2025

Draft: LLM Streaming #1062

Closed

igordayen force-pushed the llm-streaming-e2e branch from 1247ec8 to 2a68147 Compare December 2, 2025 02:00

poutsma requested changes Dec 2, 2025

View reviewed changes

igordayen added 5 commits December 3, 2025 12:32

LLM Streaming functionality

d862c60

Enhanced Streaming Infrastructure and Tests

dcc0bea

rebased one more time; generalized LLM Thinking detection; added more tests on reactive pattern: {onNext, onError, onComplete...} enhanced logic on multi-chuncked JSONL-based objects

End-to-End Integration testing with real LLM

c2546af

rebased to agent-api 0.3.1 synchronized with test-domain major update to Streaming Chat Client Operations; addition of LLM IT test, to be executed manually

Restored OpenAiModelsConfig.kt from main branch

6727a48

Rename Streaming API to proper name createObjectStream

6c907eb

igordayen force-pushed the llm-streaming-e2e branch from 2a68147 to 6c907eb Compare December 3, 2025 23:17

igordayen added 2 commits December 3, 2025 20:31

StreamingCapability Detector Refactoring:

e8b20ca

move from spi package to api common/support marked as 'internal'

Restored OpenAiModelsConfig.kt from main branch - second time

bb19e69

poutsma approved these changes Dec 4, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

LLM Streaming Functionality with E2E Integration Tests #1088

LLM Streaming Functionality with E2E Integration Tests #1088

igordayen commented Nov 29, 2025 •

edited

Loading

Uh oh!

igordayen commented Nov 29, 2025

Uh oh!

igordayen commented Dec 1, 2025

Uh oh!

poutsma left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sonarqubecloud bot commented Dec 4, 2025

Uh oh!

poutsma left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

LLM Streaming Functionality with E2E Integration Tests #1088

Are you sure you want to change the base?

LLM Streaming Functionality with E2E Integration Tests #1088

Conversation

igordayen commented Nov 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

LLM Streaming Integration with Tooling Support

Overview

User Code Usage

Basic Streaming with Tools

StreamingJacksonOutputConverter

Workflow

Remaining Major Tasks:

Uh oh!

igordayen commented Nov 29, 2025

Uh oh!

igordayen commented Dec 1, 2025

Uh oh!

poutsma left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sonarqubecloud bot commented Dec 4, 2025

Quality Gate passed

Uh oh!

poutsma left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

igordayen commented Nov 29, 2025 •

edited

Loading