preemptive generation feature #783

simllll · 2025-10-22T13:50:18Z

Description

this allows STTs to send PREFLIGHT_TRANSCRIPT events.
"exact" same implemenation as the python library.

Changes Made

brings the SpeechEventType PREFLIGHT_TRANSCRIPT to the typescript library. but besides this it also enables preempetive generation before VAD end-of-speech or turn detection completes, to start generation early.

~~Btw: I couldn't find any actual PREFLIGHT_TRANSCRIPT events in the python version... Am I missing something?~~
I found it ;-) Deepgram STT feature support for preemptive gen is prepared here: simllll/agents-js@feat/preemtive-gen...simllll:agents-js:feat/preemptive-gen-deepgram-stt

Pre-Review Checklist

Build passes: All builds (lint, typecheck, tests) pass locally
AI-generated code reviewed: Removed unnecessary comments and ensured code quality
Changes explained: All changes are properly documented and justified above
Scope appropriate: All changes relate to the PR title, or explanations provided for why they're included

Testing

Automated tests added/updated (if applicable)
All tests pass
Make sure both restaurant_agent.ts and realtime_agent.ts work properly (for major changes)

Additional Notes

Note to reviewers: Please ensure the pre-review checklist is completed before starting your review.

changeset-bot · 2025-10-22T13:50:22Z

⚠️ No Changeset found

Latest commit: 8e46e3d

Merging this PR will not cause a version bump for any packages. If these changes should not result in a new version, you're good to go. If these changes should result in a version bump, you need to add a changeset.

This PR includes no changesets

When changesets are added to this PR, you'll see the packages that this PR includes changesets for and the associated semver types

Click here to learn what changesets are, and how to add one.

Click here if you're a maintainer who wants to add a changeset to this PR

toubatbrian · 2025-10-24T06:34:55Z

agents/src/llm/chat_context.ts

+  isEquivalent(other: ChatContext): boolean {
+    // Same object reference
+    if (this === other) {
+      return true;
+    }
+
+    // Different lengths
+    if (this._items.length !== other._items.length) {
+      return false;
+    }
+
+    // Compare each item pair
+    for (let i = 0; i < this._items.length; i++) {
+      const a = this._items[i]!;
+      const b = other._items[i]!;
+
+      // IDs and types must match
+      if (a.id !== b.id || a.type !== b.type) {
+        return false;
+      }


This is nice, can you also add a unittest for this function? Inside chat_context.test.ts?

toubatbrian · 2025-10-24T06:48:56Z

agents/src/voice/agent_activity.ts

+    const preemptive = this.preemptiveGeneration;
+    if (preemptive) {
+      // Add the user message to the chat context for comparison
+      const validationChatCtx = this.agent.chatCtx.copy();
+      if (userMessage) {
+        validationChatCtx.insert(userMessage);
+      }
+
+      // Validate: transcript matches, context equivalent, tools unchanged, toolChoice unchanged
+      const transcriptMatches = preemptive.info.newTranscript === info.newTranscript;
+      const contextEquivalent = preemptive.chatCtx.isEquivalent(validationChatCtx);
+      const toolsUnchanged = preemptive.tools === this.agent.toolCtx;
+      const toolChoiceUnchanged = preemptive.toolChoice === this.toolChoice;
+
+      if (transcriptMatches && contextEquivalent && toolsUnchanged && toolChoiceUnchanged) {
+        // Use preemptive generation!
+        const speechHandle = preemptive.speechHandle;
+        this.preemptiveGeneration = undefined;
+
+        const leadTime = Date.now() - preemptive.createdAt;
+        this.logger.info(
+          {
+            transcript: info.newTranscript,
+            leadTimeMs: leadTime,
+            confidence: preemptive.info.transcriptConfidence,
+          },
+          'using preemptive generation',
+        );
+
+        // Schedule the preemptive speech
+        this.scheduleSpeech(speechHandle, SpeechHandle.SPEECH_PRIORITY_NORMAL);
+
+        // Emit metrics
+        const eouMetrics: EOUMetrics = {


Can we make sure we have the parity implementation as in python agent framework? https://github.com/livekit/agents/blob/a9bc03562f498f3666978ad008fc93b2cbbd22a9/livekit-agents/livekit/agents/voice/agent_activity.py#L1384-L1420

toubatbrian · 2025-10-24T06:54:06Z

agents/src/voice/audio_recognition.ts

+          // Update preflight transcript and confidence
+          this.audioPreflightTranscript = `${this.audioTranscript} ${preflightTranscript}`.trim();
+          this.preflightTranscriptConfidence = preflightConfidence;
+
+          // Trigger preemptive generation if conditions are met
+          if (
+            this.hooks.onPreemptiveGeneration &&
+            (this.turnDetectionMode !== 'manual' || this.userTurnCommitted)
+          ) {
+            // Calculate confidence including all final transcripts plus the current preflight


Let's follow the same params naming as in python agent:

# still need to increment it as it's used for turn detection, self._last_final_transcript_time = time.time() # preflight transcript includes all pre-committed transcripts (including final transcript from the previous STT run) self._audio_preflight_transcript = (self._audio_transcript + " " + transcript).lstrip() self._audio_interim_transcript = transcript if not self._vad or self._last_speaking_time == 0: # vad disabled, use stt timestamp self._last_speaking_time = time.time()

preemptive generation based on python library

8e46e3d

simllll changed the title ~~base for preemptive generation~~ feature preemptive generation Oct 22, 2025

simllll changed the title ~~feature preemptive generation~~ preemptive generation feature Oct 22, 2025

simllll mentioned this pull request Oct 22, 2025

deepgram stt PREFLIGHT_TRANSCRIPT event #784

Draft

7 tasks

toubatbrian reviewed Oct 24, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

preemptive generation feature #783

preemptive generation feature #783

Uh oh!

simllll commented Oct 22, 2025 •

edited

Loading

Uh oh!

changeset-bot bot commented Oct 22, 2025

Uh oh!

toubatbrian Oct 24, 2025

Uh oh!

toubatbrian Oct 24, 2025

Uh oh!

toubatbrian Oct 24, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

preemptive generation feature #783

Are you sure you want to change the base?

preemptive generation feature #783

Uh oh!

Conversation

simllll commented Oct 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Changes Made

Pre-Review Checklist

Testing

Additional Notes

Uh oh!

changeset-bot bot commented Oct 22, 2025

⚠️ No Changeset found

Uh oh!

toubatbrian Oct 24, 2025

Choose a reason for hiding this comment

Uh oh!

toubatbrian Oct 24, 2025

Choose a reason for hiding this comment

Uh oh!

toubatbrian Oct 24, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

simllll commented Oct 22, 2025 •

edited

Loading