Speech to text #941

ZhiyuAlexZhang · 2026-02-05T01:29:22Z

Context

I am adding a new speech-to-text Input Stream called Speech, using the Web Speech API.

This will enable voice input and appeal to those who can speak a language but are still in the process of learning how to read or write.

Wordplay users can configure the stream through:

reset: Clears accumulated speech when set to true
language: BCP-47 language code (e.g., en-US), defaults to en-US
limit: Max number of words to keep, defaults to infinite

Related issues

N/A, this is a new feature. With that said, the localization team will need to translate the Speech Stream descriptions.

Verification

I tested on both Chrome and Safari, and both work as intended. The testing was primarily done in English, with some additional testing in Chinese, Japanese, Korean, and Spanish.

Checklist

N/A, we are ready to ship! 💯

… for different languages.

amyjko

This is very close! I'm impressed how well you picked up most of the patterns necessary to build a stream; it's one of the more complex parts of the programming language implementation.

Here are the key things to address:

There are several error handling cases in the Speech implementation that either fail silently, or fail by printing to the console. All of these should be localized error values that become a new string value. See Webpage for a pattern for this. Essentially, when there's an error, we want that error to show up as the stream's value on stage. (I think is is more graceful then displaying an exception, since the stream already results in a text value).
I believe your branch is out of sync with main; there are several type errors in see npm run check related to the ColorJS package that are resolved in main, but aren't resolved here.
There are a few places where the we should and shouldn't be using the current locale selected for the UI. See comments for detail.

I will machine translate the strings prior to merging, once the error messages all have appropriate localization strings.

amyjko · 2026-02-07T23:45:18Z

src/components/project/ProjectView.svelte

+            // Choose the selected evaluation locale or if not selected, the user's preferred locales
+            evaluationLocale
+                ? [evaluationLocale]
+                : $locales.getPreferredLocales(),


This should not be using the locales database; projects have locales that they have embedded in them and they aren't necessarily the same as the UI locale chosen. Is there a reason this needed to be changed to use the user's current UI locale as a fallback?

amyjko · 2026-02-07T23:47:19Z

src/input/Speech.ts

+            window.SpeechRecognition || window.webkitSpeechRecognition;
+
+        if (!SpeechRecognitionAPI) {
+            console.warn('Speech recognition not supported in this browser.');


If the API isn't supported, the stream should return an error value as its value, rather than failing silently. See Webpage for an example of how to handle this.

amyjko · 2026-02-07T23:47:36Z

src/input/Speech.ts

+            this.recognition.interimResults = false; // Only final results
+            this.recognition.maxAlternatives = 1;    // Only best match
+            this.recognition.lang = this.languageCode;
+            console.log(


Drop the console messages prior to deployment.

amyjko · 2026-02-07T23:48:03Z

src/input/Speech.ts

+            case 'network':
+                // Network error - could be no internet or service unavailable
+                this.react(
+                    'Network error: Check your internet connection and try again.',


All of these messages should be localized, rather than hard coded as English.

amyjko · 2026-02-07T23:48:22Z

src/input/Speech.ts

+        if (this.retryCount < this.maxRetries && this.on) {
+            this.retryCount++;
+            const delay = this.retryDelay * this.retryCount; // Exponential backoff
+            console.log(


Remove this log before we finalize this PR.

amyjko · 2026-02-07T23:49:00Z

src/input/Speech.ts

+                    try {
+                        this.recognition?.start();
+                    } catch (e) {
+                        console.error('Failed to restart recognition:', e);


This should be a localized string value for the stream, not a console error. We want it to be visible on stage.

amyjko · 2026-02-07T23:49:16Z

src/input/Speech.ts

+            }, delay);
+        } else if (this.retryCount >= this.maxRetries) {
+            // Max retries reached
+            this.react(


This should be a localizes string value for the stream, not a console error.

amyjko · 2026-02-07T23:50:50Z

src/input/Speech.ts

+                // Get the language from the parameter, default to 'en-US'
+                const languageCode =
+                    evaluation.get(languageBind.names, TextValue)?.text ??
+                    'en-US';


I think this should default to the preferred locale in the locales argument passed to createSpeechDefinition. That way, if the creator has Spanish chosen for their UI locale, the default will be Spanish, rather than English. Call locales.getLocale() as the default.

amyjko · 2026-02-07T23:52:51Z

src/locale/en-US.json

+                "Note: Speech recognition requires microphone access and an internet connection in most browsers. Also, there might be usage limits, so don't leave it on for too long!"
+            ],
+            "names": ["🗣️", "Speech", "Voice"],
+            "reset": {


Need to run npm run schemas to update the schema to allow these input keys.

amyjko · 2026-02-07T23:53:08Z

static/locales/de-DE/de-DE.json

+        "Speech": {
+            "doc": ["$?"],
+            "names": ["$?"],
+            "reset": {


I will translate these before merging.

ZhiyuAlexZhang added 5 commits October 29, 2025 16:30

Initial commit

fdf3666

Cleaned up examples

365cfbf

HOLD: Added try-catch blocks to handle errors. But needs optimization…

7549fd3

… for different languages.

Added switching languages and placeholders

cc3afbc

Added sliding window, language support, and comments

ec3e215

amyjko self-assigned this Feb 7, 2026

amyjko requested changes Feb 7, 2026

View reviewed changes

Merge remote-tracking branch 'upstream/main' into speech-to-text

ac2f6e4

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Speech to text #941

Speech to text #941

Uh oh!

ZhiyuAlexZhang commented Feb 5, 2026

Uh oh!

amyjko left a comment

Uh oh!

amyjko Feb 7, 2026

Uh oh!

amyjko Feb 7, 2026

Uh oh!

amyjko Feb 7, 2026

Uh oh!

amyjko Feb 7, 2026

Uh oh!

amyjko Feb 7, 2026

Uh oh!

amyjko Feb 7, 2026

Uh oh!

amyjko Feb 7, 2026

Uh oh!

amyjko Feb 7, 2026

Uh oh!

amyjko Feb 7, 2026

Uh oh!

amyjko Feb 7, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Speech to text #941

Are you sure you want to change the base?

Speech to text #941

Uh oh!

Conversation

ZhiyuAlexZhang commented Feb 5, 2026

Context

Related issues

Verification

Checklist

Uh oh!

amyjko left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants