refactor: update tts asr api and samples #73

666ycy · 2026-01-07T03:38:50Z

Description

Checklist

Follow the CONTRIBUTING Guide.
Make your Pull Request title in the https://www.conventionalcommits.org/ specification.
Ensure the tests pass (Run mvn clean test from the repository root)
Appropriate docs were updated (if necessary)

Fixes #<issue_number>

Add or Update API

I have added the necessary test case and all cases have passed.

tomsun28 · 2026-01-07T06:09:16Z

please run the mvn spring-javaformat:apply to fix the code style.

samples/src/main/ai.z.openapi.samples/AudioSpeechStreamExample.java

Copilot

Pull request overview

This PR refactors the Text-to-Speech (TTS) and Automatic Speech Recognition (ASR) APIs by updating model identifiers, enhancing request parameters, and providing comprehensive sample implementations for both streaming and non-streaming scenarios.

Updated ASR and TTS model identifiers to newer versions (glm-asr-2512 and glm-tts)
Enhanced AudioTranscriptionRequest with new fields: fileBase64, prompt, and hotwords to support advanced transcription features
Simplified response data structures by removing unused fields from AudioTranscriptionChunk, AudioTranscriptionResult, and ChatFunction
Added four comprehensive sample files demonstrating both streaming and blocking audio operations

Reviewed changes

Copilot reviewed 12 out of 13 changed files in this pull request and generated 6 comments.

Show a summary per file

File	Description
ChatCompletionExample.java	Added debug print statement for request object
AudioTranscriptionsStreamExample.java	New sample demonstrating streaming audio transcription with chunk processing
AudioTranscriptionsExample.java	New sample showing basic audio transcription with result processing
AudioSpeechStreamExample.java	New sample for streaming TTS conversion with real-time audio chunk handling
AudioSpeechExample.java	Enhanced with explicit stream and responseFormat configuration
ChatFunction.java	Removed unused `required` field (now properly located in ChatFunctionParameters)
AudioTranscriptionResult.java	Removed unused `segments` field to simplify response structure
AudioTranscriptionRequest.java	Added new fields for advanced features: fileBase64, prompt, hotwords; updated duration limit documentation from 60s to 30s
AudioTranscriptionChunk.java	Simplified structure by removing `choices` field, using direct delta string
AudioSpeechStreamingResponse.java	Changed generic type from ObjectNode to ModelData for better type safety
AudioServiceImpl.java	Added voice validation, updated file extension handling to use dynamic responseFormat
Constants.java	Updated model identifiers: `glm-asr` → `glm-asr-2512`, `cogtts` → `glm-tts` with updated documentation

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

samples/src/main/ai.z.openapi.samples/AudioSpeechStreamExample.java

core/src/main/java/ai/z/openapi/service/audio/AudioServiceImpl.java

samples/src/main/ai.z.openapi.samples/ChatCompletionExample.java

core/src/main/java/ai/z/openapi/service/audio/AudioServiceImpl.java

core/src/main/java/ai/z/openapi/service/audio/AudioSpeechStreamingResponse.java

tomsun28

LGTM！

666ycy and others added 8 commits December 31, 2025 12:52

test:ceshi

3401da2

Update README_CN.md

a19a765

Merge remote-tracking branch 'origin/test' into test

ce91ca9

fix:Improve text-to-speech functionality

25b9742

fix:Improve text-to-speech functionality

4cbaed1

fix:Improve text-to-speech and speech-to-text functionalities.

2596c0c

fix: Add supplementary annotation information

45b40c8

Update README_CN.md

34e4284

tomsun28 changed the title ~~Update samples~~ refactor: update tts asr api and samples Jan 7, 2026

tomsun28 requested a review from Copilot January 7, 2026 06:08

Copilot started reviewing on behalf of tomsun28 January 7, 2026 06:08 View session

tomsun28 reviewed Jan 7, 2026

View reviewed changes

samples/src/main/ai.z.openapi.samples/AudioSpeechStreamExample.java Outdated Show resolved Hide resolved

Copilot AI reviewed Jan 7, 2026

View reviewed changes

666ycy added 4 commits January 7, 2026 15:30

fix: Modify code

f017aa9

fix: Modify code

9f261f2

fix: Modify code

7ed0bd5

fix: Modify code

5dbc718

tomsun28 approved these changes Jan 7, 2026

View reviewed changes

tomsun28 merged commit 18ca6db into main Jan 7, 2026
3 checks passed

tomsun28 deleted the update-samples branch January 7, 2026 08:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

refactor: update tts asr api and samples #73

refactor: update tts asr api and samples #73

Uh oh!

666ycy commented Jan 7, 2026

Uh oh!

tomsun28 commented Jan 7, 2026

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

tomsun28 left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

refactor: update tts asr api and samples #73

refactor: update tts asr api and samples #73

Uh oh!

Conversation

666ycy commented Jan 7, 2026

Description

Checklist

Add or Update API

Uh oh!

tomsun28 commented Jan 7, 2026

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

tomsun28 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants