Feature/headless transcribe cli (for use in OpenClaw)#952
Open
mane81 wants to merge 2 commits intocjpais:mainfrom
Open
Feature/headless transcribe cli (for use in OpenClaw)#952mane81 wants to merge 2 commits intocjpais:mainfrom
mane81 wants to merge 2 commits intocjpais:mainfrom
Conversation
|
Check out https://k2-fsa.github.io/sherpa/onnx/index.html which is more suitable for automation and has a lot of tools and features. |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Before Submitting This PR
If this is a feature or change that was previously closed/rejected:
Human Written Description
I built this for my personal local automation setup with OpenClaw.
My main use case is receiving voice messages (Telegram/WhatsApp) and having OpenClaw transcribe them reliably via a deterministic CLI call, instead of UI/hotkey interaction.
This also helps avoid running multiple separate STT stacks in parallel and keeps my workflow simpler while still using Handy’s local model selection.
Related Issues/Discussions
Fixes #N/A
Discussion: N/A (happy to open one if required before merge)
Community Feedback
No dedicated discussion thread yet.
If preferred, I can open one first and collect feedback there before merge.
Testing
Tested locally on macOS (Apple Silicon):
cargo check -qpassescargo run -- --transcribe-file /path/to/audio.wav --format textreturns transcript to stdoutcargo run -- --transcribe-file /path/to/audio.wav --format jsonreturns JSON outputsettings_store.json--model-idoverride worksScreenshots/Videos (if applicable)
N/A (CLI-only change)
AI Assistance
If AI was used: