Skip to content

v1.1.0

Latest

Choose a tag to compare

@perrette perrette released this 22 May 22:56
· 2 commits to main since this release

Highlights

  • --debug flag logs one line per STT request (model, language, prompt, audio duration) so you can see exactly what's being sent. Output to stderr via the scribe logger.
  • Streaming recipes documented: Balanced (default) and Patient profiles, with the cross-chunk context trade-off explained.

New features

  • --words auto-formatting for backends that fold words into the prompt (whisper-futo, openai, groq). A bare wordlist Tierney Comet is now rendered as "Tierney, Comet." — the trailing period biases Whisper toward punctuated output. faster-whisper's hotwords channel is unchanged.
  • --stream-first-chunk-min (default 3.0 s) raises the floor for the very first chunk of a streaming thread, giving Whisper enough audio for a punctuated bootstrap transcript whose tail seeds the rolling prompt. Inactive in Patient mode (--stream-context-length 0).
  • Options → Prompt submenu in the tray with file pickers for prompt.txt / words.txt and a Reload action so editing those files in a text editor is a one-click refresh.
  • Menu presets: 15 s added to Chunk max picker, 8× added to Context reset picker.

Fixes

  • Auto-discovered prompt.txt / words.txt paths now persist into the argparse namespace at startup, so the tray menu correctly shows the loaded file instead of (none).

Docs

  • docs/backends.md — new Prompt style biases output style section (why your transcripts may come back unpunctuated and what to do about it), plus Streaming recipes — two profiles.
  • docs/cli.md — new --stream-first-chunk-min row.