WhisperSpeech web UI

Web UI for WhisperSpeech (https://github.com/collabora/WhisperSpeech)

Info

Note

Version 2.x now allows voice generation via API.

Test platforms:

Name	Info
CPU	AMD Ryzen 7900X3D (iGPU disabled in BIOS)
GPU	AMD Radeon 7900XTX
RAM	64GB DDR5 6600MHz
Motherboard	ASRock B650E PG Riptide WiFi (3.18.AS02 Beta)
OS	Ubuntu 24.04.2 LTS
Kernel	6.11.0-17-generic
ROCm	6.3.1

Name	Info
CPU	IntelCore i5-12500H
GPU	NVIDIA GeForce RTX 4050
RAM	16GB DDR4 3200MHz
Motherboard	GIGABYTE G5 MF (BIOS FB10)
OS	Ubuntu 24.04.2 LTS
Kernel	6.11.0-17-generic
NVIDIA Driver	550.120
CUDA	12.4

Instalation:

1. Install Python 3.12

2. Clone repository

3. Mount the repository directory.

3. Create and activate venv

4. For ROCm set HSA_OVERRIDE_GFX_VERSION. For the Radeon 7900XTX:

export HSA_OVERRIDE_GFX_VERSION=11.0.0

5. Install ffmpeg:

Ubuntu 24.04/24.10:

sudo apt install ffmpeg

6. Install requirements

CPU (not recommended):

pip install -r requirements.txt

CUDA 12.4:

pip install -r requrements_cuda_12.4.txt

ROCm 6.2.4

pip install -r requirements_rocm_6.2.4.txt

7. Run:

python webui.py

With -h or --help for help:

python webui.py -h

GUI tanslation:

Languages
English
Polish

1. Install PyBabel:

pip install babel==2.16.0

2. Extract messages.pot:

pybabel extract -F babel.cfg -o ./locale/messages.pot .

3. Create new:

pybabel init -i ./locale/messages.pot -d ./locale -l pl_PL
# Replace pl_PL by your language

4. Compile:

pybabel compile -d ./locale

Name		Name	Last commit message	Last commit date
Latest commit History 64 Commits
locale		locale
outputs		outputs
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
api_example.html		api_example.html
babel.cfg		babel.cfg
requirements.txt		requirements.txt
requirements_cuda_12.4.txt		requirements_cuda_12.4.txt
requirements_rocm_6.2.4.txt		requirements_rocm_6.2.4.txt
screenshot.png		screenshot.png
webui.py		webui.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

WhisperSpeech web UI

Info

Test platforms:

Instalation:

GUI tanslation:

About

Releases 7

Languages

License

Mateusz-Dera/whisperspeech-webui

Folders and files

Latest commit

History

Repository files navigation

WhisperSpeech web UI

Info

Test platforms:

Instalation:

GUI tanslation:

About

Topics

Resources

License

Stars

Watchers

Forks

Releases 7

Languages