Skip to content

Mateusz-Dera/whisperspeech-webui

Repository files navigation

WhisperSpeech web UI

Web UI for WhisperSpeech (https://github.com/collabora/WhisperSpeech)

Preview

Info

Version

Note

Version 2.x now allows voice generation via API.

Test platforms:

Name Info
CPU AMD Ryzen 7900X3D (iGPU disabled in BIOS)
GPU AMD Radeon 7900XTX
RAM 64GB DDR5 6600MHz
Motherboard ASRock B650E PG Riptide WiFi (3.18.AS02 Beta)
OS Ubuntu 24.04.2 LTS
Kernel 6.11.0-17-generic
ROCm 6.3.1
Name Info
CPU IntelCore i5-12500H
GPU NVIDIA GeForce RTX 4050
RAM 16GB DDR4 3200MHz
Motherboard GIGABYTE G5 MF (BIOS FB10)
OS Ubuntu 24.04.2 LTS
Kernel 6.11.0-17-generic
NVIDIA Driver 550.120
CUDA 12.4

Instalation:

1. Install Python 3.12

2. Clone repository

3. Mount the repository directory.

3. Create and activate venv

4. For ROCm set HSA_OVERRIDE_GFX_VERSION. For the Radeon 7900XTX:

export HSA_OVERRIDE_GFX_VERSION=11.0.0

5. Install ffmpeg:

Ubuntu 24.04/24.10:

sudo apt install ffmpeg

6. Install requirements

CPU (not recommended):

pip install -r requirements.txt

CUDA 12.4:

pip install -r requrements_cuda_12.4.txt

ROCm 6.2.4

pip install -r requirements_rocm_6.2.4.txt

7. Run:

python webui.py

With -h or --help for help:

python webui.py -h

GUI tanslation:

Languages
English
Polish

1. Install PyBabel:

pip install babel==2.16.0

2. Extract messages.pot:

pybabel extract -F babel.cfg -o ./locale/messages.pot . 

3. Create new:

pybabel init -i ./locale/messages.pot -d ./locale -l pl_PL
# Replace pl_PL by your language

4. Compile:

pybabel compile -d ./locale