A desktop application that automatically constructs a timeline using a script and audio files, combining images and subtitles to create a video (MP4).
- Script-Based Auto Alignment: Aligns audio to text using Whisper and VAD.
- Timeline Editing: Edit audio, subtitle, and image clips via drag-and-drop.
- Subtitle Auto-Completion: Generates subtitle clips with intelligent line breaks using morphological analysis.
- Batch Image Application: Automatically place images matching audio clips.
- Video Rendering: Export as MP4 using FFmpeg.
- Project Save/Load: Save and resume work via
.pbbproject files.
Narrator: Once upon a time, there was a brave rabbit.
Rabbit: Hello, world! I am going on an adventure.
Fox: Wait for me!
Each line follows Speaker Name: Dialogue. Lines not matching this format may be ignored.
- Python 3.10+, PyQt6
- faster-whisper, Silero VAD, Pydub
- KiwiPiePy (Korean NLP, LGPL v3)
- FFmpeg (system install required)
Prerequisites: Python 3.10+, FFmpeg in PATH
git clone https://github.com/kangjoseph90/PictureBookBuilder.git
cd PictureBookBuilder
pip install -r requirements.txt
python src/main.py- New Project:
File>New Project - Load Script: Load a
.txtscript file from the left panel. - Link Audio: Assign audio files to each speaker.
- Process:
Tools>Start Processing(F5) - Add Images: Load folder and use
Tools>Batch Apply Images - Edit: Adjust clips and subtitles on the timeline.
- Export:
Export>Render Video(F9)
GPL v3. See LICENSE for details.
Includes: PyQt6 (GPL v3), KiwiPiePy (LGPL v3)