Author : Cristopher Benge
Columbia University in the City of New York
Podcast Creator allows you to transform your PDF's into podcasts in a way that lets you customize the tone (or 'mood'), the length, and voices. Translation support from English to French, German, and Spanish is built-in, and the final result also provides you with the generated Script.
Podcast Creator is a local Python/HTML/JavaScript/CSS/Flask hosted web application that utilizes Google Vertex AI (Text-to-Speech API), Google Cloud Translation API, and Anthropic Claude/Sonnet 3.5 to transform PDFs into interesting podcast scripts. Pydub is used to stitch together segments of the podcast script into lively conversation with natural sounding transitions between host and guest.
I used Poetry as my dependence manager, but have also provided a Pipfile and requirements.txt to help setup your local environment. There are a couple of additional steps you'll need to take to make the application fully functional:
-
Since we are using Google Cloud API's you will also need to create a GCP project and enable the Cloud Text-to-Speech API and Translate APIs. Additionally, you will need to need to acquire an API key from Anthropic and provide it in the next step.
-
Setup your
.envfile to look like this:
ANTHROPIC_API_KEY="<YOUR ANTHROPIC API KEY>"- You will need to ensure you have the requisite libraries installed; I recommend
poetryfor virtual environment management. arequirements.txtis provided as well:
pip install -r requirements.txt- Once the environment is setup, you can start the app by running:
python app.py- Navigate to localhost:5000, and you should see the landing page:
- Clicking "Get Started" will take you to the main screen; just follow the prompts and create your podcast!
Licensed under the MIT License. See LICENSE file for more details.



