🔊 Transform text into natural speech with multiple AI providers of your choice - Replicate, OpenAI, ElevenLabs
TTS Studio is a modern web application that converts text into natural-sounding speech using advanced AI models. Powered by models from leading providers like Replicate, OpenAI, and ElevenLabs, it offers an intuitive interface for generating high-quality audio from text.
- 🚀 Instant Deployment: Deploy with a single click using Vercel's seamless platform integration
- 🎯 Multiple Providers: Choose from Replicate (Kokoro), OpenAI TTS, or ElevenLabs TTS for text-to-speech conversion
- 🗣️ Rich Voice Selection: Access a variety of voices across different providers
- ⚡ Real-time Generation: Generate audio quickly with advanced AI models
- 💾 Easy Downloads: Save generated audio files directly to your device
- 🎛️ Customization: Adjust speech speed and other parameters
- 📱 Responsive Design: Works seamlessly on desktop and mobile devices
Deploy 🎙️ TTS Studio with a single click using Vercel's powerful and scalable environment.
For detailed deployment instructions, check out the Next.js deployment documentation.
- Node.js 18+ installed
- API keys from providers you want to use:
-
Clone the repository:
git clone https://github.com/Justmalhar/tts-studio.git cd tts-studio
-
Install dependencies:
npm install
-
Set up environment variables:
cp .env.example .env.local
-
Add your API keys to
.env.local
:REPLICATE_API_KEY=your_replicate_api_key OPENAI_API_KEY=your_openai_api_key ELEVENLABS_API_KEY=your_elevenlabs_api_key
-
Start the development server:
npm run dev
-
Open http://localhost:3000 in your browser 🎉
Create a .env.local
file in the root directory:
cp .env.example .env.local
Add your API keys:
REPLICATE_API_KEY=your_replicate_api_key
OPENAI_API_KEY=your_openai_api_key
ELEVENLABS_API_KEY=your_elevenlabs_api_key
For Vercel deployment:
- Go to your project settings
- Navigate to the Environment Variables section
- Add the same variables as above
- Redeploy your application
- 🔑 Setup your API keys for the provider you want to use in
.env.local
file or Vercel Environment Variables - 📝 Type or paste your text in the input field
- 🎤 Select a voice from the available options
- ⚙️ Adjust the speech settings (speed, etc.)
- 🎵 Click "Generate Audio" to create your speech
- 💾 Use the player controls or download the audio file
- Advanced text-to-speech model (82M params)
- Optimized for long-form content
- Multiple voice options
- Learn more about Kokoro
- State-of-the-art TTS technology
- Multiple voices and languages
- High-quality audio output
- OpenAI TTS Documentation
- Professional-grade voice synthesis
- 32+ languages supported
- Advanced customization options
- ElevenLabs Documentation
Contributions are welcome! Please feel free to submit a Pull Request. For major changes:
- Fork the repository
- Create your feature branch (
git checkout -b feature/AmazingFeature
) - Commit your changes (
git commit -m 'Add some AmazingFeature'
) - Push to the branch (
git push origin feature/AmazingFeature
) - Open a Pull Request
This project is licensed under the MIT License - see the LICENSE file for details.
- 🎯 Replicate for the Kokoro model
- 🤖 OpenAI for their TTS API
- 🎤 ElevenLabs for their voice technology
- 🎨 Tailwind CSS for the styling
- 🎭 Framer Motion for animations
- Twitter/X: @justmalhar 🛠
- LinkedIn: Malhar Ujawane 💻