Skip to content

Nituhimani/Voice-Text

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Speech-to-Text Web Application

A web-based speech-to-text application that converts spoken words into text using Google's Speech Recognition API. The application features a modern, responsive user interface and real-time audio processing.

Features

  • Real-time audio recording through browser
  • Speech-to-text conversion using Google's Speech Recognition API
  • Modern and responsive user interface
  • Automatic audio format conversion for compatibility
  • Error handling and user feedback

Prerequisites

  • Python 3.x
  • pip (Python package installer)
  • ffmpeg (for audio processing)

Installation

  1. Clone the repository:
git clone <your-repository-url>
cd <repository-name>
  1. Install the required Python packages:
pip install -r requirements.txt
  1. Install ffmpeg (if not already installed):
  • On macOS:
brew install ffmpeg
  • On Ubuntu/Debian:
sudo apt-get install ffmpeg

Usage

  1. Start the Flask server:
python app.py
  1. Open your web browser and navigate to:
http://127.0.0.1:5000
  1. Click the "Start Recording" button and speak
  2. Click "Stop Recording" when finished
  3. The transcribed text will appear below

Project Structure

├── app.py              # Flask backend
├── requirements.txt    # Python dependencies
├── static/
│   └── style.css      # CSS styles
└── templates/
    └── index.html     # Frontend template

Technologies Used

  • Python
  • Flask
  • SpeechRecognition
  • PyAudio
  • pydub
  • HTML5
  • CSS3
  • JavaScript

License

This project is open source and available under the MIT License.

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors