Audio-to-Notes

Audio-to-Notes is a Python application that streamlines the process of transcribing audio files and generating concise notes using state-of-the-art AI models. It features a simple command-line interface for processing audio files, and leverages OpenAI Whisper for transcription and OpenAI's GPT-4o for note generation.

Features

Audio File Support: Accepts a wide range of audio formats (e.g., mp4a, wav, mp3, flac, m4a, ogg, etc.).
Automatic Processing: Monitors a folder for new audio files and processes them automatically.
Automatic Transcription: Uses OpenAI Whisper for accurate speech-to-text transcription.
AI-Powered Notes: Summarizes transcriptions into clear, concise notes using OpenAI's GPT-4o API.
Organized Output: Saves both transcription and notes with filenames containing the original title, current date/time, and a descriptive suffix.
Robust File Handling: Handles audio conversion (via ffmpeg) and long file chunking for optimal processing.

Requirements

Python 3.8+
openai-whisper (for speech-to-text transcription)
openai Python package (for GPT-4o note generation)
ffmpeg (system package, for audio conversion)
ffmpeg-python (Python wrapper for ffmpeg)

Installation

Clone the repository:
```
git clone <repo-url>
cd audio-to-notes
```
Install Python dependencies:
```
pip install -r requirements.txt
```

Install system dependencies:

Install ffmpeg:

sudo apt-get update && sudo apt-get install ffmpeg

Set your OpenAI API key:
- Obtain an API key from OpenAI.
- Set it as an environment variable:
```
export OPENAI_API_KEY=your-key-here
```

Usage

Run the application:
```
python app.py
```
Using the GUI:
- Click the button to select an audio file (any supported format).
- Enter a descriptive title in the textbox.
- Click the button to start processing.
- The app will:
  - Convert the audio to a compatible format if needed.
  - Transcribe the audio using OpenAI Whisper.
  - Save the transcription as <title>-<datetime>-transcription.txt.
  - Generate notes from the transcription using OpenAI GPT-4o.
  - Save the notes as <title>-<datetime>-notes.txt.
- All output files are saved in the current working directory.

Example

Suppose you have an audio file meeting.m4a and want to generate notes titled "Team Meeting". The app will produce files like:

Team Meeting-2025-05-18-15-30-12-transcription.txt
Team Meeting-2025-05-18-15-30-12-notes.txt

Troubleshooting

ffmpeg Not Found: Install ffmpeg using your system package manager (see above).
OpenAI API Errors: Make sure your API key is valid and has sufficient quota.
tkinter Not Installed: On some Linux systems, install with sudo apt-get install python3-tk.

License

This app: MIT License

Acknowledgments

Contributing

Pull requests and issues are welcome! Please open an issue to discuss major changes first.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
__pycache__		__pycache__
processing		processing
README.md		README.md
app.py		app.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Audio-to-Notes

Features

Requirements

Installation

Usage

Example

Troubleshooting

License

Acknowledgments

Contributing

About

Uh oh!

Releases

Packages

Languages

GaryGealy/audio-to-notes

Folders and files

Latest commit

History

Repository files navigation

Audio-to-Notes

Features

Requirements

Installation

Usage

Example

Troubleshooting

License

Acknowledgments

Contributing

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages