This repository contains the code for Yoda, a Conversation Co-Pilot application that utilizes Gradio for audio transcription and response generation based on OpenAI models.
Yoda is a Conversation Co-Pilot that assists users in transcribing spoken audio and generating context-aware, polite, and assertive responses to facilitate workplace conversations.
- Audio transcription using OpenAI's Whisper ASR model.
- Response generation for workplace scenarios using OpenAI's GPT-based models.
- Real-time interaction through Gradio's user-friendly interface.
- Hugging Face (HugginFace) for transformer models
- Whisper for Automatic Speech Recognition (ASR)
- TogetherAI for collaborative features
- LangChain for language processing
Ensure you have the following prerequisites installed before using Yoda:
- Python
- Gradio
- Transformers library
- OpenAI API key
Clone the repository:
git clone https://github.com/fzinnah17/Yoda
Install the required dependencies:
pip install -r requirements.txt
Set your Open AI API Key:
export OPENAI_API_KEY = "your-api-key-here"
Run Yoda, the Conversation Co-Pilot:
python app.py
Yoda.-.Conversation.Copilot.mov
Launch Yoda by running the provided command. Speak into the microphone to trigger audio transcription. Experience real-time, context-aware response generation based on workplace scenarios.
The future work for Yoda, the Conversation Co-Pilot, includes:
- Implementing a vector database to store entire conversations.
- Allowing the Language Model (LLM) to reference back to the stored conversations.
- Summarizing chat history and providing access to the most recent dialogue.
If you like to contribute to Yoda, the Conversation Co-Pilot, please follow the guidelines outlined in the CONTRIBUTING.md file.
This project is licensed under the MIT License - see the LICENSE file for details.