Skip to content

Athene is a voice-controlled AI assistant that combines speech recognition, LLMs, and text-to-speech to help you with your everyday tasks.

Notifications You must be signed in to change notification settings

LatentSpaceExplorer/athene

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Athene

Athene is a voice-controlled AI assistant that can manage your tasks, read to you your clipboard content, and engage in natural conversations using LLMs.

Features

  • Voice recognition and processing with dual input methods:
    • Hold-to-record button in the application window
    • Global hotkey (Alt+Y) that works system-wide
  • Natural language task management with persistent storage
  • Text-to-speech responses using Kokoro
  • Read text from clipboard
  • General conversation
  • Modular plugin system for easy extension

Usage

To record a message, you can either:

  • Hold the "Hold to Record" button in the application window
  • Hold Alt+Y anywhere on your system (works even when Athene is not focused)

Release the button or key combination to process and receive a response.

Prerequisites

  • Python 3.12 or higher
  • NVIDIA GPU with CUDA support (recommended)
  • Groq API key for LLM access

Installation

  1. Clone the repository

  2. Install dependencies using uv (recommended) or pip:

uv sync
  1. Set up environment variables:

    • Create a .env file
    • Add your Groq API key: GROQ_API_KEY=your_key_here
  2. Download Kokoro models:

    • Create a models/kokoro directory
    • Add required model files:
      • kokoro-v1.0.onnx
      • voices-v1.0.bin
  3. Run the application:

uv run ./athene.py

Supported Commands

  • Task Management: Add, remove, and list tasks
  • Clipboard: Read your clipboard text
  • General Conversation: Natural dialogue on various topics

Planned Features

  • Add support alternative LLMs backends (e.g. OpenAI, local)
  • Voice activation by name (e.g. "Athene")
  • Recurring tasks
  • Additional plugins:
    • Reminders
    • Day planning
    • Habit tracking
    • Internet search
    • File reading

Technical Stack

  • Speech Recognition: SpeechRecognition
  • NLP: LangChain (currently only Groq is supported)
  • TTS: Kokoro ONNX

About

Athene is a voice-controlled AI assistant that combines speech recognition, LLMs, and text-to-speech to help you with your everyday tasks.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages