llmjam

A real-time AI-powered musical jam session tool. Capture your melody via microphone, convert it to MIDI, send it to a Large Language Model (LLM) for creative response, and play back the AI's musical reply through a virtual MIDI device.

Features

Audio-to-MIDI: Converts your live audio input into MIDI notes using pitch detection (CREPE).
LLM Musician: Sends your melody to a Large Language Model (LLM) (e.g., OpenAI GPT-4, any model via OpenRouter, or a local model via Ollama) which responds with a new, creative MIDI sequence.
Virtual MIDI Output: Plays the AI's response through a virtual MIDI port, usable with any DAW or synth.
Call-and-response: Designed for interactive, improvisational music sessions.

Installation

Requirements

Python 3.11
macOS (tested; may work on Linux/Windows with adjustments)
PortAudio (for sounddevice)
RtMidi (for python-rtmidi)
TensorFlow-macos and TensorFlow-metal (for fast pitch detection)

Python Dependencies

All dependencies are listed in pyproject.toml:

sounddevice
crepe
openai
requests
python-rtmidi
python-dotenv
tensorflow-macos
tensorflow-metal
ollama (for local LLMs)

Install with:

python3.11 -m venv .venv
source .venv/bin/activate
pip install -r requirements.txt  # or use a tool like uv or pip-tools

Setup

LLM Provider and API Keys: Create a .env file in the project root to configure your LLM provider and model. Supported providers are openai, openrouter, and ollama (for local models). Example configuration:
```
# Choose provider: openai, openrouter, or ollama
LLM_PROVIDER=openai

# For OpenAI
OPENAI_API_KEY=sk-...
OPENAI_MODEL=gpt-4.1

# For OpenRouter (uses https://openrouter.ai, supports many models)
OPENROUTER_API_KEY=...
OPENROUTER_MODEL=anthropic/claude-3.7-sonnet:thinking

# For Ollama (local LLMs, requires Ollama running locally)
OLLAMA_MODEL=llama3.2
```
- If both OPENAI_API_KEY and OPENROUTER_API_KEY are set, and LLM_PROVIDER is not specified, OpenRouter will be used by default.
- For Ollama, make sure you have Ollama installed and running locally, and the model you specify is available (e.g., run ollama pull llama3.2).
Audio/MIDI Devices: Ensure your system has a working microphone and a DAW or synth that can receive MIDI from a virtual port.

Usage

Run the main app:

python llmjam.py

You can set the tempo for the session using the --bpm flag:

python llmjam.py --bpm 120

The default is 95 BPM.

Press Enter to start jamming.
Play or sing a melody. Recording starts on sound and stops after 1s of silence.
The melody is converted to MIDI, sent to the LLM, and the AI's response is played back as MIDI.
To stop, press Ctrl+C or Cmd+C.
To set or change the playing style at any time, press s during a jam session and enter a short phrase describing your desired style (e.g., "jazzy", "aggressive", "like a lullaby").
To change the tempo, you must restart the application with the --bpm flag.

How it Works

Audio Capture: Listens for sound, records until silence.
Pitch Detection: Uses CREPE (TensorFlow) to convert audio to MIDI notes.
LLM Response: Sends MIDI to an LLM (e.g., GPT-4, any OpenRouter-supported model, or a local Ollama model) with a musician prompt; receives a new melody as CSV.
MIDI Playback: Plays the AI's response through a virtual MIDI port.

Troubleshooting

No sound detected: Check your microphone and input device settings.
No MIDI output: Ensure your DAW/synth is listening to the "llmjam MIDI Out" port.
API errors: Check your .env file and API key validity.
TensorFlow/CREPE errors: Ensure you have the correct versions for your platform (see TensorFlow-macos/metal docs).

License

MIT

Credits

CREPE for pitch detection
OpenAI for LLM API
OpenRouter for multi-provider LLM API
Ollama for local LLM API
python-rtmidi for MIDI output

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
.idea		.idea
sounds		sounds
.env.example		.env.example
.gitignore		.gitignore
.python-version		.python-version
README.md		README.md
audio_input.py		audio_input.py
llm_client.py		llm_client.py
llmjam.py		llmjam.py
midi_input_echo_all_ports.py		midi_input_echo_all_ports.py
midi_monitor_all_ports_safe.py		midi_monitor_all_ports_safe.py
midi_output.py		midi_output.py
midi_output_demo_send_note.py		midi_output_demo_send_note.py
pitch_to_midi.py		pitch_to_midi.py
pyproject.toml		pyproject.toml
route_midi.py		route_midi.py
screenshot.png		screenshot.png
show_all_midi.py		show_all_midi.py
show_midi.py		show_midi.py
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

llmjam

Features

Installation

Requirements

Python Dependencies

Setup

Usage

How it Works

Troubleshooting

License

Credits

About

Uh oh!

Releases

Packages

Uh oh!

Languages

logicmoo/llmjam

Folders and files

Latest commit

History

Repository files navigation

llmjam

Features

Installation

Requirements

Python Dependencies

Setup

Usage

How it Works

Troubleshooting

License

Credits

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages