🎵 Fine-Tuning FunctionGemma for Music Control

Train Google's FunctionGemma-270M model to understand custom music functions in under 3 minutes on a CPU.

✨ Results

Improved model performance from 75% → 100% accuracy using a gradual scaling approach:

Model	Accuracy	Functions
Base FunctionGemma	75%	4 functions
Fine-Tuned	100%	4 functions

Live Models:

Jageen/music-2func - 2 functions, 100% accuracy
Jageen/music-4func - 4 functions, 98.9% accuracy

🚀 Quick Start

Prerequisites

Python 3.9+
8GB RAM minimum
HuggingFace account with FunctionGemma access (get access)

Setup

# Clone the repository
git clone https://github.com/jageenshukla/functiongemma-finetuning-lora.git
cd functiongemma-finetuning-lora

# Run setup script
chmod +x setup.sh
./setup.sh

# Activate environment
source venv/bin/activate

# Login to HuggingFace
python -c "from huggingface_hub import login; login()"

Train Your Model

4-Function Model (2-3 minutes)

# Generate dataset
python scripts/generate_4func_dataset.py

# Train
python scripts/train_4func.py

# Expected: 98.9% training accuracy

Test Locally

Compare base model vs fine-tuned performance:

python scripts/local_demo.py

Output:

Base FunctionGemma: 75% (6/8 tests)
Fine-Tuned Model: 100% (8/8 tests)
Improvement: +25 percentage points

Deploy to HuggingFace

export HUGGINGFACE_TOKEN='hf_your_token_here'
python scripts/push_4func_to_hf.py

🎯 What It Does

Converts natural language to structured function calls:

Input:  "Play Bohemian Rhapsody"
Output: play_song(song_name="Bohemian Rhapsody")

Input:  "Pause the music"
Output: playback_control(action="pause")

Input:  "Search for rock songs"
Output: search_music(query="rock songs")

📊 Training Details

Approach: Gradual scaling (2→4→8→18 functions)

Start small, validate, then scale
Prevents cognitive overload
Achieves 95-100% accuracy per stage

Efficient Fine-Tuning:

Method: LoRA (Low-Rank Adaptation)
Trainable params: 3.8M (1.4% of base model)
Training time: ~2.5 minutes per stage (CPU)
Model size: 15MB adapter

Training Configuration:

Base model: google/functiongemma-270m-it
Epochs: 5
Batch size: 2
Learning rate: 2e-4
Examples per function: 25-30

📁 Project Structure

functiongemma-finetuning-lora/
├── scripts/
│   ├── generate_4func_dataset.py   # Generate training data
│   ├── train_4func.py              # Train the model
│   ├── local_demo.py               # Local comparison demo
│   └── push_4func_to_hf.py         # Deploy to HuggingFace
├── config/
│   └── music_functions.py          # Function definitions
├── data/
│   └── four_func_dataset/          # Training datasets
├── requirements.txt                # Dependencies
└── setup.sh                        # Setup script

🔧 Usage in Production

Python

from transformers import AutoTokenizer, AutoModelForCausalLM
from peft import PeftModel

# Load base model
base_model = AutoModelForCausalLM.from_pretrained(
    "google/functiongemma-270m-it",
    torch_dtype=torch.float32,
    device_map="cpu"
)

# Load fine-tuned adapter
model = PeftModel.from_pretrained(base_model, "Jageen/music-4func")
tokenizer = AutoTokenizer.from_pretrained("Jageen/music-4func")

# Merge for faster inference (recommended)
model = model.merge_and_unload()

# Use for inference
def predict(user_input):
    messages = [{"role": "user", "content": user_input}]
    prompt = tokenizer.apply_chat_template(
        messages,
        tools=MUSIC_FUNCTIONS,
        add_generation_prompt=True,
        tokenize=False
    )
    inputs = tokenizer(prompt, return_tensors="pt")
    outputs = model.generate(**inputs, max_new_tokens=128)
    return tokenizer.decode(outputs[0])

# Test
result = predict("Play Bohemian Rhapsody")
print(result)
# Output: <start_function_call>call:play_song{song_name:<escape>Bohemian Rhapsody<escape>}<end_function_call>

🎯 Key Findings

What Works:

Gradual scaling: 2→4→8→18 functions
Complete LoRA configuration (all 7 target modules)
Proper data formatting (pass dicts, not JSON strings)
25-30 examples per function minimum

Results by Approach:

Approach	Functions	Accuracy	Status
All 18 at once	18	0%	❌ Failed
Gradual (2 func)	2	100%	✅ Success
Gradual (4 func)	4	98.9%	✅ Success

📚 Resources

Models:

Documentation:

🤝 Contributing

This project is open for contributions. Feel free to:

Add more function examples
Improve training efficiency
Expand to more functions
Share deployment experiences

📄 License

This project uses FunctionGemma, which requires accepting the Gemma license.

🙏 Acknowledgments

Google for FunctionGemma
HuggingFace for transformers, PEFT, and TRL
Open-source community for LoRA research

Happy Training! 🎵🤖

For detailed technical notes, troubleshooting, and development history, see DEVELOPMENT_NOTES.md.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🎵 Fine-Tuning FunctionGemma for Music Control

✨ Results

🚀 Quick Start

Prerequisites

Setup

Train Your Model

Test Locally

Deploy to HuggingFace

🎯 What It Does

📊 Training Details

📁 Project Structure

🔧 Usage in Production

Python

🎯 Key Findings

📚 Resources

🤝 Contributing

📄 License

🙏 Acknowledgments

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
config		config
data		data
scripts		scripts
.gitignore		.gitignore
DEVELOPMENT_NOTES.md		DEVELOPMENT_NOTES.md
README.md		README.md
requirements.txt		requirements.txt
setup.sh		setup.sh

jageenshukla/functiongemma-finetuning-lora

Folders and files

Latest commit

History

Repository files navigation

🎵 Fine-Tuning FunctionGemma for Music Control

✨ Results

🚀 Quick Start

Prerequisites

Setup

Train Your Model

Test Locally

Deploy to HuggingFace

🎯 What It Does

📊 Training Details

📁 Project Structure

🔧 Usage in Production

Python

🎯 Key Findings

📚 Resources

🤝 Contributing

📄 License

🙏 Acknowledgments

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages