LoRA Fine-Tuning

In this project, I fine-tune the small language model using knowledge distillation from a large language model – Llama 3.1-7B, with the aim of transferring some aviation specific domain knowledge to the small model.

The training data for fine-tuning is a set of 1000 QA pairs extracted from aviation related technical documentation.

Small language model: SmolLM-135M-Instruct

Fine-tuning Approach:

Generate training data using knowledge distillation from Llama 3.1- 7B
Training data consists of QA pairs extracted using LLM from aviation related technical documents
Splitting training data into training and validation sets - Drive
Asessing performance of SmolLM-135M-Instruct on the validation data.
LoRA fine-tuning of SmolLM-135M-Instruct.
Assessing model performance on validation set, post fine-tuning.

Steps to run unit testing code:

Run requirements.txt:
```
!pip install requirements.txt
```
Update config variables in lora\loraconfig.json:

hf_data
- train_data - Path to QA pairs training data
- checkpoint - HuggingFace model name of small langauge model to fine-tune
- device - Specify GPU ID
- lora - LoRA parameters
- training - Training hyperparameters
Run lora_unit_test.py

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
src		src
Dockerfile		Dockerfile
LICENSE		LICENSE
LoRA_finetuning_report.pdf		LoRA_finetuning_report.pdf
README.md		README.md
requirements.txt		requirements.txt
smollm_finetuned_eval.xlsx		smollm_finetuned_eval.xlsx

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LoRA Fine-Tuning

Fine-tuning Approach:

Steps to run unit testing code:

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

LoRA Fine-Tuning

Fine-tuning Approach:

Steps to run unit testing code:

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages