feat(peft): add llama token classification example #525

SauravMaheshkar · 2024-05-11T00:46:02Z

Adds notebook for Token Classification example for Fine-tuning llama 2 for Named Entity Recognition.

I also created a peft/ directory since there are other notebooks related to other articles hosted on my gists that should live on wandb/examples.

Request for Review: @tcapelle @soumik12345

review-notebook-app · 2024-05-11T00:46:07Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

github-actions · 2024-05-11T00:47:15Z

Thanks for contributing to wandb/examples!
We appreciate your efforts in opening a PR for the examples repository. Our goal is to ensure a smooth and enjoyable experience for you 😎.

Guidelines

The examples repo is regularly tested against the ever-evolving ML stack. To facilitate our work, please adhere to the following guidelines:

Notebook naming: You can use a combination of snake_case and CamelCase for your notebook name. Avoid using spaces (replace them with _) and special characters (&%$?). For example:

Cool_Keras_integration_example_with_weights_and_biases.ipynb

is acceptable, but

Cool Keras Example with W&B.ipynb

is not. Avoid spaces and the & character. To refer to W&B, you can use: weights_and_biases or just wandb (it's our library, after all!)

Managing dependencies within the notebook: You may need to set up dependencies to ensure that your code works. Please avoid the following practices:
- Docker-related activities. If Docker installation is required, consider adding a full example with the corresponding Dockerfile to the wandb/examples/examples folder (where non-Colab examples reside).
- Using pip install as the primary method to install packages. When calling pip in a cell, avoid performing other tasks. We automatically filter these types of cells, and executing other actions might break the automatic testing of the notebooks. For example,
```
pip install -qU wandb transformers gpt4
```
is acceptable, but
```
pip install -qU wandb
import wandb
```
is not.
- Installing packages from a GitHub branch. Although it's acceptable 😎 to directly obtain the latest bleeding-edge libraries from GitHub, did you know that you can install them like this:
```
!pip install -q git+https://github.com/huggingface/transformers
```
You don't need to clone, then cd into the repo and install it in editable mode.
- Avoid referencing specific Colab directories. Google Colab has a /content directory where everything resides. Avoid explicitly referencing this directory because we test our notebooks with pure Jupyter (without Colab). Instead, use relative paths to make the notebook reproducible.
The Jupyter notebook file .ipynb is nothing more than a JSON file with primarily two types of cells: markdown and code. There is also a bunch of other metadata specific to Google Colab. We have a set of tools to ensure proper notebook formatting. These tools can be found at wandb/nb_helpers.

Before merging, wait for a maintainer to clean and format the notebooks you're adding. You can tag @tcapelle.

Before marking the PR as ready for review, please run your notebook one more time. Restart the Colab and run all. We will provide you with links to open the Colabs below

The following colabs were changed
-colabs/peft/llama_token_cls.ipynb

tcapelle · 2024-05-27T07:55:19Z

Hey, can you just use

import wandb
wandb.login()

instead?

SauravMaheshkar · 2024-05-27T12:59:18Z

Hey, can you just use
import wandb
wandb.login()
instead?

@tcapelle Addressed in 2fe5f12

tcapelle · 2024-05-29T10:18:34Z

Two extra small changes:

remove the entity = "saurav" and default to None (so it' uses your default entity after calling wandb.login)
Add the image to the peft folder and import it as a static image on the notebook (![](llama_image.png).

SauravMaheshkar · 2024-05-29T13:10:11Z

Two extra small changes:

remove the entity = "saurav" and default to None (so it' uses your default entity after calling wandb.login)

Add the image to the peft folder and import it as a static image on the notebook (![](llama_image.png).

@tcapelle fixed in 7c73358

tcapelle · 2024-05-29T13:38:38Z

extra detail, use a relative path to the image, so when we merge the path to your branch is not needed

SauravMaheshkar · 2024-05-29T13:41:59Z

extra detail, use a relative path to the image, so when we merge the path to your branch is not needed

The image is not in my fork of the wandb/examples repository but rather in my SauravMaheshkar/SauravMaheshkar repository. Ref

tcapelle · 2024-05-29T13:53:55Z

add it to the folder then

feat(peft): add llama token classification example

204ccd8

fix: revert to wandb.login()

2fe5f12

fix: use link for img asset

7c73358

SauravMaheshkar and others added 3 commits May 29, 2024 14:57

chore: upload img

3d0908b

feat(colabs/peft): use relative link for img

5d61a99

clean up

a263c4b

tcapelle approved these changes May 29, 2024

View reviewed changes

tcapelle merged commit f52d2ac into wandb:master May 29, 2024
2 checks passed

SauravMaheshkar deleted the saurav/peft-llama-example branch May 29, 2024 14:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(peft): add llama token classification example #525

feat(peft): add llama token classification example #525

SauravMaheshkar commented May 11, 2024 •

edited

Loading

review-notebook-app bot commented May 11, 2024

github-actions bot commented May 11, 2024 •

edited

Loading

tcapelle commented May 27, 2024

SauravMaheshkar commented May 27, 2024 •

edited

Loading

tcapelle commented May 29, 2024

SauravMaheshkar commented May 29, 2024

tcapelle commented May 29, 2024

SauravMaheshkar commented May 29, 2024 •

edited

Loading

tcapelle commented May 29, 2024

feat(peft): add llama token classification example #525

feat(peft): add llama token classification example #525

Conversation

SauravMaheshkar commented May 11, 2024 • edited Loading

review-notebook-app bot commented May 11, 2024

github-actions bot commented May 11, 2024 • edited Loading

Guidelines

Before marking the PR as ready for review, please run your notebook one more time. Restart the Colab and run all. We will provide you with links to open the Colabs below

tcapelle commented May 27, 2024

SauravMaheshkar commented May 27, 2024 • edited Loading

tcapelle commented May 29, 2024

SauravMaheshkar commented May 29, 2024

tcapelle commented May 29, 2024

SauravMaheshkar commented May 29, 2024 • edited Loading

tcapelle commented May 29, 2024

SauravMaheshkar commented May 11, 2024 •

edited

Loading

github-actions bot commented May 11, 2024 •

edited

Loading

SauravMaheshkar commented May 27, 2024 •

edited

Loading

SauravMaheshkar commented May 29, 2024 •

edited

Loading