Name		Name	Last commit message	Last commit date
parent directory ..
benchmarks		benchmarks
clip		clip
ireers_tools		ireers_tools
llama3.1		llama3.1
quality_tests		quality_tests
README.md		README.md
action.yml		action.yml
pytest.ini		pytest.ini
requirements-iree.txt		requirements-iree.txt
setup.py		setup.py

README.md

Sharktank Model Tests

This test suite includes small scale versions of Large Language Models (LLMs) and other Generative AI (GenAI) programs exported using the sharktank package built as part of the shark-ai project.

Quickstart

Download files through git lfs as needed:

git lfs install
git lfs pull --include="*"

 git lfs ls-files
 # 37f90b4754 * sharktank_models/llama3.1/assets/toy_llama.irpa
 # 7172acdf43 * sharktank_models/llama3.1/assets/toy_llama.mlir
 # e997647ecc * sharktank_models/llama3.1/assets/toy_llama_tp2.irpa
 # b7b2f5a206 * sharktank_models/llama3.1/assets/toy_llama_tp2.mlir
 # 917845c887 * sharktank_models/llama3.1/assets/toy_llama_tp2.rank0.irpa
 # 9ab51093c4 * sharktank_models/llama3.1/assets/toy_llama_tp2.rank1.irpa

Set up your virtual environment and install requirements:
```
cd sharktank_models

python -m venv .venv
source .venv/bin/activate
python -m pip install -e sharktank_models/
```
- To use IREE from nightly pre-release Python packages:
```
python -m pip install -r sharktank_models/requirements-iree.txt
```
- To use a custom version of IREE follow the instructions for building the IREE Python packages from source.
Run pytest using typical flags:
```
pytest \
  -rA \
  -m "target_cpu" \
  --timeout=300 \
  --durations=0 \
  --log-cli-level=info
```
See https://docs.pytest.org/en/stable/how-to/usage.html for other options.

Advanced pytest usage

The log-cli-level level can also be set to debug, warning, or error. See https://docs.pytest.org/en/stable/how-to/logging.html.
Run only tests matching a name pattern:
```
pytest -k llama
```
Run tests that require an AMD GPU (https://docs.pytest.org/en/stable/example/markers.html):
```
pytest -m "target_hip"
```
Ignore xfail marks (https://docs.pytest.org/en/stable/how-to/skipping.html#ignoring-xfail):
```
pytest --runxfail
```

Run tests in parallel using https://pytest-xdist.readthedocs.io/ (note that this swallows some logging):

# Run with an automatic number of threads (usually one per CPU core).
pytest -n auto

# Run on an explicit number of threads.
pytest -n 4

Create an HTML report using https://pytest-html.readthedocs.io/en/latest/index.html
```
pytest --html=report.html --self-contained-html --log-cli-level=info
```
See also https://docs.pytest.org/en/latest/how-to/output.html#creating-junitxml-format-files

Running quality tests

Please refer to Quality tests README to run tests

Running benchmark tests

Please refer to Benchmark tests README to run tests

Note: for benchmark tests to run, you will need vmfbs files available

Generating model files using Shark AI

In order to generate and compile MLIR files to compile, run quality tests and benchmarking tests, please run the following the following commands:

This example generates IRPA and MLIR files for Llama, please look in Shark AI Models to see which models you can generate

python3 -m pip install sharktank

# For Sharktank nightly releases, please use this installation command
python3 -m pip install sharktank -f https://github.com/nod-ai/shark-ai/releases/expanded_assets/dev-wheels --pre

# Generate the IRPA files:
python3 -m sharktank.models.llama.toy_llama --output toy_llama.irpa

# Generate the MLIR files:
python3 -m sharktank.examples.export_paged_llm_v1 --bs=1 \
    --irpa-file toy_llama.irpa --output-mlir toy_llama.mlir

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

sharktank_models

sharktank_models

README.md

Sharktank Model Tests

Quickstart

Advanced pytest usage

Running quality tests

Running benchmark tests

Generating model files using Shark AI

Files

sharktank_models

Directory actions

More options

Directory actions

More options

Latest commit

History

sharktank_models

Folders and files

parent directory

README.md

Sharktank Model Tests

Quickstart

Advanced pytest usage

Running quality tests

Running benchmark tests

Generating model files using Shark AI