Skip to content

Commit

Permalink
chore: update tutorials to match the latest llama.cpp version
Browse files Browse the repository at this point in the history
  • Loading branch information
mcharytoniuk committed Jul 1, 2024
1 parent b033a9e commit b3a0600
Show file tree
Hide file tree
Showing 4 changed files with 9 additions and 14 deletions.
Original file line number Diff line number Diff line change
@@ -1,4 +1,4 @@
resource "aws_imagebuilder_component" "apt_nvidia_driver_555" {
resource "aws_imagebuilder_component" "apt_nvidia_driver_550" {
data = yamlencode({
phases = [{
name = "build"
Expand All @@ -8,10 +8,10 @@ resource "aws_imagebuilder_component" "apt_nvidia_driver_555" {
inputs = {
commands = [
"sudo apt-get update",
"DEBIAN_FRONTEND=noninteractive sudo apt-get install -yq nvidia-driver-555",
"DEBIAN_FRONTEND=noninteractive sudo apt-get install -yq nvidia-driver-550",
]
}
name = "apt_nvidia_driver_555"
name = "apt_nvidia_driver_550"
onFailure = "Abort"
timeoutSeconds = 180
},
Expand All @@ -23,7 +23,7 @@ resource "aws_imagebuilder_component" "apt_nvidia_driver_555" {
}]
schemaVersion = 1.0
})
name = "apt_nvidia_driver_555"
name = "apt_nvidia_driver_550"
platform = "Linux"
supported_os_versions = [
"Ubuntu 22"
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -7,10 +7,8 @@ resource "aws_imagebuilder_component" "cuda_toolkit_12" {
action = "ExecuteBash"
inputs = {
commands = [
"wget https://developer.download.nvidia.com/compute/cuda/repos/ubuntu2204/x86_64/cuda-keyring_1.1-1_all.deb",
"sudo dpkg -i cuda-keyring_1.1-1_all.deb",
"sudo apt-get update",
"DEBIAN_FRONTEND=noninteractive sudo apt-get -yq install cuda-toolkit-12-5"
"DEBIAN_FRONTEND=noninteractive sudo apt-get -yq install nvidia-cuda-toolkit"
]
}
name = "apt_cuda_toolkit_12"
Expand Down
2 changes: 1 addition & 1 deletion infra/tutorial-installing-llamacpp-aws-cuda.md
Original file line number Diff line number Diff line change
Expand Up @@ -143,5 +143,5 @@ sudo reboot
For example, with `Mistral Instruct 7B` what worked for me is:

```shell
./batched-bench ../mistral-7b-instruct-v0.2.Q4_K_M.gguf 2048 2048 512 0 999 128,256,512 128,256 1,2,4,8,16,32
./llama-batched-bench -m ../mistral-7b-instruct-v0.2.Q4_K_M.gguf 2048 2048 512 0 999 128,256,512 128,256 1,2,4,8,16,32
```
9 changes: 3 additions & 6 deletions infra/tutorial-installing-llamacpp-aws-ec2-image-builder.md
Original file line number Diff line number Diff line change
Expand Up @@ -70,7 +70,7 @@ You can complete the tutorial steps either manually or by automating the setup w
**Component name: apt_nvidia_driver_555**
```yaml
name: apt_nvidia_driver_555
description: "Component to install NVIDIA driver 555 on Ubuntu"
description: "Component to install NVIDIA driver 550 on Ubuntu"
schemaVersion: '1.0'
phases:
- name: build
Expand All @@ -80,7 +80,7 @@ You can complete the tutorial steps either manually or by automating the setup w
inputs:
commands:
- sudo apt-get update
- DEBIAN_FRONTEND=noninteractive sudo apt-get install -yq nvidia-driver-555
- DEBIAN_FRONTEND=noninteractive sudo apt-get install -yq nvidia-driver-550
onFailure: Abort
timeoutSeconds: 180
- name: reboot
Expand All @@ -100,10 +100,7 @@ You can complete the tutorial steps either manually or by automating the setup w
action: ExecuteBash
inputs:
commands:
- wget https://developer.download.nvidia.com/compute/cuda/repos/ubuntu2204/x86_64/cuda-keyring_1.1-1_all.deb
- sudo dpkg -i cuda-keyring_1.1-1_all.deb
- sudo apt-get update
- DEBIAN_FRONTEND=noninteractive sudo apt-get -yq install cuda-toolkit-12-5
- DEBIAN_FRONTEND=noninteractive sudo apt-get -yq install nvidia-cuda-toolkit
onFailure: Abort
timeoutSeconds: 600
- name: reboot
Expand Down

0 comments on commit b3a0600

Please sign in to comment.