mistral-rs: fix cuda support #343254

GaetanLepage · 2024-09-20T12:27:16Z

Description of changes

cc @SomeoneSerge

Things done

Add a 👍 reaction to pull requests you find important.

GaetanLepage · 2024-09-20T12:40:52Z

Result of nixpkgs-review pr 343254 run on aarch64-linux 1

1 package built:

mistral-rs

GaetanLepage · 2024-09-20T12:42:04Z

Result of nixpkgs-review pr 343254 run on x86_64-linux 1

1 package built:

mistral-rs

Aleksanaa · 2024-09-22T09:02:41Z

pkgs/by-name/mi/mistral-rs/package.nix

+    let
+      patchelfCommand = binaryName: ''
+        patchelf \
+          --add-rpath ${


Can we use autoPatchelfHook with runtimeDependencies (Since there are only this two executables)?

https://nixos.org/manual/nixpkgs/unstable/#setup-hook-autopatchelfhook

Thanks for the tip. I went and try it and it doesn't work weirdly...
I'm pushing the changes.

The ldd output is the same, but I guess that the rpath is not about that.

This is the result of patchelf --print-rpath for the two variants.

Aleksanaa · 2024-09-23T00:46:49Z

pkgs/by-name/mi/mistral-rs/package.nix

+  runtimeDependencies = lib.optionals cudaSupport [
+    cudaPackages.libcublas
+    cudaPackages.libcurand
+  ];


Suggested change

runtimeDependencies = lib.optionals cudaSupport [

cudaPackages.libcublas

cudaPackages.libcurand

];

Try this. I guess it's because autoPatchelfHook overrides rpath (--set-rpath) instead of prepending/appending (--add-rpath) it.

nixpkgs/pkgs/build-support/setup-hooks/auto-patchelf.py

Lines 303 to 307 in 122d20b

if rpath:

print("setting RPATH to:", rpath_str)

subprocess.run(

["patchelf", "--set-rpath", rpath_str, path.as_posix()] + extra_args,

check=True)

But you already have these two in buildInputs, and it seems unneeded to add them to runtimeDependencies again, see torch as an example:

nixpkgs/pkgs/development/python-modules/torch/bin.nix

Lines 45 to 77 in 122d20b

nativeBuildInputs = lib.optionals stdenv.isLinux [

addDriverRunpath

autoPatchelfHook

autoAddDriverRunpath

];

buildInputs = lib.optionals stdenv.isLinux (

with cudaPackages;

[

# $out/${sitePackages}/nvfuser/_C*.so wants libnvToolsExt.so.1 but torch/lib only ships

# libnvToolsExt-$hash.so.1

cuda_nvtx

cuda_cudart

cuda_cupti

cuda_nvrtc

cudnn

libcublas

libcufft

libcurand

libcusolver

libcusparse

nccl

]

);

autoPatchelfIgnoreMissingDeps = lib.optionals stdenv.isLinux [

# This is the hardware-dependent userspace driver that comes from

# nvidia_x11 package. It must be deployed at runtime in

# /run/opengl-driver/lib or pointed at by LD_LIBRARY_PATH variable, rather

# than pinned in runpath

"libcuda.so.1"

];

~~I guess we have to add something like extraRuntimeDependencies to autoPatchelfHook then.~~

Also try this:

nixpkgs/pkgs/development/python-modules/torch/bin.nix

Lines 71 to 77 in 122d20b

autoPatchelfIgnoreMissingDeps = lib.optionals stdenv.isLinux [

# This is the hardware-dependent userspace driver that comes from

# nvidia_x11 package. It must be deployed at runtime in

# /run/opengl-driver/lib or pointed at by LD_LIBRARY_PATH variable, rather

# than pinned in runpath

"libcuda.so.1"

];

I guess runtimeDependencies is still not necessary, unless cuda libraries are loaded by dlopen

Thank you for your suggestions !
I tried removing the

runtimeDependencies = lib.optionals cudaSupport [ cudaPackages.libcublas cudaPackages.libcurand ];

block but it doesn't work better.
Those libs are loaded at runtime (dlopen I guess) and thus are not seen as required by patchelf`.

I guess runtimeDependencies is still not necessary, unless cuda libraries are loaded by dlopen

This works for the cuda lib but it still complains about cublas...

I guess we have to add something like extraRuntimeDependencies to autoPatchelfHook then.

appendRunpathsArray exists, used for adding $ORIGIN in a few places

Aleksanaa · 2024-10-29T10:09:21Z

@GaetanLepage Could you fix this to a usable state? I suppose it's at least better than nothing

GaetanLepage · 2024-10-29T10:28:14Z

@GaetanLepage Could you fix this to a usable state? I suppose it's at least better than nothing

Sure. Maybe let's merge the update first and then I'll rebase this branch.
#352064

Aleksanaa · 2024-10-30T03:55:37Z

Done

GaetanLepage · 2024-10-30T08:42:36Z

Currently fails to build with cudaSupport enabled:

error[E0063]: missing field `seed_value` in initializer of `cuda_backend::device::CudaDevice`
   --> /build/cargo-vendor-dir/candle-core-0.7.2/src/cuda_backend/device.rs:203:12
    |
203 |         Ok(Self {
    |            ^^^^ missing `seed_value`

GaetanLepage requested a review from ConnorBaker September 20, 2024 12:31

GaetanLepage force-pushed the mistral-rs branch from bc57c89 to 2970b76 Compare September 20, 2024 12:32

ofborg bot added 11.by: package-maintainer This PR was created by the maintainer of the package it changes 10.rebuild-darwin: 1-10 10.rebuild-darwin: 1 10.rebuild-linux: 1-10 10.rebuild-linux: 1 labels Sep 20, 2024

GaetanLepage requested a review from SomeoneSerge September 20, 2024 15:07

GaetanLepage force-pushed the mistral-rs branch from 2970b76 to a85c226 Compare September 20, 2024 15:31

Aleksanaa reviewed Sep 22, 2024

View reviewed changes

GaetanLepage force-pushed the mistral-rs branch from a85c226 to 3eddcf9 Compare September 22, 2024 21:24

Aleksanaa reviewed Sep 23, 2024

View reviewed changes

FliegendeWurst added the 6.topic: cuda Parallel computing platform and API label Oct 29, 2024

mistral-rs: fix cuda support

6c3b603

GaetanLepage force-pushed the mistral-rs branch from 3eddcf9 to 6c3b603 Compare October 30, 2024 08:40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

mistral-rs: fix cuda support #343254

mistral-rs: fix cuda support #343254

GaetanLepage commented Sep 20, 2024 •

edited

Loading

GaetanLepage commented Sep 20, 2024

GaetanLepage commented Sep 20, 2024

Aleksanaa Sep 22, 2024

GaetanLepage Sep 22, 2024

GaetanLepage Sep 22, 2024

GaetanLepage Sep 22, 2024

Aleksanaa Sep 23, 2024

Aleksanaa Sep 23, 2024 •

edited

Loading

Aleksanaa Sep 23, 2024 •

edited

Loading

GaetanLepage Sep 24, 2024

GaetanLepage Sep 26, 2024

SomeoneSerge Oct 29, 2024

Aleksanaa commented Oct 29, 2024

GaetanLepage commented Oct 29, 2024

Aleksanaa commented Oct 30, 2024

GaetanLepage commented Oct 30, 2024

	if rpath:
	print("setting RPATH to:", rpath_str)
	subprocess.run(
	["patchelf", "--set-rpath", rpath_str, path.as_posix()] + extra_args,
	check=True)

	nativeBuildInputs = lib.optionals stdenv.isLinux [
	addDriverRunpath
	autoPatchelfHook
	autoAddDriverRunpath
	];

	buildInputs = lib.optionals stdenv.isLinux (
	with cudaPackages;
	[
	# $out/${sitePackages}/nvfuser/_C*.so wants libnvToolsExt.so.1 but torch/lib only ships
	# libnvToolsExt-$hash.so.1
	cuda_nvtx

	cuda_cudart
	cuda_cupti
	cuda_nvrtc
	cudnn
	libcublas
	libcufft
	libcurand
	libcusolver
	libcusparse
	nccl
	]
	);

	autoPatchelfIgnoreMissingDeps = lib.optionals stdenv.isLinux [
	# This is the hardware-dependent userspace driver that comes from
	# nvidia_x11 package. It must be deployed at runtime in
	# /run/opengl-driver/lib or pointed at by LD_LIBRARY_PATH variable, rather
	# than pinned in runpath
	"libcuda.so.1"
	];

mistral-rs: fix cuda support #343254

Are you sure you want to change the base?

mistral-rs: fix cuda support #343254

Conversation

GaetanLepage commented Sep 20, 2024 • edited Loading

Description of changes

Things done

GaetanLepage commented Sep 20, 2024

GaetanLepage commented Sep 20, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Aleksanaa Sep 23, 2024 • edited Loading

Choose a reason for hiding this comment

Aleksanaa Sep 23, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Aleksanaa commented Oct 29, 2024

GaetanLepage commented Oct 29, 2024

Aleksanaa commented Oct 30, 2024

GaetanLepage commented Oct 30, 2024

GaetanLepage commented Sep 20, 2024 •

edited

Loading

Aleksanaa Sep 23, 2024 •

edited

Loading

Aleksanaa Sep 23, 2024 •

edited

Loading