feature request: edge device CUDA backend #8214

mindbeast · 2023-12-19T02:08:59Z

mindbeast
Dec 19, 2023

Does it make sense for executorch to have a mobile cuda backend? There are many edge devices in the Jetson lineup from nvidia that have a cuda gpu, but can benefit from not wanting to link an enormous libtorch dependence.

bionictoucan · 2024-02-12T11:41:17Z

bionictoucan
Feb 12, 2024

+1 to this, would be nice to get comparable performance to TensorRT without having to export models to ONNX etc. first!

0 replies

hietalajulius · 2024-02-29T17:54:38Z

hietalajulius
Feb 29, 2024

+1

0 replies

mergennachin · 2024-02-29T18:15:50Z

mergennachin
Feb 29, 2024
Collaborator

@mindbeast @bionictoucan @hietalajulius

Hi, thanks for the comment.

Yes, that makes sense in general.

Right now, for ExecuTorch, we are integrating Vulkan into ExecuTorch. The reason is that it is a suitable solution for mobile GPUs. Enabling mobile use-cases is our primary goal at the moment.

We will revisit Cuda, but perhaps, in the second half in the year. Curious, what are your current product needs?

0 replies

DzAvril · 2024-09-11T11:01:41Z

DzAvril
Sep 11, 2024

Apologies for opening a similar feature request in #5263.

Curious, what are your current product needs?

@mergennachin We want to deploy LLMs in cars, but Python-based inference frameworks like vLLM and SGLang are not suitable for edge devices.

We will revisit Cuda, but perhaps, in the second half in the year.

Nearly five months have passed, is there any progress on this?

0 replies

digantdesai · 2024-09-11T14:47:09Z

digantdesai
Sep 11, 2024
Collaborator

Thank you for following up @DzAvril.

We want to deploy LLMs in cars, but Python-based inference frameworks like vLLM and SGLang are not suitable for edge devices.

I guess this is using a platform similar to Jetson?

Nearly five months have passed, is there any progress on this?

No update yet on CUDA backend for ET at the moment. We can get back to you here once we plan something.

0 replies

DzAvril · 2024-09-12T01:22:19Z

DzAvril
Sep 12, 2024

I guess this is using a platform similar to Jetson?

@digantdesai Yes, Jetson Orin for now, and possibly Thor in the future.

Looking forward to your update.

0 replies

DuinoDu · 2025-01-11T05:05:21Z

DuinoDu
Jan 11, 2025

For mobile cuda backend, does torch_tensorrt satisfy the requirement?

0 replies

mindbeast · 2025-01-23T21:06:41Z

mindbeast
Jan 23, 2025
Author

@DuinoDu My expectation is that compatibility is poor with torch_tensorrt. I expect a more compliant backend in executorch would help a lot of developers.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feature request: edge device CUDA backend #8214

{{title}}

Replies: 8 comments

{{title}}

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

Select a reply

feature request: edge device CUDA backend #8214

mindbeast Dec 19, 2023

Replies: 8 comments

bionictoucan Feb 12, 2024

hietalajulius Feb 29, 2024

mergennachin Feb 29, 2024 Collaborator

DzAvril Sep 11, 2024

digantdesai Sep 11, 2024 Collaborator

DzAvril Sep 12, 2024

DuinoDu Jan 11, 2025

mindbeast Jan 23, 2025 Author

mindbeast
Dec 19, 2023

bionictoucan
Feb 12, 2024

hietalajulius
Feb 29, 2024

mergennachin
Feb 29, 2024
Collaborator

DzAvril
Sep 11, 2024

digantdesai
Sep 11, 2024
Collaborator

DzAvril
Sep 12, 2024

DuinoDu
Jan 11, 2025

mindbeast
Jan 23, 2025
Author