remove gguf from mlx-swift build #111

davidkoski · 2024-07-10T18:07:57Z

Hmm, we should build without GGUF support in MLX Swift / MLX C (MLX_BUILD_GGUF=OFF). We don't support it in the API anyway so it doesn't make sense to include it right?

aPaleBlueDot · 2024-08-26T09:00:18Z

Is there any way to bring in a Q4_1 quantized GGUF into MLX Swift? Or to specify q4_1 in mlx 's own quantization?

aPaleBlueDot · 2024-08-26T09:13:20Z

More broadly speaking, my use case requires quantizations other than the 2/4/8 bit options currently provided, especially Q3, Q5 and Q6 flavors. Are there any future plans to add more quantization options?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

remove gguf from mlx-swift build #111

remove gguf from mlx-swift build #111

davidkoski commented Jul 10, 2024

aPaleBlueDot commented Aug 26, 2024 •

edited

Loading

aPaleBlueDot commented Aug 26, 2024 •

edited

Loading

remove gguf from mlx-swift build #111

remove gguf from mlx-swift build #111

Comments

davidkoski commented Jul 10, 2024

aPaleBlueDot commented Aug 26, 2024 • edited Loading

aPaleBlueDot commented Aug 26, 2024 • edited Loading

aPaleBlueDot commented Aug 26, 2024 •

edited

Loading

aPaleBlueDot commented Aug 26, 2024 •

edited

Loading