feat: add support for llama.cpp, ollama, and deepseek completions #22

hxueh · 2025-02-01T09:37:52Z

Add integration for llama.cpp to enable new completion workflows.
Add integration for Ollama to enable new completion workflows.
Add integration for DeepSeek to enable new completion workflows.

* Add integration for [llama.cpp](https://github.com/ggerganov/llama.cpp/blob/master/examples/server/README.md#post-infill-for-code-infilling) to enable new completion workflows. * Add integration for [Ollama](https://github.com/ollama/ollama/blob/main/docs/api.md#generate-a-completion) to enable new completion workflows. * Add integration for [DeepSeek](https://api-docs.deepseek.com/api/create-completion) to enable new completion workflows.

feat: add support for llama.cpp, ollama, and deepseek completions

mohankumarelec · 2025-02-01T13:35:10Z

Hi @hxueh,

Thank you so much for taking the time to create this PR - I really appreciate your effort! 🙌

I was going through the changes, and I realized that llama.cpp, Ollama, and DeepSeek completions follow the OpenAI-compatible API. Since we can simply use the existing OpenAI provider and configure the base URL accordingly, that should be sufficient to support these models, right?

Just wanted to share these references:

Let me know what you think! Again, really appreciate your contribution

hxueh · 2025-02-01T16:51:20Z

Hi @mohankumarelec , I do miss the Ollama and llama.cpp docs. Thanks for the reference.

I did implement the DeepSeek as compatible to OpenAI. However, for Ollama, I do miss that docs since I just read the docs I provided in the PR, and miss the compatibility part.

For llama.cpp, there is a dedicated API for FIM, which the author of llama.cpp use in his VSCode extension so I think this will be a better solution.

Thanks for creating such a great VSCode extension. I really love it. Hope my contribution makes it better.

mohankumarelec · 2025-02-01T17:13:47Z

Hi @hxueh,

Thank you once again for your valuable contribution! The changes look great. If you could remove the Deepseek and Ollama implementations from the PR (since we can utilize the OpenAI provider for these), I’d be happy to merge the PR with the Llama.cpp provider updates.

Looking forward to your update!

Ollama completion could be done with OpenAI completion.

hxueh · 2025-02-01T19:24:06Z

Hi @mohankumarelec , I have removed the Ollama implementation. However, the DeepSeek implementation could not removed.

According to the get model docs, it use https://api.deepseek.com or https://api.deepseek.com/v1 as base URL. But when using FIM, it must use https://api.deepseek.com/beta as the base URL.

If the base URL doesn't set as /beta it won't be able to fetch models. If it does, the FIM won't succeed.

hxueh added 2 commits February 1, 2025 17:35

Merge pull request #1 from hxueh/feat/completion

74d44c5

feat: add support for llama.cpp, ollama, and deepseek completions

hxueh mentioned this pull request Feb 1, 2025

feat: add support for llama.cpp, ollama, and deepseek completions #21

Closed

fix: remove Ollama completion

3387587

Ollama completion could be done with OpenAI completion.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add support for llama.cpp, ollama, and deepseek completions #22

feat: add support for llama.cpp, ollama, and deepseek completions #22

hxueh commented Feb 1, 2025

mohankumarelec commented Feb 1, 2025

hxueh commented Feb 1, 2025

mohankumarelec commented Feb 1, 2025

hxueh commented Feb 1, 2025

feat: add support for llama.cpp, ollama, and deepseek completions #22

Are you sure you want to change the base?

feat: add support for llama.cpp, ollama, and deepseek completions #22

Conversation

hxueh commented Feb 1, 2025

mohankumarelec commented Feb 1, 2025

hxueh commented Feb 1, 2025

mohankumarelec commented Feb 1, 2025

hxueh commented Feb 1, 2025