Performance issues with Ollama on first message with disabled MCP servers #82

nicolaskrier · 2025-03-25T22:53:07Z

Hello,

As discussed in this issue, I am experiencing performance issues while running Dive locally with all configured MCP servers disabled on my MacBook Pro M4 with 48 GB RAM. As @kevinwatt suggested, the slowness occurs only after sending the first message; the rest of the conversation is fine.

Tests Conducted

I performed some tests by typing only a greeting message: "Hello!". Here are the results:

1. Test with Ollama only:

ollama run mistral-small:latest

Result: The response appears quickly.

2. Test with OpenWeb UI and Ollama:

ollama serve

Result: The response appears after 7-8 seconds.

3. Test with Dive and Ollama:

ollama serve

The response appears after 11-12 seconds.

Logs

Here are the logs for the last two tests:

Thanks

Thank you in advance for your assistance.

kevinwatt · 2025-03-26T09:39:43Z

Hi Nico,

Thanks for the testing. I double-checked the log files. They look the same, except for the run times.

Based on our internal discussions, this issue might be related to the Desktop's Default System Prompt. Similar to Claude Desktop, Dive also has a System Prompt of about 1000 tokens. This is currently the default setting.

https://github.com/OpenAgentPlatform/Dive/blob/main/services/prompt/system.ts

As a result of our discussions, we will provide an option to disable it in the version(0.7.3) in the system settings. However, turning it off might result in worse performance for some less capable models.
Nevertheless, this turn-off prompt feature is still necessary when conducting basic tests like this or when users simply want to use the model directly.

We greatly appreciate your bringing this issue to our attention. Your feedback is instrumental in helping us improve Dive.

nicolaskrier mentioned this issue Mar 25, 2025

Dive AI is great but have some issues or need some improvement. #76

Open

johnfunmula self-assigned this Mar 26, 2025

kevinwatt assigned tuo-funmula Mar 27, 2025

kevinwatt added the todo what needs to be done label Mar 27, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Performance issues with Ollama on first message with disabled MCP servers #82

Performance issues with Ollama on first message with disabled MCP servers #82

nicolaskrier commented Mar 25, 2025

kevinwatt commented Mar 26, 2025 •

edited

Loading

Performance issues with Ollama on first message with disabled MCP servers #82

Performance issues with Ollama on first message with disabled MCP servers #82

Comments

nicolaskrier commented Mar 25, 2025

Tests Conducted

Logs

Thanks

kevinwatt commented Mar 26, 2025 • edited Loading

kevinwatt commented Mar 26, 2025 •

edited

Loading