feat: Adding support for Groq API, Cerebras SDK, compatibility to have much higher response #54

noahvandal · 2026-01-18T01:47:53Z

I added support for the Groq API and the Cerebras SDK, to enable a much higher throughput of RLM inference.

I did note that the gpt-oss series models did not perform that well, as there were errors on the manner in which it was trying to call tools, instead of using the REPL. The other models such as llama-3.3-70b-versatile, et. al., performed fine.
{NOTE}: trying out the gpt-oss on Cerebras worked just fine; this seems to be a Groq issue.

…e speeds

Adding support for Groq API compatibility to have much higher respons…

59c9085

…e speeds

noahvandal changed the title ~~Adding support for Groq API compatibility to have much higher response~~ feat: Adding support for Groq API compatibility to have much higher response Jan 18, 2026

Added support for cerebras sdk also

5c33625

noahvandal changed the title ~~feat: Adding support for Groq API compatibility to have much higher response~~ feat: Adding support for Groq API, Cerebras SDK, compatibility to have much higher response Jan 18, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Adding support for Groq API, Cerebras SDK, compatibility to have much higher response #54

feat: Adding support for Groq API, Cerebras SDK, compatibility to have much higher response #54

noahvandal commented Jan 18, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

feat: Adding support for Groq API, Cerebras SDK, compatibility to have much higher response #54

Are you sure you want to change the base?

feat: Adding support for Groq API, Cerebras SDK, compatibility to have much higher response #54

Conversation

noahvandal commented Jan 18, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

noahvandal commented Jan 18, 2026 •

edited

Loading