Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Feat add json plugin and support low, medium, high reasoning efforts for thinking models #163

Merged
merged 64 commits into from
Feb 27, 2025

Conversation

codelion
Copy link
Owner

@codelion codelion commented Feb 5, 2025

  • Add support to generate JSON from any model using OpenAI compatible pydantic types or jsonschema.
  • Add support for reasoning_effort, you can use it with deepseek r1 models.
  • Simplify local inference and support flash-atten 2 the local inference now works with very little degradation in performance when compared to vllm for requests that are not batched.

@codelion codelion marked this pull request as draft February 5, 2025 13:59
@codelion codelion changed the title Feat add json plugin Feat add json plugin and support low, medium, high reasoning efforts for thinking models Feb 27, 2025
@codelion codelion marked this pull request as ready for review February 27, 2025 00:46
@codelion codelion merged commit 7be1f50 into main Feb 27, 2025
1 check passed
@codelion codelion deleted the feat-add-json-plugin branch February 27, 2025 00:47
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant