Support Batch Mode for Vertex AI

It would be great if we could utilize batch mode in LLM providers such as Vertex AI, Open AI and Claude. 

Working with LLMs at a production level can mean a lot of data constantly which results in a big cost factor. 

Batch mode from Vertex AI and Open AI both suggest a 50% cost reduction when using batch mode. Right now, we would have to switch our services and use the native SDK of the provider so that we can use batch mode. 

https://ai.google.dev/gemini-api/docs/batch-mode

https://platform.openai.com/docs/guides/batch

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Support Batch Mode for Vertex AI #4125

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Support Batch Mode for Vertex AI #4125

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions