litellm

Jul 5, 2024

8bfe0d2 · Jul 5, 2024

This branch is 2 commits ahead of, 33 commits behind zenbase-ai/core:main.

Name	Name	Last commit message	Last commit date
parent directory ..
.env.sample	.env.sample	feat(chore) \| add litellm cache (zenbase-ai#9 )	Jul 5, 2024
README.md	README.md	feat(chore) \| add litellm cache (zenbase-ai#9 )	Jul 5, 2024
config.yaml	config.yaml	feat(chore) \| add litellm cache (zenbase-ai#9 )	Jul 5, 2024
docker-compose.yml	docker-compose.yml	feat(chore) \| add litellm cache (zenbase-ai#9 )	Jul 5, 2024

README.md

LiteLLM Cache is a proxy server designed to cache your LLM requests, helping to reduce costs and improve efficiency.

Configure Settings:
- Navigate to ./config.yaml and update the configuration as per your requirements. For more information, visit LiteLLM Documentation.
Prepare Environment Variables:
- Create a .env file from the .env.sample file. Adjust the details in .env to match your config.yaml settings.
Start the Docker Container:
```
docker-compose up -d
```
Update Your LLM Server URL:
- Change the LLM calling server URL in your application to http://0.0.0.0:4000.
For example, using the OpenAI Python SDK:
```
from openai import OpenAI

llm = OpenAI(
    base_url='http://0.0.0.0:4000'
)
```

With these steps, your LLM requests will be routed through the LiteLLM Cache proxy server, optimizing performance and reducing costs.