feat: Enhance ai-cache Plugin with Vector Similarity-Based LLM Cache Recall and Multi-DB Support #1248

EnableAsync · 2024-08-25T06:46:25Z

Ⅰ. Describe what this PR did

This PR extends the functionality of the ai-cache plugin, enabling more efficient AI application development by introducing vector similarity-based caching and recall mechanisms.

Ⅱ. Does this pull request fix one issue?

Please refer to issue #1040 and #1041.

Ⅲ. Why don't you add test cases (unit test/integration test)?

Test cases will be added later.

Ⅳ. Describe how to verify it

After filling in the apikey and ChromaCollectionID in docker-compose-test/envoy.yaml, execute the following code:

cd docker-compose-test/
docker compose up

Then test it by accessing the LLM via cURL:

curl http://172.17.0.1:10000/v1/chat/completions -X POST -d '{"model":"gpt-4o-mini","messages":[{"content":"今天中午吃什么","role":"user"}]}' -H "Content-Type: application/json"

Ⅴ. Special notes for reviews

update update: 注意在使用http协议的时候不要用tls update: add lobechat add: makefile for ai-proxy fix bugs fix bugs fix: redis connection fix: dashvector and dashscope cluster fix: change vdb collection feat: add chroma logic docs: 增加 api 说明 update: no callback version fix: change to callback fix: finish chrome remove: key update: gitignore

CLAassistant · 2024-08-25T06:46:32Z

All committers have signed the CLA.

codecov-commenter · 2024-09-06T03:34:11Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 43.52%. Comparing base (ef31e09) to head (a1a7eef).
Report is 173 commits behind head on main.

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #1248      +/-   ##
==========================================
+ Coverage   35.91%   43.52%   +7.61%     
==========================================
  Files          69       76       +7     
  Lines       11576    12320     +744     
==========================================
+ Hits         4157     5362    +1205     
+ Misses       7104     6622     -482     
- Partials      315      336      +21

see 69 files with indirect coverage changes

fix: remove key

…to feat/chroma

CH3CHO

LGTM

CH3CHO · 2024-11-08T04:22:56Z

plugins/wasm-go/extensions/ai-cache/vector/weaviate.go

+		return err
+	}
+
+	err = d.client.Post(


SaaS 版本的 Weaviate 似乎需要一个 API Key 才能访问。

johnlanni and others added 9 commits August 1, 2024 15:09

fix bugs

4f7bfbd

fix bugs

0f9e816

fix bugs

ff1bce6

fix conflict

f2a9ff6

Merge branch 'alibaba:main' into main

5cbae03

alter some errors

27b2f71

fix: embedding error

130f2ee

fix bugs && update interface design

56314d7

EnableAsync changed the title ~~feat: Add Chroma vector database support to ai-cache WASM Plugin~~ feat: Add ai-cache WASM Plugin Aug 25, 2024

EnableAsync changed the title ~~feat: Add ai-cache WASM Plugin~~ feat: Enhance ai-cache Plugin with Vector Similarity-Based LLM Cache Recall and Multi-DB Support Aug 25, 2024

EnableAsync and others added 7 commits August 25, 2024 17:20

feat: add elasticsearch

3d7e85c

fix bugs && refine the variable names

85549d0

update design for cache to support extension

8444f5e

Merge branch 'alibaba:main' into main

a655bc4

Merge branch 'alibaba:main' into feat/chroma

57bc863

Refined the code; README.md content needs to be updated.

d68fa88

add: makefile for weaviate

d6c643f

EnableAsync and others added 10 commits September 6, 2024 15:32

feat: add weaviate

3f3a1bc

feat: add pinecone

71cc25b

fix: remove key

fix bugs, README.md to be updated

5179392

fix bugs, refine variable name, update README.md

ece7e2f

Merge branch 'alibaba:main' into main

e868a1a

delete folder

138a526

Merge branch 'feat/chroma' of https://github.com/Suchun-sv/higress in…

65aafbd

…to feat/chroma

fix: format

bfaed4c

fix typos

e8ad550

update

a40f5e9

Suchun-sv and others added 22 commits October 20, 2024 18:46

update test

4caf9be

fix bugs

d04d78a

update

81bde6d

fix: bugs

ea34f4a

Merge branch 'main' into main

784740f

add support for skip-cache

f5b50fd

update README.md and change to FQDNCluster

a1fe701

change to FQDNCluster

730d951

provide support for the legacy configuration

335c04c

simplify resp func, add func name when debug

59bddf6

Merge branch 'alibaba:main' into main

e4901d9

change *.typ to *

36f0d77

add support for legacy config

009a1b1

update content_type in stream resp

4515f43

fix bugs

c048280

add support for legacy configuration

0ec24f3

fix bugs

a658bfe

handle the data: [DONE] and return in escaped string

a199144

dont read resp when ERROR_PARTIAL_MESSAGE_KEY not nil

77f05d6

Update redis_wrapper.go

28c629c

merge

bd84cd0

merge

d9ce358

EnableAsync requested review from Xunzhuo and 2456868764 as code owners October 29, 2024 04:18

EnableAsync and others added 4 commits October 29, 2024 12:19

update: README.md

4a95557

merge

04f288c

fix: READMME.md

902d810

Update README.md

a1a7eef

CH3CHO approved these changes Nov 8, 2024

View reviewed changes

CH3CHO requested changes Nov 8, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Enhance ai-cache Plugin with Vector Similarity-Based LLM Cache Recall and Multi-DB Support #1248

feat: Enhance ai-cache Plugin with Vector Similarity-Based LLM Cache Recall and Multi-DB Support #1248

EnableAsync commented Aug 25, 2024 •

edited

Loading

CLAassistant commented Aug 25, 2024 •

edited

Loading

codecov-commenter commented Sep 6, 2024 •

edited

Loading

CH3CHO left a comment

CH3CHO Nov 8, 2024

feat: Enhance ai-cache Plugin with Vector Similarity-Based LLM Cache Recall and Multi-DB Support #1248

Are you sure you want to change the base?

feat: Enhance ai-cache Plugin with Vector Similarity-Based LLM Cache Recall and Multi-DB Support #1248

Conversation

EnableAsync commented Aug 25, 2024 • edited Loading

Ⅰ. Describe what this PR did

Ⅱ. Does this pull request fix one issue?

Ⅲ. Why don't you add test cases (unit test/integration test)?

Ⅳ. Describe how to verify it

Ⅴ. Special notes for reviews

CLAassistant commented Aug 25, 2024 • edited Loading

codecov-commenter commented Sep 6, 2024 • edited Loading

Codecov Report

CH3CHO left a comment

Choose a reason for hiding this comment

CH3CHO Nov 8, 2024

Choose a reason for hiding this comment

EnableAsync commented Aug 25, 2024 •

edited

Loading

CLAassistant commented Aug 25, 2024 •

edited

Loading

codecov-commenter commented Sep 6, 2024 •

edited

Loading