-
Notifications
You must be signed in to change notification settings - Fork 61
Description
Description
PowerMem查询速度极慢,6-7秒,而且库里面只有不到10条数据。大模型(Qwen/Qwen3-4B-Instruct-2507-FP8)和嵌入模型(Qwen/Qwen3-Embedding-0.6B)都是使用的参数极小的,向量库使用oceanbase(1024维)。三个模块分别使用脚本测试,时延都是正常的,毫秒级,但组合在一起,就延时这么多,不知道哪里的问题。
使用 https://www.powermem.ai/docs/examples/scenario_10_sparse_vector 的例子得到的结果如下:
✓ Configuration updated from YAML successfully
✓ Current Mode: UserMemory (Profile Enabled)
✓ Memory initialized successfully
Adding test memories...
Search query: 'AI algorithms'
Search time: 4.074261426925659s
Found 5 results:
-
Score: 0.0164
Content: Machine learning is a branch of artificial intelli... -
Score: 0.0081
Content: Deep learning uses multi-layer neural networks for... -
Score: 0.0079
Content: Natural language processing is an interdisciplinar... -
Score: 0.0078
Content: Knowledge graphs are graph-structured data used to... -
Score: 0.0077
Content: Vector search is an important technology for infor...
Description
PowerMem query speed is extremely slow, 6-7 seconds, and there are less than 10 pieces of data in the database. The large model (Qwen/Qwen3-4B-Instruct-2507-FP8) and the embedded model (Qwen/Qwen3-Embedding-0.6B) both use extremely small parameters, and the vector library uses oceanbase (1024 dimensions). The three modules were tested separately using scripts, and the delays were all normal, at the millisecond level, but when combined together, the delay was so much that I don’t know what the problem is.
The results obtained using the example of https://www.powermem.ai/docs/examples/scenario_10_sparse_vector are as follows:
✓ Configuration updated from YAML successfully
✓ Current Mode: UserMemory (Profile Enabled)
✓ Memory initialized successfully
Adding test memories...
Search query: 'AI algorithms'
Search time: 4.074261426925659s
Found 5 results:
-
Score: 0.0164
Content: Machine learning is a branch of artificial intelligence... -
Score: 0.0081
Content: Deep learning uses multi-layer neural networks for... -
Score: 0.0079
Content: Natural language processing is an interdisciplinar... -
Score: 0.0078
Content: Knowledge graphs are graph-structured data used to... -
Score: 0.0077
Content: Vector search is an important technology for infor...
Metadata
Metadata
Assignees
Labels
Type
Projects
Status