Add save/load to Embeddings #8818

TomeHirata · 2025-09-17T06:59:43Z

Copilot

Pull Request Overview

Adds save/load functionality to the Embeddings class, enabling persistence of embeddings indices to disk for fast loading without recomputing embeddings.

Implements save(), load(), and from_saved() methods for the Embeddings class
Adds comprehensive test coverage for save/load functionality including error handling
Removes the TODO comment about adding save/load methods

Reviewed Changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 2 comments.

File	Description
dspy/retrievers/embeddings.py	Implements save/load methods with pickle for config, numpy for embeddings, and FAISS index persistence
tests/retrievers/test_embeddings.py	Adds comprehensive tests for save/load functionality and error cases

_{Tip: Customize your code reviews with copilot-instructions.md. Create the file or learn how to get started.}

tests/retrievers/test_embeddings.py

dspy/retrievers/embeddings.py

chenmoneygithub · 2025-09-18T19:34:06Z

dspy/retrievers/embeddings.py

+            self.index = None
+
+        # Reinitialize the search function
+        self.search_fn = Unbatchify(self._batch_forward)


do we need to reinitialize the search_fn?

Yup, this is because from_saved bypasses __init__

chenmoneygithub · 2025-09-18T19:36:42Z

dspy/retrievers/embeddings.py

+                # but we can still save the embeddings for brute force search
+                pass
+
+    def load(self, path: str, embedder):


How do we want this load to be called? Do we want users to first create an Embedding instance then call embedding.load(), or make it a class method that return a loaded embedding?

This is actually only for consistency with other APIs like module.load. I guess mostly people will use from_saved. Do you think we should make load a classmethod?

Gotcha, I am asking because of the line self.search_fn = Unbatchify(self._batch_forward), if we do:

embedder = dspy.Embeddings(...) embedder.load(...)

do we still need this self.search_fn = Unbatchify(self._batch_forward)?

chenmoneygithub · 2025-09-22T20:24:41Z

LGTM after #8818 (comment) is resolved, thank you!

Add save/load to Embeddings

6bba980

TomeHirata requested review from Copilot, okhat and chenmoneygithub September 17, 2025 06:59

TomeHirata mentioned this pull request Sep 17, 2025

[Feature] Add .save and .load methods in embeddings.py #8807

Open

Copilot AI reviewed Sep 17, 2025

View reviewed changes

tests/retrievers/test_embeddings.py Show resolved Hide resolved

dspy/retrievers/embeddings.py Show resolved Hide resolved

chenmoneygithub reviewed Sep 18, 2025

View reviewed changes

comment

409e007

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add save/load to Embeddings #8818

Add save/load to Embeddings #8818

Uh oh!

TomeHirata commented Sep 17, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

chenmoneygithub Sep 18, 2025

Uh oh!

TomeHirata Sep 19, 2025

Uh oh!

chenmoneygithub Sep 18, 2025

Uh oh!

TomeHirata Sep 19, 2025 •

edited

Loading

Uh oh!

chenmoneygithub Sep 22, 2025

Uh oh!

chenmoneygithub commented Sep 22, 2025

Uh oh!

Uh oh!

Add save/load to Embeddings #8818

Are you sure you want to change the base?

Add save/load to Embeddings #8818

Uh oh!

Conversation

TomeHirata commented Sep 17, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

chenmoneygithub Sep 18, 2025

Choose a reason for hiding this comment

Uh oh!

TomeHirata Sep 19, 2025

Choose a reason for hiding this comment

Uh oh!

chenmoneygithub Sep 18, 2025

Choose a reason for hiding this comment

Uh oh!

TomeHirata Sep 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

chenmoneygithub Sep 22, 2025

Choose a reason for hiding this comment

Uh oh!

chenmoneygithub commented Sep 22, 2025

Uh oh!

Uh oh!

TomeHirata Sep 19, 2025 •

edited

Loading