Automatic adjustment of itopk size according to filtering rate #509

anaruse · 2024-12-04T06:40:23Z

This PR is based on #492.

The new multi-CTA algorithm proposed in #492 can be used to obtain good recall even with high filtering rates. However, good recall cannot be obtained unless the number of search iterations, or itopk size, one of CAGRA's search parameters, is appropriately increased according to the filtering rate. Therefore, users need to find the appropriate itopk size according to the filtering rate by trial and error, which is a pain.

This PR is intended to alleviate this problem by internally calculating the filtering rate and automatically adjusting the itopk size accordingly.

copy-pr-bot · 2024-12-04T06:40:26Z

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

…when the number of results is large Fix some issues Fix lower recall issue with new multi-cta algo Removing redundant code and changing some parameters Update cpp/src/neighbors/detail/cagra/search_plan.cuh Co-authored-by: Tamas Bela Feher <[email protected]> Remove an unnecessary line and satisfy clang-format

cjnolet · 2024-12-05T21:40:50Z

/ok to test

Handle the case when the search result contains invalid indices when building the updated graph in add_nodes. For debugging purposes, fail if any invalid indices found; in future, we can replace RAFT_FAIL with RAFT_LOG_WARN to make the add_nodes routine more robust.

…of result_buffer

Co-authored-by: Artem M. Chirkin <[email protected]>

cjnolet · 2025-01-23T18:05:54Z

cpp/include/cuvs/neighbors/cagra.hpp

+   * The value must be equal to or greater than 0.0 and less than 1.0. Default value is
+   * negative, in which case the filtering rate is automatically set.
+   */
+  float filtering_rate = -1.0;


This is interesting. So is specifying this value an alternative to explicitly computing the sparsity of the pre-filtering bitset (which would introduce additional compute into the search)?

It would be clearer to write "automatically computed" rather than "automatically set" here. I will fix it.

I like that. "automatically compted" is perfect.

cjnolet · 2025-01-23T18:14:56Z

cpp/src/neighbors/cagra.cuh

@@ -349,9 +351,18 @@ void search(raft::resources const& res,
    auto& sample_filter =
      dynamic_cast<const cuvs::neighbors::filtering::bitset_filter<uint32_t, int64_t>&>(
        sample_filter_ref);
+    search_params params_copy = params;
+    if (params.filtering_rate < 0.0) {


Oh I see- the filtering rate set to "auto" will automatically calculate the sparsity of the bitset while a user can skip having to compute this if they set the filtering rate directly. I like this.

That is correct. By default, it calculates the filtering rate, but if the user specifies the filtering rate, that calculation is skipped.

cjnolet · 2025-01-23T18:40:05Z

/ok to test

cjnolet · 2025-01-23T18:48:17Z

@anaruse it looks like there's some failed style checkers in this PR. Can you run the git pre-commit hooks to fix them? Here's the docs to enable them.

cjnolet · 2025-01-29T16:33:42Z

/ok to test

cjnolet · 2025-01-30T14:27:43Z

/ok to test

cjnolet · 2025-01-30T22:58:48Z

@anaruse I think the changes look great. Can you also add the new search param to the main indexing docs for completeness? That would be the file in docs/source/indexes/cagra.rst

anaruse · 2025-01-31T08:25:11Z

@anaruse I think the changes look great. Can you also add the new search param to the main indexing docs for completeness? That would be the file in docs/source/indexes/cagra.rst

I've updated text in the filtering considerations section of docs/source/indexes/cagra.rst. What do you think of this?

cjnolet · 2025-02-01T02:53:42Z

/ok to test

cjnolet · 2025-02-01T02:58:19Z

/ok to test

cjnolet · 2025-02-01T05:50:11Z

/merge

anaruse requested a review from a team as a code owner December 4, 2024 06:40

github-actions bot added the cpp label Dec 4, 2024

anaruse marked this pull request as draft December 4, 2024 06:40

anaruse added 2 commits December 5, 2024 15:18

Merge branch 'branch-24.12' into improved_multi_cta_algo

8ff6991

anaruse force-pushed the adjust_itopk branch 2 times, most recently from d2e8d4c to aa7cfde Compare December 5, 2024 09:52

tfeher and others added 2 commits December 5, 2024 07:51

fix style

37e26c1

Merge branch 'branch-24.12' into improved_multi_cta_algo

3665d45

cjnolet assigned anaruse Dec 5, 2024

cjnolet added improvement Improves an existing functionality non-breaking Introduces a non-breaking change vector search labels Dec 5, 2024

achirkin and others added 14 commits December 9, 2024 11:13

Merge branch 'branch-25.02' into improved_multi_cta_algo

018e792

Resolving various issues with the new multi-CTA algorithm

bedd224

Add comments in add_nodes.cuh

ea8c273

Limit tht number of warnings output

5025481

Avoid invalid results in search results as much as possible

b61126a

Improve the accuracy of the new multi-CTA algo by revising the usase …

588bd0c

…of result_buffer

Reduce the number of shared memory access

228a1ae

Remove unused code

776f2f5

Merge branch 'branch-25.02' into improved_multi_cta_algo

9d262f7

Update cpp/src/neighbors/detail/cagra/device_common.hpp

192c0a9

Co-authored-by: Artem M. Chirkin <[email protected]>

Merge branch 'branch-25.02' into improved_multi_cta_algo

d19a6c4

Merge branch 'branch-25.02' into improved_multi_cta_algo

b5c31b3

Fixed data type issues

81e4b39

cjnolet reviewed Jan 23, 2025

View reviewed changes

cjnolet changed the base branch from branch-24.12 to branch-25.02 January 23, 2025 18:39

cjnolet marked this pull request as ready for review January 23, 2025 18:39

cjnolet and others added 5 commits January 23, 2025 13:49

Merge branch 'branch-25.02' into improved_multi_cta_algo

cdc4bc4

Merge branch 'branch-25.02' into improved_multi_cta_algo

dd371dc

Merge branch 'branch-25.02' into improved_multi_cta_algo

e769ca7

Merge branch 'branch-25.02' into improved_multi_cta_algo

ce93427

Merge branch 'branch-25.02' into improved_multi_cta_algo

c133c8b

anaruse added 2 commits January 30, 2025 02:10

Fixed problem of infinite loop when graph degree is small

baa3c0c

Adjust itopk size according to filtering rate

06d404c

anaruse force-pushed the adjust_itopk branch from 622ec68 to 06d404c Compare January 30, 2025 06:37

anaruse added 2 commits January 30, 2025 20:34

Merge branch 'branch-25.02' into adjust_itopk

23852af

Fix merge mistake

7e7ff1e

Updated text in the filtering considerration of cagra.rst

b05f174

anaruse requested a review from a team as a code owner January 31, 2025 08:23

Merge branch 'branch-25.02' into adjust_itopk

33eda3a

Merge branch 'branch-25.02' into adjust_itopk

6566a08

cjnolet approved these changes Feb 1, 2025

View reviewed changes

rapids-bot bot merged commit 888a34f into rapidsai:branch-25.02 Feb 1, 2025
61 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Automatic adjustment of itopk size according to filtering rate #509

Automatic adjustment of itopk size according to filtering rate #509

anaruse commented Dec 4, 2024

copy-pr-bot bot commented Dec 4, 2024

cjnolet commented Dec 5, 2024

cjnolet Jan 23, 2025

anaruse Jan 24, 2025

cjnolet Jan 30, 2025

cjnolet Jan 23, 2025

anaruse Jan 24, 2025

cjnolet commented Jan 23, 2025

cjnolet commented Jan 23, 2025

cjnolet commented Jan 29, 2025

cjnolet commented Jan 30, 2025

cjnolet commented Jan 30, 2025

anaruse commented Jan 31, 2025

cjnolet commented Feb 1, 2025

cjnolet commented Feb 1, 2025

cjnolet commented Feb 1, 2025

Automatic adjustment of itopk size according to filtering rate #509

Automatic adjustment of itopk size according to filtering rate #509

Conversation

anaruse commented Dec 4, 2024

copy-pr-bot bot commented Dec 4, 2024

cjnolet commented Dec 5, 2024

cjnolet Jan 23, 2025

Choose a reason for hiding this comment

anaruse Jan 24, 2025

Choose a reason for hiding this comment

cjnolet Jan 30, 2025

Choose a reason for hiding this comment

cjnolet Jan 23, 2025

Choose a reason for hiding this comment

anaruse Jan 24, 2025

Choose a reason for hiding this comment

cjnolet commented Jan 23, 2025

cjnolet commented Jan 23, 2025

cjnolet commented Jan 29, 2025

cjnolet commented Jan 30, 2025

cjnolet commented Jan 30, 2025

anaruse commented Jan 31, 2025

cjnolet commented Feb 1, 2025

cjnolet commented Feb 1, 2025

cjnolet commented Feb 1, 2025