feat: add reranker training by adil-a · Pull Request #1449 · NVIDIA-NeMo/Automodel

adil-a · 2026-03-04T06:56:06Z

Add in retriever train recipe, retriever collator, retriever data loader
Remove lm_p
Change Biencoder -> Encoder

Signed-off-by: adil-a <adil.asif2000@hotmail.com>

copy-pr-bot · 2026-03-04T06:56:10Z

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

Signed-off-by: adil-a <adil.asif2000@hotmail.com>

nemo_automodel/_transformers/encoder.py

Signed-off-by: adil-a <adil.asif2000@hotmail.com>

rnyak · 2026-03-06T20:37:13Z

nemo_automodel/recipes/encoder/train_cross_encoder.py

+                    )
+                    with train_ctx:
+                        outputs = model(**batch, return_dict=True)
+                        logits = outputs.logits.view(-1, self.train_n_passages)


we basically want to use same value both for train_n_passage both for training and validation? in the yaml file we have train_n_passage defined twice under dataloader and validation_dataloader. what if user put different value for each. according to line above we'd be still using train_n_passages defined under dataloader field in the yaml right?

rnyak · 2026-03-06T20:38:43Z

examples/encoder/cross_encoder/llama3_2_1b.yaml

+#     model_type: crossencoder
+#     data_dir_list: training_datasets/validation.json
+#     data_type: eval
+#     train_n_passages: 5


I think we better name this to val_n_passages? bcs in the current case it should be identical to train_n_passages, I wonder what happens if we set it to a different number than the training. I left a related comment about this below.

nemo_automodel/_transformers/encoder.py

rnyak · 2026-03-06T20:44:18Z

examples/encoder/cross_encoder/llama3_2_1b.yaml

+
+seed: 42
+
+train_n_passages: 5


why we duplicate this in three different places? isnt that risky? why not define only once.

rnyak · 2026-03-06T20:52:04Z

examples/encoder/cross_encoder/llama3_2_1b.yaml

+seed: 42
+
+train_n_passages: 5
+eval_negative_size: 4


I think we are overcomplicating the yaml. why we need the eval_neg_size?

option 1:

set train_n_passages only once in the yaml, use it everywhere, both for training and evaluation
option 2:

set train_n_passages, and val_n_passages, and use train_n_passages for training process, the second for val process.

I will make the change to support the second option

rnyak · 2026-03-06T20:56:57Z

nemo_automodel/recipes/encoder/train_retriever_encoder.py

+                    q_reps = model(query)
+                    p_reps = model(passage)

                    n_passages = self.eval_negative_size + 1


a bit unnecessary, related to my comment above. if we go with option 1 then n_passages is basically same as self.train_n_passages.

Signed-off-by: adil-a <adil.asif2000@hotmail.com>

moving from biencoder to encoder refactor

9160756

Signed-off-by: adil-a <adil.asif2000@hotmail.com>

adil-a added 5 commits March 4, 2026 07:49

staging

c608d57

Signed-off-by: adil-a <adil.asif2000@hotmail.com>

train_encoder.py -> train_retriever_encoder.py

0f58911

Signed-off-by: adil-a <adil.asif2000@hotmail.com>

cross encoder recipe

04b0027

Signed-off-by: adil-a <adil.asif2000@hotmail.com>

cleaning up

decb5c8

Signed-off-by: adil-a <adil.asif2000@hotmail.com>

dir refactor

f4fa536

Signed-off-by: adil-a <adil.asif2000@hotmail.com>

adil-a commented Mar 4, 2026

View reviewed changes

nemo_automodel/_transformers/encoder.py Outdated Show resolved Hide resolved

adil-a commented Mar 4, 2026

View reviewed changes

nemo_automodel/_transformers/encoder.py Show resolved Hide resolved

rebase

930a54a

Signed-off-by: adil-a <adil.asif2000@hotmail.com>

adil-a changed the title ~~feat: add retriever training~~ feat: add reranker training Mar 5, 2026

adil-a added 7 commits March 5, 2026 04:42

separating out biencoder and crossencoder

b5f5a8b

Signed-off-by: adil-a <adil.asif2000@hotmail.com>

lm_q -> model

684a39c

Signed-off-by: adil-a <adil.asif2000@hotmail.com>

updating configs and adding dataset tests for cross encoder

586c42d

Signed-off-by: adil-a <adil.asif2000@hotmail.com>

lint

5d5accb

Signed-off-by: adil-a <adil.asif2000@hotmail.com>

bug fixes

52004a9

Signed-off-by: adil-a <adil.asif2000@hotmail.com>

changes

4a4d02f

Signed-off-by: adil-a <adil.asif2000@hotmail.com>

adding acc logging

45d85cc

Signed-off-by: adil-a <adil.asif2000@hotmail.com>

rnyak mentioned this pull request Mar 5, 2026

docs: add retriever docs #1407

Open

3 tasks

adil-a added 2 commits March 6, 2026 16:26

merging main

74aab50

Signed-off-by: adil-a <adil.asif2000@hotmail.com>

te patches + lint

325754e

Signed-off-by: adil-a <adil.asif2000@hotmail.com>

rnyak reviewed Mar 6, 2026

View reviewed changes

merging main and resolving conflicts

b6f5009

Signed-off-by: adil-a <adil.asif2000@hotmail.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add reranker training#1449

feat: add reranker training#1449
adil-a wants to merge 17 commits intomainfrom
adil/retriever-feat

adil-a commented Mar 4, 2026 •

edited

Loading

Uh oh!

copy-pr-bot bot commented Mar 4, 2026

Uh oh!

Uh oh!

Uh oh!

rnyak Mar 6, 2026

Uh oh!

rnyak Mar 6, 2026

Uh oh!

Uh oh!

rnyak Mar 6, 2026

Uh oh!

rnyak Mar 6, 2026

Uh oh!

adil-a Mar 7, 2026

Uh oh!

rnyak Mar 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants


		seed: 42

		train_n_passages: 5

Conversation

adil-a commented Mar 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

copy-pr-bot bot commented Mar 4, 2026

Uh oh!

Uh oh!

Uh oh!

rnyak Mar 6, 2026

Choose a reason for hiding this comment

Uh oh!

rnyak Mar 6, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

rnyak Mar 6, 2026

Choose a reason for hiding this comment

Uh oh!

rnyak Mar 6, 2026

Choose a reason for hiding this comment

Uh oh!

adil-a Mar 7, 2026

Choose a reason for hiding this comment

Uh oh!

rnyak Mar 6, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

adil-a commented Mar 4, 2026 •

edited

Loading