fix: biencoder PEFT adapter key remapping for merge_lora by adil-a · Pull Request #1478 · NVIDIA-NeMo/Automodel

adil-a · 2026-03-07T00:31:06Z

Summary

Fix PEFT adapter key remapping in BiencoderStateDictAdapter so that merge_lora.py works against the standalone base model (LlamaBidirectionalModel)
The biencoder wraps the base model as lm_q, so adapter weights/target modules were saved with incorrect prefixes (lm_q. → model. instead of lm_q. → bare)
LlamaBidirectionalModel extends LlamaModel (not LlamaForCausalLM), so its modules are layers.* not model.layers.*

Changes

state_dict_adapter.py: Fix PEFT key path in to_hf (base_model.model.lm_q.X → base_model.model.X, was incorrectly producing base_model.model.model.X). Fix from_hf to handle the corrected format.
addons.py: Strip lm_q. prefix from target modules in _extract_target_modules for biencoder, so adapter_config.json has module names matching the standalone HF base model.

Test plan

Unit tests for PEFT key roundtrip (to_hf → from_hf)
Unit test for target module extraction with biencoder
All existing test_addons.py and test_state_dict_adapter.py tests pass (31 total)
End-to-end: run biencoder training → merge_lora.py against nvidia/llama-nemotron-embed-1b-v2

🤖 Generated with Claude Code

The biencoder wraps the base model as `lm_q`, so PEFT adapter weights and target modules were saved with `lm_q.` prefix. However, the standalone base model (LlamaBidirectionalModel extends LlamaModel) uses bare module names like `layers.0.self_attn.q_proj` — no `model.` prefix. This caused merge_lora.py to fail because PEFT could not match the target modules or weight keys against the base model. Changes: - state_dict_adapter: fix PEFT key path in to_hf to strip `lm_q.` without adding `model.` (base_model.model.lm_q.X → base_model.model.X) - state_dict_adapter: fix from_hf to handle new PEFT key format (base_model.model.X → base_model.model.lm_q.X) - addons: strip `lm_q.` prefix from target modules for biencoder so adapter_config.json is compatible with the standalone HF base model Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

copy-pr-bot · 2026-03-07T00:31:09Z

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

adil-a · 2026-03-07T04:47:57Z

/ok to test b6c94c1

Signed-off-by: adil-a <adil.asif2000@hotmail.com>

adil-a · 2026-03-07T04:55:35Z

/ok to test 019acfc

adil-a requested review from a team, HuiyingLi, akoumpa and hemildesai as code owners March 7, 2026 00:31

Merge branch 'r0.3.0' into fix/biencoder-peft-key-remapping-r030

b6c94c1

akoumpa added the r0.3.0 Add for cherry-pick into release branch r0.3.0 label Mar 7, 2026

adil-a removed the r0.3.0 Add for cherry-pick into release branch r0.3.0 label Mar 7, 2026

adil-a mentioned this pull request Mar 7, 2026

fix: biencoder PEFT adapter key remapping for merge_lora #1479

Open

3 tasks

copy-pr-bot bot temporarily deployed to nemo-ci March 7, 2026 04:48 Inactive

copy-pr-bot bot had a problem deploying to test March 7, 2026 04:48 Error

lint

019acfc

Signed-off-by: adil-a <adil.asif2000@hotmail.com>

copy-pr-bot bot temporarily deployed to nemo-ci March 7, 2026 04:55 Inactive

copy-pr-bot bot temporarily deployed to test March 7, 2026 04:55 Inactive

copy-pr-bot bot temporarily deployed to nemo-ci March 7, 2026 05:06 Inactive

copy-pr-bot bot temporarily deployed to nemo-ci March 7, 2026 05:16 Inactive

copy-pr-bot bot temporarily deployed to nemo-ci March 7, 2026 05:32 Inactive

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: biencoder PEFT adapter key remapping for merge_lora#1478

fix: biencoder PEFT adapter key remapping for merge_lora#1478
adil-a wants to merge 3 commits intor0.3.0from
fix/biencoder-peft-key-remapping-r030

adil-a commented Mar 7, 2026

Uh oh!

copy-pr-bot bot commented Mar 7, 2026

Uh oh!

adil-a commented Mar 7, 2026

Uh oh!

adil-a commented Mar 7, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

adil-a commented Mar 7, 2026

Summary

Changes

Test plan

Uh oh!

copy-pr-bot bot commented Mar 7, 2026

Uh oh!

adil-a commented Mar 7, 2026

Uh oh!

adil-a commented Mar 7, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants