[ENH] Implements DoRA #790

julian-fong · 2025-02-04T01:59:14Z

Paper: https://arxiv.org/pdf/2402.09353
Github: https://github.com/NVlabs/DoRA/

julian-fong · 2025-03-12T22:46:32Z

@calpt I've written out the compose function, using https://github.com/NVlabs/DoRA/blob/7ee989695252cb0f4f7579182c81581aa75139a7/commonsense_reasoning/peft/src/peft/tuners/dora.py#L411 as an reference point.

Using the first notebook as a testing standpoint, it does train, however the performance is quite poor (does not outperform LoRA by a decent amount). maybe there is something wrong with my implementation (i'm also curious as to why they do norm_scale-1)

It also does not seem straightforward/simple to write out the delta_w methods and the inv_com method, as i believe it would require the pretrained weights W_o as an input for the calculation.

An initial review is appreciated :)

calpt

This is in a very promising state, the changes seem to look good for me from a functional perspective. What I think could be improved still is the readability of the implementation to ensure this is as easily understandable as possible for developers. Left a few comments mainly to this regard.

src/adapters/methods/lora.py

calpt · 2025-05-26T20:28:26Z

src/adapters/methods/lora.py

+    norm_scale = m.weight.view(-1) / norm
+    scaled_weights = (norm_scale - 1) * weights
+    scaled_lora = norm_scale * added
+    result = scaled_weights + scaled_lora
+    return result


I believe this computation is due to us calling the the com method afterwards, which adds re-adds the layer output? While this technically works, I think we should refactor this to make the code overall more readable (even if that makes it less concise). The formulation from the paper should ideally be clearly readable in our code to make it as understandable as possible for developers.

Thats right, the idea here for the method compute_dora_deltaw was to formulate the 'BAx' that goes into the Vera or LoRA layers. I tried to formulate it this way so that we could still utilize the original com method and keep as much of the original code intact. the method also isn't exactly too extensible since its designed specifically for the "W_oX + BAx" com method

src/adapters/methods/lora.py

calpt · 2025-05-26T20:34:14Z

src/adapters/methods/lora.py

+    """This function returns the required weights necessary
+    to compute the inverse composition where `composition_mode` == add.
+    """
+    result = weights - weights * norm.unsqueeze(1) / m.weight - added


similar to above, I think we should try to make this more readable

docs/methods.md

julian-fong added 8 commits February 3, 2025 20:57

initial commit

79f9ef4

updates

d6d1dab

add dora condition for �dd_adapter

7a68d71

fixed typos

97456d3

wrote com function

9867049

fixed typo

fb2063b

fixed formatting

986c88b

fixed formatting:

acf5b21

julian-fong and others added 21 commits March 12, 2025 18:49

added clearer comments

8a58bdd

fixed formatting

7e115e8

Merge branch 'adapter-hub:main' into implement_dora

6a1d181

Merge branch 'main' into implement_dora

9997d93

fix conflicts

224ab60

fix quality

074893a

fixed compose, wrote com_inv and delta_w

cb7aaca

fixed docstring

f56edfb

added test file

1a53be5

fix com and com_inv methods

46364dd

fixed docstring

ab6e084

fix quality

d73fe0a

remove code

c2798be

fixed bug

cca4c54

fixed com and com_inv

8a80ba9

improved code functionality

333b79d

removed test for deberta and gpt2

1846210

updated docs

dbb08c7

updated lora.py

2e8c1fc

reverted dora module - extended as a global method applicable to lora

847c9a1

fixed typo

6e7b8eb

julian-fong and others added 6 commits May 10, 2025 12:49

add average tests for dora with lora

ee08a5b

fixed quality

330d8b5

added vera tests for dora

43e25a3

improved code readability, and fixed docstring

64583a0

Merge branch 'adapter-hub:main' into implement_dora

3709259

Merge branch 'adapter-hub:main' into implement_dora

9df3b5a

calpt reviewed May 26, 2025

View reviewed changes

julian-fong added 23 commits May 26, 2025 22:32

first updates from review

d2c9fbe

added use_dora attr to each lora module

217b8c7

fixed m initialization

5657673

fixed quality

49d15d8

fix initialization of m

b08872f

changed init_weights param

34b58c3

add doraconfig and dvoraconfig checks

5ce3574

add dora and dvora configs

b5befc1

fixed quality, docstrings, and removed unnecesary test

e432559

updates

5bafdac

updates

040239a

updates

b7f373f

updates

03da934

updates

00ccf61

updates

6b002fe

updates

daae98c

updates

325013a

updates

4ff31ce

updates

66ab2ec

updates

7ad9c29

fixed bug

4a223ed

refactored code

cecc99f

updates

1c9f5a9

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[ENH] Implements DoRA #790

[ENH] Implements DoRA #790

Uh oh!

julian-fong commented Feb 4, 2025 •

edited

Loading

Uh oh!

julian-fong commented Mar 12, 2025

Uh oh!

calpt left a comment

Uh oh!

Uh oh!

Uh oh!

calpt May 26, 2025

Uh oh!

julian-fong May 27, 2025

Uh oh!

Uh oh!

calpt May 26, 2025

Uh oh!

Uh oh!

Uh oh!

[ENH] Implements DoRA #790

Are you sure you want to change the base?

[ENH] Implements DoRA #790

Uh oh!

Conversation

julian-fong commented Feb 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

julian-fong commented Mar 12, 2025

Uh oh!

calpt left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

calpt May 26, 2025

Choose a reason for hiding this comment

Uh oh!

julian-fong May 27, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

calpt May 26, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

julian-fong commented Feb 4, 2025 •

edited

Loading