Pull Request: Adding HiRA integration into PEFT library #2668

hqsiswiliam · 2025-07-24T10:32:51Z

Feature request

This request proposes integrating HiRA (Hadamard High-Rank Adaptation) as described in the ICLR 2025 oral paper (https://openreview.net/pdf?id=TwJrTz9cRS) (https://iclr.cc/virtual/2025/oral/31839) and implemented in the hqsiswiliam/hira repository into the core PEFT library. This will enable users to apply HiRA through the familiar get_peft_model API and benefit from its high-rank updates without adding any inference overhead.

Motivation

General Motivation

PEFT methods like LoRA achieve parameter-efficient fine-tuning by injecting low-rank updates into pre-trained weights. While effective, purely low-rank adaptation can struggle to capture complex patterns in large language models.

1. Expressiveness grows with the rank

Empirically, increasing the LoRA rank in LLM training yields better downstream performance:

Higher LoRA rank correlates with improved task accuracy.

2. HiRA: Hadamard high-rank updates without extra parameters

HiRA sidesteps the expressiveness constraint by computing a Hadamard-enhanced update:

$$ \Delta W = W_0 \odot (A B) $$

HiRA uses the Hadamard product to inject high-rank structure into the frozen weight matrix $W_0$ via low-rank matrix $A$ and $B$.

3. Singular-value patterns

After training, HiRA exhibits a rich singular-value pattern, akin to full-rank fine-tuning (FFT), indicating its ability to model complex transformations without the expensive computational overhead:

HiRA’s singular-value distribution closely mirrors that of FFT.

4. Performance gains

Across commonsense reasoning benchmarks, HiRA outperforms LoRA and other PEFT baselines:

HiRA delivers notable accuracy improvements over baseline adapters.

5. No extra parameter or compute cost

Despite its high-rank behaviour, HiRA introduces no additional trainable parameters compared to LoRA:

HiRA matches LoRA’s GRAM usage and training hours.

6. Complementary with LoRA (HiLoRA)

Combining HiRA and LoRA into a hybrid “HiLoRA” setup yields even stronger results than either method alone:

HiLoRA leverages both low-rank and Hadamard high-rank updates for better expressiveness.

By integrating HiRA into PEFT, users gain richer adaptation capability without sacrificing the parameter efficiency and usability that PEFT provides.

Your contribution

We would be pleased to submit a pull request to integrate HiRA class implementation into the PEFT framework. We welcome any suggestions for alternative integration approaches and appreciate any guidance on best practices.

BenjaminBossan

Thanks for this PR to add HiRA to PEFT. The method looks promising and the provided code is already quite mature.

When I started reading the paper, I was at first reminded of FedPara, aka LoHa, which is already integrated into PEFT, as that method also relies on the Hadamard product. However, IIUC, the two methods are still distinct: HiRA basically corresponds to LoRA, but instead of adding dW, we multiply it. In that way, it is much closer to LoRA than to LoHa. Still, I wanted to flag this, as I'm not sure you are aware (your paper doesn't seem to be reference FedPara).

At the moment, I haven't done a full in-depth review, but I think that makes more sense once we have completed the next step.

I noticed that you have formatted some unrelated files in method_comparison, could you please undo those changes? Usually, when you run make style, that directory should not be included.

I think a good next step is to add HiRA to the testing matrix we have in PEFT. For now, let's add some entries similar to the ones you can find here:

peft/tests/test_custom_models.py

Lines 70 to 72 in 92d65ca

    
           ("Vanilla MLP 1 LoRA", "MLP", LoraConfig, {"target_modules": "lin0"}), 
        
           ("Vanilla MLP 2 LoRA", "MLP", LoraConfig, {"target_modules": ["lin0"]}), 
        
           ("Vanilla MLP 3 LoRA", "MLP", LoraConfig, {"target_modules": ["lin1"]}),

Since you also support embedding and conv layers, please make sure to include examples with those layers as well (basically, copy the relevant examples from LoRA and adjust them).

Then, please run pytest tests/test_custom_models.py -k "hira and not shira" -v and see if those tests pass. Once we get there, we can discuss the best next steps.

src/peft/tuners/hira/__init__.py

src/peft/tuners/hira/config.py

src/peft/utils/constants.py

tests/test_hira.py

github-actions · 2025-08-23T15:03:36Z

This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread.

BenjaminBossan · 2025-08-25T09:29:31Z

@hqsiswiliam Do you still plan on working on this PR?

hqsiswiliam · 2025-08-25T15:02:37Z

@hqsiswiliam Do you still plan on working on this PR?

Hi, BenjaminBossan. Thanks for checking in! I’ll continue working on this PR over the next few days.

kwargs = locals().copy() del kwargs["self"]

- rename HiRAConfig-> HiraConfig - rename HiRAModel -> HiraModel - remove _enable_peft_forward_hooks - rename HiRALayer -> HiraLayer

…ed-on-lora/model.py Simplify HiRA model to match latest LoRA structure

…h-latest-lora/model.py Align HiRA model with LoRA model

BenjaminBossan · 2025-11-27T10:42:35Z

@hqsiswiliam Once you're done with your changes, please ping me so that I know the PR is ready for review.

- remove _custom_modules - fix NaN error when unmerging

- fix unmerge behaviours, only replace 0 with tiny numbers for numeric stable.

hqsiswiliam · 2025-12-07T15:54:53Z

@hqsiswiliam Once you're done with your changes, please ping me so that I know the PR is ready for review.

Hi @BenjaminBossan, thanks for the heads-up. I’ve completed the updates, and the PR is ready for review whenever you have time.

BenjaminBossan

Thanks a lot for all the updates. I have checked them and did another review of the PR. There is still a bit of work to do, but the rest should be relatively simple.

There is now a merge conflict, which is fortunately easy to resolve. Could you please merge with/rebase on the latest main and fix it?

Also, as we're nearing completion, let's add an example to the examples/ folder. You can add a new one, perhaps something from the paper, or copy an existing one and modify it to use HiRA.

Regarding testing, the existing tests are quite comprehensive but there are still uncovered cases. I added some comments about this, but there is more: HiRA tests should be added for decoder models, encoder-decoder models, etc. This is quite straightforward. Check for instance how this was added for another PEFT method and take the same approach.

BenjaminBossan · 2025-12-08T11:17:19Z

src/peft/tuners/hira/__init__.py

+        return Linear4bit
+
+
+#


Let's remove this EETQ code.

BenjaminBossan · 2025-12-08T11:19:30Z

docs/source/package_reference/hira.md

@@ -0,0 +1,90 @@
+# HiRA


Let's also add an entry to the _toctree.yml or else this won't appear in the docs.

BenjaminBossan · 2025-12-08T11:20:34Z

src/peft/tuners/hira/config.py

+        layers_pattern (`Optional[Union[List[str], str]]`):
+            The layer pattern name, used only if `layers_to_transform` is different from `None`. This should target the
+            `nn.ModuleList` of the model, which is often called `'layers'` or `'h'`.
+        r_pattern (`dict`):


For consistency with other PEFT methods, let's call this rank_pattern.

BenjaminBossan · 2025-12-08T11:23:21Z

src/peft/tuners/hira/model.py

+    def _check_merge_allowed(self):
+        """Verify that the configuration supports merging.
+
+        Currently gptq quantization and replicated layers do not support merging.


There are no GPTQ layers for HiRA, so let's completely remove this method.

BenjaminBossan · 2025-12-08T11:30:41Z

src/peft/tuners/hira/model.py

+        return new_module
+
+    @contextmanager
+    def _enable_peft_forward_hooks(self, *args, **kwargs):


Just so we're on the same boat, by providing this method, we want HiRA to allow mixed adapter batches. If this is the idea, please also add tests for this. That means adding HiRA to the test matrix here:

peft/tests/test_custom_models.py

Line 5675 in 4731379

MIXED_ADAPTER_TEST_CASES = [

BenjaminBossan · 2025-12-08T11:32:54Z

tests/test_custom_models.py

Let's also add HiRA to the test matrix for testing multiple adapters:

peft/tests/test_custom_models.py

Line 945 in 4731379

MULTIPLE_ACTIVE_ADAPTERS_TEST_CASES = [

BenjaminBossan · 2025-12-08T11:33:45Z

tests/test_custom_models.py


        lr = 0.5
-        if config_kwargs.get("use_dora"):
+        if config_kwargs.get("use_dora") or config.__class__ == HiraConfig:


Suggested change

if config_kwargs.get("use_dora") or config.__class__ == HiraConfig:

if config_kwargs.get("use_dora") or (config_cls == HiraConfig):

BenjaminBossan · 2025-12-08T11:36:47Z

tests/test_hira.py

+from peft.tuners.hira import Linear
+
+
+def test_manual_hira_linear_equivalence():


I think this test (and thus the whole file) can be removed. We're really just re-implementing the forward method and checking if the results are the same. I don't think it adds any real value and we don't have this for any other PEFT method, I don't believe HiRA is special in this regard.

- modify config.__class__ to config_cls - remove _check_merge_allowed in hira model.

- adding HiRA cases to `MULTIPLE_ACTIVE_ADAPTERS_TEST_CASES`.

hqsiswiliam added 25 commits June 8, 2025 21:13

- initial commit for hira adapter

bc16e34

- This initial modification of HiRA's config

3c27937

- update HiRA Model

aeb3d54

- update HiRA Layer

d290008

- update HiRA Layer partially

dcdbe27

- update HiRA Layer partially (Embedding Layer)

8f48e2c

- update HiRA Layer partially (ConvNd Layer)

86e5195

- update HiRA Layer partially (ConvNd Layer)

da12aab

- update HiRA Layer partially (Conv1/2/3d Layer)

69ace05

- update HiRA Layer partially (MultiheadAttention)

2c53c8d

- remove HiRA Layer partially (MultiheadAttention)

32f6a4d

- update HiRA layer, model, and config

f86c9a9

- add bnb implementation and __init__.py

54c8de7

- add HiRA's Linear8bitLt implementation

ef18d9f

- update HiRA's layer comment

7c4718b

- add HiRA's Linear4bit

8506413

- complete HiRA's Linear4bit

9e8c017

- add test_hira

71907b4

- HiRA: updates to peft init, tuners, types, and GPU tests

ce782b6

Merge remote-tracking branch 'upstream/main'

d20332e

- HiRA: updates to HiRA layer, and HiRA testing

d76e328

- HiRA: formatting hira

e933f2a

- HiRA: formatting hira

0a4b3aa

- HiRA: add document

6b4092a

- apply merge

aab9204

hqsiswiliam mentioned this pull request Jul 24, 2025

Integrate HiRA (Hadamard High-Rank Adaptation) #2534

Closed

BenjaminBossan requested changes Jul 25, 2025

View reviewed changes

hqsiswiliam added 15 commits November 26, 2025 22:06

- remove unnecessary assert

83d8fe4

- remove _mixed_batch_forward

e99f9b1

- remove _check_forward_args and _mixed_batch_forward

02ab80d

- remove quant_methods = ["gptq", "aqlm", "awq"]

d86e199

- remove _cache_store and _cache_pop

69f1d48

- remove

a65e542

kwargs = locals().copy() del kwargs["self"]

- remove _register_custom_module

c1ea186

- remove layer_replication

2d5dfb9

- remove add_weighted_adapter, _check_add_weighted_adapter

f3326e5

- rename HiRAConfig-> HiraConfig - rename HiRAModel -> HiraModel - remove _enable_peft_forward_hooks - rename HiRALayer -> HiraLayer

- remove init_hira_weights

7631652

Simplify HiRA model to match latest LoRA structure

476152c

Merge pull request #1 from hqsiswiliam/codex/update-hira/model.py-bas…

8816ab0

…ed-on-lora/model.py Simplify HiRA model to match latest LoRA structure

- apply newest fork to main branch

b221675

Align HiRA forward hook setup

b43da11

Merge pull request #3 from hqsiswiliam/codex/update-hira/model.py-wit…

d84a7cc

…h-latest-lora/model.py Align HiRA model with LoRA model

hqsiswiliam added 5 commits December 7, 2025 22:53

- remove layer_replication

9241655

- remove _custom_modules - fix NaN error when unmerging

- fix too large learning rate when testing HiRA to result NaN

8899af9

- fix unmerge behaviours, only replace 0 with tiny numbers for numeric stable.

- set init_weight=True by default avoid randomness when not set.

58456fd

Merge branch 'huggingface:main' into main

e0d41ff

- do make style after all changes

789f0d3

BenjaminBossan requested changes Dec 8, 2025

View reviewed changes

hqsiswiliam added 7 commits December 9, 2025 16:07

- remove EETQ code

9b2d4f7

- update _toctree.yml, adding HiRA to sections.

7ce1321

- rename rank_pattern for naming consistency.

2b7693f

- remove test_hira.py

25115a7

- modify config.__class__ to config_cls - remove _check_merge_allowed in hira model.

- resolving an issue when mulit-HiRA to one module.

bb62397

- adding HiRA cases to `MULTIPLE_ACTIVE_ADAPTERS_TEST_CASES`.

- add encoder-decoder test cases.

15789e0

- add example for hira

075aaa0

	("Vanilla MLP 1 LoRA", "MLP", LoraConfig, {"target_modules": "lin0"}),
	("Vanilla MLP 2 LoRA", "MLP", LoraConfig, {"target_modules": ["lin0"]}),
	("Vanilla MLP 3 LoRA", "MLP", LoraConfig, {"target_modules": ["lin1"]}),

	if config_kwargs.get("use_dora") or config.__class__ == HiraConfig:
	if config_kwargs.get("use_dora") or (config_cls == HiraConfig):

		from peft.tuners.hira import Linear


		def test_manual_hira_linear_equivalence():

Pull Request: Adding HiRA integration into PEFT library #2668

Are you sure you want to change the base?

Pull Request: Adding HiRA integration into PEFT library #2668

Uh oh!

Conversation

hqsiswiliam commented Jul 24, 2025

Feature request

Motivation

General Motivation

1. Expressiveness grows with the rank

2. HiRA: Hadamard high-rank updates without extra parameters

3. Singular-value patterns

4. Performance gains

5. No extra parameter or compute cost

6. Complementary with LoRA (HiLoRA)

Your contribution

Uh oh!

BenjaminBossan left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

github-actions bot commented Aug 23, 2025

Uh oh!

BenjaminBossan commented Aug 25, 2025

Uh oh!

hqsiswiliam commented Aug 25, 2025

Uh oh!

BenjaminBossan commented Nov 27, 2025

Uh oh!

hqsiswiliam commented Dec 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

BenjaminBossan left a comment

Choose a reason for hiding this comment

Uh oh!

BenjaminBossan Dec 8, 2025

Choose a reason for hiding this comment

Uh oh!

BenjaminBossan Dec 8, 2025

Choose a reason for hiding this comment

Uh oh!

BenjaminBossan Dec 8, 2025

Choose a reason for hiding this comment

Uh oh!

BenjaminBossan Dec 8, 2025

Choose a reason for hiding this comment

Uh oh!

BenjaminBossan Dec 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

BenjaminBossan Dec 8, 2025

Choose a reason for hiding this comment

Uh oh!

BenjaminBossan Dec 8, 2025

Choose a reason for hiding this comment

Uh oh!

BenjaminBossan Dec 8, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

hqsiswiliam commented Dec 7, 2025 •

edited

Loading

BenjaminBossan Dec 8, 2025 •

edited

Loading