Fix GPTQ NF4&FP4 quant #2314

Kaihui-intel · 2025-10-15T09:20:40Z

User description

Type of Change

bug fix

Description

fix return int_weight torch.zeros(weight. Shape).to(device))

Expected Behavior & Potential Risk

the expected behavior that triggered by this PR

How has this PR been tested?

how to reproduce the test (including hardware information)

Dependency Change?

any library dependency introduced or removed

PR Type

Bug fix

Description

Fix incorrect assignment of quantized weights for NF4 & FP4
Ensure int_weight is correctly updated with quantized values

Diagram Walkthrough

flowchart LR
  A["Initialize int_weight"] -- "Loop through groups" --> B["Quantize group"]
  B -- "Update int_weight for NF4/FP4" --> C["Copy quantized values"]
  C -- "Handle tail group" --> D["Quantize tail group"]
  D -- "Update int_weight for NF4/FP4" --> E["Copy tail quantized values"]

File Walkthrough

Relevant files

Bug fix

utility.py `Correct weight assignment in quant_weight_w_scale` neural_compressor/torch/algorithms/weight_only/utility.py Added correct assignment of quantized weights to `int_weight` for NF4 & FP4 Ensured `int_weight` is updated with quantized values in both loop and tail handling	+2/-0

Signed-off-by: Kaihui-intel <[email protected]>

PRAgent4INC · 2025-10-15T09:21:18Z

PR Reviewer Guide 🔍

Here are some key observations to aid the review process:

⏱️ Estimated effort to review: 3 🔵🔵🔵⚪⚪
🧪 No relevant tests
🔒 No security concerns identified
⚡ Recommended focus areas for review Incorrect Indexing The indexing in the new lines seems incorrect. The slice `int_weight[:, leng * group_size :]` is used twice, which will always point to the tail part of the tensor, not the current group being processed. int_weight[:, leng * group_size :].copy_(int_weight_tmp) Incorrect Indexing Similar to the first issue, the indexing in the new lines seems incorrect. The slice `int_weight[:, leng * group_size :]` is used twice, which will always point to the tail part of the tensor, not the current group being processed. int_weight[:, leng * group_size :].copy_(int_weight_tmp)

PRAgent4INC · 2025-10-15T09:21:33Z

PR Code Suggestions ✨

Signed-off-by: Kaihui-intel <[email protected]>

fix gptq NF4/FP4

2af8493

Signed-off-by: Kaihui-intel <[email protected]>

Kaihui-intel requested a review from xin3he October 15, 2025 09:20

PRAgent4INC added the Review effort 3/5 label Oct 15, 2025

update int_weight

5fe07d9

Signed-off-by: Kaihui-intel <[email protected]>

Kaihui-intel requested a review from XuehaoSun October 16, 2025 02:01

xin3he approved these changes Oct 17, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix GPTQ NF4&FP4 quant #2314

Fix GPTQ NF4&FP4 quant #2314

Uh oh!

Kaihui-intel commented Oct 15, 2025 •

edited by PRAgent4INC

Loading

Uh oh!

PRAgent4INC commented Oct 15, 2025

Uh oh!

PRAgent4INC commented Oct 15, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Fix GPTQ NF4&FP4 quant #2314

Are you sure you want to change the base?

Fix GPTQ NF4&FP4 quant #2314

Uh oh!

Conversation

Kaihui-intel commented Oct 15, 2025 • edited by PRAgent4INC Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

User description

Type of Change

Description

Expected Behavior & Potential Risk

How has this PR been tested?

Dependency Change?

PR Type

Description

Diagram Walkthrough

File Walkthrough

Uh oh!

PRAgent4INC commented Oct 15, 2025

PR Reviewer Guide 🔍

Uh oh!

PRAgent4INC commented Oct 15, 2025

PR Code Suggestions ✨

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Kaihui-intel commented Oct 15, 2025 •

edited by PRAgent4INC

Loading