Skip to content

Enable TorchAO int4wo quantization tests on XPU#13537

Open
jiqing-feng wants to merge 4 commits intohuggingface:mainfrom
jiqing-feng:torchao-xpu-int4wo
Open

Enable TorchAO int4wo quantization tests on XPU#13537
jiqing-feng wants to merge 4 commits intohuggingface:mainfrom
jiqing-feng:torchao-xpu-int4wo

Conversation

@jiqing-feng
Copy link
Copy Markdown
Contributor

What does this PR do?

Enables TorchAO int4wo (Int4WeightOnlyConfig) quantization tests on Intel XPU devices.

Changes

  1. Remove _int4wo_skip marker: The int4wo tests were previously restricted to CUDA only. Removed this skip marker so they run on all accelerator backends.

  2. XPU-specific int4 packing format: Added XPU-specific handling in _get_quant_config() — Intel XPU requires int4_packing_format="plain_int32" for Int4WeightOnlyConfig.

- Remove _int4wo_skip marker that restricted int4wo tests to CUDA only
- Add XPU-specific int4_packing_format='plain_int32' for Int4WeightOnlyConfig
@github-actions github-actions Bot added tests size/S PR with diff < 50 LOC labels Apr 22, 2026
@github-actions github-actions Bot added size/S PR with diff < 50 LOC and removed size/S PR with diff < 50 LOC labels Apr 22, 2026
@github-actions github-actions Bot added size/S PR with diff < 50 LOC and removed size/S PR with diff < 50 LOC labels Apr 22, 2026
@github-actions github-actions Bot added size/S PR with diff < 50 LOC and removed size/S PR with diff < 50 LOC labels Apr 28, 2026
@jiqing-feng
Copy link
Copy Markdown
Contributor Author

Hi @sayakpaul . Would you please review this PR? And the rests listed in there

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

size/S PR with diff < 50 LOC tests

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant