fix(qonnx): handle scalar zpt for per-tensor bias quantization by narutozxp · Pull Request #238 · fastmachinelearning/qonnx

narutozxp · 2026-05-28T12:58:29Z

When Conv bias quantization is enabled with per-tensor quantization, exported QONNX models may not always use the same shape representation for scale and zero-point. The scale may be stored as (1,), while the zero-point may be stored as ().

The previous logic relied on shape comparison against (1,), which could fail to catch this scalar zero-point representation. If this case is not handled explicitly, the subsequent reshape operation can fail because the zero-point shape is not normalized to the expected per-tensor form.

This PR makes the check semantically correct by treating both forms as valid per-tensor quantization parameters.

fix(qonnx): handle scalar zpt for per-tensor bias quantization

d398b12

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(qonnx): handle scalar zpt for per-tensor bias quantization#238

fix(qonnx): handle scalar zpt for per-tensor bias quantization#238
narutozxp wants to merge 1 commit into
fastmachinelearning:mainfrom
narutozxp:main

narutozxp commented May 28, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

narutozxp commented May 28, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant