Use specialization constants for conv2d params in q8ta shaders#18178
Use specialization constants for conv2d params in q8ta shaders#18178SS-JIA wants to merge 1 commit intopytorch:mainfrom
Conversation
Summary: Replace Conv2DParams UBO with specialization constants in q8ta_conv2d, q8ta_conv2d_dw, and q8ta_im2col shaders. This allows the GPU compiler to treat conv params (kernel size, stride, padding, dilation, groups) as compile-time constants, enabling loop unrolling, dead code elimination, and constant folding. Differential Revision: D96503584
🔗 Helpful Links🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/18178
Note: Links to docs will display an error until the docs builds have been completed. ❌ 3 New Failures, 4 Pending, 1 Unrelated FailureAs of commit b194df5 with merge base 4f900b2 ( NEW FAILURES - The following jobs have failed:
BROKEN TRUNK - The following job failed but were present on the merge base:👉 Rebase onto the `viable/strict` branch to avoid these failures
This comment was automatically generated by Dr. CI and updates every 15 minutes. |
This PR needs a
|
Summary:
Replace Conv2DParams UBO with specialization constants in q8ta_conv2d,
q8ta_conv2d_dw, and q8ta_im2col shaders. This allows the GPU compiler to
treat conv params (kernel size, stride, padding, dilation, groups) as
compile-time constants, enabling loop unrolling, dead code elimination,
and constant folding.
Differential Revision: D96503584