gemma3 e2e runner on cuda #15323

pytorchbot · 2025-10-21T20:24:51Z

This PR was created by the merge bot to help merge the original PR into the main branch.
ghstack PR number: #15282 by @Gasoonjia
^ Please use this as the source of truth for the PR details, comments, and reviews
ghstack PR base: https://github.com/pytorch/executorch/tree/gh/gasoonjia/61/base
ghstack PR head: https://github.com/pytorch/executorch/tree/gh/gasoonjia/61/head
Merge bot PR base: https://github.com/pytorch/executorch/tree/gh/gasoonjia/60/orig
Merge bot PR head: https://github.com/pytorch/executorch/tree/gh/gasoonjia/61/orig
Differential Revision: D85087532
@diff-train-skip-merge

Pull Request resolved: #15228 This diff modifies the `aoti_torch_empty_strided` function to support the creation of incontiguous tensors. To achieve it, this diff: 1. update the way to calculate the memory size by using both tensor size and the stride 2. skip stride check in ETensor by adding and checking cmake macro `USE_CUDA_BACKEND` when building with CUDA backend support. we will soon bring the ETensor check back for every backend after migrating to use slimtensor. ghstack-source-id: 317688814 @exported-using-ghexport Differential Revision: [D84938258](https://our.internmc.facebook.com/intern/diff/D84938258/)

Pull Request resolved: #15241 This diff adds a module-level benchmark for the GEMMA3 model. Also introduce mutlmodal_benchmark.cpp to replace original voxtral_runner.cpp for benchmarking both gemma3 and voxtral model in module level. ghstack-source-id: 317688813 Differential Revision: [D84958564](https://our.internmc.facebook.com/intern/diff/D84958564/)

Pull Request resolved: #15282 This diff introduces e2e runner for gemma3 model on cuda delegating using AOTI library, which is guarded by CI. Also other necessary infrastructure updates for building and running the `gemma3 e2e runner` on CUDA devices. ghstack-source-id: 317688815 Differential Revision: [D85087532](https://our.internmc.facebook.com/intern/diff/D85087532/)

pytorch-bot · 2025-10-21T20:24:55Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/15323

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

This comment was automatically generated by Dr. CI and updates every 15 minutes.

github-actions · 2025-10-21T20:51:24Z

This PR needs a `release notes:` label

If your change should be included in the release notes (i.e. would users of this library care about this change?), please use a label starting with release notes:. This helps us keep track and include your important work in the next release notes.

To add a label, you can comment to pytorchbot, for example
@pytorchbot label "release notes: none"

For more information, see
https://github.com/pytorch/pytorch/wiki/PyTorch-AutoLabel-Bot#why-categorize-for-release-notes-and-how-does-it-work.

@Gasoonjia

This PR was created by the merge bot to help merge the original PR into the main branch. ghstack PR number: pytorch#15282 by @Gasoonjia ^ Please use this as the source of truth for the PR details, comments, and reviews ghstack PR base: https://github.com/pytorch/executorch/tree/gh/gasoonjia/61/base ghstack PR head: https://github.com/pytorch/executorch/tree/gh/gasoonjia/61/head Merge bot PR base: https://github.com/pytorch/executorch/tree/gh/gasoonjia/60/orig Merge bot PR head: https://github.com/pytorch/executorch/tree/gh/gasoonjia/61/orig Differential Revision: [D85087532](https://our.internmc.facebook.com/intern/diff/D85087532/) @diff-train-skip-merge --------- Co-authored-by: gasoonjia <gasoonjia@icloud.com> Co-authored-by: Gasoonjia <gasoonjia@meta.com>

Gasoonjia added 3 commits October 21, 2025 10:23

pytorchbot requested review from jackzhxng, kirklandsign, larryliu0820, lucylq, mergennachin and swolchok as code owners October 21, 2025 20:24

meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Oct 21, 2025

Gasoonjia approved these changes Oct 21, 2025

View reviewed changes

Base automatically changed from gh/gasoonjia/60/orig to main October 21, 2025 20:47

Merge branch 'main' into gh/gasoonjia/61/orig

012bd15

Gasoonjia merged commit ff6deb2 into main Oct 21, 2025
121 of 124 checks passed

Gasoonjia deleted the gh/gasoonjia/61/orig branch October 21, 2025 20:58

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

gemma3 e2e runner on cuda #15323

gemma3 e2e runner on cuda #15323

Uh oh!

pytorchbot commented Oct 21, 2025

Uh oh!

pytorch-bot bot commented Oct 21, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Oct 21, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

gemma3 e2e runner on cuda #15323

gemma3 e2e runner on cuda #15323

Uh oh!

Conversation

pytorchbot commented Oct 21, 2025

Uh oh!

pytorch-bot bot commented Oct 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/15323

Uh oh!

github-actions bot commented Oct 21, 2025

This PR needs a release notes: label

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

pytorch-bot bot commented Oct 21, 2025 •

edited

Loading

This PR needs a `release notes:` label