Skip to content

Conversation

@HectorSVC
Copy link
Contributor

Description

Remove ep_weight_sharing_ctx_gen tool from QNN EP python wheel

@HectorSVC HectorSVC added the ep:QNN issues related to QNN exeution provider label May 29, 2025
@HectorSVC HectorSVC merged commit 665922d into main Jun 3, 2025
88 checks passed
@HectorSVC HectorSVC deleted the qnn_py_remove_ctx_gen branch June 3, 2025 16:34
javier-intel pushed a commit to intel/onnxruntime that referenced this pull request Jun 15, 2025
…soft#24895)

### Description
Remove ep_weight_sharing_ctx_gen tool from QNN EP python wheel
adrianlizarraga pushed a commit that referenced this pull request Aug 1, 2025
### Description
Remove ep_weight_sharing_ctx_gen tool from QNN EP python wheel
adrianlizarraga pushed a commit that referenced this pull request Aug 5, 2025
### Description
Remove ep_weight_sharing_ctx_gen tool from QNN EP python wheel
adrianlizarraga added a commit that referenced this pull request Aug 11, 2025
### Description
- #24265
- #24616
- #24640
- #24707
- #24646
- #24750
- #24809
- #24895
- #24820
- #25002
- #25171
- #25283
- #24818
- #25351
- #25361
- #25388
- #25520
- #25158




### Motivation and Context
<!-- - Why is this change required? What problem does it solve?
- If it fixes an open issue, please link to the issue here. -->

---------

Co-authored-by: quic-zhaoxul <quic_zhaoxul@quicinc.com>
Co-authored-by: Yuduo Wu <6426433+1duo@users.noreply.github.com>
Co-authored-by: Hector Li <hecli@microsoft.com>
Co-authored-by: chenweng-quic <168707118+chenweng-quic@users.noreply.github.com>
Co-authored-by: qti-yuduo <yuduow@qti.qualcomm.com>
Co-authored-by: Akupadhye <aupadhye@qti.qualcomm.com>
Co-authored-by: Jeff Kilpatrick <jkilpatrick@qti.qualcomm.com>
Co-authored-by: Jeff Kilpatrick <jkilpat@qti.qualcomm.com>
Co-authored-by: George Wu <jywu@microsoft.com>
Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: quic-calvnguy <quic_calvnguy@quicinc.com>
Co-authored-by: Changming Sun <chasun@microsoft.com>
Co-authored-by: Yulong Wang <7679871+fs-eire@users.noreply.github.com>
adrianlizarraga added a commit that referenced this pull request Nov 25, 2025
### Description
- Updates the `ep_weight_sharing_ctx_gen` tool to support specifying a
plugin EP configuration (via JSON).
- Mark the `ep_weight_sharing_ctx_gen` tool as deprecated and add
notification to README that recommends the use the public Python ORT
APIs instead.
- Note we no longer publish a binary for this tool [as of ORT
1.22.2](#24895).
- Added an example Python script in the README.
- Added a Python unit test that tests compiling models with weight
sharing using an example plugin EP.

#### Tool usage
Create a JSON file that contains information about the plugin EP to
load/use (e.g., `example_plugin_ep_config.json`):
```json
{
    "ep_library_registration_name": "example_plugin_ep",
    "ep_library_path": "example_plugin_ep.dll",
    "selected_ep_name": "example_plugin_ep",
    "default_ep_options": { "option_key": "option_value" }
}
```

Call the `ep_weight_sharing_ctx_gen` tool with the `-p` command-line
option to specify the location of the above configuration file:

```console
$ ep_weight_sharing_ctx_gen.exe -p example_plugin_ep_config.json model_1.onnx,model_2.onnx
```

### Motivation and Context
Close the functionality gap between traditional provider-bridge EPs and
plugin EPs. This PR allows using plugin EPs with the tool that compiles
models with weight sharing.
quic-ankus pushed a commit to CodeLinaro/onnxruntime that referenced this pull request Nov 25, 2025
…soft#24895)

### Description
Remove ep_weight_sharing_ctx_gen tool from QNN EP python wheel
Rohanjames1997 pushed a commit to Rohanjames1997/onnxruntime that referenced this pull request Dec 4, 2025
### Description
- Updates the `ep_weight_sharing_ctx_gen` tool to support specifying a
plugin EP configuration (via JSON).
- Mark the `ep_weight_sharing_ctx_gen` tool as deprecated and add
notification to README that recommends the use the public Python ORT
APIs instead.
- Note we no longer publish a binary for this tool [as of ORT
1.22.2](microsoft#24895).
- Added an example Python script in the README.
- Added a Python unit test that tests compiling models with weight
sharing using an example plugin EP.

#### Tool usage
Create a JSON file that contains information about the plugin EP to
load/use (e.g., `example_plugin_ep_config.json`):
```json
{
    "ep_library_registration_name": "example_plugin_ep",
    "ep_library_path": "example_plugin_ep.dll",
    "selected_ep_name": "example_plugin_ep",
    "default_ep_options": { "option_key": "option_value" }
}
```

Call the `ep_weight_sharing_ctx_gen` tool with the `-p` command-line
option to specify the location of the above configuration file:

```console
$ ep_weight_sharing_ctx_gen.exe -p example_plugin_ep_config.json model_1.onnx,model_2.onnx
```

### Motivation and Context
Close the functionality gap between traditional provider-bridge EPs and
plugin EPs. This PR allows using plugin EPs with the tool that compiles
models with weight sharing.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ep:QNN issues related to QNN exeution provider

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants