Remove dangling layers. #2686

ggorman · 2025-07-25T10:36:48Z

No description provided.

codecov · 2025-07-25T10:51:42Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 92.07%. Comparing base (5f399da) to head (6e3ad7e).
⚠️ Report is 5 commits behind head on main.

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #2686      +/-   ##
==========================================
+ Coverage   91.31%   92.07%   +0.76%     
==========================================
  Files         248      248              
  Lines       49387    49387              
  Branches     4355     4355              
==========================================
+ Hits        45096    45473     +377     
+ Misses       3559     3212     -347     
+ Partials      732      702      -30

Flag	Coverage Δ
pytest-gpu-aomp-amdgpuX	`72.87% <ø> (+0.49%)`	⬆️
pytest-gpu-nvc-nvidiaX	`73.99% <ø> (?)`

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

Copilot

Pull Request Overview

This PR removes dangling Docker layers and improves CI pipeline efficiency by adding cleanup mechanisms and updating build processes. The changes focus on better resource management in GPU-enabled workflows.

Removes unused environment variables and simplifies matrix configuration
Switches from docker build to docker buildx for enhanced build capabilities
Adds comprehensive cleanup steps to prevent accumulation of dangling Docker layers

Reviewed Changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 3 comments.

File	Description
.github/workflows/pytest-gpu.yml	Major refactoring to use buildx, remove unused variables, add cleanup steps, and enhance test configuration with better logging
.github/workflows/docker-devito.yml	Removes conditional check that was preventing cleanup on nvidia GPU runners

.github/workflows/pytest-gpu.yml

…tainer Background ---------- Each self-hosted runner is pinned to a specific GPU via a host-level CUDA_VISIBLE_DEVICES and we forward that mask to Docker: docker run --gpus "device=$CUDA_VISIBLE_DEVICES" … That flag alone is sufficient—Docker restricts the visible devices for the container. Problem ------- We also injected the same variable into the container’s environment (-e CUDA_VISIBLE_DEVICES). Inside the container the CUDA/OpenACC runtime renumbers the visible GPUs to 0…N-1, so a value like “1” or “2,3” is suddenly invalid and the first kernel call aborts (`exit 1`) when multiple runners share the host. Fix --- * Drop the `-e CUDA_VISIBLE_DEVICES` export from `${{ matrix.flags }}`. The device list is still enforced by `--gpus`, but the runtime now starts counting at 0 as expected. Verified on: * Two concurrent nvidiagpu runners on a 4-V100 host – full test suite passes. * amdgpu runner – unchanged.

…may contain spaces.

ggorman requested a review from Copilot July 25, 2025 10:36

This comment was marked as outdated.

Sign in to view

mloubout added the CI continuous integration label Jul 25, 2025

mloubout approved these changes Jul 25, 2025

View reviewed changes

ggorman force-pushed the prune-dangling branch from fa23e5f to 91a2383 Compare August 1, 2025 09:11

ggorman requested a review from Copilot August 1, 2025 09:29

Copilot AI reviewed Aug 1, 2025

View reviewed changes

.github/workflows/pytest-gpu.yml Show resolved Hide resolved

.github/workflows/pytest-gpu.yml Outdated Show resolved Hide resolved

.github/workflows/pytest-gpu.yml Outdated Show resolved Hide resolved

mloubout reviewed Aug 1, 2025

View reviewed changes

.github/workflows/pytest-gpu.yml Outdated Show resolved Hide resolved

ggorman requested review from FabioLuporini and mloubout August 1, 2025 13:50

mloubout approved these changes Aug 2, 2025

View reviewed changes

ggorman added 4 commits August 2, 2025 14:46

ci: Remove dangling layers from CI docker builds.

adf3dd1

ci: bug fixes

7d20234

CI: Change ${RUNNER_NAME} to ${RUNNER_NAME// /_} because runner name …

6e3ad7e

…may contain spaces.

mloubout force-pushed the prune-dangling branch from ec0f3b7 to 6e3ad7e Compare August 2, 2025 18:48

mloubout merged commit 1c6040a into main Aug 3, 2025
36 checks passed

mloubout deleted the prune-dangling branch August 3, 2025 01:49

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove dangling layers. #2686

Remove dangling layers. #2686

Uh oh!

ggorman commented Jul 25, 2025

Uh oh!

This comment was marked as outdated.

Uh oh!

codecov bot commented Jul 25, 2025 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Remove dangling layers. #2686

Remove dangling layers. #2686

Uh oh!

Conversation

ggorman commented Jul 25, 2025

Uh oh!

This comment was marked as outdated.

Uh oh!

codecov bot commented Jul 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

codecov bot commented Jul 25, 2025 •

edited

Loading