Skip to content

Fix persistent CI failures.#1205

Merged
katiewasnothere merged 2 commits intoapple:mainfrom
jglogan:disable-realhostdns
Feb 13, 2026
Merged

Fix persistent CI failures.#1205
katiewasnothere merged 2 commits intoapple:mainfrom
jglogan:disable-realhostdns

Conversation

@jglogan
Copy link
Contributor

@jglogan jglogan commented Feb 13, 2026

Type of Change

  • Bug fix
  • New feature
  • Breaking change
  • Documentation update

Motivation and Context

  • CI build is failing because runners don't have an /etc/resolver directory, causing the directory monitor to fail. This occurs while the install-kernel make target is running, so it appears that kernel download is failing when it's just that the API server is dying. Created [Bug]: Directory monitor for localhost DNS fails if no resolver directory. #1207 for the issue.
  • Fixing the initial scan for the directory just moves the problem to the filesystem watch loop, likely because we're not testing the result of open() for errors.
  • Right now the priority is getting CI running and PRs merged, so just commenting out the realhost DNS server setup.
  • Also seeing that under some conditions it can take quite a while for launchd to start the helper for the default network (8 seconds or more). With the 10 second health check timeout after API server registration, this means that some CI runs can exhibit this failure mode. Added a --timeout option to SystemStart and set a 60 second timeout for install-kernel and integration Makefile targets.
  • Fixed a bug where --debug was being placed in the wrong location in the api server startup args.
  • Disabled all network CLI tests due to container bootstrap errors when trying to run the container immediately after creating the network. The slow network helper launch could be the reason behind the failures that drove us to serialize these tests. Filed [chore]: Fix CLINetwork tests. #1206 for this issue.

Testing

  • Tested locally
  • Added/updated tests
  • Added/updated docs

@github-actions github-actions bot added the cli label Feb 13, 2026
- CI build is failing because runners don't have
  an `/etc/resolver` directory, causing the directory
  monitor to fail. This occurs while the
  `install-kernel` make target is running, so it
  appears that kernel download is failing when it's
  just that the API server is dying.
- Fixing the initial scan for the directory just
  moves the problem to the filesystem watch loop,
  likely because we're not testing the result of
  `open()` for errors.
- Right now the priority is getting CI running
  and PRs merged, so just commenting out the
  realhost DNS server setup.
- Also seeing that under some conditions it can
  take quite a while for launchd to start the
  helper for the default network (8 seconds or
  more). With the 10 second health check timeout
  after API server registration, this means that
  some CI runs can exhibit this failure mode.
  Added a `--timeout` option to SystemStart and
  set a 60 second timeout for install-kernel and
  integration Makefile targets.
- Disabled `testNetworkCreateAndUse()` due to
  container bootstrap errors when trying to run
  the container immediately after creating the
  network. The slow network helper launch could
  be the reason behind the failures that drove
  us to serialize these tests.
@jglogan jglogan force-pushed the disable-realhostdns branch from 288eb32 to 24e8325 Compare February 13, 2026 03:10
@jglogan jglogan mentioned this pull request Feb 13, 2026
1 task
@jglogan jglogan force-pushed the disable-realhostdns branch from 43e3b19 to b1db7b7 Compare February 13, 2026 04:29
@JaewonHur
Copy link
Contributor

This resolves #1184.

@katiewasnothere katiewasnothere merged commit 4c800db into apple:main Feb 13, 2026
4 of 6 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants