Moved tests to using testcontainers #232

SubhadityaMukherjee · 2026-01-23T12:23:35Z

Moved tests to using testcontainers. It does seem to work but it might be good to verify if this is what was intended. :)

coderabbitai · 2026-01-23T12:23:56Z

Note

Other AI code review bot(s) detected

CodeRabbit has detected other AI code review bot(s) in this pull request and will avoid duplicating their findings in the review comments. This may lead to a less comprehensive review.

Walkthrough

This pull request configures MySQL test container integration for the project. Three new development dependencies are added to pyproject.toml: testcontainers, pymysql, and cryptography. The tests/conftest.py file is updated to include MySQL test container scaffolding with session-scoped fixtures. An autouse fixture is introduced to override the MySqlContainer connection logic with a custom wait strategy. The expdb_test fixture is updated to use the new MySqlContainer fixture, with the connection managed within a begin/rollback context, replacing the previous implementation based on expdb_database().

🚥 Pre-merge checks | ✅ 2 | ❌ 1

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 16.67% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.

✅ Passed checks (2 passed)

Check name	Status	Explanation
Title check	✅ Passed	The title 'Moved tests to using testcontainers' accurately summarizes the main change - migrating the test infrastructure to use testcontainers for MySQL testing instead of a previous approach.
Description check	✅ Passed	The description is directly related to the changeset, explaining the migration to testcontainers and appropriately acknowledging uncertainty about the implementation approach.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing touches

📝 Generate docstrings

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

sourcery-ai

Hey - I've found 1 issue, and left some high level feedback:

The new expdb_test fixture no longer uses automatic_rollback, so test cases may now leak state into each other and depend on execution order; consider wrapping the Testcontainers-backed connection in the same rollback helper to preserve isolation.
Overriding MySqlContainer._connect globally in an autouse fixture tightly couples tests to a private implementation detail of testcontainers; if possible, prefer configuring an official wait strategy on the container instance instead of monkey-patching a private method at module scope.

Prompt for AI Agents

Please address the comments from this code review:

## Overall Comments
- The new `expdb_test` fixture no longer uses `automatic_rollback`, so test cases may now leak state into each other and depend on execution order; consider wrapping the Testcontainers-backed connection in the same rollback helper to preserve isolation.
- Overriding `MySqlContainer._connect` globally in an autouse fixture tightly couples tests to a private implementation detail of testcontainers; if possible, prefer configuring an official wait strategy on the container instance instead of monkey-patching a private method at module scope.

## Individual Comments

### Comment 1
<location> `tests/conftest.py:67-74` </location>
<code_context>
+        container.stop()
+
+
+@pytest.fixture
+def expdb_test(mysql_container: MySqlContainer) -> Connection:
+    url = mysql_container.get_connection_url()
+    url = url.replace("mysql://", "mysql+pymysql://")
+
+    engine = sqlalchemy.create_engine(url)
+    with engine.begin() as connection:
+        yield connection
+
+
</code_context>

<issue_to_address>
**issue (testing):** The new `expdb_test` fixture no longer wraps connections in `automatic_rollback`, so test data will persist across tests and may break isolation.

With `engine.begin()` the transaction is committed on context exit, so writes from one test will persist into the next and can cause order-dependent, flaky tests when sharing the same database. To preserve isolation, either wrap the dynamic `engine` with `automatic_rollback`, explicitly manage a transaction and roll it back per test, or add a fixture that truncates/cleans the DB between tests while still using Testcontainers.
</issue_to_address>

Sourcery is free for open source - if you like our reviews please consider sharing them ✨

_{Help me be more useful! Please click 👍 or 👎 on each comment and I'll use the feedback to improve your reviews.}

sourcery-ai · 2026-01-23T12:24:25Z

tests/conftest.py

+@pytest.fixture
+def expdb_test(mysql_container: MySqlContainer) -> Connection:
+    url = mysql_container.get_connection_url()
+    url = url.replace("mysql://", "mysql+pymysql://")
+
+    engine = sqlalchemy.create_engine(url)
+    with engine.begin() as connection:
+        yield connection


issue (testing): The new expdb_test fixture no longer wraps connections in automatic_rollback, so test data will persist across tests and may break isolation.

With engine.begin() the transaction is committed on context exit, so writes from one test will persist into the next and can cause order-dependent, flaky tests when sharing the same database. To preserve isolation, either wrap the dynamic engine with automatic_rollback, explicitly manage a transaction and roll it back per test, or add a fixture that truncates/cleans the DB between tests while still using Testcontainers.

coderabbitai

Actionable comments posted: 3

🤖 Fix all issues with AI agents

In `@tests/conftest.py`:
- Line 45: Remove the unused lint suppression by deleting the trailing "  #
noqa: SLF001" on the assignment MySqlContainer._connect = _connect so the line
reads only the assignment; this drops the unnecessary directive and resolves the
Ruff unused-directive warning while leaving the monkeypatch intact.
- Around line 48-64: The mysql_container fixture currently calls
container.start() unguarded and will hard-fail when Docker isn't reachable; wrap
the container.start() call in a try/except that catches
docker.errors.DockerException and calls pytest.skip(...) to skip the session
tests when Docker is unavailable (ensure you import pytest and docker.errors if
not present), then proceed to yield container and keep the existing finally
container.stop() behavior so cleanup still runs if start succeeded.
- Around line 29-44: The fixture override_testcontainers_connect is
monkey-patching the private MySqlContainer._connect; remove that override and
instead use the public waiting utilities—either call wait_for_logs(container,
r"ready for connections", timeout=60, interval=1) after container startup (use
the testcontainers.core.waiting_utils.wait_for_logs function) or create a
subclass (e.g., MySqlContainerWithWait) and implement _connect decorated with
`@wait_container_is_ready` to handle retry logic; in short: stop overriding
_connect directly in override_testcontainers_connect and switch to wait_for_logs
or a decorated subclass of MySqlContainer for log-based readiness.

♻️ Duplicate comments (1)

tests/conftest.py (1)
67-74: engine.begin() commits on exit; tests may leak data across cases.

Line 73 uses engine.begin(), which commits on context exit. That breaks isolation and can make tests order-dependent. Prefer automatic_rollback(engine) or explicit rollback per test.
✅ Safer rollback-based fixture
     engine = sqlalchemy.create_engine(url)
-    with engine.begin() as connection:
+    with automatic_rollback(engine) as connection:
         yield connection
In SQLAlchemy, does `engine.begin()` commit on context exit, and what is the recommended pattern for rollback-based test fixtures?

🧹 Nitpick comments (1)

pyproject.toml (1)
35-37: Confirm and constrain new dev dependency versions.

Lines 35–37 add unpinned dev deps; to keep CI reproducible and avoid breaking changes, consider adding compatible version ranges (or a constraints file) after confirming they support Python 3.12 and your SQLAlchemy/MySQL stack.
What are the latest stable versions of testcontainers, pymysql, and cryptography, and do they support Python 3.12? Any breaking changes that could affect SQLAlchemy or PyMySQL usage?

coderabbitai · 2026-01-23T12:33:24Z

tests/conftest.py

+@pytest.fixture(scope="session", autouse=True)
+def override_testcontainers_connect() -> None:
+    """
+    Override MySqlContainer._connect once per test session.
+    Applied automatically everywhere.
+    """
+
+    def _connect(self: MySqlContainer) -> None:
+        wait_strategy = LogMessageWaitStrategy(
+            re.compile(
+                r".*: ready for connections",
+                flags=re.DOTALL | re.MULTILINE,
+            )
+        )
+        wait_strategy.wait_until_ready(self)
+


🛠️ Refactor suggestion | 🟠 Major

🧩 Analysis chain

🌐 Web query:

testcontainers-python MySqlContainer wait strategy customize public API

💡 Result:

In testcontainers-python, MySqlContainer doesn’t expose a Java-style “wait strategy” hook on the class itself; the public API for customizing readiness is via the waiting utilities (log-based waiting) and/or by subclassing and overriding the connection check.

1) Log-based waiting (public API)

Use testcontainers.core.waiting_utils.wait_for_logs(container, predicate, timeout=None, interval=1) after the container is started (or inside a custom container’s startup flow). The predicate can be a regex string or a callable. The function supports timeout and interval. [1]

from testcontainers.mysql import MySqlContainer from testcontainers.core.waiting_utils import wait_for_logs with MySqlContainer("mysql:8.0") as mysql: wait_for_logs(mysql, r"ready for connections", timeout=60, interval=1)

MySqlContainer itself is the standard MySQL wrapper you’re already using. [2]

2) Connection-based waiting via @wait_container_is_ready (public API)

If you want “wait until I can connect” semantics (common for DBs), subclass MySqlContainer and decorate your _connect() with wait_container_is_ready(...) so failures are retried until success/timeout. [3]

import mysql.connector from testcontainers.mysql import MySqlContainer from testcontainers.core.waiting_utils import wait_container_is_ready class MyReadyMySql(MySqlContainer): `@wait_container_is_ready`(mysql.connector.Error) def _connect(self): conn = mysql.connector.connect( host=self.get_container_host_ip(), port=int(self.get_exposed_port(3306)), user=self.username, password=self.password, database=self.dbname, ) conn.close() with MyReadyMySql("mysql:8.0") as mysql: ...

3) Tuning retry/timeout behavior

Retry limits/sleep/timeout for readiness waiting are controlled via testcontainers_config (global configuration). [3]

from testcontainers.core.config import testcontainers_config testcontainers_config.timeout = 180 testcontainers_config.max_tries = 180 testcontainers_config.sleep_time = 1

Sources: [1] [2] [3]

Use wait_for_logs() or subclass with @wait_container_is_ready() instead of monkey-patching _connect.

Line 36 overrides a private method, which breaks on testcontainers-python upgrades if internals change. testcontainers-python provides public APIs:

wait_for_logs(): Call after container startup with a regex predicate for log-based readiness

Subclass + @wait_container_is_ready() decorator: Override _connect in a subclass with the decorator to handle retry logic safely

Example using wait_for_logs:

from testcontainers.mysql import MySqlContainer from testcontainers.core.waiting_utils import wait_for_logs with MySqlContainer("mysql:8.0") as mysql: wait_for_logs(mysql, r"ready for connections", timeout=60, interval=1)

🤖 Prompt for AI Agents

In `@tests/conftest.py` around lines 29 - 44, The fixture override_testcontainers_connect is monkey-patching the private MySqlContainer._connect; remove that override and instead use the public waiting utilities—either call wait_for_logs(container, r"ready for connections", timeout=60, interval=1) after container startup (use the testcontainers.core.waiting_utils.wait_for_logs function) or create a subclass (e.g., MySqlContainerWithWait) and implement _connect decorated with `@wait_container_is_ready` to handle retry logic; in short: stop overriding _connect directly in override_testcontainers_connect and switch to wait_for_logs or a decorated subclass of MySqlContainer for log-based readiness.

coderabbitai · 2026-01-23T12:33:24Z

tests/conftest.py

+        )
+        wait_strategy.wait_until_ready(self)
+
+    MySqlContainer._connect = _connect  # noqa: SLF001


⚠️ Potential issue | 🟡 Minor

Remove unused # noqa: SLF001.

Ruff reports this directive as unused at Line 45; drop it to keep lint clean.

🧹 Proposed cleanup

- MySqlContainer._connect = _connect # noqa: SLF001 + MySqlContainer._connect = _connect

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

MySqlContainer._connect = _connect # noqa: SLF001

MySqlContainer._connect = _connect

🧰 Tools

🪛 Ruff (0.14.13)

45-45: Unused noqa directive (non-enabled: SLF001)

Remove unused noqa directive

(RUF100)

🤖 Prompt for AI Agents

In `@tests/conftest.py` at line 45, Remove the unused lint suppression by deleting the trailing " # noqa: SLF001" on the assignment MySqlContainer._connect = _connect so the line reads only the assignment; this drops the unnecessary directive and resolves the Ruff unused-directive warning while leaving the monkeypatch intact.

coderabbitai · 2026-01-23T12:33:24Z

tests/conftest.py

+@pytest.fixture(scope="session")
+def mysql_container() -> MySqlContainer:
+    container = MySqlContainer(
+        os.environ.get(
+            "OPENML_DATABASES_OPENML_URL",
+            "openml/test-database:20240105",
+        ),
+        username=os.environ.get("OPENML_DATABASES_OPENML_USERNAME", ""),
+        password=os.environ.get("OPENML_DATABASES_OPENML_PASSWORD", ""),
+        dbname="openml_expdb",
+    )
+
+    container.start()
+    try:
+        yield container
+    finally:
+        container.stop()


⚠️ Potential issue | 🟠 Major

🧩 Analysis chain

🌐 Web query:

testcontainers-python MySqlContainer.start() Docker unavailable exception type

💡 Result:

In testcontainers-python, MySqlContainer.start() ultimately initializes the Docker SDK client (docker / “docker-py”). If Docker isn’t reachable (daemon not running, wrong DOCKER_HOST, etc.), the failure you’ll see is typically:

docker.errors.DockerException (often with a message like “Error while fetching server API version …”) [1][2]

So, the exception type to catch for “Docker unavailable” is generally docker.errors.DockerException. [1][2]

🏁 Script executed:

# Check if there are existing patterns for Docker error handling in the codebase rg -i "docker|container" tests/conftest.py -A 3 -B 3

Repository: openml/server-api

Length of output: 1397

🏁 Script executed:

# Search for any existing pytest.skip usage patterns related to Docker or external services rg "pytest.skip" --type py -B 2 -A 2

Repository: openml/server-api

Length of output: 2038

🏁 Script executed:

# Check imports in conftest.py to understand what's currently imported head -30 tests/conftest.py

Repository: openml/server-api

Length of output: 840

🏁 Script executed:

# Check if there are any other exception handling patterns for testcontainers in the codebase rg "testcontainers|MySqlContainer" --type py

Repository: openml/server-api

Length of output: 622

🏁 Script executed:

# Verify the fixture is used as a dependency in tests to understand impact scope rg "mysql_container" tests/ --type py

Repository: openml/server-api

Length of output: 265

Skip tests gracefully when Docker isn't available (CI currently fails).

The pipeline reports a Docker connection error; container.start() will hard-fail the suite when Docker isn't reachable. Wrap the start call in a try-except to catch docker.errors.DockerException and skip tests in non-Docker environments.

🛠️ Proposed fix

+from docker.errors import DockerException + `@pytest.fixture`(scope="session") def mysql_container() -> MySqlContainer: container = MySqlContainer( os.environ.get( "OPENML_DATABASES_OPENML_URL", "openml/test-database:20240105", ), username=os.environ.get("OPENML_DATABASES_OPENML_USERNAME", ""), password=os.environ.get("OPENML_DATABASES_OPENML_PASSWORD", ""), dbname="openml_expdb", ) - container.start() + started = False + try: + container.start() + started = True + except DockerException as exc: + pytest.skip(f"Docker unavailable: {exc}") try: yield container finally: - container.stop() + if started: + container.stop()

🧰 Tools

🪛 GitHub Actions: Tests

[error] 50-50: Failed to initialize MySqlContainer due to docker connection error

🤖 Prompt for AI Agents

In `@tests/conftest.py` around lines 48 - 64, The mysql_container fixture currently calls container.start() unguarded and will hard-fail when Docker isn't reachable; wrap the container.start() call in a try/except that catches docker.errors.DockerException and calls pytest.skip(...) to skip the session tests when Docker is unavailable (ensure you import pytest and docker.errors if not present), then proceed to yield container and keep the existing finally container.stop() behavior so cleanup still runs if start succeeded.

SubhadityaMukherjee and others added 3 commits January 23, 2026 13:17

moved tests to testcontainers

1629117

Merge branch 'openml:main' into tests_to_testcontainers

f90cbf1

cleanup pointless changes

467b614

sourcery-ai bot reviewed Jan 23, 2026

View reviewed changes

Oops a message got deleted by mistake

078fc07

coderabbitai bot reviewed Jan 23, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Moved tests to using testcontainers #232

Moved tests to using testcontainers #232

SubhadityaMukherjee commented Jan 23, 2026

Uh oh!

coderabbitai bot commented Jan 23, 2026 •

edited

Loading

Other AI code review bot(s) detected

Uh oh!

sourcery-ai bot left a comment

Uh oh!

sourcery-ai bot Jan 23, 2026

Uh oh!

coderabbitai bot left a comment

Uh oh!

coderabbitai bot Jan 23, 2026

Uh oh!

coderabbitai bot Jan 23, 2026

Uh oh!

coderabbitai bot Jan 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

	MySqlContainer._connect = _connect # noqa: SLF001
	MySqlContainer._connect = _connect

Uh oh!

Moved tests to using testcontainers #232

Are you sure you want to change the base?

Moved tests to using testcontainers #232

Conversation

SubhadityaMukherjee commented Jan 23, 2026

Uh oh!

coderabbitai bot commented Jan 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Other AI code review bot(s) detected

Walkthrough

Uh oh!

sourcery-ai bot left a comment

Choose a reason for hiding this comment

Uh oh!

sourcery-ai bot Jan 23, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Jan 23, 2026

Choose a reason for hiding this comment

1) Log-based waiting (public API)

2) Connection-based waiting via @wait_container_is_ready (public API)

3) Tuning retry/timeout behavior

Uh oh!

coderabbitai bot Jan 23, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Jan 23, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

coderabbitai bot commented Jan 23, 2026 •

edited

Loading

2) Connection-based waiting via `@wait_container_is_ready` (public API)