Skip to content

Conversation

@jeremyeder
Copy link
Contributor

@jeremyeder jeremyeder commented Dec 10, 2025

Summary

Add comprehensive Harbor integration to empirically measure Claude Code performance impact of agent files using Harbor's Terminal-Bench.

Key Features:

  • 🔬 A/B testing framework (with/without any agent file)
  • 📊 Statistical significance testing (t-tests, Cohen's d effect sizes)
  • 📄 Multiple output formats (JSON, Markdown, HTML)
  • 📈 Interactive dashboard with Chart.js visualizations (self-contained)
  • 🛠️ CLI commands: compare, list, view

Example Usage:

# Install Harbor
uv tool install harbor

# Compare any agent file (default: .claude/agents/doubleagent.md)
agentready harbor compare \
  -t adaptive-rejection-sampler \
  -t async-http-client \
  -t terminal-file-browser \
  --verbose \
  --open-dashboard

# Or specify a different agent file
agentready harbor compare \
  -t task1 -t task2 \
  --agent-file .claude/agents/my-custom-agent.md \
  --verbose

Architecture

Data Models (src/agentready/models/harbor.py):

  • HarborTaskResult - Single task result from result.json
  • HarborRunMetrics - Aggregated metrics per run
  • HarborComparison - Complete comparison with deltas and statistical tests

Services (src/agentready/services/harbor/):

  • HarborRunner - Execute Harbor CLI via subprocess
  • AgentFileToggler - Safely enable/disable agent files using context managers
  • ResultParser - Parse Harbor result.json files
  • HarborComparer - Calculate deltas and statistical significance
  • DashboardGenerator - Generate HTML reports with inlined Chart.js

Reporters (src/agentready/reporters/):

  • HarborMarkdownReporter - GitHub-Flavored Markdown reports
  • DashboardGenerator - Interactive HTML dashboard

Code Quality

Agent Reviews:

  • feature-dev:code-reviewer - No high-priority issues, excellent architecture
  • pr-review-toolkit:code-simplifier - Clean separation of concerns
  • doubleagent.md - 95% score, exemplary library-first architecture

Critical Fixes Applied:

  1. Fixed division by zero in delta calculations (returns None instead of 0.0)
  2. Removed manual toggler.enable() call (context manager handles restoration)
  3. Inlined Chart.js library (maintains self-contained HTML report principle)

Test Coverage:

  • 24/24 tests passing ✅
  • 98% models coverage
  • 95%+ services coverage
  • Comprehensive edge case testing (exceptions, idempotency, invalid data)

Linters:

  • black - Code formatting
  • isort - Import sorting
  • ruff - Linting (all checks passed)
  • ✅ Pre-commit hooks - All passing

Output Files

Results stored in .agentready/harbor_comparisons/ (gitignored):

  • JSON: Machine-readable comparison data
  • Markdown: GitHub-friendly report (commit this for PRs)
  • HTML: Interactive dashboard with Chart.js visualizations (self-contained, ~210KB)

Symlinks (for quick access):

  • comparison_latest.json
  • comparison_latest.md
  • comparison_latest.html

Statistical Methods

Significance Criteria (both required):

  • P-value < 0.05: 95% confidence (two-sample t-test)
  • Cohen's d effect size:
    • Small: 0.2 ≤ |d| < 0.5
    • Medium: 0.5 ≤ |d| < 0.8
    • Large: |d| ≥ 0.8

Sample Size Recommendations:

  • Minimum: 3 tasks (for statistical tests)
  • Recommended: 5-10 tasks (reliable results)
  • Comprehensive: 20+ tasks (production validation)

Use Cases

1. Validate Agent File Effectiveness

agentready harbor compare \
  --agent-file .claude/agents/my-agent.md \
  -t task1 -t task2 -t task3

2. Compare Different Agents
Run twice with different --agent-file values, then compare the JSON results.

3. A/B Test Agent Modifications
Before/after comparison when iterating on agent designs.

4. Benchmark Agent Performance
Quantify agent impact on success rate, duration, and task completion.

Documentation

  • ✅ User guide: docs/harbor-comparison-guide.md (comprehensive, 400 lines)
  • ✅ Developer guide: CLAUDE.md (Harbor section added, 100 lines)
  • ✅ Code comments: Extensive docstrings and inline comments

Test Plan

  • Run Harbor comparison with 3 tasks (verified all metrics calculate correctly)
  • Test with/without agent file (toggler context manager works)
  • Verify statistical calculations (t-tests, Cohen's d, significance flags)
  • Generate all output formats (JSON, Markdown, HTML)
  • Test symlink creation (cross-platform compatibility)
  • Verify error handling (missing Harbor, invalid JSON, failed benchmarks)
  • Test edge cases (zero duration, all failures, partial data)
  • Run unit tests (24/24 passing)
  • Run linters (black, isort, ruff - all passing)

Related Issues

Provides empirical validation for any agent file's impact on Claude Code performance.

🤖 Generated with Claude Code

Co-Authored-By: Claude Sonnet 4.5 noreply@anthropic.com

Add comprehensive Harbor integration to empirically measure Claude Code
performance impact of .claude/agents/doubleagent.md using Terminal-Bench.

Features:
- A/B testing framework (with/without agent file)
- Statistical significance testing (t-tests, Cohen's d)
- Multiple output formats (JSON, Markdown, HTML)
- Interactive dashboard with Chart.js visualizations
- CLI commands: compare, list, view

Data Models:
- HarborTaskResult: Single task result from result.json
- HarborRunMetrics: Aggregated metrics per run
- HarborComparison: Complete comparison with deltas

Services:
- HarborRunner: Execute Harbor CLI via subprocess
- AgentFileToggler: Safely enable/disable agent files
- ResultParser: Parse Harbor result.json files
- HarborComparer: Calculate deltas and statistical significance
- DashboardGenerator: Generate HTML reports

Reporters:
- HarborMarkdownReporter: GitHub-Flavored Markdown
- DashboardGenerator: Interactive HTML with inlined Chart.js

Critical fixes applied based on code review:
- Fixed division by zero in delta calculations (returns None)
- Removed manual toggler.enable() call (context manager handles it)
- Inlined Chart.js library (self-contained HTML principle)

Test Coverage: 24/24 tests passing, 98% models coverage, 95%+ services

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
@jeremyeder jeremyeder merged commit a56e318 into ambient-code:main Dec 10, 2025
5 of 12 checks passed
github-actions bot pushed a commit that referenced this pull request Dec 10, 2025
# [2.18.0](v2.17.0...v2.18.0) (2025-12-10)

### Features

* add Harbor Terminal-Bench comparison for agent effectiveness ([#199](#199)) ([a56e318](a56e318))
@github-actions
Copy link
Contributor

🎉 This PR is included in version 2.18.0 🎉

The release is available on GitHub release

Your semantic-release bot 📦🚀

github-actions bot pushed a commit to jeremyeder/agentready that referenced this pull request Dec 11, 2025
# [2.10.0](v2.9.0...v2.10.0) (2025-12-11)

### Bug Fixes

* add bounded retry logic for LLM rate limit handling ([ambient-code#205](https://github.com/jeremyeder/agentready/issues/205)) ([6ecb786](6ecb786)), closes [ambient-code#104](https://github.com/jeremyeder/agentready/issues/104)
* disable attestations for Test PyPI to avoid conflict ([ambient-code#155](https://github.com/jeremyeder/agentready/issues/155)) ([a33e3cd](a33e3cd)), closes [pypa/#action-pypi-publish](https://github.com/jeremyeder/agentready/issues/action-pypi-publish)
* leaderboard workflow and SSH URL support ([ambient-code#147](https://github.com/jeremyeder/agentready/issues/147)) ([de28cd0](de28cd0))
* make E2E test timeouts configurable and add sensitive directory test ([ambient-code#206](https://github.com/jeremyeder/agentready/issues/206)) ([27e87e5](27e87e5)), closes [ambient-code#104](https://github.com/jeremyeder/agentready/issues/104) [ambient-code#192](https://github.com/jeremyeder/agentready/issues/192)
* resolve all test suite failures - achieve zero failures ([ambient-code#180](https://github.com/jeremyeder/agentready/issues/180)) ([990fa2d](990fa2d)), closes [ambient-code#148](https://github.com/jeremyeder/agentready/issues/148) [ambient-code#147](https://github.com/jeremyeder/agentready/issues/147) [ambient-code#145](https://github.com/jeremyeder/agentready/issues/145)
* resolve broken links and workflow failures ([ambient-code#160](https://github.com/jeremyeder/agentready/issues/160)) ([fbf5cf7](fbf5cf7))
* resolve YAML syntax error in continuous-learning workflow ([ambient-code#172](https://github.com/jeremyeder/agentready/issues/172)) ([3d40fcc](3d40fcc))
* resolve YAML syntax error in update-docs workflow and add actionlint ([ambient-code#173](https://github.com/jeremyeder/agentready/issues/173)) ([97b06af](97b06af))
* skip PR comments for external forks to prevent permission errors ([ambient-code#163](https://github.com/jeremyeder/agentready/issues/163)) ([2a29fb8](2a29fb8))

### Features

* add ambient-code/agentready to leaderboard ([ambient-code#148](https://github.com/jeremyeder/agentready/issues/148)) ([621152e](621152e))
* add Harbor Terminal-Bench comparison for agent effectiveness ([ambient-code#199](https://github.com/jeremyeder/agentready/issues/199)) ([a56e318](a56e318))
* add Memory MCP server allow list to repository settings ([ambient-code#203](https://github.com/jeremyeder/agentready/issues/203)) ([41d87bb](41d87bb))
* add quay/quay to leaderboard ([ambient-code#162](https://github.com/jeremyeder/agentready/issues/162)) ([d6e8df0](d6e8df0))
* Add weekly research update skill and automation ([ambient-code#145](https://github.com/jeremyeder/agentready/issues/145)) ([7ba17a6](7ba17a6))
* automate PyPI publishing with trusted publishing (OIDC) ([ambient-code#154](https://github.com/jeremyeder/agentready/issues/154)) ([71f4632](71f4632)), closes [pypa/#action-pypi-publish](https://github.com/jeremyeder/agentready/issues/action-pypi-publish)
* container support ([ambient-code#171](https://github.com/jeremyeder/agentready/issues/171)) ([c6874ea](c6874ea))
* convert AgentReady assessment to on-demand workflow ([ambient-code#213](https://github.com/jeremyeder/agentready/issues/213)) ([b5a1ce0](b5a1ce0)), closes [ambient-code#191](https://github.com/jeremyeder/agentready/issues/191)
* enhance assessors with multi-language support and security ([ambient-code#200](https://github.com/jeremyeder/agentready/issues/200)) ([85712f2](85712f2)), closes [#10](#10)
* Harbor framework integration for Terminal-Bench evaluations ([ambient-code#202](https://github.com/jeremyeder/agentready/issues/202)) ([d73a8c8](d73a8c8)), closes [#4](#4) [ambient-code#178](https://github.com/jeremyeder/agentready/issues/178) [ambient-code#178](https://github.com/jeremyeder/agentready/issues/178)
* Redesign homepage features with two-column layout and research links ([ambient-code#189](https://github.com/jeremyeder/agentready/issues/189)) ([570087d](570087d)), closes [ambient-code#187](https://github.com/jeremyeder/agentready/issues/187)
* replace markdown-link-check with lychee for link validation ([ambient-code#177](https://github.com/jeremyeder/agentready/issues/177)) ([f1a4545](f1a4545))
* Terminal-Bench eval harness (MVP Phase 1) ([ambient-code#178](https://github.com/jeremyeder/agentready/issues/178)) ([d06bab4](d06bab4)), closes [ambient-code#171](https://github.com/jeremyeder/agentready/issues/171)

### Performance Improvements

* implement lazy loading for heavy CLI commands ([ambient-code#151](https://github.com/jeremyeder/agentready/issues/151)) ([6a7cd4e](6a7cd4e))
jeremyeder pushed a commit to jeremyeder/agentready that referenced this pull request Dec 12, 2025
# [2.10.0](v2.9.0...v2.10.0) (2025-12-11)

### Bug Fixes

* add bounded retry logic for LLM rate limit handling ([ambient-code#205](https://github.com/jeremyeder/agentready/issues/205)) ([6ecb786](6ecb786)), closes [ambient-code#104](https://github.com/jeremyeder/agentready/issues/104)
* disable attestations for Test PyPI to avoid conflict ([ambient-code#155](https://github.com/jeremyeder/agentready/issues/155)) ([a33e3cd](a33e3cd)), closes [pypa/#action-pypi-publish](https://github.com/jeremyeder/agentready/issues/action-pypi-publish)
* leaderboard workflow and SSH URL support ([ambient-code#147](https://github.com/jeremyeder/agentready/issues/147)) ([de28cd0](de28cd0))
* make E2E test timeouts configurable and add sensitive directory test ([ambient-code#206](https://github.com/jeremyeder/agentready/issues/206)) ([27e87e5](27e87e5)), closes [ambient-code#104](https://github.com/jeremyeder/agentready/issues/104) [ambient-code#192](https://github.com/jeremyeder/agentready/issues/192)
* resolve all test suite failures - achieve zero failures ([ambient-code#180](https://github.com/jeremyeder/agentready/issues/180)) ([990fa2d](990fa2d)), closes [ambient-code#148](https://github.com/jeremyeder/agentready/issues/148) [ambient-code#147](https://github.com/jeremyeder/agentready/issues/147) [ambient-code#145](https://github.com/jeremyeder/agentready/issues/145)
* resolve broken links and workflow failures ([ambient-code#160](https://github.com/jeremyeder/agentready/issues/160)) ([fbf5cf7](fbf5cf7))
* resolve YAML syntax error in continuous-learning workflow ([ambient-code#172](https://github.com/jeremyeder/agentready/issues/172)) ([3d40fcc](3d40fcc))
* resolve YAML syntax error in update-docs workflow and add actionlint ([ambient-code#173](https://github.com/jeremyeder/agentready/issues/173)) ([97b06af](97b06af))
* skip PR comments for external forks to prevent permission errors ([ambient-code#163](https://github.com/jeremyeder/agentready/issues/163)) ([2a29fb8](2a29fb8))

### Features

* add ambient-code/agentready to leaderboard ([ambient-code#148](https://github.com/jeremyeder/agentready/issues/148)) ([621152e](621152e))
* add Harbor Terminal-Bench comparison for agent effectiveness ([ambient-code#199](https://github.com/jeremyeder/agentready/issues/199)) ([a56e318](a56e318))
* add Memory MCP server allow list to repository settings ([ambient-code#203](https://github.com/jeremyeder/agentready/issues/203)) ([41d87bb](41d87bb))
* add quay/quay to leaderboard ([ambient-code#162](https://github.com/jeremyeder/agentready/issues/162)) ([d6e8df0](d6e8df0))
* Add weekly research update skill and automation ([ambient-code#145](https://github.com/jeremyeder/agentready/issues/145)) ([7ba17a6](7ba17a6))
* automate PyPI publishing with trusted publishing (OIDC) ([ambient-code#154](https://github.com/jeremyeder/agentready/issues/154)) ([71f4632](71f4632)), closes [pypa/#action-pypi-publish](https://github.com/jeremyeder/agentready/issues/action-pypi-publish)
* container support ([ambient-code#171](https://github.com/jeremyeder/agentready/issues/171)) ([c6874ea](c6874ea))
* convert AgentReady assessment to on-demand workflow ([ambient-code#213](https://github.com/jeremyeder/agentready/issues/213)) ([b5a1ce0](b5a1ce0)), closes [ambient-code#191](https://github.com/jeremyeder/agentready/issues/191)
* enhance assessors with multi-language support and security ([ambient-code#200](https://github.com/jeremyeder/agentready/issues/200)) ([85712f2](85712f2)), closes [#10](#10)
* Harbor framework integration for Terminal-Bench evaluations ([ambient-code#202](https://github.com/jeremyeder/agentready/issues/202)) ([d73a8c8](d73a8c8)), closes [#4](#4) [ambient-code#178](https://github.com/jeremyeder/agentready/issues/178) [ambient-code#178](https://github.com/jeremyeder/agentready/issues/178)
* Redesign homepage features with two-column layout and research links ([ambient-code#189](https://github.com/jeremyeder/agentready/issues/189)) ([570087d](570087d)), closes [ambient-code#187](https://github.com/jeremyeder/agentready/issues/187)
* replace markdown-link-check with lychee for link validation ([ambient-code#177](https://github.com/jeremyeder/agentready/issues/177)) ([f1a4545](f1a4545))
* Terminal-Bench eval harness (MVP Phase 1) ([ambient-code#178](https://github.com/jeremyeder/agentready/issues/178)) ([d06bab4](d06bab4)), closes [ambient-code#171](https://github.com/jeremyeder/agentready/issues/171)

### Performance Improvements

* implement lazy loading for heavy CLI commands ([ambient-code#151](https://github.com/jeremyeder/agentready/issues/151)) ([6a7cd4e](6a7cd4e))
jeremyeder pushed a commit that referenced this pull request Dec 12, 2025
# [2.10.0](jeremyeder/agentready@v2.9.0...v2.10.0) (2025-12-11)

### Bug Fixes

* add bounded retry logic for LLM rate limit handling ([#205](https://github.com/jeremyeder/agentready/issues/205)) ([6ecb786](jeremyeder@6ecb786)), closes [#104](https://github.com/jeremyeder/agentready/issues/104)
* disable attestations for Test PyPI to avoid conflict ([#155](https://github.com/jeremyeder/agentready/issues/155)) ([a33e3cd](jeremyeder@a33e3cd)), closes [pypa/#action-pypi-publish](https://github.com/jeremyeder/agentready/issues/action-pypi-publish)
* leaderboard workflow and SSH URL support ([#147](https://github.com/jeremyeder/agentready/issues/147)) ([de28cd0](jeremyeder@de28cd0))
* make E2E test timeouts configurable and add sensitive directory test ([#206](https://github.com/jeremyeder/agentready/issues/206)) ([27e87e5](jeremyeder@27e87e5)), closes [#104](https://github.com/jeremyeder/agentready/issues/104) [#192](https://github.com/jeremyeder/agentready/issues/192)
* resolve all test suite failures - achieve zero failures ([#180](https://github.com/jeremyeder/agentready/issues/180)) ([990fa2d](jeremyeder@990fa2d)), closes [#148](https://github.com/jeremyeder/agentready/issues/148) [#147](https://github.com/jeremyeder/agentready/issues/147) [#145](https://github.com/jeremyeder/agentready/issues/145)
* resolve broken links and workflow failures ([#160](https://github.com/jeremyeder/agentready/issues/160)) ([fbf5cf7](jeremyeder@fbf5cf7))
* resolve YAML syntax error in continuous-learning workflow ([#172](https://github.com/jeremyeder/agentready/issues/172)) ([3d40fcc](jeremyeder@3d40fcc))
* resolve YAML syntax error in update-docs workflow and add actionlint ([#173](https://github.com/jeremyeder/agentready/issues/173)) ([97b06af](jeremyeder@97b06af))
* skip PR comments for external forks to prevent permission errors ([#163](https://github.com/jeremyeder/agentready/issues/163)) ([2a29fb8](jeremyeder@2a29fb8))

### Features

* add ambient-code/agentready to leaderboard ([#148](https://github.com/jeremyeder/agentready/issues/148)) ([621152e](jeremyeder@621152e))
* add Harbor Terminal-Bench comparison for agent effectiveness ([#199](https://github.com/jeremyeder/agentready/issues/199)) ([a56e318](jeremyeder@a56e318))
* add Memory MCP server allow list to repository settings ([#203](https://github.com/jeremyeder/agentready/issues/203)) ([41d87bb](jeremyeder@41d87bb))
* add quay/quay to leaderboard ([#162](https://github.com/jeremyeder/agentready/issues/162)) ([d6e8df0](jeremyeder@d6e8df0))
* Add weekly research update skill and automation ([#145](https://github.com/jeremyeder/agentready/issues/145)) ([7ba17a6](jeremyeder@7ba17a6))
* automate PyPI publishing with trusted publishing (OIDC) ([#154](https://github.com/jeremyeder/agentready/issues/154)) ([71f4632](jeremyeder@71f4632)), closes [pypa/#action-pypi-publish](https://github.com/jeremyeder/agentready/issues/action-pypi-publish)
* container support ([#171](https://github.com/jeremyeder/agentready/issues/171)) ([c6874ea](jeremyeder@c6874ea))
* convert AgentReady assessment to on-demand workflow ([#213](https://github.com/jeremyeder/agentready/issues/213)) ([b5a1ce0](jeremyeder@b5a1ce0)), closes [#191](https://github.com/jeremyeder/agentready/issues/191)
* enhance assessors with multi-language support and security ([#200](https://github.com/jeremyeder/agentready/issues/200)) ([85712f2](jeremyeder@85712f2)), closes [#10](jeremyeder#10)
* Harbor framework integration for Terminal-Bench evaluations ([#202](https://github.com/jeremyeder/agentready/issues/202)) ([d73a8c8](jeremyeder@d73a8c8)), closes [#4](jeremyeder#4) [#178](https://github.com/jeremyeder/agentready/issues/178) [#178](https://github.com/jeremyeder/agentready/issues/178)
* Redesign homepage features with two-column layout and research links ([#189](https://github.com/jeremyeder/agentready/issues/189)) ([570087d](jeremyeder@570087d)), closes [#187](https://github.com/jeremyeder/agentready/issues/187)
* replace markdown-link-check with lychee for link validation ([#177](https://github.com/jeremyeder/agentready/issues/177)) ([f1a4545](jeremyeder@f1a4545))
* Terminal-Bench eval harness (MVP Phase 1) ([#178](https://github.com/jeremyeder/agentready/issues/178)) ([d06bab4](jeremyeder@d06bab4)), closes [#171](https://github.com/jeremyeder/agentready/issues/171)

### Performance Improvements

* implement lazy loading for heavy CLI commands ([#151](https://github.com/jeremyeder/agentready/issues/151)) ([6a7cd4e](jeremyeder@6a7cd4e))
github-actions bot pushed a commit to jeremyeder/agentready that referenced this pull request Dec 12, 2025
# [2.10.0](v2.9.0...v2.10.0) (2025-12-12)

### Bug Fixes

* add bounded retry logic for LLM rate limit handling ([ambient-code#205](https://github.com/jeremyeder/agentready/issues/205)) ([6ecb786](6ecb786)), closes [ambient-code#104](https://github.com/jeremyeder/agentready/issues/104)
* disable attestations for Test PyPI to avoid conflict ([ambient-code#155](https://github.com/jeremyeder/agentready/issues/155)) ([a33e3cd](a33e3cd)), closes [pypa/#action-pypi-publish](https://github.com/jeremyeder/agentready/issues/action-pypi-publish)
* leaderboard workflow and SSH URL support ([ambient-code#147](https://github.com/jeremyeder/agentready/issues/147)) ([de28cd0](de28cd0))
* make E2E test timeouts configurable and add sensitive directory test ([ambient-code#206](https://github.com/jeremyeder/agentready/issues/206)) ([27e87e5](27e87e5)), closes [ambient-code#104](https://github.com/jeremyeder/agentready/issues/104) [ambient-code#192](https://github.com/jeremyeder/agentready/issues/192)
* resolve all test suite failures - achieve zero failures ([ambient-code#180](https://github.com/jeremyeder/agentready/issues/180)) ([990fa2d](990fa2d)), closes [ambient-code#148](https://github.com/jeremyeder/agentready/issues/148) [ambient-code#147](https://github.com/jeremyeder/agentready/issues/147) [ambient-code#145](https://github.com/jeremyeder/agentready/issues/145)
* resolve broken links and workflow failures ([ambient-code#160](https://github.com/jeremyeder/agentready/issues/160)) ([fbf5cf7](fbf5cf7))
* resolve YAML syntax error in continuous-learning workflow ([ambient-code#172](https://github.com/jeremyeder/agentready/issues/172)) ([3d40fcc](3d40fcc))
* resolve YAML syntax error in update-docs workflow and add actionlint ([ambient-code#173](https://github.com/jeremyeder/agentready/issues/173)) ([97b06af](97b06af))
* skip PR comments for external forks to prevent permission errors ([ambient-code#163](https://github.com/jeremyeder/agentready/issues/163)) ([2a29fb8](2a29fb8))

### Features

* add ambient-code/agentready to leaderboard ([ambient-code#148](https://github.com/jeremyeder/agentready/issues/148)) ([621152e](621152e))
* add Harbor Terminal-Bench comparison for agent effectiveness ([ambient-code#199](https://github.com/jeremyeder/agentready/issues/199)) ([a56e318](a56e318))
* add Memory MCP server allow list to repository settings ([ambient-code#203](https://github.com/jeremyeder/agentready/issues/203)) ([41d87bb](41d87bb))
* add quay/quay to leaderboard ([ambient-code#162](https://github.com/jeremyeder/agentready/issues/162)) ([d6e8df0](d6e8df0))
* Add weekly research update skill and automation ([ambient-code#145](https://github.com/jeremyeder/agentready/issues/145)) ([7ba17a6](7ba17a6))
* automate PyPI publishing with trusted publishing (OIDC) ([ambient-code#154](https://github.com/jeremyeder/agentready/issues/154)) ([71f4632](71f4632)), closes [pypa/#action-pypi-publish](https://github.com/jeremyeder/agentready/issues/action-pypi-publish)
* container support ([ambient-code#171](https://github.com/jeremyeder/agentready/issues/171)) ([c6874ea](c6874ea))
* convert AgentReady assessment to on-demand workflow ([ambient-code#213](https://github.com/jeremyeder/agentready/issues/213)) ([b5a1ce0](b5a1ce0)), closes [ambient-code#191](https://github.com/jeremyeder/agentready/issues/191)
* enhance assessors with multi-language support and security ([ambient-code#200](https://github.com/jeremyeder/agentready/issues/200)) ([85712f2](85712f2)), closes [#10](#10)
* Harbor framework integration for Terminal-Bench evaluations ([ambient-code#202](https://github.com/jeremyeder/agentready/issues/202)) ([d73a8c8](d73a8c8)), closes [#4](#4) [ambient-code#178](https://github.com/jeremyeder/agentready/issues/178) [ambient-code#178](https://github.com/jeremyeder/agentready/issues/178)
* Redesign homepage features with two-column layout and research links ([ambient-code#189](https://github.com/jeremyeder/agentready/issues/189)) ([570087d](570087d)), closes [ambient-code#187](https://github.com/jeremyeder/agentready/issues/187)
* replace markdown-link-check with lychee for link validation ([ambient-code#177](https://github.com/jeremyeder/agentready/issues/177)) ([f1a4545](f1a4545))
* Terminal-Bench eval harness (MVP Phase 1) ([ambient-code#178](https://github.com/jeremyeder/agentready/issues/178)) ([d06bab4](d06bab4)), closes [ambient-code#171](https://github.com/jeremyeder/agentready/issues/171)

### Performance Improvements

* implement lazy loading for heavy CLI commands ([ambient-code#151](https://github.com/jeremyeder/agentready/issues/151)) ([6a7cd4e](6a7cd4e))
github-actions bot pushed a commit to jeremyeder/agentready that referenced this pull request Dec 12, 2025
# [2.10.0](v2.9.0...v2.10.0) (2025-12-12)

### Bug Fixes

* add bounded retry logic for LLM rate limit handling ([ambient-code#205](https://github.com/jeremyeder/agentready/issues/205)) ([6ecb786](6ecb786)), closes [ambient-code#104](https://github.com/jeremyeder/agentready/issues/104)
* disable attestations for Test PyPI to avoid conflict ([ambient-code#155](https://github.com/jeremyeder/agentready/issues/155)) ([a33e3cd](a33e3cd)), closes [pypa/#action-pypi-publish](https://github.com/jeremyeder/agentready/issues/action-pypi-publish)
* leaderboard workflow and SSH URL support ([ambient-code#147](https://github.com/jeremyeder/agentready/issues/147)) ([de28cd0](de28cd0))
* make E2E test timeouts configurable and add sensitive directory test ([ambient-code#206](https://github.com/jeremyeder/agentready/issues/206)) ([27e87e5](27e87e5)), closes [ambient-code#104](https://github.com/jeremyeder/agentready/issues/104) [ambient-code#192](https://github.com/jeremyeder/agentready/issues/192)
* rename research report in data directory ([b8ddfdc](b8ddfdc))
* resolve all test suite failures - achieve zero failures ([ambient-code#180](https://github.com/jeremyeder/agentready/issues/180)) ([990fa2d](990fa2d)), closes [ambient-code#148](https://github.com/jeremyeder/agentready/issues/148) [ambient-code#147](https://github.com/jeremyeder/agentready/issues/147) [ambient-code#145](https://github.com/jeremyeder/agentready/issues/145)
* resolve broken links and workflow failures ([ambient-code#160](https://github.com/jeremyeder/agentready/issues/160)) ([fbf5cf7](fbf5cf7))
* resolve YAML syntax error in continuous-learning workflow ([ambient-code#172](https://github.com/jeremyeder/agentready/issues/172)) ([3d40fcc](3d40fcc))
* resolve YAML syntax error in update-docs workflow and add actionlint ([ambient-code#173](https://github.com/jeremyeder/agentready/issues/173)) ([97b06af](97b06af))
* skip PR comments for external forks to prevent permission errors ([ambient-code#163](https://github.com/jeremyeder/agentready/issues/163)) ([2a29fb8](2a29fb8))

### Features

* add ambient-code/agentready to leaderboard ([ambient-code#148](https://github.com/jeremyeder/agentready/issues/148)) ([621152e](621152e))
* add Harbor Terminal-Bench comparison for agent effectiveness ([ambient-code#199](https://github.com/jeremyeder/agentready/issues/199)) ([a56e318](a56e318))
* add Memory MCP server allow list to repository settings ([ambient-code#203](https://github.com/jeremyeder/agentready/issues/203)) ([41d87bb](41d87bb))
* add quay/quay to leaderboard ([ambient-code#162](https://github.com/jeremyeder/agentready/issues/162)) ([d6e8df0](d6e8df0))
* Add weekly research update skill and automation ([ambient-code#145](https://github.com/jeremyeder/agentready/issues/145)) ([7ba17a6](7ba17a6))
* automate PyPI publishing with trusted publishing (OIDC) ([ambient-code#154](https://github.com/jeremyeder/agentready/issues/154)) ([71f4632](71f4632)), closes [pypa/#action-pypi-publish](https://github.com/jeremyeder/agentready/issues/action-pypi-publish)
* container support ([ambient-code#171](https://github.com/jeremyeder/agentready/issues/171)) ([c6874ea](c6874ea))
* convert AgentReady assessment to on-demand workflow ([ambient-code#213](https://github.com/jeremyeder/agentready/issues/213)) ([b5a1ce0](b5a1ce0)), closes [ambient-code#191](https://github.com/jeremyeder/agentready/issues/191)
* enhance assessors with multi-language support and security ([ambient-code#200](https://github.com/jeremyeder/agentready/issues/200)) ([85712f2](85712f2)), closes [#10](#10)
* Harbor framework integration for Terminal-Bench evaluations ([ambient-code#202](https://github.com/jeremyeder/agentready/issues/202)) ([d73a8c8](d73a8c8)), closes [#4](#4) [ambient-code#178](https://github.com/jeremyeder/agentready/issues/178) [ambient-code#178](https://github.com/jeremyeder/agentready/issues/178)
* Redesign homepage features with two-column layout and research links ([ambient-code#189](https://github.com/jeremyeder/agentready/issues/189)) ([570087d](570087d)), closes [ambient-code#187](https://github.com/jeremyeder/agentready/issues/187)
* replace markdown-link-check with lychee for link validation ([ambient-code#177](https://github.com/jeremyeder/agentready/issues/177)) ([f1a4545](f1a4545))
* Terminal-Bench eval harness (MVP Phase 1) ([ambient-code#178](https://github.com/jeremyeder/agentready/issues/178)) ([d06bab4](d06bab4)), closes [ambient-code#171](https://github.com/jeremyeder/agentready/issues/171)

### Performance Improvements

* implement lazy loading for heavy CLI commands ([ambient-code#151](https://github.com/jeremyeder/agentready/issues/151)) ([6a7cd4e](6a7cd4e))
jeremyeder added a commit that referenced this pull request Dec 15, 2025
* chore: improve lychee link checker retry handling

- Increase max_retries from 3 to 5 for better transient error handling
- Reduce retry_wait_time from 30s to 2s for faster retries
- Remove implementation-status exclusions (file was removed)
- Total retry time: 10s (5 × 2s) vs previous 90s (3 × 30s)

More attempts, faster response, better handling of transient failures.

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>

* fix: simplify PR review workflow and remove duplicate trigger

- Remove pull_request trigger (doesn't have secret access)
- Keep only pull_request_target (has secret access)
- Remove broken output parsing (claude-code-action doesn't support custom outputs)
- Simplify to just run /review-agentready command
- Fixes duplicate workflow runs and ANTHROPIC_API_KEY missing errors

The workflow was running twice - once without secrets (failing)
and once with secrets (succeeding). Now runs once with secrets.

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>

* feat: consolidate GitHub Actions workflows by purpose

PHASE 1 (completed earlier):
- Delete 5 redundant workflows (tests, publish-pypi, docs-lint, 2× leaderboard)
- Rename 3 workflows (ci, agentready-dev, update-docs-manual)
- Add 3 new workflows (stale-issues, leaderboard, update-docs)
- Optimize triggers for cost savings

PHASE 2 (this commit):
- Merge coverage-report.yml into ci.yml as new job
- Create docs.yml combining link-check + future docs jobs
- Fix actionlint issues (proper quoting, combined redirects)
- Rename agentready-dev workflow
- Add explicit @agentready-dev agent invocation in prompt
- Update all GitHub Actions to latest versions (v6)
- Delete update-docs-manual.yml (redundant with automated update-docs.yml)
- Add GitHub Actions guidelines to CLAUDE.md
- Reorganize README.md with TOC, research citations, expanded CLI reference

NET RESULT:
- 16 workflows → 13 workflows (-3 total)
- Clear purpose-driven organization
- 100% actionlint compliance for modified workflows
- Improved maintainability

All modified workflows validated with actionlint.

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>

* fix: correct default weights to sum to 1.0

The default weights were summing to 0.99 instead of 1.0, causing validation errors:
"Default weights must sum to 1.0 (got 0.9900, difference: -0.0100)"

Root cause: Tier distribution was:
- Tier 1: 54% (0.54)
- Tier 2: 27% (0.27)
- Tier 3: 15% (0.15)
- Tier 4:  3% (0.03)
Total: 99% (0.99) ❌

Fix: Increased dependency_security from 0.04 to 0.05
- Tier 1: 55% (0.55)
- Tier 2: 27% (0.27)
- Tier 3: 15% (0.15)
- Tier 4:  3% (0.03)
Total: 100% (1.00) ✅

This permanently fixes the floating-point validation error that appeared frequently in CI.

🤖 Generated with Claude Code

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>

* chore(release): 2.10.0 [skip ci]

# [2.10.0](jeremyeder/agentready@v2.9.0...v2.10.0) (2025-12-11)

### Bug Fixes

* add bounded retry logic for LLM rate limit handling ([#205](https://github.com/jeremyeder/agentready/issues/205)) ([6ecb786](jeremyeder@6ecb786)), closes [#104](https://github.com/jeremyeder/agentready/issues/104)
* disable attestations for Test PyPI to avoid conflict ([#155](https://github.com/jeremyeder/agentready/issues/155)) ([a33e3cd](jeremyeder@a33e3cd)), closes [pypa/#action-pypi-publish](https://github.com/jeremyeder/agentready/issues/action-pypi-publish)
* leaderboard workflow and SSH URL support ([#147](https://github.com/jeremyeder/agentready/issues/147)) ([de28cd0](jeremyeder@de28cd0))
* make E2E test timeouts configurable and add sensitive directory test ([#206](https://github.com/jeremyeder/agentready/issues/206)) ([27e87e5](jeremyeder@27e87e5)), closes [#104](https://github.com/jeremyeder/agentready/issues/104) [#192](https://github.com/jeremyeder/agentready/issues/192)
* resolve all test suite failures - achieve zero failures ([#180](https://github.com/jeremyeder/agentready/issues/180)) ([990fa2d](jeremyeder@990fa2d)), closes [#148](https://github.com/jeremyeder/agentready/issues/148) [#147](https://github.com/jeremyeder/agentready/issues/147) [#145](https://github.com/jeremyeder/agentready/issues/145)
* resolve broken links and workflow failures ([#160](https://github.com/jeremyeder/agentready/issues/160)) ([fbf5cf7](jeremyeder@fbf5cf7))
* resolve YAML syntax error in continuous-learning workflow ([#172](https://github.com/jeremyeder/agentready/issues/172)) ([3d40fcc](jeremyeder@3d40fcc))
* resolve YAML syntax error in update-docs workflow and add actionlint ([#173](https://github.com/jeremyeder/agentready/issues/173)) ([97b06af](jeremyeder@97b06af))
* skip PR comments for external forks to prevent permission errors ([#163](https://github.com/jeremyeder/agentready/issues/163)) ([2a29fb8](jeremyeder@2a29fb8))

### Features

* add ambient-code/agentready to leaderboard ([#148](https://github.com/jeremyeder/agentready/issues/148)) ([621152e](jeremyeder@621152e))
* add Harbor Terminal-Bench comparison for agent effectiveness ([#199](https://github.com/jeremyeder/agentready/issues/199)) ([a56e318](jeremyeder@a56e318))
* add Memory MCP server allow list to repository settings ([#203](https://github.com/jeremyeder/agentready/issues/203)) ([41d87bb](jeremyeder@41d87bb))
* add quay/quay to leaderboard ([#162](https://github.com/jeremyeder/agentready/issues/162)) ([d6e8df0](jeremyeder@d6e8df0))
* Add weekly research update skill and automation ([#145](https://github.com/jeremyeder/agentready/issues/145)) ([7ba17a6](jeremyeder@7ba17a6))
* automate PyPI publishing with trusted publishing (OIDC) ([#154](https://github.com/jeremyeder/agentready/issues/154)) ([71f4632](jeremyeder@71f4632)), closes [pypa/#action-pypi-publish](https://github.com/jeremyeder/agentready/issues/action-pypi-publish)
* container support ([#171](https://github.com/jeremyeder/agentready/issues/171)) ([c6874ea](jeremyeder@c6874ea))
* convert AgentReady assessment to on-demand workflow ([#213](https://github.com/jeremyeder/agentready/issues/213)) ([b5a1ce0](jeremyeder@b5a1ce0)), closes [#191](https://github.com/jeremyeder/agentready/issues/191)
* enhance assessors with multi-language support and security ([#200](https://github.com/jeremyeder/agentready/issues/200)) ([85712f2](jeremyeder@85712f2)), closes [#10](jeremyeder#10)
* Harbor framework integration for Terminal-Bench evaluations ([#202](https://github.com/jeremyeder/agentready/issues/202)) ([d73a8c8](jeremyeder@d73a8c8)), closes [#4](jeremyeder#4) [#178](https://github.com/jeremyeder/agentready/issues/178) [#178](https://github.com/jeremyeder/agentready/issues/178)
* Redesign homepage features with two-column layout and research links ([#189](https://github.com/jeremyeder/agentready/issues/189)) ([570087d](jeremyeder@570087d)), closes [#187](https://github.com/jeremyeder/agentready/issues/187)
* replace markdown-link-check with lychee for link validation ([#177](https://github.com/jeremyeder/agentready/issues/177)) ([f1a4545](jeremyeder@f1a4545))
* Terminal-Bench eval harness (MVP Phase 1) ([#178](https://github.com/jeremyeder/agentready/issues/178)) ([d06bab4](jeremyeder@d06bab4)), closes [#171](https://github.com/jeremyeder/agentready/issues/171)

### Performance Improvements

* implement lazy loading for heavy CLI commands ([#151](https://github.com/jeremyeder/agentready/issues/151)) ([6a7cd4e](jeremyeder@6a7cd4e))

* fix: update CHANGELOG.md links to use ambient-code/agentready

* fix: update version flag test to match new format

The --version output format was updated in PR #221 to show:
'AgentReady v2.20.2\nResearch Report: 2025-12-08'

Updated test assertions to match the new format.

* fix: make link checker simple and stable

Changes:
- Accept 403 status codes (sites that block bots)
- Exclude academic publishers (ACM, IEEE, Springer)
- Exclude research sites that commonly block scrapers (Anthropic, Microsoft Research)
- Exclude placeholder/future research URLs (arxiv.org/abs/25xx)
- Exclude Claude AI URLs (blocks automated scrapers)
- Reduce timeout from 20s to 10s for faster failures
- Reduce retries from 5 to 2 (fail fast)
- Make link-check non-blocking with continue-on-error
- Consolidate into single step (remove duplicate checks)
- Remove verbose flag to reduce noise

This makes CI more reliable by accepting that some research/academic
sites will always block automated checks, and focusing only on critical
infrastructure links that we can actually verify.

* fix: remove continue-on-error from link checker

The comprehensive exclusions should make it pass reliably.
If it fails, that's a real issue we should fix.

* fix: add missing v2.14.1 and v2.14.0 changelog entries

Restored missing changelog entries between v2.15.0 and v2.13.0:
- v2.14.1 (2025-12-05): YAML syntax error fix
- v2.14.0 (2025-12-05): Container support feature

Retrieved from git history (commits f67072e and 8bb403f).

---------

Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com>
Co-authored-by: semantic-release-bot <semantic-release-bot@martynus.net>
github-actions bot pushed a commit to jeremyeder/agentready that referenced this pull request Dec 16, 2025
# [2.10.0](v2.9.0...v2.10.0) (2025-12-16)

### Bug Fixes

* add bounded retry logic for LLM rate limit handling ([ambient-code#205](https://github.com/jeremyeder/agentready/issues/205)) ([6ecb786](6ecb786)), closes [ambient-code#104](https://github.com/jeremyeder/agentready/issues/104)
* disable attestations for Test PyPI to avoid conflict ([ambient-code#155](https://github.com/jeremyeder/agentready/issues/155)) ([a33e3cd](a33e3cd)), closes [pypa/#action-pypi-publish](https://github.com/jeremyeder/agentready/issues/action-pypi-publish)
* downgrade docker/metadata-action to v5 and fix shellcheck warnings ([12f5509](12f5509))
* enable Harbor task filtering for smoketest support ([ambient-code#222](https://github.com/jeremyeder/agentready/issues/222)) ([f780188](f780188))
* leaderboard workflow and SSH URL support ([ambient-code#147](https://github.com/jeremyeder/agentready/issues/147)) ([de28cd0](de28cd0))
* make E2E test timeouts configurable and add sensitive directory test ([ambient-code#206](https://github.com/jeremyeder/agentready/issues/206)) ([27e87e5](27e87e5)), closes [ambient-code#104](https://github.com/jeremyeder/agentready/issues/104) [ambient-code#192](https://github.com/jeremyeder/agentready/issues/192)
* rename research report in data directory ([b8ddfdc](b8ddfdc))
* resolve all test suite failures - achieve zero failures ([ambient-code#180](https://github.com/jeremyeder/agentready/issues/180)) ([990fa2d](990fa2d)), closes [ambient-code#148](https://github.com/jeremyeder/agentready/issues/148) [ambient-code#147](https://github.com/jeremyeder/agentready/issues/147) [ambient-code#145](https://github.com/jeremyeder/agentready/issues/145)
* resolve broken links and workflow failures ([ambient-code#160](https://github.com/jeremyeder/agentready/issues/160)) ([fbf5cf7](fbf5cf7))
* resolve YAML syntax error in continuous-learning workflow ([ambient-code#172](https://github.com/jeremyeder/agentready/issues/172)) ([3d40fcc](3d40fcc))
* resolve YAML syntax error in update-docs workflow and add actionlint ([ambient-code#173](https://github.com/jeremyeder/agentready/issues/173)) ([97b06af](97b06af))
* skip PR comments for external forks to prevent permission errors ([ambient-code#163](https://github.com/jeremyeder/agentready/issues/163)) ([2a29fb8](2a29fb8))
* update --version flag to show correct version and research report date ([ambient-code#221](https://github.com/jeremyeder/agentready/issues/221)) ([5a85abb](5a85abb))
* **workflows:** handle all event types in agentready-dev workflow ([9b942bf](9b942bf))

### Features

* add ambient-code/agentready to leaderboard ([ambient-code#148](https://github.com/jeremyeder/agentready/issues/148)) ([621152e](621152e))
* add Harbor Terminal-Bench comparison for agent effectiveness ([ambient-code#199](https://github.com/jeremyeder/agentready/issues/199)) ([a56e318](a56e318))
* add Memory MCP server allow list to repository settings ([ambient-code#203](https://github.com/jeremyeder/agentready/issues/203)) ([41d87bb](41d87bb))
* add quay/quay to leaderboard ([ambient-code#162](https://github.com/jeremyeder/agentready/issues/162)) ([d6e8df0](d6e8df0))
* Add weekly research update skill and automation ([ambient-code#145](https://github.com/jeremyeder/agentready/issues/145)) ([7ba17a6](7ba17a6))
* automate PyPI publishing with trusted publishing (OIDC) ([ambient-code#154](https://github.com/jeremyeder/agentready/issues/154)) ([71f4632](71f4632)), closes [pypa/#action-pypi-publish](https://github.com/jeremyeder/agentready/issues/action-pypi-publish)
* consolidate GitHub Actions workflows by purpose ([ambient-code#217](https://github.com/jeremyeder/agentready/issues/217)) ([717ca6b](717ca6b)), closes [ambient-code#221](https://github.com/jeremyeder/agentready/issues/221)
* container support ([ambient-code#171](https://github.com/jeremyeder/agentready/issues/171)) ([c6874ea](c6874ea))
* convert AgentReady assessment to on-demand workflow ([ambient-code#213](https://github.com/jeremyeder/agentready/issues/213)) ([b5a1ce0](b5a1ce0)), closes [ambient-code#191](https://github.com/jeremyeder/agentready/issues/191)
* enhance assessors with multi-language support and security ([ambient-code#200](https://github.com/jeremyeder/agentready/issues/200)) ([85712f2](85712f2)), closes [#10](#10)
* Harbor framework integration for Terminal-Bench evaluations ([ambient-code#202](https://github.com/jeremyeder/agentready/issues/202)) ([d73a8c8](d73a8c8)), closes [#4](#4) [ambient-code#178](https://github.com/jeremyeder/agentready/issues/178) [ambient-code#178](https://github.com/jeremyeder/agentready/issues/178)
* Redesign homepage features with two-column layout and research links ([ambient-code#189](https://github.com/jeremyeder/agentready/issues/189)) ([570087d](570087d)), closes [ambient-code#187](https://github.com/jeremyeder/agentready/issues/187)
* replace markdown-link-check with lychee for link validation ([ambient-code#177](https://github.com/jeremyeder/agentready/issues/177)) ([f1a4545](f1a4545))
* Terminal-Bench eval harness (MVP Phase 1) ([ambient-code#178](https://github.com/jeremyeder/agentready/issues/178)) ([d06bab4](d06bab4)), closes [ambient-code#171](https://github.com/jeremyeder/agentready/issues/171)

### Performance Improvements

* implement lazy loading for heavy CLI commands ([ambient-code#151](https://github.com/jeremyeder/agentready/issues/151)) ([6a7cd4e](6a7cd4e))
github-actions bot pushed a commit to jeremyeder/agentready that referenced this pull request Dec 16, 2025
# [2.10.0](v2.9.0...v2.10.0) (2025-12-16)

### Bug Fixes

* add bounded retry logic for LLM rate limit handling ([ambient-code#205](https://github.com/jeremyeder/agentready/issues/205)) ([6ecb786](6ecb786)), closes [ambient-code#104](https://github.com/jeremyeder/agentready/issues/104)
* disable attestations for Test PyPI to avoid conflict ([ambient-code#155](https://github.com/jeremyeder/agentready/issues/155)) ([a33e3cd](a33e3cd)), closes [pypa/#action-pypi-publish](https://github.com/jeremyeder/agentready/issues/action-pypi-publish)
* downgrade docker/metadata-action to v5 and fix shellcheck warnings ([12f5509](12f5509))
* enable Harbor task filtering for smoketest support ([ambient-code#222](https://github.com/jeremyeder/agentready/issues/222)) ([f780188](f780188))
* leaderboard workflow and SSH URL support ([ambient-code#147](https://github.com/jeremyeder/agentready/issues/147)) ([de28cd0](de28cd0))
* make E2E test timeouts configurable and add sensitive directory test ([ambient-code#206](https://github.com/jeremyeder/agentready/issues/206)) ([27e87e5](27e87e5)), closes [ambient-code#104](https://github.com/jeremyeder/agentready/issues/104) [ambient-code#192](https://github.com/jeremyeder/agentready/issues/192)
* rename research report in data directory ([b8ddfdc](b8ddfdc))
* resolve all test suite failures - achieve zero failures ([ambient-code#180](https://github.com/jeremyeder/agentready/issues/180)) ([990fa2d](990fa2d)), closes [ambient-code#148](https://github.com/jeremyeder/agentready/issues/148) [ambient-code#147](https://github.com/jeremyeder/agentready/issues/147) [ambient-code#145](https://github.com/jeremyeder/agentready/issues/145)
* resolve broken links and workflow failures ([ambient-code#160](https://github.com/jeremyeder/agentready/issues/160)) ([fbf5cf7](fbf5cf7))
* resolve YAML syntax error in continuous-learning workflow ([ambient-code#172](https://github.com/jeremyeder/agentready/issues/172)) ([3d40fcc](3d40fcc))
* resolve YAML syntax error in update-docs workflow and add actionlint ([ambient-code#173](https://github.com/jeremyeder/agentready/issues/173)) ([97b06af](97b06af))
* skip PR comments for external forks to prevent permission errors ([ambient-code#163](https://github.com/jeremyeder/agentready/issues/163)) ([2a29fb8](2a29fb8))
* update --version flag to show correct version and research report date ([ambient-code#221](https://github.com/jeremyeder/agentready/issues/221)) ([5a85abb](5a85abb))
* **workflows:** handle all event types in agentready-dev workflow ([9b942bf](9b942bf))

### Features

* add ambient-code/agentready to leaderboard ([ambient-code#148](https://github.com/jeremyeder/agentready/issues/148)) ([621152e](621152e))
* add Harbor Terminal-Bench comparison for agent effectiveness ([ambient-code#199](https://github.com/jeremyeder/agentready/issues/199)) ([a56e318](a56e318))
* add Memory MCP server allow list to repository settings ([ambient-code#203](https://github.com/jeremyeder/agentready/issues/203)) ([41d87bb](41d87bb))
* add quay/quay to leaderboard ([ambient-code#162](https://github.com/jeremyeder/agentready/issues/162)) ([d6e8df0](d6e8df0))
* Add weekly research update skill and automation ([ambient-code#145](https://github.com/jeremyeder/agentready/issues/145)) ([7ba17a6](7ba17a6))
* automate PyPI publishing with trusted publishing (OIDC) ([ambient-code#154](https://github.com/jeremyeder/agentready/issues/154)) ([71f4632](71f4632)), closes [pypa/#action-pypi-publish](https://github.com/jeremyeder/agentready/issues/action-pypi-publish)
* consolidate GitHub Actions workflows by purpose ([ambient-code#217](https://github.com/jeremyeder/agentready/issues/217)) ([717ca6b](717ca6b)), closes [ambient-code#221](https://github.com/jeremyeder/agentready/issues/221)
* container support ([ambient-code#171](https://github.com/jeremyeder/agentready/issues/171)) ([c6874ea](c6874ea))
* convert AgentReady assessment to on-demand workflow ([ambient-code#213](https://github.com/jeremyeder/agentready/issues/213)) ([b5a1ce0](b5a1ce0)), closes [ambient-code#191](https://github.com/jeremyeder/agentready/issues/191)
* enhance assessors with multi-language support and security ([ambient-code#200](https://github.com/jeremyeder/agentready/issues/200)) ([85712f2](85712f2)), closes [#10](#10)
* Harbor framework integration for Terminal-Bench evaluations ([ambient-code#202](https://github.com/jeremyeder/agentready/issues/202)) ([d73a8c8](d73a8c8)), closes [#4](#4) [ambient-code#178](https://github.com/jeremyeder/agentready/issues/178) [ambient-code#178](https://github.com/jeremyeder/agentready/issues/178)
* Redesign homepage features with two-column layout and research links ([ambient-code#189](https://github.com/jeremyeder/agentready/issues/189)) ([570087d](570087d)), closes [ambient-code#187](https://github.com/jeremyeder/agentready/issues/187)
* replace markdown-link-check with lychee for link validation ([ambient-code#177](https://github.com/jeremyeder/agentready/issues/177)) ([f1a4545](f1a4545))
* Terminal-Bench eval harness (MVP Phase 1) ([ambient-code#178](https://github.com/jeremyeder/agentready/issues/178)) ([d06bab4](d06bab4)), closes [ambient-code#171](https://github.com/jeremyeder/agentready/issues/171)
* **workflows:** add comment posting for [@agentready-dev](https://github.com/agentready-dev) agent ([5dff614](5dff614))

### Performance Improvements

* implement lazy loading for heavy CLI commands ([ambient-code#151](https://github.com/jeremyeder/agentready/issues/151)) ([6a7cd4e](6a7cd4e))
github-actions bot pushed a commit to jeremyeder/agentready that referenced this pull request Dec 16, 2025
# [2.10.0](v2.9.0...v2.10.0) (2025-12-16)

### Bug Fixes

* add bounded retry logic for LLM rate limit handling ([ambient-code#205](https://github.com/jeremyeder/agentready/issues/205)) ([6ecb786](6ecb786)), closes [ambient-code#104](https://github.com/jeremyeder/agentready/issues/104)
* disable attestations for Test PyPI to avoid conflict ([ambient-code#155](https://github.com/jeremyeder/agentready/issues/155)) ([a33e3cd](a33e3cd)), closes [pypa/#action-pypi-publish](https://github.com/jeremyeder/agentready/issues/action-pypi-publish)
* downgrade docker/metadata-action to v5 and fix shellcheck warnings ([12f5509](12f5509))
* enable Harbor task filtering for smoketest support ([ambient-code#222](https://github.com/jeremyeder/agentready/issues/222)) ([f780188](f780188))
* leaderboard workflow and SSH URL support ([ambient-code#147](https://github.com/jeremyeder/agentready/issues/147)) ([de28cd0](de28cd0))
* make E2E test timeouts configurable and add sensitive directory test ([ambient-code#206](https://github.com/jeremyeder/agentready/issues/206)) ([27e87e5](27e87e5)), closes [ambient-code#104](https://github.com/jeremyeder/agentready/issues/104) [ambient-code#192](https://github.com/jeremyeder/agentready/issues/192)
* rename research report in data directory ([b8ddfdc](b8ddfdc))
* resolve all test suite failures - achieve zero failures ([ambient-code#180](https://github.com/jeremyeder/agentready/issues/180)) ([990fa2d](990fa2d)), closes [ambient-code#148](https://github.com/jeremyeder/agentready/issues/148) [ambient-code#147](https://github.com/jeremyeder/agentready/issues/147) [ambient-code#145](https://github.com/jeremyeder/agentready/issues/145)
* resolve broken links and workflow failures ([ambient-code#160](https://github.com/jeremyeder/agentready/issues/160)) ([fbf5cf7](fbf5cf7))
* resolve YAML syntax error in continuous-learning workflow ([ambient-code#172](https://github.com/jeremyeder/agentready/issues/172)) ([3d40fcc](3d40fcc))
* resolve YAML syntax error in update-docs workflow and add actionlint ([ambient-code#173](https://github.com/jeremyeder/agentready/issues/173)) ([97b06af](97b06af))
* skip PR comments for external forks to prevent permission errors ([ambient-code#163](https://github.com/jeremyeder/agentready/issues/163)) ([2a29fb8](2a29fb8))
* update --version flag to show correct version and research report date ([ambient-code#221](https://github.com/jeremyeder/agentready/issues/221)) ([5a85abb](5a85abb))
* **workflows:** handle all event types in agentready-dev workflow ([9b942bf](9b942bf))
* **workflows:** improve error handling and logging for comment posting ([9ea1e6b](9ea1e6b))
* **workflows:** improve issue number extraction and add debug step ([ecd896b](ecd896b))

### Features

* add ambient-code/agentready to leaderboard ([ambient-code#148](https://github.com/jeremyeder/agentready/issues/148)) ([621152e](621152e))
* add Harbor Terminal-Bench comparison for agent effectiveness ([ambient-code#199](https://github.com/jeremyeder/agentready/issues/199)) ([a56e318](a56e318))
* add Memory MCP server allow list to repository settings ([ambient-code#203](https://github.com/jeremyeder/agentready/issues/203)) ([41d87bb](41d87bb))
* add quay/quay to leaderboard ([ambient-code#162](https://github.com/jeremyeder/agentready/issues/162)) ([d6e8df0](d6e8df0))
* Add weekly research update skill and automation ([ambient-code#145](https://github.com/jeremyeder/agentready/issues/145)) ([7ba17a6](7ba17a6))
* automate PyPI publishing with trusted publishing (OIDC) ([ambient-code#154](https://github.com/jeremyeder/agentready/issues/154)) ([71f4632](71f4632)), closes [pypa/#action-pypi-publish](https://github.com/jeremyeder/agentready/issues/action-pypi-publish)
* consolidate GitHub Actions workflows by purpose ([ambient-code#217](https://github.com/jeremyeder/agentready/issues/217)) ([717ca6b](717ca6b)), closes [ambient-code#221](https://github.com/jeremyeder/agentready/issues/221)
* container support ([ambient-code#171](https://github.com/jeremyeder/agentready/issues/171)) ([c6874ea](c6874ea))
* convert AgentReady assessment to on-demand workflow ([ambient-code#213](https://github.com/jeremyeder/agentready/issues/213)) ([b5a1ce0](b5a1ce0)), closes [ambient-code#191](https://github.com/jeremyeder/agentready/issues/191)
* enhance assessors with multi-language support and security ([ambient-code#200](https://github.com/jeremyeder/agentready/issues/200)) ([85712f2](85712f2)), closes [#10](#10)
* Harbor framework integration for Terminal-Bench evaluations ([ambient-code#202](https://github.com/jeremyeder/agentready/issues/202)) ([d73a8c8](d73a8c8)), closes [#4](#4) [ambient-code#178](https://github.com/jeremyeder/agentready/issues/178) [ambient-code#178](https://github.com/jeremyeder/agentready/issues/178)
* Redesign homepage features with two-column layout and research links ([ambient-code#189](https://github.com/jeremyeder/agentready/issues/189)) ([570087d](570087d)), closes [ambient-code#187](https://github.com/jeremyeder/agentready/issues/187)
* replace markdown-link-check with lychee for link validation ([ambient-code#177](https://github.com/jeremyeder/agentready/issues/177)) ([f1a4545](f1a4545))
* Terminal-Bench eval harness (MVP Phase 1) ([ambient-code#178](https://github.com/jeremyeder/agentready/issues/178)) ([d06bab4](d06bab4)), closes [ambient-code#171](https://github.com/jeremyeder/agentready/issues/171)
* **workflows:** add comment posting for [@agentready-dev](https://github.com/agentready-dev) agent ([5dff614](5dff614))

### Performance Improvements

* implement lazy loading for heavy CLI commands ([ambient-code#151](https://github.com/jeremyeder/agentready/issues/151)) ([6a7cd4e](6a7cd4e))
github-actions bot pushed a commit to jeremyeder/agentready that referenced this pull request Dec 16, 2025
# [2.10.0](v2.9.0...v2.10.0) (2025-12-16)

### Bug Fixes

* add bounded retry logic for LLM rate limit handling ([ambient-code#205](https://github.com/jeremyeder/agentready/issues/205)) ([6ecb786](6ecb786)), closes [ambient-code#104](https://github.com/jeremyeder/agentready/issues/104)
* disable attestations for Test PyPI to avoid conflict ([ambient-code#155](https://github.com/jeremyeder/agentready/issues/155)) ([a33e3cd](a33e3cd)), closes [pypa/#action-pypi-publish](https://github.com/jeremyeder/agentready/issues/action-pypi-publish)
* downgrade docker/metadata-action to v5 and fix shellcheck warnings ([12f5509](12f5509))
* enable Harbor task filtering for smoketest support ([ambient-code#222](https://github.com/jeremyeder/agentready/issues/222)) ([f780188](f780188))
* leaderboard workflow and SSH URL support ([ambient-code#147](https://github.com/jeremyeder/agentready/issues/147)) ([de28cd0](de28cd0))
* make E2E test timeouts configurable and add sensitive directory test ([ambient-code#206](https://github.com/jeremyeder/agentready/issues/206)) ([27e87e5](27e87e5)), closes [ambient-code#104](https://github.com/jeremyeder/agentready/issues/104) [ambient-code#192](https://github.com/jeremyeder/agentready/issues/192)
* rename research report in data directory ([b8ddfdc](b8ddfdc))
* resolve all test suite failures - achieve zero failures ([ambient-code#180](https://github.com/jeremyeder/agentready/issues/180)) ([990fa2d](990fa2d)), closes [ambient-code#148](https://github.com/jeremyeder/agentready/issues/148) [ambient-code#147](https://github.com/jeremyeder/agentready/issues/147) [ambient-code#145](https://github.com/jeremyeder/agentready/issues/145)
* resolve broken links and workflow failures ([ambient-code#160](https://github.com/jeremyeder/agentready/issues/160)) ([fbf5cf7](fbf5cf7))
* resolve YAML syntax error in continuous-learning workflow ([ambient-code#172](https://github.com/jeremyeder/agentready/issues/172)) ([3d40fcc](3d40fcc))
* resolve YAML syntax error in update-docs workflow and add actionlint ([ambient-code#173](https://github.com/jeremyeder/agentready/issues/173)) ([97b06af](97b06af))
* skip PR comments for external forks to prevent permission errors ([ambient-code#163](https://github.com/jeremyeder/agentready/issues/163)) ([2a29fb8](2a29fb8))
* update --version flag to show correct version and research report date ([ambient-code#221](https://github.com/jeremyeder/agentready/issues/221)) ([5a85abb](5a85abb))
* **workflows:** ensure post-comment step runs after Claude Code Action ([b087e5c](b087e5c))
* **workflows:** handle all event types in agentready-dev workflow ([9b942bf](9b942bf))
* **workflows:** improve error handling and logging for comment posting ([9ea1e6b](9ea1e6b))
* **workflows:** improve issue number extraction and add debug step ([ecd896b](ecd896b))

### Features

* add ambient-code/agentready to leaderboard ([ambient-code#148](https://github.com/jeremyeder/agentready/issues/148)) ([621152e](621152e))
* add Harbor Terminal-Bench comparison for agent effectiveness ([ambient-code#199](https://github.com/jeremyeder/agentready/issues/199)) ([a56e318](a56e318))
* add Memory MCP server allow list to repository settings ([ambient-code#203](https://github.com/jeremyeder/agentready/issues/203)) ([41d87bb](41d87bb))
* add quay/quay to leaderboard ([ambient-code#162](https://github.com/jeremyeder/agentready/issues/162)) ([d6e8df0](d6e8df0))
* Add weekly research update skill and automation ([ambient-code#145](https://github.com/jeremyeder/agentready/issues/145)) ([7ba17a6](7ba17a6))
* automate PyPI publishing with trusted publishing (OIDC) ([ambient-code#154](https://github.com/jeremyeder/agentready/issues/154)) ([71f4632](71f4632)), closes [pypa/#action-pypi-publish](https://github.com/jeremyeder/agentready/issues/action-pypi-publish)
* consolidate GitHub Actions workflows by purpose ([ambient-code#217](https://github.com/jeremyeder/agentready/issues/217)) ([717ca6b](717ca6b)), closes [ambient-code#221](https://github.com/jeremyeder/agentready/issues/221)
* container support ([ambient-code#171](https://github.com/jeremyeder/agentready/issues/171)) ([c6874ea](c6874ea))
* convert AgentReady assessment to on-demand workflow ([ambient-code#213](https://github.com/jeremyeder/agentready/issues/213)) ([b5a1ce0](b5a1ce0)), closes [ambient-code#191](https://github.com/jeremyeder/agentready/issues/191)
* enhance assessors with multi-language support and security ([ambient-code#200](https://github.com/jeremyeder/agentready/issues/200)) ([85712f2](85712f2)), closes [#10](#10)
* Harbor framework integration for Terminal-Bench evaluations ([ambient-code#202](https://github.com/jeremyeder/agentready/issues/202)) ([d73a8c8](d73a8c8)), closes [#4](#4) [ambient-code#178](https://github.com/jeremyeder/agentready/issues/178) [ambient-code#178](https://github.com/jeremyeder/agentready/issues/178)
* Redesign homepage features with two-column layout and research links ([ambient-code#189](https://github.com/jeremyeder/agentready/issues/189)) ([570087d](570087d)), closes [ambient-code#187](https://github.com/jeremyeder/agentready/issues/187)
* replace markdown-link-check with lychee for link validation ([ambient-code#177](https://github.com/jeremyeder/agentready/issues/177)) ([f1a4545](f1a4545))
* Terminal-Bench eval harness (MVP Phase 1) ([ambient-code#178](https://github.com/jeremyeder/agentready/issues/178)) ([d06bab4](d06bab4)), closes [ambient-code#171](https://github.com/jeremyeder/agentready/issues/171)
* **workflows:** add comment posting for [@agentready-dev](https://github.com/agentready-dev) agent ([5dff614](5dff614))

### Performance Improvements

* implement lazy loading for heavy CLI commands ([ambient-code#151](https://github.com/jeremyeder/agentready/issues/151)) ([6a7cd4e](6a7cd4e))
github-actions bot pushed a commit to jeremyeder/agentready that referenced this pull request Dec 16, 2025
# [2.10.0](v2.9.0...v2.10.0) (2025-12-16)

### Bug Fixes

* add bounded retry logic for LLM rate limit handling ([ambient-code#205](https://github.com/jeremyeder/agentready/issues/205)) ([6ecb786](6ecb786)), closes [ambient-code#104](https://github.com/jeremyeder/agentready/issues/104)
* disable attestations for Test PyPI to avoid conflict ([ambient-code#155](https://github.com/jeremyeder/agentready/issues/155)) ([a33e3cd](a33e3cd)), closes [pypa/#action-pypi-publish](https://github.com/jeremyeder/agentready/issues/action-pypi-publish)
* downgrade docker/metadata-action to v5 and fix shellcheck warnings ([12f5509](12f5509))
* enable Harbor task filtering for smoketest support ([ambient-code#222](https://github.com/jeremyeder/agentready/issues/222)) ([f780188](f780188))
* leaderboard workflow and SSH URL support ([ambient-code#147](https://github.com/jeremyeder/agentready/issues/147)) ([de28cd0](de28cd0))
* make E2E test timeouts configurable and add sensitive directory test ([ambient-code#206](https://github.com/jeremyeder/agentready/issues/206)) ([27e87e5](27e87e5)), closes [ambient-code#104](https://github.com/jeremyeder/agentready/issues/104) [ambient-code#192](https://github.com/jeremyeder/agentready/issues/192)
* rename research report in data directory ([b8ddfdc](b8ddfdc))
* resolve all test suite failures - achieve zero failures ([ambient-code#180](https://github.com/jeremyeder/agentready/issues/180)) ([990fa2d](990fa2d)), closes [ambient-code#148](https://github.com/jeremyeder/agentready/issues/148) [ambient-code#147](https://github.com/jeremyeder/agentready/issues/147) [ambient-code#145](https://github.com/jeremyeder/agentready/issues/145)
* resolve broken links and workflow failures ([ambient-code#160](https://github.com/jeremyeder/agentready/issues/160)) ([fbf5cf7](fbf5cf7))
* resolve YAML syntax error in continuous-learning workflow ([ambient-code#172](https://github.com/jeremyeder/agentready/issues/172)) ([3d40fcc](3d40fcc))
* resolve YAML syntax error in update-docs workflow and add actionlint ([ambient-code#173](https://github.com/jeremyeder/agentready/issues/173)) ([97b06af](97b06af))
* skip PR comments for external forks to prevent permission errors ([ambient-code#163](https://github.com/jeremyeder/agentready/issues/163)) ([2a29fb8](2a29fb8))
* update --version flag to show correct version and research report date ([ambient-code#221](https://github.com/jeremyeder/agentready/issues/221)) ([5a85abb](5a85abb))
* **workflows:** ensure post-comment step runs after Claude Code Action ([b087e5c](b087e5c))
* **workflows:** handle all event types in agentready-dev workflow ([9b942bf](9b942bf))
* **workflows:** improve error handling and logging for comment posting ([9ea1e6b](9ea1e6b))
* **workflows:** improve issue number extraction and add debug step ([ecd896b](ecd896b))
* **workflows:** simplify post-comment step condition ([1bbf40a](1bbf40a))

### Features

* add ambient-code/agentready to leaderboard ([ambient-code#148](https://github.com/jeremyeder/agentready/issues/148)) ([621152e](621152e))
* add Harbor Terminal-Bench comparison for agent effectiveness ([ambient-code#199](https://github.com/jeremyeder/agentready/issues/199)) ([a56e318](a56e318))
* add Memory MCP server allow list to repository settings ([ambient-code#203](https://github.com/jeremyeder/agentready/issues/203)) ([41d87bb](41d87bb))
* add quay/quay to leaderboard ([ambient-code#162](https://github.com/jeremyeder/agentready/issues/162)) ([d6e8df0](d6e8df0))
* Add weekly research update skill and automation ([ambient-code#145](https://github.com/jeremyeder/agentready/issues/145)) ([7ba17a6](7ba17a6))
* automate PyPI publishing with trusted publishing (OIDC) ([ambient-code#154](https://github.com/jeremyeder/agentready/issues/154)) ([71f4632](71f4632)), closes [pypa/#action-pypi-publish](https://github.com/jeremyeder/agentready/issues/action-pypi-publish)
* consolidate GitHub Actions workflows by purpose ([ambient-code#217](https://github.com/jeremyeder/agentready/issues/217)) ([717ca6b](717ca6b)), closes [ambient-code#221](https://github.com/jeremyeder/agentready/issues/221)
* container support ([ambient-code#171](https://github.com/jeremyeder/agentready/issues/171)) ([c6874ea](c6874ea))
* convert AgentReady assessment to on-demand workflow ([ambient-code#213](https://github.com/jeremyeder/agentready/issues/213)) ([b5a1ce0](b5a1ce0)), closes [ambient-code#191](https://github.com/jeremyeder/agentready/issues/191)
* enhance assessors with multi-language support and security ([ambient-code#200](https://github.com/jeremyeder/agentready/issues/200)) ([85712f2](85712f2)), closes [#10](#10)
* Harbor framework integration for Terminal-Bench evaluations ([ambient-code#202](https://github.com/jeremyeder/agentready/issues/202)) ([d73a8c8](d73a8c8)), closes [#4](#4) [ambient-code#178](https://github.com/jeremyeder/agentready/issues/178) [ambient-code#178](https://github.com/jeremyeder/agentready/issues/178)
* Redesign homepage features with two-column layout and research links ([ambient-code#189](https://github.com/jeremyeder/agentready/issues/189)) ([570087d](570087d)), closes [ambient-code#187](https://github.com/jeremyeder/agentready/issues/187)
* replace markdown-link-check with lychee for link validation ([ambient-code#177](https://github.com/jeremyeder/agentready/issues/177)) ([f1a4545](f1a4545))
* Terminal-Bench eval harness (MVP Phase 1) ([ambient-code#178](https://github.com/jeremyeder/agentready/issues/178)) ([d06bab4](d06bab4)), closes [ambient-code#171](https://github.com/jeremyeder/agentready/issues/171)
* **workflows:** add comment posting for [@agentready-dev](https://github.com/agentready-dev) agent ([5dff614](5dff614))

### Performance Improvements

* implement lazy loading for heavy CLI commands ([ambient-code#151](https://github.com/jeremyeder/agentready/issues/151)) ([6a7cd4e](6a7cd4e))
github-actions bot pushed a commit to jeremyeder/agentready that referenced this pull request Dec 16, 2025
# [2.10.0](v2.9.0...v2.10.0) (2025-12-16)

### Bug Fixes

* add bounded retry logic for LLM rate limit handling ([ambient-code#205](https://github.com/jeremyeder/agentready/issues/205)) ([6ecb786](6ecb786)), closes [ambient-code#104](https://github.com/jeremyeder/agentready/issues/104)
* disable attestations for Test PyPI to avoid conflict ([ambient-code#155](https://github.com/jeremyeder/agentready/issues/155)) ([a33e3cd](a33e3cd)), closes [pypa/#action-pypi-publish](https://github.com/jeremyeder/agentready/issues/action-pypi-publish)
* downgrade docker/metadata-action to v5 and fix shellcheck warnings ([12f5509](12f5509))
* enable Harbor task filtering for smoketest support ([ambient-code#222](https://github.com/jeremyeder/agentready/issues/222)) ([f780188](f780188))
* leaderboard workflow and SSH URL support ([ambient-code#147](https://github.com/jeremyeder/agentready/issues/147)) ([de28cd0](de28cd0))
* make E2E test timeouts configurable and add sensitive directory test ([ambient-code#206](https://github.com/jeremyeder/agentready/issues/206)) ([27e87e5](27e87e5)), closes [ambient-code#104](https://github.com/jeremyeder/agentready/issues/104) [ambient-code#192](https://github.com/jeremyeder/agentready/issues/192)
* rename research report in data directory ([b8ddfdc](b8ddfdc))
* resolve all test suite failures - achieve zero failures ([ambient-code#180](https://github.com/jeremyeder/agentready/issues/180)) ([990fa2d](990fa2d)), closes [ambient-code#148](https://github.com/jeremyeder/agentready/issues/148) [ambient-code#147](https://github.com/jeremyeder/agentready/issues/147) [ambient-code#145](https://github.com/jeremyeder/agentready/issues/145)
* resolve broken links and workflow failures ([ambient-code#160](https://github.com/jeremyeder/agentready/issues/160)) ([fbf5cf7](fbf5cf7))
* resolve YAML syntax error in continuous-learning workflow ([ambient-code#172](https://github.com/jeremyeder/agentready/issues/172)) ([3d40fcc](3d40fcc))
* resolve YAML syntax error in update-docs workflow and add actionlint ([ambient-code#173](https://github.com/jeremyeder/agentready/issues/173)) ([97b06af](97b06af))
* skip PR comments for external forks to prevent permission errors ([ambient-code#163](https://github.com/jeremyeder/agentready/issues/163)) ([2a29fb8](2a29fb8))
* update --version flag to show correct version and research report date ([ambient-code#221](https://github.com/jeremyeder/agentready/issues/221)) ([5a85abb](5a85abb))
* **workflows:** ensure post-comment step runs after Claude Code Action ([b087e5c](b087e5c))
* **workflows:** handle all event types in agentready-dev workflow ([9b942bf](9b942bf))
* **workflows:** improve error handling and logging for comment posting ([9ea1e6b](9ea1e6b))
* **workflows:** improve issue number extraction and add debug step ([ecd896b](ecd896b))
* **workflows:** remove if:always() to test step execution ([ff0bb12](ff0bb12))
* **workflows:** simplify post-comment step condition ([1bbf40a](1bbf40a))

### Features

* add ambient-code/agentready to leaderboard ([ambient-code#148](https://github.com/jeremyeder/agentready/issues/148)) ([621152e](621152e))
* add Harbor Terminal-Bench comparison for agent effectiveness ([ambient-code#199](https://github.com/jeremyeder/agentready/issues/199)) ([a56e318](a56e318))
* add Memory MCP server allow list to repository settings ([ambient-code#203](https://github.com/jeremyeder/agentready/issues/203)) ([41d87bb](41d87bb))
* add quay/quay to leaderboard ([ambient-code#162](https://github.com/jeremyeder/agentready/issues/162)) ([d6e8df0](d6e8df0))
* Add weekly research update skill and automation ([ambient-code#145](https://github.com/jeremyeder/agentready/issues/145)) ([7ba17a6](7ba17a6))
* automate PyPI publishing with trusted publishing (OIDC) ([ambient-code#154](https://github.com/jeremyeder/agentready/issues/154)) ([71f4632](71f4632)), closes [pypa/#action-pypi-publish](https://github.com/jeremyeder/agentready/issues/action-pypi-publish)
* consolidate GitHub Actions workflows by purpose ([ambient-code#217](https://github.com/jeremyeder/agentready/issues/217)) ([717ca6b](717ca6b)), closes [ambient-code#221](https://github.com/jeremyeder/agentready/issues/221)
* container support ([ambient-code#171](https://github.com/jeremyeder/agentready/issues/171)) ([c6874ea](c6874ea))
* convert AgentReady assessment to on-demand workflow ([ambient-code#213](https://github.com/jeremyeder/agentready/issues/213)) ([b5a1ce0](b5a1ce0)), closes [ambient-code#191](https://github.com/jeremyeder/agentready/issues/191)
* enhance assessors with multi-language support and security ([ambient-code#200](https://github.com/jeremyeder/agentready/issues/200)) ([85712f2](85712f2)), closes [#10](#10)
* Harbor framework integration for Terminal-Bench evaluations ([ambient-code#202](https://github.com/jeremyeder/agentready/issues/202)) ([d73a8c8](d73a8c8)), closes [#4](#4) [ambient-code#178](https://github.com/jeremyeder/agentready/issues/178) [ambient-code#178](https://github.com/jeremyeder/agentready/issues/178)
* Redesign homepage features with two-column layout and research links ([ambient-code#189](https://github.com/jeremyeder/agentready/issues/189)) ([570087d](570087d)), closes [ambient-code#187](https://github.com/jeremyeder/agentready/issues/187)
* replace markdown-link-check with lychee for link validation ([ambient-code#177](https://github.com/jeremyeder/agentready/issues/177)) ([f1a4545](f1a4545))
* Terminal-Bench eval harness (MVP Phase 1) ([ambient-code#178](https://github.com/jeremyeder/agentready/issues/178)) ([d06bab4](d06bab4)), closes [ambient-code#171](https://github.com/jeremyeder/agentready/issues/171)
* **workflows:** add comment posting for [@agentready-dev](https://github.com/agentready-dev) agent ([5dff614](5dff614))

### Performance Improvements

* implement lazy loading for heavy CLI commands ([ambient-code#151](https://github.com/jeremyeder/agentready/issues/151)) ([6a7cd4e](6a7cd4e))
github-actions bot pushed a commit to jeremyeder/agentready that referenced this pull request Dec 16, 2025
# [2.10.0](v2.9.0...v2.10.0) (2025-12-16)

### Bug Fixes

* add bounded retry logic for LLM rate limit handling ([ambient-code#205](https://github.com/jeremyeder/agentready/issues/205)) ([6ecb786](6ecb786)), closes [ambient-code#104](https://github.com/jeremyeder/agentready/issues/104)
* disable attestations for Test PyPI to avoid conflict ([ambient-code#155](https://github.com/jeremyeder/agentready/issues/155)) ([a33e3cd](a33e3cd)), closes [pypa/#action-pypi-publish](https://github.com/jeremyeder/agentready/issues/action-pypi-publish)
* downgrade docker/metadata-action to v5 and fix shellcheck warnings ([12f5509](12f5509))
* enable Harbor task filtering for smoketest support ([ambient-code#222](https://github.com/jeremyeder/agentready/issues/222)) ([f780188](f780188))
* leaderboard workflow and SSH URL support ([ambient-code#147](https://github.com/jeremyeder/agentready/issues/147)) ([de28cd0](de28cd0))
* make E2E test timeouts configurable and add sensitive directory test ([ambient-code#206](https://github.com/jeremyeder/agentready/issues/206)) ([27e87e5](27e87e5)), closes [ambient-code#104](https://github.com/jeremyeder/agentready/issues/104) [ambient-code#192](https://github.com/jeremyeder/agentready/issues/192)
* rename research report in data directory ([b8ddfdc](b8ddfdc))
* resolve all test suite failures - achieve zero failures ([ambient-code#180](https://github.com/jeremyeder/agentready/issues/180)) ([990fa2d](990fa2d)), closes [ambient-code#148](https://github.com/jeremyeder/agentready/issues/148) [ambient-code#147](https://github.com/jeremyeder/agentready/issues/147) [ambient-code#145](https://github.com/jeremyeder/agentready/issues/145)
* resolve broken links and workflow failures ([ambient-code#160](https://github.com/jeremyeder/agentready/issues/160)) ([fbf5cf7](fbf5cf7))
* resolve YAML syntax error in continuous-learning workflow ([ambient-code#172](https://github.com/jeremyeder/agentready/issues/172)) ([3d40fcc](3d40fcc))
* resolve YAML syntax error in update-docs workflow and add actionlint ([ambient-code#173](https://github.com/jeremyeder/agentready/issues/173)) ([97b06af](97b06af))
* skip PR comments for external forks to prevent permission errors ([ambient-code#163](https://github.com/jeremyeder/agentready/issues/163)) ([2a29fb8](2a29fb8))
* update --version flag to show correct version and research report date ([ambient-code#221](https://github.com/jeremyeder/agentready/issues/221)) ([5a85abb](5a85abb))
* **workflows:** ensure post-comment step runs after Claude Code Action ([b087e5c](b087e5c))
* **workflows:** handle all event types in agentready-dev workflow ([9b942bf](9b942bf))
* **workflows:** improve error handling and logging for comment posting ([9ea1e6b](9ea1e6b))
* **workflows:** improve issue number extraction and add debug step ([ecd896b](ecd896b))
* **workflows:** remove if:always() to test step execution ([ff0bb12](ff0bb12))
* **workflows:** simplify post-comment step condition ([1bbf40a](1bbf40a))

### Features

* add ambient-code/agentready to leaderboard ([ambient-code#148](https://github.com/jeremyeder/agentready/issues/148)) ([621152e](621152e))
* add Harbor Terminal-Bench comparison for agent effectiveness ([ambient-code#199](https://github.com/jeremyeder/agentready/issues/199)) ([a56e318](a56e318))
* add Memory MCP server allow list to repository settings ([ambient-code#203](https://github.com/jeremyeder/agentready/issues/203)) ([41d87bb](41d87bb))
* add quay/quay to leaderboard ([ambient-code#162](https://github.com/jeremyeder/agentready/issues/162)) ([d6e8df0](d6e8df0))
* Add weekly research update skill and automation ([ambient-code#145](https://github.com/jeremyeder/agentready/issues/145)) ([7ba17a6](7ba17a6))
* automate PyPI publishing with trusted publishing (OIDC) ([ambient-code#154](https://github.com/jeremyeder/agentready/issues/154)) ([71f4632](71f4632)), closes [pypa/#action-pypi-publish](https://github.com/jeremyeder/agentready/issues/action-pypi-publish)
* consolidate GitHub Actions workflows by purpose ([ambient-code#217](https://github.com/jeremyeder/agentready/issues/217)) ([717ca6b](717ca6b)), closes [ambient-code#221](https://github.com/jeremyeder/agentready/issues/221)
* container support ([ambient-code#171](https://github.com/jeremyeder/agentready/issues/171)) ([c6874ea](c6874ea))
* convert AgentReady assessment to on-demand workflow ([ambient-code#213](https://github.com/jeremyeder/agentready/issues/213)) ([b5a1ce0](b5a1ce0)), closes [ambient-code#191](https://github.com/jeremyeder/agentready/issues/191)
* enhance assessors with multi-language support and security ([ambient-code#200](https://github.com/jeremyeder/agentready/issues/200)) ([85712f2](85712f2)), closes [#10](#10)
* Harbor framework integration for Terminal-Bench evaluations ([ambient-code#202](https://github.com/jeremyeder/agentready/issues/202)) ([d73a8c8](d73a8c8)), closes [#4](#4) [ambient-code#178](https://github.com/jeremyeder/agentready/issues/178) [ambient-code#178](https://github.com/jeremyeder/agentready/issues/178)
* Redesign homepage features with two-column layout and research links ([ambient-code#189](https://github.com/jeremyeder/agentready/issues/189)) ([570087d](570087d)), closes [ambient-code#187](https://github.com/jeremyeder/agentready/issues/187)
* replace markdown-link-check with lychee for link validation ([ambient-code#177](https://github.com/jeremyeder/agentready/issues/177)) ([f1a4545](f1a4545))
* Terminal-Bench eval harness (MVP Phase 1) ([ambient-code#178](https://github.com/jeremyeder/agentready/issues/178)) ([d06bab4](d06bab4)), closes [ambient-code#171](https://github.com/jeremyeder/agentready/issues/171)
* **workflows:** add comment posting for [@agentready-dev](https://github.com/agentready-dev) agent ([5dff614](5dff614))

### Performance Improvements

* implement lazy loading for heavy CLI commands ([ambient-code#151](https://github.com/jeremyeder/agentready/issues/151)) ([6a7cd4e](6a7cd4e))
github-actions bot pushed a commit to jeremyeder/agentready that referenced this pull request Dec 16, 2025
# [2.10.0](v2.9.0...v2.10.0) (2025-12-16)

### Bug Fixes

* add bounded retry logic for LLM rate limit handling ([ambient-code#205](https://github.com/jeremyeder/agentready/issues/205)) ([6ecb786](6ecb786)), closes [ambient-code#104](https://github.com/jeremyeder/agentready/issues/104)
* disable attestations for Test PyPI to avoid conflict ([ambient-code#155](https://github.com/jeremyeder/agentready/issues/155)) ([a33e3cd](a33e3cd)), closes [pypa/#action-pypi-publish](https://github.com/jeremyeder/agentready/issues/action-pypi-publish)
* downgrade docker/metadata-action to v5 and fix shellcheck warnings ([12f5509](12f5509))
* enable Harbor task filtering for smoketest support ([ambient-code#222](https://github.com/jeremyeder/agentready/issues/222)) ([f780188](f780188))
* leaderboard workflow and SSH URL support ([ambient-code#147](https://github.com/jeremyeder/agentready/issues/147)) ([de28cd0](de28cd0))
* make E2E test timeouts configurable and add sensitive directory test ([ambient-code#206](https://github.com/jeremyeder/agentready/issues/206)) ([27e87e5](27e87e5)), closes [ambient-code#104](https://github.com/jeremyeder/agentready/issues/104) [ambient-code#192](https://github.com/jeremyeder/agentready/issues/192)
* rename research report in data directory ([b8ddfdc](b8ddfdc))
* resolve all test suite failures - achieve zero failures ([ambient-code#180](https://github.com/jeremyeder/agentready/issues/180)) ([990fa2d](990fa2d)), closes [ambient-code#148](https://github.com/jeremyeder/agentready/issues/148) [ambient-code#147](https://github.com/jeremyeder/agentready/issues/147) [ambient-code#145](https://github.com/jeremyeder/agentready/issues/145)
* resolve broken links and workflow failures ([ambient-code#160](https://github.com/jeremyeder/agentready/issues/160)) ([fbf5cf7](fbf5cf7))
* resolve YAML syntax error in continuous-learning workflow ([ambient-code#172](https://github.com/jeremyeder/agentready/issues/172)) ([3d40fcc](3d40fcc))
* resolve YAML syntax error in update-docs workflow and add actionlint ([ambient-code#173](https://github.com/jeremyeder/agentready/issues/173)) ([97b06af](97b06af))
* skip PR comments for external forks to prevent permission errors ([ambient-code#163](https://github.com/jeremyeder/agentready/issues/163)) ([2a29fb8](2a29fb8))
* update --version flag to show correct version and research report date ([ambient-code#221](https://github.com/jeremyeder/agentready/issues/221)) ([5a85abb](5a85abb))
* **workflows:** ensure post-comment step runs after Claude Code Action ([b087e5c](b087e5c))
* **workflows:** handle all event types in agentready-dev workflow ([9b942bf](9b942bf))
* **workflows:** improve error handling and logging for comment posting ([9ea1e6b](9ea1e6b))
* **workflows:** improve issue number extraction and add debug step ([ecd896b](ecd896b))
* **workflows:** remove if:always() to test step execution ([ff0bb12](ff0bb12))
* **workflows:** simplify post-comment step condition ([1bbf40a](1bbf40a))

### Features

* add ambient-code/agentready to leaderboard ([ambient-code#148](https://github.com/jeremyeder/agentready/issues/148)) ([621152e](621152e))
* add Harbor Terminal-Bench comparison for agent effectiveness ([ambient-code#199](https://github.com/jeremyeder/agentready/issues/199)) ([a56e318](a56e318))
* add Memory MCP server allow list to repository settings ([ambient-code#203](https://github.com/jeremyeder/agentready/issues/203)) ([41d87bb](41d87bb))
* add quay/quay to leaderboard ([ambient-code#162](https://github.com/jeremyeder/agentready/issues/162)) ([d6e8df0](d6e8df0))
* Add weekly research update skill and automation ([ambient-code#145](https://github.com/jeremyeder/agentready/issues/145)) ([7ba17a6](7ba17a6))
* automate PyPI publishing with trusted publishing (OIDC) ([ambient-code#154](https://github.com/jeremyeder/agentready/issues/154)) ([71f4632](71f4632)), closes [pypa/#action-pypi-publish](https://github.com/jeremyeder/agentready/issues/action-pypi-publish)
* consolidate GitHub Actions workflows by purpose ([ambient-code#217](https://github.com/jeremyeder/agentready/issues/217)) ([717ca6b](717ca6b)), closes [ambient-code#221](https://github.com/jeremyeder/agentready/issues/221)
* container support ([ambient-code#171](https://github.com/jeremyeder/agentready/issues/171)) ([c6874ea](c6874ea))
* convert AgentReady assessment to on-demand workflow ([ambient-code#213](https://github.com/jeremyeder/agentready/issues/213)) ([b5a1ce0](b5a1ce0)), closes [ambient-code#191](https://github.com/jeremyeder/agentready/issues/191)
* enhance assessors with multi-language support and security ([ambient-code#200](https://github.com/jeremyeder/agentready/issues/200)) ([85712f2](85712f2)), closes [#10](#10)
* Harbor framework integration for Terminal-Bench evaluations ([ambient-code#202](https://github.com/jeremyeder/agentready/issues/202)) ([d73a8c8](d73a8c8)), closes [#4](#4) [ambient-code#178](https://github.com/jeremyeder/agentready/issues/178) [ambient-code#178](https://github.com/jeremyeder/agentready/issues/178)
* Redesign homepage features with two-column layout and research links ([ambient-code#189](https://github.com/jeremyeder/agentready/issues/189)) ([570087d](570087d)), closes [ambient-code#187](https://github.com/jeremyeder/agentready/issues/187)
* replace markdown-link-check with lychee for link validation ([ambient-code#177](https://github.com/jeremyeder/agentready/issues/177)) ([f1a4545](f1a4545))
* Terminal-Bench eval harness (MVP Phase 1) ([ambient-code#178](https://github.com/jeremyeder/agentready/issues/178)) ([d06bab4](d06bab4)), closes [ambient-code#171](https://github.com/jeremyeder/agentready/issues/171)
* **workflows:** add comment posting for [@agentready-dev](https://github.com/agentready-dev) agent ([5dff614](5dff614))

### Performance Improvements

* implement lazy loading for heavy CLI commands ([ambient-code#151](https://github.com/jeremyeder/agentready/issues/151)) ([6a7cd4e](6a7cd4e))
github-actions bot pushed a commit to jeremyeder/agentready that referenced this pull request Dec 16, 2025
# [2.10.0](v2.9.0...v2.10.0) (2025-12-16)

### Bug Fixes

* add bounded retry logic for LLM rate limit handling ([ambient-code#205](https://github.com/jeremyeder/agentready/issues/205)) ([6ecb786](6ecb786)), closes [ambient-code#104](https://github.com/jeremyeder/agentready/issues/104)
* disable attestations for Test PyPI to avoid conflict ([ambient-code#155](https://github.com/jeremyeder/agentready/issues/155)) ([a33e3cd](a33e3cd)), closes [pypa/#action-pypi-publish](https://github.com/jeremyeder/agentready/issues/action-pypi-publish)
* downgrade docker/metadata-action to v5 and fix shellcheck warnings ([12f5509](12f5509))
* enable Harbor task filtering for smoketest support ([ambient-code#222](https://github.com/jeremyeder/agentready/issues/222)) ([f780188](f780188))
* leaderboard workflow and SSH URL support ([ambient-code#147](https://github.com/jeremyeder/agentready/issues/147)) ([de28cd0](de28cd0))
* make E2E test timeouts configurable and add sensitive directory test ([ambient-code#206](https://github.com/jeremyeder/agentready/issues/206)) ([27e87e5](27e87e5)), closes [ambient-code#104](https://github.com/jeremyeder/agentready/issues/104) [ambient-code#192](https://github.com/jeremyeder/agentready/issues/192)
* rename research report in data directory ([b8ddfdc](b8ddfdc))
* resolve all test suite failures - achieve zero failures ([ambient-code#180](https://github.com/jeremyeder/agentready/issues/180)) ([990fa2d](990fa2d)), closes [ambient-code#148](https://github.com/jeremyeder/agentready/issues/148) [ambient-code#147](https://github.com/jeremyeder/agentready/issues/147) [ambient-code#145](https://github.com/jeremyeder/agentready/issues/145)
* resolve broken links and workflow failures ([ambient-code#160](https://github.com/jeremyeder/agentready/issues/160)) ([fbf5cf7](fbf5cf7))
* resolve YAML syntax error in continuous-learning workflow ([ambient-code#172](https://github.com/jeremyeder/agentready/issues/172)) ([3d40fcc](3d40fcc))
* resolve YAML syntax error in update-docs workflow and add actionlint ([ambient-code#173](https://github.com/jeremyeder/agentready/issues/173)) ([97b06af](97b06af))
* skip PR comments for external forks to prevent permission errors ([ambient-code#163](https://github.com/jeremyeder/agentready/issues/163)) ([2a29fb8](2a29fb8))
* update --version flag to show correct version and research report date ([ambient-code#221](https://github.com/jeremyeder/agentready/issues/221)) ([5a85abb](5a85abb))
* **workflows:** ensure post-comment step runs after Claude Code Action ([b087e5c](b087e5c))
* **workflows:** handle all event types in agentready-dev workflow ([9b942bf](9b942bf))
* **workflows:** improve error handling and logging for comment posting ([9ea1e6b](9ea1e6b))
* **workflows:** improve issue number extraction and add debug step ([ecd896b](ecd896b))
* **workflows:** remove if:always() to test step execution ([ff0bb12](ff0bb12))
* **workflows:** simplify post-comment step condition ([1bbf40a](1bbf40a))

### Features

* add ambient-code/agentready to leaderboard ([ambient-code#148](https://github.com/jeremyeder/agentready/issues/148)) ([621152e](621152e))
* add Harbor Terminal-Bench comparison for agent effectiveness ([ambient-code#199](https://github.com/jeremyeder/agentready/issues/199)) ([a56e318](a56e318))
* add Memory MCP server allow list to repository settings ([ambient-code#203](https://github.com/jeremyeder/agentready/issues/203)) ([41d87bb](41d87bb))
* add quay/quay to leaderboard ([ambient-code#162](https://github.com/jeremyeder/agentready/issues/162)) ([d6e8df0](d6e8df0))
* Add weekly research update skill and automation ([ambient-code#145](https://github.com/jeremyeder/agentready/issues/145)) ([7ba17a6](7ba17a6))
* automate PyPI publishing with trusted publishing (OIDC) ([ambient-code#154](https://github.com/jeremyeder/agentready/issues/154)) ([71f4632](71f4632)), closes [pypa/#action-pypi-publish](https://github.com/jeremyeder/agentready/issues/action-pypi-publish)
* consolidate GitHub Actions workflows by purpose ([ambient-code#217](https://github.com/jeremyeder/agentready/issues/217)) ([717ca6b](717ca6b)), closes [ambient-code#221](https://github.com/jeremyeder/agentready/issues/221)
* container support ([ambient-code#171](https://github.com/jeremyeder/agentready/issues/171)) ([c6874ea](c6874ea))
* convert AgentReady assessment to on-demand workflow ([ambient-code#213](https://github.com/jeremyeder/agentready/issues/213)) ([b5a1ce0](b5a1ce0)), closes [ambient-code#191](https://github.com/jeremyeder/agentready/issues/191)
* enhance assessors with multi-language support and security ([ambient-code#200](https://github.com/jeremyeder/agentready/issues/200)) ([85712f2](85712f2)), closes [#10](#10)
* Harbor framework integration for Terminal-Bench evaluations ([ambient-code#202](https://github.com/jeremyeder/agentready/issues/202)) ([d73a8c8](d73a8c8)), closes [#4](#4) [ambient-code#178](https://github.com/jeremyeder/agentready/issues/178) [ambient-code#178](https://github.com/jeremyeder/agentready/issues/178)
* Redesign homepage features with two-column layout and research links ([ambient-code#189](https://github.com/jeremyeder/agentready/issues/189)) ([570087d](570087d)), closes [ambient-code#187](https://github.com/jeremyeder/agentready/issues/187)
* replace markdown-link-check with lychee for link validation ([ambient-code#177](https://github.com/jeremyeder/agentready/issues/177)) ([f1a4545](f1a4545))
* Terminal-Bench eval harness (MVP Phase 1) ([ambient-code#178](https://github.com/jeremyeder/agentready/issues/178)) ([d06bab4](d06bab4)), closes [ambient-code#171](https://github.com/jeremyeder/agentready/issues/171)
* **workflows:** add comment posting for [@agentready-dev](https://github.com/agentready-dev) agent ([5dff614](5dff614))

### Performance Improvements

* implement lazy loading for heavy CLI commands ([ambient-code#151](https://github.com/jeremyeder/agentready/issues/151)) ([6a7cd4e](6a7cd4e))
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant