Update agent display names to include model and scaffold by Chesars · Pull Request #412 · SWE-bench/experiments

Chesars · 2026-02-09T18:51:07Z

Summary

Updates names for 16 entries that were missing model or scaffold

Changes

Entry	Before	After
Augment Agent v1	Augment Agent v1	Augment Agent v1 + Claude Sonnet 4
Augment Agent v0	Augment Agent v0	Augment Agent v0 + Sonnet 3.7 + O1
OpenHands + 4x Scaled	OpenHands + 4x Scaled (2024-02-03)	OpenHands + 4x Scaled + Claude 3.5 Sonnet + o3-mini (2024-02-03)
AppMap Navie v2	AppMap Navie v2	AppMap Navie v2 + Claude 3.5 Sonnet + GPT-4o
PatchPilot-v1.1	PatchPilot-v1.1	PatchPilot-v1.1 + o4-mini
SWE-Exp	SWE-Exp	SWE-Exp + DeepSeek-V3-0324
SWE-Rizzo	SWE-Rizzo	SWE-Rizzo + Claude 3.7
Nemotron-CORTEXA	Nemotron-CORTEXA	Nemotron-CORTEXA + NV-EmbedCode + Claude 3.5 Sonnet + DeepSeek-V3 + o3-mini + GPT-4o + GPT-4-turbo + Qwen2.5-72B + Llama-3.1-405B + Llama-3.3-70B
GLM-4.5	GLM-4.5	OpenHands + GLM-4.5
Skywork-SWE-32B	Skywork-SWE-32B	OpenHands + Skywork-SWE-32B
Skywork-SWE-32B + TTS(Bo8)	Skywork-SWE-32B + TTS(Bo8)	OpenHands + Skywork-SWE-32B + TTS(Bo8)
MCTS-Refine-7B	MCTS-Refine-7B	Agentless + MCTS-Refine-7B
DeepSWE-Preview	DeepSWE-Preview	R2E-Agent + DeepSWE-Preview
DeepSWE-Preview + TTS(Bo16)	DeepSWE-Preview + TTS(Bo16)	R2E-Agent + DeepSWE-Preview + TTS(Bo16)
FrogBoss-32B-2510	FrogBoss-32B-2510	debug-gym + FrogBoss-32B-2510
FrogMini-14B-2510	FrogMini-14B-2510	debug-gym + FrogMini-14B-2510

Related: #406
Closes: SWE-bench/swe-bench.github.io#40

- Augment Agent v1 → + Claude Sonnet 4 - Augment Agent v0 → + Sonnet 3.7 + O1 - OpenHands + 4x Scaled → + Claude 3.5 Sonnet + o3-mini - AppMap Navie v2 → + Claude 3.5 Sonnet + GPT-4o - PatchPilot-v1.1 → + o4-mini - SWE-Exp → + DeepSeek-V3-0324 - SWE-Rizzo → + Claude 3.7 - Nemotron-CORTEXA → + all 9 models used - GLM-4.5 → OpenHands + GLM-4.5 - Skywork-SWE-32B → OpenHands + Skywork-SWE-32B - Skywork-SWE-32B + TTS(Bo8) → OpenHands + Skywork-SWE-32B + TTS(Bo8) - MCTS-Refine-7B → Agentless + MCTS-Refine-7B - DeepSWE-Preview → R2E-Agent + DeepSWE-Preview - DeepSWE-Preview + TTS(Bo16) → R2E-Agent + DeepSWE-Preview + TTS(Bo16) - FrogBoss-32B-2510 → debug-gym + FrogBoss-32B-2510 - FrogMini-14B-2510 → debug-gym + FrogMini-14B-2510

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update agent display names to include model and scaffold#412

Update agent display names to include model and scaffold#412
Chesars wants to merge 1 commit intoSWE-bench:mainfrom
Chesars:update-agent-display-names-v2

Chesars commented Feb 9, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

Chesars commented Feb 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Changes

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Chesars commented Feb 9, 2026 •

edited

Loading