Add CoreThink Agent v1.0 SWE-bench Lite submission#334
Add CoreThink Agent v1.0 SWE-bench Lite submission#334JayVaghasiya-ai wants to merge 3 commits intoSWE-bench:mainfrom
Conversation
- 62.33% success rate (187/300 resolved instances) - Neuro-symbolic approach with Claude Sonnet-4 + DeepSeek-R1 - Complete reasoning traces and evaluation artifacts included - Technical report: https://arxiv.org/pdf/2509.00971 - Organization: CoreThink.ai
- Remove unnecessary eval.sh files from logs/ (299 files) - Remove hook_traces directories from trajs/ (299 dirs) - Remove .pred and .trace.log files from trajs/ (598 files)
- Merged content from 299 .info.log files into their corresponding .traj files - Removed all .info.log files to consolidate trajectory data - Each .traj file now contains complete trajectory information
|
Thank you for the feedback! I've already addressed this concern and consolidated the trajectory files. Here's what we've updated: What We Changed Previously, we had two files per task instance in the trajs/ directory: Alignment with SWE-bench Lite Guidelines The logs/ directory contains 3 files per instance (patch.diff, report.json, test_output.txt), which are the standard evaluation artifacts generated by the SWE-bench harness. These are kept separate as they represent the evaluation results rather than the agent's internal reasoning process. Is this structure now aligned with your requirements, or would you like us to further consolidate any other files? Best regards, |

Uh oh!
There was an error while loading. Please reload this page.