Dynamic Programming Back to Back SWE

Bito's AI Architect Achieves Highest Success Rate of 60.8% on SWE-Bench Pro

The evaluation used identical Claude Sonnet 4.5 agents under two conditions. In the baseline condition, the agent relied on native file search and tool-driven exploration to infer repository structure ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

Bito's AI Architect Achieves Highest Success Rate of 60.8% on SWE-Bench Pro

Trending now