A month later Augment's Open Source Agent used with Claude 3.7 and o1 still tops the SWE-Bench Verified leaderboard. Sure, it's only a month, but in the rapidly changing world of AI that is something.
With all the great models released in the past month this scaffolding may be even more useful.