SWE-agent

Name: SWE-agent
Rating: 82.8 (200 reviews)

Coding & Software Engineering

#56

SWE-agent takes a GitHub issue and tries to automatically fix it, using your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [NeurIPS 2024]

SWE-agent scored a 82.8 on the Agentic Leaderboard, ranking #56 overall out of 200 evaluated agents, due to its strong performance in Reliability (97.4%).

Rank

#56

Score

82.8