devin
links
The “First AI Software Engineer” Is Bungling the Vast Majority of Tasks It’s Asked to Do - It took longer than a human, and failed at the vast majority of tasks.
Out of 20 tasks we attempted, we saw 14 failures, three inconclusive results, and just three successes,” the researchers found — a meager success rate of just 15 percent.