NHacker Next
  • new
  • past
  • show
  • ask
  • show
  • jobs
  • submit
Evaluating Agent-Based Program Repair at Google (arxiv.org)
moqizhengz 11 days ago [-]
In conclusion, Google selected 178 relatively easy issues out of their 80K BUG database and found out Gemini 1.5 was kind of good when dealing with machine-detected bugs.

Maybe its time to build some post-ut automated patch generation CI pipeline?

And I think the other ongoing experiment mentioned in the paper is more interesting. ``` investigating the ability of an agent to generate bug-reproducing tests ```

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact
Rendered at 14:56:34 GMT+0000 (Coordinated Universal Time) with Vercel.