NHacker Next
  • new
  • past
  • show
  • ask
  • show
  • jobs
  • submit
SWE-bench Verified no longer measures frontier coding capabilities (ferrer nofollow" target="_blank">openai.com)
339 points by kmdupree 1 days ago | 179 comments
Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact
Rendered at 01:08:05 GMT+0000 (Coordinated Universal Time) with Cloudflare Workers.