Seems like a good benchmark for AGI. Start with things that are easy for humans ... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		robryan 82 days ago \| parent \| context \| favorite \| on: GPT-5 Seems like a good benchmark for AGI. Start with things that are easy for humans but hard for LLMs currently.

mustaphah 81 days ago [–]

But they have access to tools (though I'm not sure why they're not using them in this case).

Ask it to count using a coding tool, and it will always give you the right answer. Just as humans use tools to overcome their limits, LLMs should do the same.

Consider applying for YC's Winter 2026 batch! Applications are open till Nov 10
Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact