Sample size of 1 but GPT-5 seems horrendous at coding? My go to benchmark is a 3...

M4v3R · 2025-08-08T13:14:20 1754658860

This is what I got from your prompt in one shot with GPT-5 Thinking:

Game: https://chatgpt.com/canvas/shared/6895f722f2708191ac4a6d1645...

Conversation: https://chatgpt.com/share/6895f74a-0c5c-8004-b349-69da096531...

The controls are inverted for some reason and it could be a bit faster, but I fixed both of these easily with one prompt and here's the corrected version: https://chatgpt.com/canvas/shared/6895f82759f88191ba41c9fcd5...

raducu · 2025-08-11T06:57:42 1754895462

Thanks, the issue was indeed not using explicitly the thinking model or they changed something over the weekend -- it's at least on par with Claude now.

EDIT: clearly better than Claude or any other model that I tried before. I had a bonus benchmark -- add a narrow triangle on the head of the snake that indicates the direction of movement, after a single iteration GPT-5 fixed it whereas Claude could never get the rotation of the triangle right, nor could o3 the last time I tried.

Frieren · 2025-08-08T07:54:30 1754639670

> My go to benchmark is a 3d snake game Claude does almost flawlessly (or at least in 3-4 iterations)

If you need to know how the snake game should look to get the code then Claude is not doing the work you are.