Games like Alpha Go have very limited(or known) end state so reinforcements lear... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		codetiger on Feb 24, 2024 \| parent \| context \| favorite \| on: GPT in 500 Lines of SQL Games like Alpha Go have very limited(or known) end state so reinforcements learning or similar methods work great. However, I wonder how will AI train itself in learning human languages without being judged by humans. It’s just a matter of time before someone figures out

erwincoumans on Feb 24, 2024 [–]

Right, a rich simulator with humans for feedback: an evolved version of online worlds with a mix of AI NPC's and real people, with the task: find the NPC's. The NPC's can train in rooms with exclusive NPC's or mixed with people, without knowing.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact