Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

The Monte Carlo analysis AlphaZero used functioned as a sort of multi-step reasoning for it. GPT can use its token buffer for some multi-step reasoning but that sort of interferes with providing a conversation with the user so it's much less effective.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: