That kinda defeats the purpose. Of course you can use AlphaGo, but the question here is – can a generative AI teach itself to play chess (and do a million other similar generic tasks) when given no specific training for it.
How about, can a generative AI teach itself how to use a chess AI to beat chess? Give GPT4 the ability to make REST API calls and also access to FFI, and put a chess-bot library somewhere. Train it how to use these but not necessarily how to use the chess API specifically. If you ask GPT4 to play chess, can it call into that library and use the requests/responses?
This has bigger ramifications too: if GPT4 learns how to use RPCs effectively, it can do accurate complex mathematical computations and simulations, do IoT/home automation tasks, and even control heavy machinery.
This is exactly what I'm hyped for in the next-gen GPT-7. Imagine it having the ability to self-teach, just like a child. I may not know how to whip up some cheesy goodness, but with external resources like YouTube vids, I can improve. And if GPT-7 can store this knowledge, it can access it for future tasks! That's some next-level stuff, and I'm stoked to see where it goes.
GPT is a language model. It doesn't call APIs. You can take the output of GPT and decide to call a specific API based on it, and sure you can bake that into ChatGPT or some independent tooling, but again that's not the purpose of the core project. I'm sure plenty of people/companies are working on "chess engine interface using ChatGPT" already.
Isn't it very likely chess games were part of the training data? If so, chess sequences seem like an amazing matched filter of sorts. It doesn't seem surprising that it can guess a next move.
Is it teaching itself anything here? Is the model being updated in any way? Or are you talking more generally, looking into the future sort of statement?
You can put every chess game ever played in its database and yet every time it plays a new game it will immediately come across a completely unique move. So it still needs to learn the rules and know how to play independently. Whether it can do that or not is yet undermined.
I think they meant “teach itself” in the course of a conversation, within the token limit. ChatGPT will forget what it has “learned” once the conversation is over.