Comment by janalsncm
8 months ago
We know the model is approximating the results of a search. We don’t know whether it is actually searching.
At the most basic level, the model is just giving probabilities for the next moves, or in the case of value approximation, guessing which bucket the value falls into.
No comments yet
Contribute on Hacker News ↗