Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Are you talking about Monte Carlo tree search? I consider it part of the algorithm in AlphaZero's case. But agreed that RL is a lot harder in real-life setting than in a board game setting.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: