Paperback
Add to list Added to list Stochastic Optimization Methods for Policy Evaluation in Reinforcement Learning
Yi Zhou, Shaocong Ma
This monograph introduces various value-based approaches for solving the policy evaluation problem in the online reinforcement learning scenario, which aims to learn the value function associated with a specific policy…
Available to order, ships in 7-14 daysAvailable to order