Solving Large Markov Decision Processes Yilan Gu Dept. of Computer Science University of Toronto April 12, 2004.
PEGASUS: A policy search method for large MDP’s and POMDP’s
Solving Large Markov Decision Processes