UCT (Upper Confidence based Tree Search) An efficient game tree search algorithm for the game of Go by Levente Kocsis and Csaba Szepesvari [1]. The UCB1.
UAV Route Planning in Delay Tolerant Networks
Off-Policy Temporal-Difference Learning with Function Approximation Doina Precup McGill University Rich Sutton Sanjoy Dasgupta AT&T Labs.
POMDPs: 5 Reward Shaping: 4 Intrinsic RL: 4 Function Approximation: 3
The ideals reality of science