Goprez sg
Reinforcement Learning: A Tutorial Rich Sutton AT&T Labs -- Research sutton.
Sense - Data Driven NYC // June 2014
1 RL for Large State Spaces: Value Function Approximation Alan Fern * Based in part on slides by Daniel Weld.
Institute of Computer Science University of Wroclaw Geometric Aspects of Online Packet Buffering An Optimal Randomized Algorithm for Two Buffers Marcin.
5/11/2015 Mahdi Naser-Moghadasi Texas Tech University.
Some Networking Aspects of Multiple Access Muriel Medard EECS MIT.
4/3. (FO)MDPs: The plan General model has no initial state; complex cost and reward functions, and finite/infinite/indefinite horizons Standard algorithms.
Concurrent Markov Decision Processes Mausam, Daniel S. Weld University of Washington Seattle.
1 Can Internet Video-on-Demand be Profitable? Cheng Huang, Jin Li (Microsoft Research Redmond), Keith W. Ross (Polytechnic University) ACM SIGCOMM 2007.
R. S. Sutton and A. G. Barto: Reinforcement Learning: An Introduction 1 From Sutton & Barto Reinforcement Learning An Introduction.
Reinforcement Learning: Learning algorithms Yishay Mansour Tel-Aviv University.