Reinforcement Learning : A Beginners Tutorial
Between MDPs and Semi-MDPs: Learning, Planning and Representing Knowledge at Multiple Temporal Scales Richard S. Sutton Doina Precup University of Massachusetts.
Richard S. Sutton Doina Precup University of Massachusetts Satinder Singh University of Colorado