Reinforcement Learning. Overview Introduction Q-learning Exploration Exploitation Evaluating RL algorithms On-Policy learning: SARSA Model-based.
1 Efficiently Learning the Accuracy of Labeling Sources for Selective Sampling by Pinar Donmez, Jaime Carbonell, Jeff Schneider School of Computer Science,