General Atari 2600 Game Playing - GeNeura...

Post on 26-May-2018

220 views 0 download

Transcript of General Atari 2600 Game Playing - GeNeura...

General Atari 2600 Game Playing

Michael Bowling

Work with:Joel Veness, Marc Bellemare, Anna Koop, Mostafa Vafadoost

http://www.arcadelearningenvironment.org

Friday, September 14, 2012

Friday, September 14, 2012

http://www.arcadelearningenvironment.org

Friday, September 14, 2012

Varied

ManyIndependent

Interesting

Friday, September 14, 2012

Friday, September 14, 2012

Friday, September 14, 2012

A.I.

0100010101...00001110101

Reinforcement Learning

Friday, September 14, 2012

A.I.

0100010101...00001110101

Planning

Friday, September 14, 2012

A.I.

Model

Model Learning

Friday, September 14, 2012

A.I.Expert

Imitation/Apprenticeship Learning

Friday, September 14, 2012

A.I.

.

.

.

Transfer Learning

Pitfall!

Pitfall II

Friday, September 14, 2012

A.I.

Intrinsic Motivation

Friday, September 14, 2012

Friday, September 14, 2012

TrainingGames

TestingGames

Friday, September 14, 2012

Friday, September 14, 2012

Contingency Awareness: knowing what you control

(Bellemare et al., AAAI 2012)

Friday, September 14, 2012

(Bellemare et al., AAAI 2012)

Contingency Awareness: knowing what you control

UnawareContingency Aware

Friday, September 14, 2012

(Bellemare et al., AAAI 2012)

Contingency Awareness: knowing what you control

1.0

0.5

0.0 00.20.40.60.81.0

Inter-Algorithm Score

Frac

tion

of G

ames

Inter-Algorithm Score Distribution

MaxColExtended MaxCol

Basic

Extended

Friday, September 14, 2012

Sketch-Based Hashing:tug-of-war vs. standard hashing

(Bellemare et al., NIPS 2012)

1.0

0.5

0.0 00.20.40.60.81.0

Frac

tion

of g

ames

Inter-algorithm score

Tug-of-War

Standard

Hash Table Size: 10001.0

0.5

0.0 00.20.40.60.81.0

Frac

tion

of g

ames

Inter-algorithm score

Tug-of-War

Standard

Hash Table Size: 50001.0

0.5

0.0 00.20.40.60.81.0

Frac

tion

of g

ames

Inter-algorithm score

Tug-of-War

Standard

Hash Table Size: 20,000

55 Testing Games

Friday, September 14, 2012

Model Learning:pixels, probabilities, and priors

(Bellemare et al., In Prep)

Friday, September 14, 2012

Model Learning:pixels, probabilities, and priors

(Bellemare et al., In Prep)

Friday, September 14, 2012

http://www.arcadelearningenvironment.org

Questions?

Source code for ALE and all agents available!

Friday, September 14, 2012

Friday, September 14, 2012

Will there be a competition?

No

vs.

Friday, September 14, 2012