SteerBench: a benchmark suite for evaluating steering behaviors

Authors: Singh, Kapadia, Faloutsos, Reinman

Presented by: Jessica Siewert

Content of presentation

• Introduction• Previous work• The Method• Assessment

Introduction – Context and motivation

– Steering of agents– Objective comparison– Standard?– Test cases and scoring, user evaluation– Metric scoring– Demonstration

Introduction – Previous work There is not really anything like it yet (Nov ‘08)

Introduction - Promises

• Evaluate objectively• Help researchers• Working towards a standard for evaluation• Take into account:– Cognitive decisions– Situation-specific aspects

The test cases

– Simple validation scenarios– Basic one – on – one interactions– Agent interactions including obstacles– Group interactions– Large-scale scenarios

The user’s opinion

• Rank on overal score across test cases (comparing)• Rank algorithms based on – a single case, or – one agent’s behavior

• Pass/fail• Visually inspect results• Examine detailed metrics of the performance

The metric

• Number of collisions• Time efficiency• Effort efficiency• Penalties?

Movies…

Developments since then• Ioannis Karamouzas , Peter Heil , Pascal Beek , Mark H. Overmars, A Predictive Colli

sion Avoidance Model for Pedestrian Simulation, Proceedings of the 2nd International Workshop on Motion in Games, November 21-24, 2009, Zeist, The Netherlands

• Shawn Singh , Mubbasir Kapadia , Billy Hewlett , Glenn Reinman , Petros Faloutsos,

A modular framework for adaptive agent-based steering, Symposium on Interactive 3D Graphics and Games, February 18-20, 2011, San Francisco, California

• Suiping Zhou , Dan Chen , Wentong Cai , Linbo Luo , Malcolm Yoke Hean Low , Feng Tian , Victor Su-Han Tay , Darren Wee Sze Ong , Benjamin D. Hamilton, Crowd modeling and simulation technologies, ACM Transactions on Modeling and Computer Simulation (TOMACS), v.20 n.4, p.1-35, October 2010

Experiments – Claim recall

• Evaluate objectively• Help researchers• Working towards a standard for evaluation

Assessment – good things

• All the measured variables seem logical (Too?)• Extensive variable set, with option to expand• Customized evaluation• Cheating not allowed – collision penalties– fail constraint– goal constraint

• Layered set of test cases

Assessment

• The measurements all seem to be approximately the same

• User test makes the difference?• Who are these users?• Examine, inspect, all vage terms• What about the objective of objectiveness?

Assessment

• How good is it to be general• How general/specific is this method?• Time efficiency VS. Effort efficiency• Should it be blind for the algorithm itself?• Penalties, fail and goal constraints not

specified!

Assessment – scoring(1/2)

• The test cases are clearly specified. But it is not specified HOW a GOOD agent SHOULD react, though they say there is such a specification

• How can you get cognitive decisions out of only position, direction and a goal?

Assessment – scoring(2/2)

• “Scoring not intended to be a proof of an algorithm’s effectiveness.”

• How do you interpreted scores and who wins?– “B is slightly better on average, but A has the

highest scores.”

Assessment – final questions

• Can this method become a standard?• What if someone claims to be so innovative

this standard does not apply to them?• Nice first try, though!

Getty images

SteerBench: a benchmark suite for evaluating steering behaviors

Documents

Transcript of SteerBench: a benchmark suite for evaluating steering behaviors

SteerBench: a benchmark suite for evaluating steering behaviors Authors: Singh, Kapadia, Faloutsos, Reinman Presented by: Jessica Siewert.

Artificial Intelligence in Game Design Complex Steering Behaviors and Combining Behaviors.

Enterprise GIS Benchmark Update Presented by David James, Business Technology Services EGIS Committee Meeting7 th July 2011 Enterprise GIS Steering Committee.

STEERING COLUMN – STEERING SYSTEM STEERING SYSTEM

POWER STEERING SYSTEM POWER STEERING SYSTEM - … · POWER STEERING – POWER STEERING SYSTEM PS–1 PS POWER STEERING SYSTEM ON-VEHICLE INSPECTION 1. CHECK STEERING EFFORT (TORQUE)

OSPM Mini-steering unit OTPM Steering Column Technical mini... · OSPM mini-steering unit, OTPM steering column ... OSPM mini-steering unit, OTPM steering column Technical Information

Benchmark 1 Answer Sheets - · PDF fileBenchmark 1 Benchmark 1 4 • Level 4 Benchmark Assessment • Benchmark 1 Benchmark 1 Comprehension (continued) GO ON 1. ... Benchmark 1 Answer

57 STEERING - Roverworld · 57 STEERING BRAKING SYSTEM CONTENTS Page -Overhaul manual steering box -Overhaul steering column -Overhaul power steering box -Bleeding power steering

BENCHMARK REPORT - GSSI€¦ · Benchmark Process on behalf of the Steering Board. Benchmark Committee A multi-stakeholder committee of technical experts appointed by GSSI’s Steering

Crowd Simulation through Steering Behaviors and Flow Fieldstwvideo01.ubm-us.net/o1/vault/gdc2013/slides/... · Graham Pentheny Independent Game Developer & AI Researcher Crowd Simulation

Steering Behaviors For Autonomous Characters Behaviors For Autonomous Characters ... whose actions are directed in real time by a human ... the 1940s as described in Norbert WienerÕs

i101 - lecture 9 - unideb.hushrek.unideb.hu/~learner/bevinfo/i101_lecture9.pdfLuis M.Rocha and Santiago Schnell Flocking Behavior Boids by Craig Reynolds (1986) 3 Steering behaviors

The new benchmark for perfection in concrete paving ... · Version with one manually slewing front crawler unit connection Machine control and levelling and steering WI-CONTROL ...

Steering Behaviors For Autonomous Characters (a typical industrial robot or an autonomous vehicle). Combinations of situated, reactive, and embodied define several distinct classes

Benchmark Universehelp.benchmarkuniverse.com/bubateacher/Content/Customer...Benchmark Universe Benchmark Education Assessment Benchmark Education Content Delivery Network Benchmark

Steering Behaviors for Autonomous Vehicles in Virtual Evironments Hongling Wang Joseph K. Kearney James Cremer Department Of Computer Science University.

POWER STEERING – POWER STEERING SYSTEM - …static.hybrids.ru/files/OfficialToyotaInfo/RepairInformation... · POWER STEERING – POWER STEERING SYSTEM PS–1 PS ... Power steering

Benchmark - Imagine It Login · PDF file100-Point Skills Battery Benchmark 1 Benchmark 2 Benchmark 3 Benchmark 4 ... Therefore, it is important to administer each Benchmark Assessment

5 th Grade Science Physical Science Benchmark A 8; Benchmark B 9, 16; Benchmark C 10, 24; Benchmark D 11; Benchmark E 22; Benchmark F 1,18891610 241122118.

Innate Behaviors. Notes Innate behaviors includes both automatic and instinctive. Innate behaviors are also known as inherited behaviors.