Adaptive Robot Assisted Therapy using Interactive...

Adaptive Robot Assisted Therapy usingInteractive Reinforcement Learning

Konstantinos Tsiakas1,2, Maria Dagioglou2, VangelisKarkaletsis2, Fillia Makedon1

1HERACLEIA - Human Centered Computing LabDepartment of Computer Science

University of Texas at Arlington, USA

2Software and Knowledge Engineering LabInstitute of Informatics and Telecommunications

National Center for Scientific Research, Demokritos, Greece

ICSR 2016

Outline

Introduction

Use CaseRobot Learning

Policy Transfer ExperimentsExperimental Results

Interactive Learning and Adaptation FrameworkDescriptionPreliminary Experiments

Ongoing Work

IntroductionRobot Assisted Therapy

Robot Assisted Therapy has been extensively applied to assistusers with cognitive and/or physical impairments

• Robotic Assistive Systems can establish a productiveinteraction with the user, improving the effects of a therapysession

• Two significant attributes of such systems (or agents) arepersonalization and safety

• Reinforcement Learning is an appropriate framework formodeling and optimizing the behavior of an agent thatinteracts with a user

Related Work

• A robotic seal that interacted with patients to explore the roleof the robot in the improvement of conversation andinteraction skils (Kidd et al. 2006)

• A Socially Assistive Robot (SAR) motivator to increase userenjoyment and performance on physical/cognitive task (Fasola& Mataric 2010)

• A social robot for personalized Robot-Child Tutoring(Ramachandran & Scassellati 2014)

These works validate that a personalized and adaptive roboticsystem can establish a tailored and productive interaction with theuser

Adaptive Robot Assistive TherapyAdaptive Training Task

• Adaptive Training Task example: A training task designed toimprove participant's specific cognitive or physical aspects

• The user needs to complete a set of three tasks

• The robot models the difficulty level based on the user'sperformance during the interaction, to assist the user to finishthe session, while keeping them engaged

Adaptive Cognitive TaskRobot Behavior Modelling as an MDP

Learning Experiments

• The agent needs to learn the optimal policy; the mappingfrom states to actions that maximizes the accumulated reward(or total return) during each interaction

• In order to evaluate this modeling, we apply ε-greedyQ-learning for four different user models

• These user models depict different user skills under differentgame parameters (task difficulty and duration)

Learning ExperimentsUser Models

Learning Experiments

Policy Transfer Experiments

We make the following assumptions:• A user specific policy (USP) is the optimal policy (for the

corresponding model); the one that maximizes total return,thus user performance

• Applying a learned policy to a different user model may not beefficient but better than learning from scratch

We apply the four different USPs to the four different user models,following an exploitation-only approach (minimize risks andharmful actions)

Policy Transfer ExperimentsResults

Interactive Learning and Adaptation FrameworkDescription

• A framework that utilizes Interactive ReinforcementLearning for dynamical adaptation

• Learning from Feedback treats the human input as a rewardafter the executed action

• Learning from Guidance allows human intervention to theselected action before execution, proposing (corrective) actions

• Integration of user feedback and expert guidance towards alifelong learning setup

Interactive Learning and Adaptation Framework

Interactive Learning and Adaptation FrameworkLearning from Feedback

Strong involvement in a task occurs when the skills of an individualmeets the challenge of a task (Chanel et al. 2008)

feedback = |diff − diff ′|/Ndiff ∈ [−1, 0]Q(s, a) = Q(s, a) + β · feedback

Interactive Learning and Adaptation FrameworkLearning from Guidance

Combination of two techniques:• Human-guided exploration - SPARC (Senft et al. 2016)• Teaching on a budget– mistake correcting (Torrey at al. 2013)

Assumption: Almost perfect supervisor – follows thecorresponding USP with probability p

Preliminary Adaptation Experiments

Ongoing Work

Thank you...

Contactkonstantinos.tsiakas@mavs.uta.edu

Adaptive Robot Assisted Therapy using Interactive...

Documents

Transcript of Adaptive Robot Assisted Therapy using Interactive...

Adaptive Locomotion for a Multilegged Robot Over Rough Terrain

The Robot is the Tether: Active, Adaptive Power Routing ...seth/The robot is the tether.pdf · The Robot is the Tether: Active, Adaptive Power Routing for ... Unary, genderless connectors

INMOTION ROBOT-ASSISTED THERAPY

Adaptive Fuzzy Model-Based Predictive Control of ... · Adaptive Fuzzy Model-Based Predictive Control of Nonholonomic ... Robot dynamics are subject to various ... the robot, while

Moving Control of Quadruped Hopping Robot Using Adaptive ...myweb.utem.edu.my/myweb/anuar/publication/RAM2010_ANUAR.pdfMoving Control of Quadruped Hopping Robot Using Adaptive CPG

Repetitive and Adaptive Control of Robot Manipulators with … · 2003. 8. 15. · Repetitive and Adaptive Control of Robot Manipulators with Velocity Estimation Kazumasa Kaneko and

Is adaptive therapy natural?

Model Reference Adaptive Control of Biped Robot Actuators ...

Adaptive impedance control based on CoM for hexapod robot ...

Adaptive categorization of ART networks in robot …techlab.bu.edu/files/resources/articles_tt/Adaptive...Adaptive categorization of ART networks in robot behavior learning using game-theoretic

Adaptive Reconfiguration of a Modular Robot through ...vigir.missouri.edu/~gdesouza/Research/Conference_CDs/IEEE...Adaptive Reconguration of a Modular Robot through Heterogeneous Inter-Module

Recreational Therapy & Adaptive sports Program at …herl.pitt.edu/symposia/adaptive-reconditioning/presentations/State... · Recreational Therapy/Adaptive Sports Goal – To engage

A Flying Robot with Adaptive Morphology for Multi-Modal ...vigir.missouri.edu/~gdesouza/Research/Conference_CDs/IEEE_IROS... · A Flying Robot with Adaptive Morphology for Multi-Modal

Aalborg Universitet Adaptive Human-Aware Robot …...Adaptive Human-Aware Robot Navigation in Close Proximity to Humans Mikael Svenstrup1, Søren Tranberg Hansen2, Hans Jørgen Andersen3

Adaptive control of a robot manipulator by visual data

An Adaptive Output Feedback Controller for Robot Arms

Adaptive Fuzzy Control for Mobile Robot Obstacle

Robot Self-Assembly As Adaptive Growth Process: Collective ... · Robot Self-Assembly as Adaptive Growth Process: Collective Selection of Seed Position and Self-Organizing Tree-Structures

Adaptive Human-Robot Interaction: From Human Intention to ...

Adaptive Biarticular Muscle Force Control for Humanoid Robot Arms