Nabiha Asghar Deep Active Learning for ... - GitHub Pages · Deep Active Learning for Dialogue...

58
Deep Active Learning for Dialogue Generation Nabiha Asghar Joint work with: Pascal Poupart, Xin Jiang (Huawei) and Hang Li (Huawei) 6th Joint Conference on Lexical and Computational Semantics (*SEM 2017)

Transcript of Nabiha Asghar Deep Active Learning for ... - GitHub Pages · Deep Active Learning for Dialogue...

Page 1: Nabiha Asghar Deep Active Learning for ... - GitHub Pages · Deep Active Learning for Dialogue Generation Nabiha Asghar Joint work with: Pascal Poupart, Xin Jiang (Huawei) and Hang

Deep Active Learning for Dialogue GenerationNabiha Asghar

Joint work with: Pascal Poupart, Xin Jiang (Huawei) and Hang Li (Huawei)

6th Joint Conference on Lexical and Computational Semantics (*SEM 2017)

Page 2: Nabiha Asghar Deep Active Learning for ... - GitHub Pages · Deep Active Learning for Dialogue Generation Nabiha Asghar Joint work with: Pascal Poupart, Xin Jiang (Huawei) and Hang

Source: http://blogs.aspect.com/2016-the-year-of-the-chatbot/2

Page 3: Nabiha Asghar Deep Active Learning for ... - GitHub Pages · Deep Active Learning for Dialogue Generation Nabiha Asghar Joint work with: Pascal Poupart, Xin Jiang (Huawei) and Hang

Source: https://venturebeat.com/2016/08/11/introducing-the-bots-landscape-170-companies-4-billion-in-funding-thousands-of-bots/

3

Page 4: Nabiha Asghar Deep Active Learning for ... - GitHub Pages · Deep Active Learning for Dialogue Generation Nabiha Asghar Joint work with: Pascal Poupart, Xin Jiang (Huawei) and Hang

Retrieval-based vs. Generative

Source: https://hackernoon.com/chatbot-architecture-496f5bf820ed

4

Page 5: Nabiha Asghar Deep Active Learning for ... - GitHub Pages · Deep Active Learning for Dialogue Generation Nabiha Asghar Joint work with: Pascal Poupart, Xin Jiang (Huawei) and Hang

Premature Rise of Chatbots

Facebook Inc's Chatbots Hit a 70% Failure Rate (The Motley Fool)

The problem with chatbots: How to make them more human? (Chatbotsmagazine.com)

Bots are stumbling (McKinsey&Company)

5

Page 6: Nabiha Asghar Deep Active Learning for ... - GitHub Pages · Deep Active Learning for Dialogue Generation Nabiha Asghar Joint work with: Pascal Poupart, Xin Jiang (Huawei) and Hang

Generative Models: Seq2Seq (Google)

Source: https://indico.io/blog/sequence-modeling-neuralnets-part1/6

Page 7: Nabiha Asghar Deep Active Learning for ... - GitHub Pages · Deep Active Learning for Dialogue Generation Nabiha Asghar Joint work with: Pascal Poupart, Xin Jiang (Huawei) and Hang

Generative Models (Seq2Seq)

me: Hello?bot: Hi. • me: How are you?bot: I'm fine.• me: What's your name?bot: It's hard to describe. •me: How so?bot: I'm not sure.• me: Are you intelligent?bot: Yes, well... • me: What are you? bot: I'm not sure.• me: Do you plan on taking over the world?bot: No, i don't.

- Short, dull & generic responses

- Cannot keep the users engaged

- Infinite loops of repetition

- Mutually inconsistent responses

7

Page 8: Nabiha Asghar Deep Active Learning for ... - GitHub Pages · Deep Active Learning for Dialogue Generation Nabiha Asghar Joint work with: Pascal Poupart, Xin Jiang (Huawei) and Hang

Idea: Reinforcement/Active Learning

Goal: Incorporate user feedback into the conversation: Human In The Loop

Research question: how do we construct a comprehensive and effective feedback mechanism

for generative dialogue models?

8

Page 9: Nabiha Asghar Deep Active Learning for ... - GitHub Pages · Deep Active Learning for Dialogue Generation Nabiha Asghar Joint work with: Pascal Poupart, Xin Jiang (Huawei) and Hang

Existing DRL Models for CAs

- Task-oriented CAs: reward function is often obvious

- Open-domain CAs: hand-crafted reward functions (ease of answering, informativity,

diversity, etc)

9

Page 10: Nabiha Asghar Deep Active Learning for ... - GitHub Pages · Deep Active Learning for Dialogue Generation Nabiha Asghar Joint work with: Pascal Poupart, Xin Jiang (Huawei) and Hang

Existing DRL Models for CAs

- Task-oriented CAs: reward function is often obvious

- Open-domain CAs: hand-crafted reward functions (ease of answering, informativity,

diversity, etc)

- Our work:- Builds on top of a trained Seq2Seq model- A simple, all-encompassing, user-centric feedback mechanism- Online, incremental learning at each turn in the conversation- No need to hand-craft the reward function

10

Page 11: Nabiha Asghar Deep Active Learning for ... - GitHub Pages · Deep Active Learning for Dialogue Generation Nabiha Asghar Joint work with: Pascal Poupart, Xin Jiang (Huawei) and Hang

Training Pipeline

+ Reply

11

Page 12: Nabiha Asghar Deep Active Learning for ... - GitHub Pages · Deep Active Learning for Dialogue Generation Nabiha Asghar Joint work with: Pascal Poupart, Xin Jiang (Huawei) and Hang

Example TranscriptHuman user: I don't want to work today.

12

Page 13: Nabiha Asghar Deep Active Learning for ... - GitHub Pages · Deep Active Learning for Dialogue Generation Nabiha Asghar Joint work with: Pascal Poupart, Xin Jiang (Huawei) and Hang

Example TranscriptHuman user: I don't want to work today.bot: Ok.bot: I wish i could robot.bot: Why not?bot: Aww: ).bot: No, you've just got to be quick.

13

Page 14: Nabiha Asghar Deep Active Learning for ... - GitHub Pages · Deep Active Learning for Dialogue Generation Nabiha Asghar Joint work with: Pascal Poupart, Xin Jiang (Huawei) and Hang

Example TranscriptHuman user: I don't want to work today.bot: Ok.bot: I wish i could robot.bot: Why not?bot: Aww: ).bot: No, you've just got to be quick.human feedback: 3

14

Page 15: Nabiha Asghar Deep Active Learning for ... - GitHub Pages · Deep Active Learning for Dialogue Generation Nabiha Asghar Joint work with: Pascal Poupart, Xin Jiang (Huawei) and Hang

Example TranscriptHuman user: I don't want to work today.bot: Ok.bot: I wish i could robot.bot: Why not?bot: Aww: ).bot: No, you've just got to be quick.human feedback: 3Human user: I just want to lie in my bed all day and have pizza.

15

Page 16: Nabiha Asghar Deep Active Learning for ... - GitHub Pages · Deep Active Learning for Dialogue Generation Nabiha Asghar Joint work with: Pascal Poupart, Xin Jiang (Huawei) and Hang

Example TranscriptHuman user: I don't want to work today.bot: Ok.bot: I wish i could robot.bot: Why not?bot: Aww: ).bot: No, you've just got to be quick.human feedback: 3Human user: I just want to lie in my bed all day and have pizza.bot: I can't do that again!bot: Does he _ _ _.bot: Good idea.bot: Ok.bot: Why not?

16

Page 17: Nabiha Asghar Deep Active Learning for ... - GitHub Pages · Deep Active Learning for Dialogue Generation Nabiha Asghar Joint work with: Pascal Poupart, Xin Jiang (Huawei) and Hang

Example TranscriptHuman user: I don't want to work today.bot: Ok.bot: I wish i could robot.bot: Why not?bot: Aww: ).bot: No, you've just got to be quick.human feedback: 3Human user: I just want to lie in my bed all day and have pizza.bot: I can't do that again!bot: Does he _ _ _.bot: Good idea.bot: Ok.bot: Why not?human feedback: Good idea. May I join you?

17

Page 18: Nabiha Asghar Deep Active Learning for ... - GitHub Pages · Deep Active Learning for Dialogue Generation Nabiha Asghar Joint work with: Pascal Poupart, Xin Jiang (Huawei) and Hang

Example TranscriptHuman user: I don't want to work today.bot: Ok.bot: I wish i could robot.bot: Why not?bot: Aww: ).bot: No, you've just got to be quick.human feedback: 3Human user: I just want to lie in my bed all day and have pizza.bot: I can't do that again!bot: Does he _ _ _.bot: Good idea.bot: Ok.bot: Why not?human feedback: Good idea. May I join you?

Human user: I don't want to work today.

18

Page 19: Nabiha Asghar Deep Active Learning for ... - GitHub Pages · Deep Active Learning for Dialogue Generation Nabiha Asghar Joint work with: Pascal Poupart, Xin Jiang (Huawei) and Hang

Example TranscriptHuman user: I don't want to work today.bot: Ok.bot: I wish i could robot.bot: Why not?bot: Aww: ).bot: No, you've just got to be quick.human feedback: 3Human user: I just want to lie in my bed all day and have pizza.bot: I can't do that again!bot: Does he _ _ _.bot: Good idea.bot: Ok.bot: Why not?human feedback: Good idea. May I join you?

Human user: I don't want to work today.bot: Why not?bot: Aww: ).bot: Ok?bot: I wish i could robot.bot: No, you've just got to be quick.

19

Page 20: Nabiha Asghar Deep Active Learning for ... - GitHub Pages · Deep Active Learning for Dialogue Generation Nabiha Asghar Joint work with: Pascal Poupart, Xin Jiang (Huawei) and Hang

Example TranscriptHuman user: I don't want to work today.bot: Ok.bot: I wish i could robot.bot: Why not?bot: Aww: ).bot: No, you've just got to be quick.human feedback: 3Human user: I just want to lie in my bed all day and have pizza.bot: I can't do that again!bot: Does he _ _ _.bot: Good idea.bot: Ok.bot: Why not?human feedback: Good idea. May I join you?

Human user: I don't want to work today.bot: Why not?bot: Aww: ).bot: Ok?bot: I wish i could robot.bot: No, you've just got to be quick.human feedback: Human user: I just want to lie in my bed all day and have pizza.

20

Page 21: Nabiha Asghar Deep Active Learning for ... - GitHub Pages · Deep Active Learning for Dialogue Generation Nabiha Asghar Joint work with: Pascal Poupart, Xin Jiang (Huawei) and Hang

Example TranscriptHuman user: I don't want to work today.bot: Ok.bot: I wish i could robot.bot: Why not?bot: Aww: ).bot: No, you've just got to be quick.human feedback: 3Human user: I just want to lie in my bed all day and have pizza.bot: I can't do that again!bot: Does he _ _ _.bot: Good idea.bot: Ok.bot: Why not?human feedback: Good idea. May I join you?

Human user: I don't want to work today.bot: Why not?bot: Aww: ).bot: Ok?bot: I wish i could robot.bot: No, you've just got to be quick.human feedback: Human user: I just want to lie in my bed all day and have pizza.bot: Good idea. May I join you?bot: Does it has been to the?bot: I can’t do that again!bot: So good.bot: Where good?

21

Page 22: Nabiha Asghar Deep Active Learning for ... - GitHub Pages · Deep Active Learning for Dialogue Generation Nabiha Asghar Joint work with: Pascal Poupart, Xin Jiang (Huawei) and Hang

Example TranscriptHuman user: I don't want to work today.bot: Ok.bot: I wish i could robot.bot: Why not?bot: Aww: ).bot: No, you've just got to be quick.human feedback: 3Human user: I just want to lie in my bed all day and have pizza.bot: I can't do that again!bot: Does he _ _ _.bot: Good idea.bot: Ok.bot: Why not?human feedback: Good idea. May I join you?

Human user: I don't want to work today.bot: Why not?bot: Aww: ).bot: Ok?bot: I wish i could robot.bot: No, you've just got to be quick.human feedback: Human user: I just want to lie in my bed all day and have pizza.bot: Good idea. May I join you?bot: Does it has been to the?bot: I can’t do that again!bot: So good.bot: Where good?human feedback: Human user: Haha! Which pizza toppings are your favourite?

22

Page 23: Nabiha Asghar Deep Active Learning for ... - GitHub Pages · Deep Active Learning for Dialogue Generation Nabiha Asghar Joint work with: Pascal Poupart, Xin Jiang (Huawei) and Hang

Active Learning at each step

For the message-response pair (x,y), use cross-entropy loss to maximize the likelihood of y given x.

23

Page 24: Nabiha Asghar Deep Active Learning for ... - GitHub Pages · Deep Active Learning for Dialogue Generation Nabiha Asghar Joint work with: Pascal Poupart, Xin Jiang (Huawei) and Hang

Active Learning at each step

For the message-response pair (x,y), use cross-entropy loss to maximize the likelihood of y given x.

Use optimizer’s learning rate to determine how quickly you want the bot to learn.

24

Page 25: Nabiha Asghar Deep Active Learning for ... - GitHub Pages · Deep Active Learning for Dialogue Generation Nabiha Asghar Joint work with: Pascal Poupart, Xin Jiang (Huawei) and Hang

Active Learning at each step

For the message-response pair (x,y), use cross-entropy loss to maximize the likelihood of y given x.

Use optimizer’s learning rate to determine how quickly you want the bot to learn.

Feedback is optional (i.e. model/policy does not change).

25

Page 26: Nabiha Asghar Deep Active Learning for ... - GitHub Pages · Deep Active Learning for Dialogue Generation Nabiha Asghar Joint work with: Pascal Poupart, Xin Jiang (Huawei) and Hang

Active Learning at each step

For the message-response pair (x,y), use cross-entropy loss to maximize the likelihood of y given x.

Use optimizer’s learning rate to determine how quickly you want the bot to learn.

Feedback is optional (i.e. model/policy does not change).

Can learn from multiple users simultaneously.

26

Page 27: Nabiha Asghar Deep Active Learning for ... - GitHub Pages · Deep Active Learning for Dialogue Generation Nabiha Asghar Joint work with: Pascal Poupart, Xin Jiang (Huawei) and Hang

Active Learning at each step

For the message-response pair (x,y), use cross-entropy loss to maximize the likelihood of y given x.

Use optimizer’s learning rate to determine how quickly you want the bot to learn.

Feedback is optional (i.e. model/policy does not change).

Can learn from multiple users simultaneously.

Goal: generate interesting, relevant and diverse responses.27

Page 28: Nabiha Asghar Deep Active Learning for ... - GitHub Pages · Deep Active Learning for Dialogue Generation Nabiha Asghar Joint work with: Pascal Poupart, Xin Jiang (Huawei) and Hang

Training Pipeline

+ Reply

28

Page 29: Nabiha Asghar Deep Active Learning for ... - GitHub Pages · Deep Active Learning for Dialogue Generation Nabiha Asghar Joint work with: Pascal Poupart, Xin Jiang (Huawei) and Hang

Heuristic Response Generation

29

Page 30: Nabiha Asghar Deep Active Learning for ... - GitHub Pages · Deep Active Learning for Dialogue Generation Nabiha Asghar Joint work with: Pascal Poupart, Xin Jiang (Huawei) and Hang

Heuristic Response Generation

BEAM SEARCH

30

Page 31: Nabiha Asghar Deep Active Learning for ... - GitHub Pages · Deep Active Learning for Dialogue Generation Nabiha Asghar Joint work with: Pascal Poupart, Xin Jiang (Huawei) and Hang

Heuristic Response Generation

w0

BEAM SEARCH

31

Page 32: Nabiha Asghar Deep Active Learning for ... - GitHub Pages · Deep Active Learning for Dialogue Generation Nabiha Asghar Joint work with: Pascal Poupart, Xin Jiang (Huawei) and Hang

Heuristic Response Generation

w0

w1b w1

c w1dw1

a

BEAM SEARCH

32

Page 33: Nabiha Asghar Deep Active Learning for ... - GitHub Pages · Deep Active Learning for Dialogue Generation Nabiha Asghar Joint work with: Pascal Poupart, Xin Jiang (Huawei) and Hang

Heuristic Response Generation

w0

w1b w1

c w1dw1

a

BEAM SEARCH

33

Page 34: Nabiha Asghar Deep Active Learning for ... - GitHub Pages · Deep Active Learning for Dialogue Generation Nabiha Asghar Joint work with: Pascal Poupart, Xin Jiang (Huawei) and Hang

Heuristic Response Generation

w0

w1b w1

c w1dw1

a

w2c w2

dw2a w2

b w2d w2

a w2b

w2c

BEAM SEARCH

34

Page 35: Nabiha Asghar Deep Active Learning for ... - GitHub Pages · Deep Active Learning for Dialogue Generation Nabiha Asghar Joint work with: Pascal Poupart, Xin Jiang (Huawei) and Hang

Heuristic Response Generation

w0

w1b w1

c w1dw1

a

w2c w2

dw2a w2

b w2d w2

a w2b

w2c

BEAM SEARCH

35

Page 36: Nabiha Asghar Deep Active Learning for ... - GitHub Pages · Deep Active Learning for Dialogue Generation Nabiha Asghar Joint work with: Pascal Poupart, Xin Jiang (Huawei) and Hang

Heuristic Response Generation

w0

w1b w1

c w1dw1

a

w2c w2

dw2a w2

b w2d w2

a w2b

w2c

BEAM SEARCH

36I don’t care! vs. I don’t care.

Page 37: Nabiha Asghar Deep Active Learning for ... - GitHub Pages · Deep Active Learning for Dialogue Generation Nabiha Asghar Joint work with: Pascal Poupart, Xin Jiang (Huawei) and Hang

Heuristic Response Generation

w0

w1b w1

c w1dw1

a

w2c w2

dw2a w2

b w2d w2

a w2b

w2c

BEAM SEARCH DIVERSE BEAM SEARCH (ICLR 2017)

37I don’t care! vs. I don’t care.

Page 38: Nabiha Asghar Deep Active Learning for ... - GitHub Pages · Deep Active Learning for Dialogue Generation Nabiha Asghar Joint work with: Pascal Poupart, Xin Jiang (Huawei) and Hang

Heuristic Response Generation

w0

w1b w1

c w1dw1

a

w2c w2

dw2a w2

b w2d w2

a w2b

w2c

BEAM SEARCH DIVERSE BEAM SEARCH (ICLR 2017)

w0

w1b w1

c w1dw1

a

38I don’t care! vs. I don’t care.

Page 39: Nabiha Asghar Deep Active Learning for ... - GitHub Pages · Deep Active Learning for Dialogue Generation Nabiha Asghar Joint work with: Pascal Poupart, Xin Jiang (Huawei) and Hang

Heuristic Response Generation

w0

w1b w1

c w1dw1

a

w2c w2

dw2a w2

b w2d w2

a w2b

w2c

BEAM SEARCH DIVERSE BEAM SEARCH (ICLR 2017)

w0

w1b w1

c w1dw1

a

w2c w2

dw2a w2

b w2d w2

a w2b

w2c

39I don’t care! vs. I don’t care.

Page 40: Nabiha Asghar Deep Active Learning for ... - GitHub Pages · Deep Active Learning for Dialogue Generation Nabiha Asghar Joint work with: Pascal Poupart, Xin Jiang (Huawei) and Hang

Heuristic Response Generation

w0

w1b w1

c w1dw1

a

w2c w2

dw2a w2

b w2d w2

a w2b

w2c

BEAM SEARCH DIVERSE BEAM SEARCH (ICLR 2017)

w0

w1b w1

c w1dw1

a

w2c w2

dw2a w2

b w2d w2

a w2b

w2c

40I don’t care! vs. I don’t care. I don’t care! vs. Who cares.

Page 41: Nabiha Asghar Deep Active Learning for ... - GitHub Pages · Deep Active Learning for Dialogue Generation Nabiha Asghar Joint work with: Pascal Poupart, Xin Jiang (Huawei) and Hang

Training Pipeline

+ Reply

41

Page 42: Nabiha Asghar Deep Active Learning for ... - GitHub Pages · Deep Active Learning for Dialogue Generation Nabiha Asghar Joint work with: Pascal Poupart, Xin Jiang (Huawei) and Hang

Experimental Setup

BLEU, NIST, ROUGE, WER: not suitable for dialogue quality evaluation

User study

42

Page 43: Nabiha Asghar Deep Active Learning for ... - GitHub Pages · Deep Active Learning for Dialogue Generation Nabiha Asghar Joint work with: Pascal Poupart, Xin Jiang (Huawei) and Hang

User Study

Online Training: one human trained the model with 200 prompts of his choice

43

Page 44: Nabiha Asghar Deep Active Learning for ... - GitHub Pages · Deep Active Learning for Dialogue Generation Nabiha Asghar Joint work with: Pascal Poupart, Xin Jiang (Huawei) and Hang

User Study

Online Training: one human trained the model with 200 prompts of his choice

Test set of prompts: We chose 100 of those, linguistically rephrased them

“How’s it going” → “How are you today?”

“I hate you.” → “I really don’t like you!”

44

Page 45: Nabiha Asghar Deep Active Learning for ... - GitHub Pages · Deep Active Learning for Dialogue Generation Nabiha Asghar Joint work with: Pascal Poupart, Xin Jiang (Huawei) and Hang

User Study

Online Training: one human trained the model with 200 prompts of his choice

Test set of prompts: We chose 100 of those, linguistically rephrased them

“How’s it going” → “How are you doing?”

“I hate you.” → “I don’t like you!”

Test pairs: Collected responses of SL1, SL2 and SL2+oAL to test prompts

45

Page 46: Nabiha Asghar Deep Active Learning for ... - GitHub Pages · Deep Active Learning for Dialogue Generation Nabiha Asghar Joint work with: Pascal Poupart, Xin Jiang (Huawei) and Hang

User Study

Online Training: one human trained the model with 200 prompts of his choice

Test set of prompts: We chose 100 of those, linguistically rephrased them

“How’s it going” → “How are you doing?”

“I hate you.” → “I don’t like you!”

Test pairs: Collected responses of SL1, SL2 and SL2+oAL to test prompts

Evaluation: Asked 5 human judges to rate the test pairs on 4 axes:

Syntactic Coherence Relevance to Prompt Interesting-ness User Engagement46

Page 47: Nabiha Asghar Deep Active Learning for ... - GitHub Pages · Deep Active Learning for Dialogue Generation Nabiha Asghar Joint work with: Pascal Poupart, Xin Jiang (Huawei) and Hang

Experiments

47

Page 48: Nabiha Asghar Deep Active Learning for ... - GitHub Pages · Deep Active Learning for Dialogue Generation Nabiha Asghar Joint work with: Pascal Poupart, Xin Jiang (Huawei) and Hang

Experiments

48

Page 49: Nabiha Asghar Deep Active Learning for ... - GitHub Pages · Deep Active Learning for Dialogue Generation Nabiha Asghar Joint work with: Pascal Poupart, Xin Jiang (Huawei) and Hang

Experiments

49

Page 50: Nabiha Asghar Deep Active Learning for ... - GitHub Pages · Deep Active Learning for Dialogue Generation Nabiha Asghar Joint work with: Pascal Poupart, Xin Jiang (Huawei) and Hang

Experiments

50

Page 51: Nabiha Asghar Deep Active Learning for ... - GitHub Pages · Deep Active Learning for Dialogue Generation Nabiha Asghar Joint work with: Pascal Poupart, Xin Jiang (Huawei) and Hang

Experiments

51

Page 52: Nabiha Asghar Deep Active Learning for ... - GitHub Pages · Deep Active Learning for Dialogue Generation Nabiha Asghar Joint work with: Pascal Poupart, Xin Jiang (Huawei) and Hang

52

Page 53: Nabiha Asghar Deep Active Learning for ... - GitHub Pages · Deep Active Learning for Dialogue Generation Nabiha Asghar Joint work with: Pascal Poupart, Xin Jiang (Huawei) and Hang

53

Page 54: Nabiha Asghar Deep Active Learning for ... - GitHub Pages · Deep Active Learning for Dialogue Generation Nabiha Asghar Joint work with: Pascal Poupart, Xin Jiang (Huawei) and Hang

AI Assistant for Insurance

54

Page 55: Nabiha Asghar Deep Active Learning for ... - GitHub Pages · Deep Active Learning for Dialogue Generation Nabiha Asghar Joint work with: Pascal Poupart, Xin Jiang (Huawei) and Hang

AI Assistant for Insurance

Automated Psychotherapy

55

Page 56: Nabiha Asghar Deep Active Learning for ... - GitHub Pages · Deep Active Learning for Dialogue Generation Nabiha Asghar Joint work with: Pascal Poupart, Xin Jiang (Huawei) and Hang

AI Assistant for Insurance

Automated Psychotherapy

Chatbot for Employee Onboarding

56

Page 57: Nabiha Asghar Deep Active Learning for ... - GitHub Pages · Deep Active Learning for Dialogue Generation Nabiha Asghar Joint work with: Pascal Poupart, Xin Jiang (Huawei) and Hang

AI Assistant for Insurance

Automated Psychotherapy

Chatbot for Employee Onboarding

Platform for Human-in-the-loop Dialogue Training

57

Page 58: Nabiha Asghar Deep Active Learning for ... - GitHub Pages · Deep Active Learning for Dialogue Generation Nabiha Asghar Joint work with: Pascal Poupart, Xin Jiang (Huawei) and Hang

Ongoing/Future Work

Based on confidence values, ask for feedback only when necessary

Devise new loss functions that measure and incorporate ‘human affect’ into the responses

58