BUSINESS CONVERSATION ANALYSIS DEEP...

47
DEEP LEARNING IN BUSINESS CONVERSATION ANALYSIS ANTHONY SCODARY, GRIDSPACE WONKYUM LEE, GRIDSPACE

Transcript of BUSINESS CONVERSATION ANALYSIS DEEP...

Page 1: BUSINESS CONVERSATION ANALYSIS DEEP …on-demand.gputechconf.com/.../s7360...business-conversation-analy… · deep learning in business conversation analysis anthony scodary, gridspace

DEEP LEARNING IN BUSINESS CONVERSATION ANALYSIS

ANTHONY SCODARY, GRIDSPACEWONKYUM LEE, GRIDSPACE

Page 2: BUSINESS CONVERSATION ANALYSIS DEEP …on-demand.gputechconf.com/.../s7360...business-conversation-analy… · deep learning in business conversation analysis anthony scodary, gridspace

INTRO

“Which translation speech recognition so and so forth I mean there's a whole bunch of amazing applications that are made possible by deep learning and so internet service providers are using it for internal application development.

And then lastly what you mentioned as cloud service providers and basically because of the adoption of gp use and because of the success of kuta and so many applications are now able to be accelerate on gp use so that we can extend the capabilities of moore's law so that we can continue.

You'd have the benefits of of computing acceleration, which which in the cloud means reducing cost.

And that's on the serve cloud service provider side of of the Internet company so that would be amazon web services as the Google compute cloud.”

Page 3: BUSINESS CONVERSATION ANALYSIS DEEP …on-demand.gputechconf.com/.../s7360...business-conversation-analy… · deep learning in business conversation analysis anthony scodary, gridspace

OVERVIEW

1. Business Conversations2. Recognition3. Analysis

Page 4: BUSINESS CONVERSATION ANALYSIS DEEP …on-demand.gputechconf.com/.../s7360...business-conversation-analy… · deep learning in business conversation analysis anthony scodary, gridspace

1. Business Conversations

DEEP LEARNING IN BUSINESS CONVERSATION ANALYSIS

Page 5: BUSINESS CONVERSATION ANALYSIS DEEP …on-demand.gputechconf.com/.../s7360...business-conversation-analy… · deep learning in business conversation analysis anthony scodary, gridspace

PROTOCOLS

SIGNALPROCESSING

Page 6: BUSINESS CONVERSATION ANALYSIS DEEP …on-demand.gputechconf.com/.../s7360...business-conversation-analy… · deep learning in business conversation analysis anthony scodary, gridspace

PROTOCOLS

- Symbol Set (Lexicon)

- Rules (Syntax)

- Meaning (Semantics)

Page 7: BUSINESS CONVERSATION ANALYSIS DEEP …on-demand.gputechconf.com/.../s7360...business-conversation-analy… · deep learning in business conversation analysis anthony scodary, gridspace

SINK

TYPES OF PROTOCOLS

SOURCE MEDIUM

Page 8: BUSINESS CONVERSATION ANALYSIS DEEP …on-demand.gputechconf.com/.../s7360...business-conversation-analy… · deep learning in business conversation analysis anthony scodary, gridspace

TYPES OF PROTOCOLS: ENDPOINTS

BIRDCALL SEISMOGRAPH GROWLING

ELECTRICFENCE TCP FIRE

ALARM

“SIT” SIRI SPEECH

NATURE

MACHINE

HUMAN

NATURE MACHINE HUMAN

Page 9: BUSINESS CONVERSATION ANALYSIS DEEP …on-demand.gputechconf.com/.../s7360...business-conversation-analy… · deep learning in business conversation analysis anthony scodary, gridspace

TYPES OF PROTOCOLS: H2H MEDIA

BANDWIDTH

INFORMATION DENSITY

EMAIL

SMSVOICEMAIL

CHAT

MISSEDCALL

POSTCARDWAVING

SPEECH

Page 10: BUSINESS CONVERSATION ANALYSIS DEEP …on-demand.gputechconf.com/.../s7360...business-conversation-analy… · deep learning in business conversation analysis anthony scodary, gridspace

WHY DO WE STILL TALK?

- Fast

- Innate

- Layered

- Synchronous

- Dense in meaning

Page 11: BUSINESS CONVERSATION ANALYSIS DEEP …on-demand.gputechconf.com/.../s7360...business-conversation-analy… · deep learning in business conversation analysis anthony scodary, gridspace

ORGANIZATIONS

INTERNALCOMMUNICATION

EXTERNALCOMMUNICATION

CallsMeetingsHallway Chats

Support CallsIn-Person Sales

DocumentsEmailChatSMS

Chat SupportSocial MediaEmail

Page 12: BUSINESS CONVERSATION ANALYSIS DEEP …on-demand.gputechconf.com/.../s7360...business-conversation-analy… · deep learning in business conversation analysis anthony scodary, gridspace

ORGANIZATIONS

INTERNALCOMMUNICATION

EXTERNALCOMMUNICATION

CallsMeetingsHallway Chats

Support CallsIn-Person Sales

DocumentsEmailChatSMS

Chat SupportSocial MediaEmail

Mostly lost today

Page 13: BUSINESS CONVERSATION ANALYSIS DEEP …on-demand.gputechconf.com/.../s7360...business-conversation-analy… · deep learning in business conversation analysis anthony scodary, gridspace

THIS DATA MATTERS

Page 14: BUSINESS CONVERSATION ANALYSIS DEEP …on-demand.gputechconf.com/.../s7360...business-conversation-analy… · deep learning in business conversation analysis anthony scodary, gridspace

THIS DATA MATTERS

Page 15: BUSINESS CONVERSATION ANALYSIS DEEP …on-demand.gputechconf.com/.../s7360...business-conversation-analy… · deep learning in business conversation analysis anthony scodary, gridspace

2. Recognition

DEEP LEARNING IN BUSINESS CONVERSATION ANALYSIS

Page 16: BUSINESS CONVERSATION ANALYSIS DEEP …on-demand.gputechconf.com/.../s7360...business-conversation-analy… · deep learning in business conversation analysis anthony scodary, gridspace

REAL-TIME CALL ANALYSIS

ASRDSPSCANNERCLASSIFIER

Page 17: BUSINESS CONVERSATION ANALYSIS DEEP …on-demand.gputechconf.com/.../s7360...business-conversation-analy… · deep learning in business conversation analysis anthony scodary, gridspace

Feature Extraction(MFCC)

Acoustic Model (GMM)

Lexicon

Language Model

“hello”

Conventional ASR - Combination of blocks designed by each expertise

GMM-HMM: 1980-2010

ASR

Page 18: BUSINESS CONVERSATION ANALYSIS DEEP …on-demand.gputechconf.com/.../s7360...business-conversation-analy… · deep learning in business conversation analysis anthony scodary, gridspace

Feature Extraction(MFCC)

Acoustic Model (GMM)

Lexicon

Language Model

“hello”

Lots of tuning to improve accuracy

Robust Feature, Speaker-Adaptation, Application specific LM

ASR

Page 19: BUSINESS CONVERSATION ANALYSIS DEEP …on-demand.gputechconf.com/.../s7360...business-conversation-analy… · deep learning in business conversation analysis anthony scodary, gridspace

Feature Extraction(MFCC)

Acoustic Model

Lexicon

Language Model

“hello”

Replacing acoustic model with deep neural net

DNN-HMM: 30%-40% improvement (2011-2017)

ASR

Page 20: BUSINESS CONVERSATION ANALYSIS DEEP …on-demand.gputechconf.com/.../s7360...business-conversation-analy… · deep learning in business conversation analysis anthony scodary, gridspace

All-in-one Deep Learning Model

“hello”

Someday in the near future, Replacing whole models with one neural net

End-to-End ASR: active research in-progress

ASR

Page 21: BUSINESS CONVERSATION ANALYSIS DEEP …on-demand.gputechconf.com/.../s7360...business-conversation-analy… · deep learning in business conversation analysis anthony scodary, gridspace

Simple Linear model(GMM)Advanced Linear model (GMM-SAT-DT)

Deep Learning ModelEnd-to-End Deep Learning (under development)

“Human parity”

ASR error rate for decades (in Academia) WER (log scale)

ASR HISTORY

Page 22: BUSINESS CONVERSATION ANALYSIS DEEP …on-demand.gputechconf.com/.../s7360...business-conversation-analy… · deep learning in business conversation analysis anthony scodary, gridspace

“However, it’s still NOT Easy in real-world business conversational voice”

Language Challenge

Acoustic Challenge

• Domain specific terminology (company name, product name, …)• Spontaneous speech (natural conversation)• Accent, Dialect, Mispronunciation

• Noise (background, channel)• Acoustic effect (reverberation, Lombard effect)• Variability from speakers• Microphone displacement (near/far field)

ASR CHALLENGES

Page 23: BUSINESS CONVERSATION ANALYSIS DEEP …on-demand.gputechconf.com/.../s7360...business-conversation-analy… · deep learning in business conversation analysis anthony scodary, gridspace

Data is King!

- General Conversational Data + in-domain data (training with in-domain data improves 15-30% accuracy)

- Simulated data with variety noise helps! (improves 10-15% accuracy)

- Data collection with semi-supervised training helps

LARGE-SCALE DATA PROCESSING

Page 24: BUSINESS CONVERSATION ANALYSIS DEEP …on-demand.gputechconf.com/.../s7360...business-conversation-analy… · deep learning in business conversation analysis anthony scodary, gridspace

Multi-GPU Training

- 4x Titan X with parallel training- One week for full-training with 25k hours audio- 80x Faster than 32 core CPU machine

LARGE-SCALE DATA PROCESSING

Page 25: BUSINESS CONVERSATION ANALYSIS DEEP …on-demand.gputechconf.com/.../s7360...business-conversation-analy… · deep learning in business conversation analysis anthony scodary, gridspace

Real-time adaptive processing

- Online i-vector adaptation (5-10% improvement)- speaker characteristics- environmental noise- Accent & dialect

- Context-based grammar adaptation (recognize in-domain specific terms)

REAL-TIME ADAPTIVE PROCESSING

Page 26: BUSINESS CONVERSATION ANALYSIS DEEP …on-demand.gputechconf.com/.../s7360...business-conversation-analy… · deep learning in business conversation analysis anthony scodary, gridspace

State-of-Art deep learning model

- Time-delayed neural network- Computation optimization (Subsampling,

bi-phone, etc)- WFST framework for search

“Purely sequence-trained neural networks for ASR based on lattice-free MMI”, Interspeech 2016

WER: 5~6% Capital Market Model 12~15% Customer Intelligence ModelReal-Time-Factor: 0.3-0.35

STATE OF THE ART DEEP LEARNING MODEL

Page 27: BUSINESS CONVERSATION ANALYSIS DEEP …on-demand.gputechconf.com/.../s7360...business-conversation-analy… · deep learning in business conversation analysis anthony scodary, gridspace

DEEP LEARNING IN BUSINESS CONVERSATION ANALYSIS

3. Analysis

Page 28: BUSINESS CONVERSATION ANALYSIS DEEP …on-demand.gputechconf.com/.../s7360...business-conversation-analy… · deep learning in business conversation analysis anthony scodary, gridspace

IS TRANSCRIPTION REALLY WHAT YOU WANT ANYWAY?

Page 29: BUSINESS CONVERSATION ANALYSIS DEEP …on-demand.gputechconf.com/.../s7360...business-conversation-analy… · deep learning in business conversation analysis anthony scodary, gridspace

STUFF WITH ACTUAL USE TO COMPANIES

- Prediction

- Classification

- Summarization

- Entity Extraction

- Anomaly Detection

Page 30: BUSINESS CONVERSATION ANALYSIS DEEP …on-demand.gputechconf.com/.../s7360...business-conversation-analy… · deep learning in business conversation analysis anthony scodary, gridspace

“ARTIFICIAL INTELLIGENCE”

Page 31: BUSINESS CONVERSATION ANALYSIS DEEP …on-demand.gputechconf.com/.../s7360...business-conversation-analy… · deep learning in business conversation analysis anthony scodary, gridspace

“ARTIFICIAL INTELLIGENCE”

ARITHMETIC

GRAPH SEARCH

CHESS

IMAGE RECOGNITION

CONVERSATION

EMOTION

CONSCIOUSNESS

ABOVE THIS LINE THIS SURELY IS

“REAL” INTELLIGENCE

Page 32: BUSINESS CONVERSATION ANALYSIS DEEP …on-demand.gputechconf.com/.../s7360...business-conversation-analy… · deep learning in business conversation analysis anthony scodary, gridspace

“ARTIFICIAL INTELLIGENCE”

TECHNOLOGY REVOLUTION

WASTE OF MONEY AND

TIME

Page 33: BUSINESS CONVERSATION ANALYSIS DEEP …on-demand.gputechconf.com/.../s7360...business-conversation-analy… · deep learning in business conversation analysis anthony scodary, gridspace

“ARTIFICIAL INTELLIGENCE”

We focus on the industry needs as

an engineering task.

Page 34: BUSINESS CONVERSATION ANALYSIS DEEP …on-demand.gputechconf.com/.../s7360...business-conversation-analy… · deep learning in business conversation analysis anthony scodary, gridspace

ANALYSIS

1. Speech is complex.

Let models decide what features

matter for a task or application.

Page 35: BUSINESS CONVERSATION ANALYSIS DEEP …on-demand.gputechconf.com/.../s7360...business-conversation-analy… · deep learning in business conversation analysis anthony scodary, gridspace

ANALYSIS

2. Speech is high dimensional.

Datasets must be large enough to

train large models to match.

Page 36: BUSINESS CONVERSATION ANALYSIS DEEP …on-demand.gputechconf.com/.../s7360...business-conversation-analy… · deep learning in business conversation analysis anthony scodary, gridspace

ANALYSIS

3. Conversational speech is noisy.

Large, well-augmented datasets are

necessary to be robust.

Page 37: BUSINESS CONVERSATION ANALYSIS DEEP …on-demand.gputechconf.com/.../s7360...business-conversation-analy… · deep learning in business conversation analysis anthony scodary, gridspace

ANALYSIS

Page 38: BUSINESS CONVERSATION ANALYSIS DEEP …on-demand.gputechconf.com/.../s7360...business-conversation-analy… · deep learning in business conversation analysis anthony scodary, gridspace

ANALYSIS

Page 39: BUSINESS CONVERSATION ANALYSIS DEEP …on-demand.gputechconf.com/.../s7360...business-conversation-analy… · deep learning in business conversation analysis anthony scodary, gridspace

ANALYSIS

Page 40: BUSINESS CONVERSATION ANALYSIS DEEP …on-demand.gputechconf.com/.../s7360...business-conversation-analy… · deep learning in business conversation analysis anthony scodary, gridspace

ANALYSIS

...

Page 41: BUSINESS CONVERSATION ANALYSIS DEEP …on-demand.gputechconf.com/.../s7360...business-conversation-analy… · deep learning in business conversation analysis anthony scodary, gridspace

ANALYSIS

Page 42: BUSINESS CONVERSATION ANALYSIS DEEP …on-demand.gputechconf.com/.../s7360...business-conversation-analy… · deep learning in business conversation analysis anthony scodary, gridspace

ANALYSIS

aardvark

zebra

One-hot(D-dimensions) ℝ300

ℝ40

Page 43: BUSINESS CONVERSATION ANALYSIS DEEP …on-demand.gputechconf.com/.../s7360...business-conversation-analy… · deep learning in business conversation analysis anthony scodary, gridspace

ANALYSIS

KING

QUEEN

BROTHER

SISTER

MAN

WOMAN

Page 44: BUSINESS CONVERSATION ANALYSIS DEEP …on-demand.gputechconf.com/.../s7360...business-conversation-analy… · deep learning in business conversation analysis anthony scodary, gridspace

ANALYSIS

i have no political party actually

~~~‘democrat’

i have no political party actually

~~~‘democrat’

i have no political party actually

~~~‘democrat’

Page 45: BUSINESS CONVERSATION ANALYSIS DEEP …on-demand.gputechconf.com/.../s7360...business-conversation-analy… · deep learning in business conversation analysis anthony scodary, gridspace

ANALYSIS

Page 46: BUSINESS CONVERSATION ANALYSIS DEEP …on-demand.gputechconf.com/.../s7360...business-conversation-analy… · deep learning in business conversation analysis anthony scodary, gridspace

API

gridspace.com

Page 47: BUSINESS CONVERSATION ANALYSIS DEEP …on-demand.gputechconf.com/.../s7360...business-conversation-analy… · deep learning in business conversation analysis anthony scodary, gridspace

QUESTIONS?