Humix community #2 - TTS, Speech Recognition and Natural Language Processing

27
Humix COMMUNITY MEETUP #2 2015/11/26

Transcript of Humix community #2 - TTS, Speech Recognition and Natural Language Processing

Page 1: Humix community #2 -  TTS, Speech Recognition and Natural Language Processing

HumixCOMMUNITY MEETUP #2 2015/11/26

Page 2: Humix community #2 -  TTS, Speech Recognition and Natural Language Processing

Part 1

Overview

Page 3: Humix community #2 -  TTS, Speech Recognition and Natural Language Processing

TTS and Speech Recognitionwhat is it ?

Page 4: Humix community #2 -  TTS, Speech Recognition and Natural Language Processing

curl -X POST \ --data-binary @'audio/hello (16bit PCM).wav' \ --header 'Content-Type: audio/l16; rate=16000;' \ 'https://www.google.com/speech-api/v2/recognize?output=json&lang=en-us&key=yourkey'

http://translate.google.com/translate_tts?ie=utf-8&tl=zh&q=Hello

Google Speech Recognition and TTS

REQUEST

RESPONSE

Try TTS

Page 5: Humix community #2 -  TTS, Speech Recognition and Natural Language Processing

https://www.projectoxford.ai/doc/speech/overview

MS Speech Recognition Service

Page 6: Humix community #2 -  TTS, Speech Recognition and Natural Language Processing

https://speech.platform.bing.com/synthesize

MS TTS Service

Page 7: Humix community #2 -  TTS, Speech Recognition and Natural Language Processing

Speech Services available today

http://www.nuance.com/for-developers/dragon/index.htm

http://www.ispeech.org/#/home

http://tts.itri.org.tw

Page 8: Humix community #2 -  TTS, Speech Recognition and Natural Language Processing

Part 2

Watson TTS & Speech

Page 9: Humix community #2 -  TTS, Speech Recognition and Natural Language Processing

https://youtu.be/JWnLgZ58zsw

1.3 min / hour

Page 10: Humix community #2 -  TTS, Speech Recognition and Natural Language Processing

http://www.ibm.com/smarterplanet/us/en/ibmwatson/developercloud/speech-to-text/api/v1/

http://www.ibm.com/smarterplanet/us/en/ibmwatson/developercloud/speech-to-text/api/v1/#introduction

Documentation

Access Endpoint

Page 11: Humix community #2 -  TTS, Speech Recognition and Natural Language Processing

23 char / min

Page 12: Humix community #2 -  TTS, Speech Recognition and Natural Language Processing

https://stream.watsonplatform.net/text-to-speech/api

http://www.ibm.com/smarterplanet/us/en/ibmwatson/developercloud/doc/text-to-speech/

Documentation

Access Endpoint

Page 13: Humix community #2 -  TTS, Speech Recognition and Natural Language Processing

Part 3 Wit.ai for Natural Language Processing

Page 14: Humix community #2 -  TTS, Speech Recognition and Natural Language Processing

https://wit.ai

Page 15: Humix community #2 -  TTS, Speech Recognition and Natural Language Processing

WIT.AI DEMO

Page 16: Humix community #2 -  TTS, Speech Recognition and Natural Language Processing

Part 4 Humix TTS Module

Page 17: Humix community #2 -  TTS, Speech Recognition and Natural Language Processing

Humix In Action

Page 18: Humix community #2 -  TTS, Speech Recognition and Natural Language Processing

Join US

https://www.facebook.com/groups/1513552855636723/

Page 19: Humix community #2 -  TTS, Speech Recognition and Natural Language Processing

Humix ArchitectureReference

Page 20: Humix community #2 -  TTS, Speech Recognition and Natural Language Processing

MicroService Architecture“An approach to developing a single application as a suite of small, independently deployable services”

Application

Service Service Service Service

Messaging Infrastructure

pub

sub

Page 21: Humix community #2 -  TTS, Speech Recognition and Natural Language Processing

Humix Architecture

HUMIX SENSE

ModuleA ModuleB ModuleC

NATS

WebSocket

Dashboard

HUMIX THINK

Flow Engine

API API API API API

RPI2

Cloud

Page 22: Humix community #2 -  TTS, Speech Recognition and Natural Language Processing

Module Registration

HUMIX SENSE

ModuleA ModuleB ModuleC

NATS

WebSocket

Dashboard

HUMIX THINK

Flow Engine

{ moduleName: ‘Temperature’ commands: [‘cmd1’,’cmd2’], events: [‘event1’,’event2’] }

humix.sense.mgmt.cmd.register

Page 23: Humix community #2 -  TTS, Speech Recognition and Natural Language Processing

Module Status Check

HUMIX SENSE

ModuleA ModuleB ModuleC

Dashboard

HUMIX THINK

Flow Engine

PING

PONG

DeviceModule Status

Page 24: Humix community #2 -  TTS, Speech Recognition and Natural Language Processing

Humix Architecture

HUMIX SENSE

ModuleA ModuleB ModuleC

Dashboard

HUMIX THINK

Flow Engine

API API API API API

Community

Foundation

Community

Page 25: Humix community #2 -  TTS, Speech Recognition and Natural Language Processing

Programming Model

1. Require Humix-Sense and provide module config

Page 26: Humix community #2 -  TTS, Speech Recognition and Natural Language Processing

Programming Model

2. Connect to “Think”

Page 27: Humix community #2 -  TTS, Speech Recognition and Natural Language Processing

Programming Model

3. Process “commands”, send “events” on device