Data Ninja Services - Data Science Summit talk 2016

26
DATA NINJA Services Pero Subasic Open Service Innovation Group DOCOMO Innovations, Inc. July 13, 2016 Copyright © DOCOMO Innovations, Inc. All Rights Reserved. @DataNinjaAPI dataninja.net

Transcript of Data Ninja Services - Data Science Summit talk 2016

Page 1: Data Ninja Services - Data Science Summit talk 2016

DATA NINJA ServicesPero SubasicOpen Service Innovation GroupDOCOMO Innovations, Inc.

July 13, 2016

Copyright © DOCOMO Innovations, Inc. All Rights Reserved.

@DataNinjaAPIdataninja.net

Page 2: Data Ninja Services - Data Science Summit talk 2016

NTT DOCOMO, Inc.

• Japan’s largest mobile phone operator • 67M subscribers in Japan• 46% are smart phones• DOCOMO Innovations is a subsidiary

Copyright © DOCOMO Innovations, Inc. All Rights Reserved.

4G subscribers 3G subscribers 2G subscribersNTT DOCOMO mobile market share in Japan

Page 3: Data Ninja Services - Data Science Summit talk 2016

Data Ninja Team

• Part of Open Service Innovation Group at DII• Formed in 2012 with 10+ researchers and engineers

– 7 members with Ph.D. degree – 80+ years of combined experience with more than 50 patents and 120

peer-reviewed papers– Diverse and extensive international large company and startup

experience – Experts in data science and text analysis

Copyright © DOCOMO Innovations, Inc. All Rights Reserved.

Page 4: Data Ninja Services - Data Science Summit talk 2016

Data Ninja Technologies, Applications and Customers

• Technologies– Natural Language Processing (NLP) and Machine Learning– Large-scale Data Analytics and Cloud Computing

• Applications– Personal voice assistants, car navigation assistants– Personalization and recommendation systems– Data management platforms and online advertising– Automated text categorization system

• Customers– Large enterprises including NTT DOCOMO, Toyota, Pioneer, Nissan

– Companies utilizing analytics to enhance service offerings and effectively provide relevant, appropriate and targeted content

Copyright © DOCOMO Innovations, Inc. All Rights Reserved.

Page 5: Data Ninja Services - Data Science Summit talk 2016

Our mission is to enable companies of all sizes to build smart services with content intelligence without having in-house advanced data science and machine learning teams.

Data Ninja Mission

Copyright © DOCOMO Innovations, Inc. All Rights Reserved.

Smart Content Smart Sentiment

Smart Data Smart Learning

Page 6: Data Ninja Services - Data Science Summit talk 2016

Content Intelligence Platform

Copyright © DOCOMO Innovations, Inc. All Rights Reserved.

Page 7: Data Ninja Services - Data Science Summit talk 2016

Data Ninja API

Copyright © DOCOMO Innovations, Inc. All Rights Reserved.

Smart Content

Smart Data

Smart Learning

Smart Sentiment

REST APIpublic cloud or technology licensing API Endpoints

Smart Applications and Services

Unstructured Data (text)

Structured Data (Concepts, Categories and Entities with Sentiments)

http://dataninja.net

Page 8: Data Ninja Services - Data Science Summit talk 2016

Smart Data Service

Smart Data service provides access to our knowledge graphs to complement the Smart Content service allowing development of sophisticated data-science applications.

Copyright © DOCOMO Innovations, Inc. All Rights Reserved.

Concept-category hierarchy Concept relationship graph

Page 9: Data Ninja Services - Data Science Summit talk 2016

Semantic Interpretation

DOCOMO Innovations 、 Inc. All Rights Reserved.

9

Pets

Animals with feathersFarm animals

Small domestic animalsFarm animals - Concept nodes act as network sensors

- Category inference- Concept inference- Learning

- Interaction with content- Communication

- Communication participants’ knowledge bases are different: there is no common grounding -> customization is necessary

- Knowledge bases are continually updated

Page 10: Data Ninja Services - Data Science Summit talk 2016

Knowledge Base Updates

DOCOMO Innovations 、 Inc. All Rights Reserved.

10

Real World

Timely Updates

Knowledge Base

Interpretation Engine

Environment and customization• General background knowledge• Vertical customization

Automated, timely KB updates• New concepts, categories and

relationships• Custom entities, concepts,

taxonomiesNew, improved interpretations• Finer resolution, increased

accuracy • Enriched interpretation

Page 11: Data Ninja Services - Data Science Summit talk 2016

Smart Data Demo

Copyright © DOCOMO Innovations, Inc. All Rights Reserved.

Page 12: Data Ninja Services - Data Science Summit talk 2016

Smart Data Use Cases

Copyright © DOCOMO Innovations, Inc. All Rights Reserved.

• Concept & Category Contextual Search: Keyword suggestion & expansion

• Related Concept Contextual Search– Personalization by building user profiles based on usage history– Recommendation based on similar concepts (e.g. during cold

starts) • Concept Popularity Lookup

– Smart search using concepts in addition to keywords to increase accuracy

– Popularity-based disambiguation– Trending visualization by finding trends of concepts and categories

for better decision making• Smart Graph

– Generation of linguistics resources for domain-specific applications• Induction Engine

– Discovery of hidden relationships among concepts to better reason with text

Page 13: Data Ninja Services - Data Science Summit talk 2016

Smart Content Overview

Smart Content service extracts meaningful categories, concepts, entities and keywords from unstructured text for broad use in analytics and data science applications.

Smart Content collects relevant data continually to add to its knowledge-base.

Smart Content knowledge-base is extendable through its configurable resource repository with custom user-defined taxonomies and entity dictionaries.

Copyright © DOCOMO Innovations, Inc. All Rights Reserved.

Page 14: Data Ninja Services - Data Science Summit talk 2016

Smart Content Use Cases

Copyright © DOCOMO Innovations, Inc. All Rights Reserved.

Example Use Cases:• Topic Detection and Tracking • Concept-based Retrieval • Image Recommendation for Online Publishing• Contextual Music Recommendation• Semantic Analysis of News Articles - demo

Page 15: Data Ninja Services - Data Science Summit talk 2016

Smart Sentiment Service

Smart Sentiment assigns a positive, negative, neutral, or “none” sentiment value to the content of a natural language text document.

Pre-defined, custom trained models are available for three domains: product reviews, social networks and news articles.

Sentiments are assigned to each extracted entity and keyword.

Copyright © DOCOMO Innovations, Inc. All Rights Reserved.

Page 16: Data Ninja Services - Data Science Summit talk 2016

Smart Sentiment Use Cases

Copyright © DOCOMO Innovations, Inc. All Rights Reserved.

Example Use Cases:• Brand reputation analysis/monitoring

• Product sentiment around release date

• Product reviews

-1

-0.8

-0.6

-0.4

-0.2

0

0.2

0.4 Sentiment for “Volkswagen” in Sep. ‘15

Date

Nor

mal

ized

sen

timen

t sc

ore

Page 17: Data Ninja Services - Data Science Summit talk 2016

The U.S. Environmental Protection Agency said Friday that Volkswagen intentionally skirted clean air laws by using a piece of software that enabled about 500,000 of its diesel cars to emit fewer smog-causing pollutants during testing than in real-world driving conditions.The agency ordered VW to fix the cars at its own expense.

201509

01

201509

02

201509

03

201509

04

201509

05

201509

06

201509

07

201509

08

201509

09

201509

10

201509

11

201509

12

201509

13

201509

14

201509

15

201509

16

201509

17

201509

18

201509

19

201509

20

201509

21

201509

22

201509

23

201509

24

201509

25

201509

26

201509

27

201509

28

201509

29

201509

30

-1

-0.8

-0.6

-0.4

-0.2

0

0.2

0.4

Sentiment for entity Volkswagen

Date

Sent

imen

t

Toyota sales fell 9 percent, Honda was down 7 percent and Volkswagen brand vehicles were down 8 percent.

German auto giant Volkswagen posted a 2.8 per cent decline …

Volkswagen’s finance chief Hans Dieter Poetsch is set to become its next chairman, putting Europe’s biggest car maker on course for calmer waters after rival factions including ousted patriarch Ferdinand Piech united to back him.

Page 18: Data Ninja Services - Data Science Summit talk 2016

Newsbot Ninja

18

https://newsbot.dataninja.net

Page 19: Data Ninja Services - Data Science Summit talk 2016

19

https://newsbot.dataninja.net

Newsbot Ninja

Page 20: Data Ninja Services - Data Science Summit talk 2016

Smart Learning Overview

Smart Learning service identifies the intent of a short piece of text, such as natural-language request to perform some action.

The service recognizes 30+ intent categories such as call request, email, news, information seeking, play music, transportation, schedule, shopping, take a photo, and similar.

Copyright © DOCOMO Innovations, Inc. All Rights Reserved.

Page 21: Data Ninja Services - Data Science Summit talk 2016

Smart Learning Use Cases

Copyright © DOCOMO Innovations, Inc. All Rights Reserved.

Use Cases:• Intelligent Personal Assistants such as Siri, Cortana,

and Google Now

• Car Navigation Assistants

• Query and sentence classification

• Intent identification and query extraction in mobile apps

Example: “Find me Italian restaurant in Palo Alto.”Task: Restaurant search Target: Italian restaurant Place: Palo Alto

Page 22: Data Ninja Services - Data Science Summit talk 2016

Thank You!

Copyright © DOCOMO Innovations, Inc. All Rights Reserved.

- Web sites: dataninja.net,demo.dataninja.net. newsbot.dataninja.net

- Visit us at our booth in the expo area

- Sign up for a demo or API

We are hiring!

Page 23: Data Ninja Services - Data Science Summit talk 2016

Backup

23Copyright © DOCOMO Innovations, Inc. All Rights Reserved.

Page 24: Data Ninja Services - Data Science Summit talk 2016

DOCOMO Innovations, Inc.

• DOCOMO Innovations is a subsidiary of NTT DOCOMO• We collaborate with business partners, research laboratories, start-

ups and engineers to develop innovative products and services

Copyright © DOCOMO Innovations, Inc. All Rights Reserved.

Page 25: Data Ninja Services - Data Science Summit talk 2016

Smart Content Advantages

• More entities extracted than competitors• Broader concept tagging to provide higher recall• Exclusive access to knowledge-graph hierarchy and

induction engine to facilitate custom advanced development• More categories in taxonomy classification for broader

coverage• Other signals including concept similarity, ranking, and

popularity for further disambiguation• Outstanding price/performance

Copyright © DOCOMO Innovations, Inc. All Rights Reserved.25

Page 26: Data Ninja Services - Data Science Summit talk 2016

Smart Content Pricing

Copyright © DOCOMO Innovations, Inc. All Rights Reserved.26