BIG DATA ANALYTICS · 2019-03-22 · Happiness Index • Use Social media to create a happiness...

75
BIG DATA ANALYTICS By Assoc. Prof. Dr. Tiranee Achalakul Ministry of Digital Economy and Society

Transcript of BIG DATA ANALYTICS · 2019-03-22 · Happiness Index • Use Social media to create a happiness...

Page 1: BIG DATA ANALYTICS · 2019-03-22 · Happiness Index • Use Social media to create a happiness index • Pilot project: Collected 15,000 text records for the thirty most populous

BIG DATA ANALYTICS

By Assoc. Prof. Dr. Tiranee Achalakul Ministry of Digital Economy and Society

Page 2: BIG DATA ANALYTICS · 2019-03-22 · Happiness Index • Use Social media to create a happiness index • Pilot project: Collected 15,000 text records for the thirty most populous

PAGE 2

Gartner’s hype cycle for Emerging Technologies 2017

Artificial General Intelligence

IOT Platform Machine learning

5G

Deep learning

Page 3: BIG DATA ANALYTICS · 2019-03-22 · Happiness Index • Use Social media to create a happiness index • Pilot project: Collected 15,000 text records for the thirty most populous

Pre-Crime Intelligence System

Beware by

DAS by

internet crawling, searching, data aggregation, data analysis, data visualization, data extraction and image and VDO analysis.

Page 4: BIG DATA ANALYTICS · 2019-03-22 · Happiness Index • Use Social media to create a happiness index • Pilot project: Collected 15,000 text records for the thirty most populous

What has changed to make AI so

useful today?

NEW

ALGORITHMS

COMPUTING

Page 5: BIG DATA ANALYTICS · 2019-03-22 · Happiness Index • Use Social media to create a happiness index • Pilot project: Collected 15,000 text records for the thirty most populous

1969

Page 6: BIG DATA ANALYTICS · 2019-03-22 · Happiness Index • Use Social media to create a happiness index • Pilot project: Collected 15,000 text records for the thirty most populous

The rapid development of technology led to

the explosive growth of data in almost every

industry and business area.

DATA IS THE NEW OIL

Data of great

Volume, Variety, and Velocity

Page 7: BIG DATA ANALYTICS · 2019-03-22 · Happiness Index • Use Social media to create a happiness index • Pilot project: Collected 15,000 text records for the thirty most populous

BIG DATA An umbrella term for all sorts of data

Criminal records, Citizen data, Court cases

Digital data to the government archived paperwork

Social media (needs, trends, and opinions)

Sensor data (heath, weather, geo-location, etc.)

Structured Unstructured

Page 8: BIG DATA ANALYTICS · 2019-03-22 · Happiness Index • Use Social media to create a happiness index • Pilot project: Collected 15,000 text records for the thirty most populous

DATA ON RECIDIVISM

• Race ad Ethnicity

• Personality

• Victims of violent crimes previously

• Major depression

• Personality disorder

• Brain function

• Psychological disorder

• Parental relations

• Education

• Peer influence

• Drugs and alcohol

• Easy access to weapon

Page 9: BIG DATA ANALYTICS · 2019-03-22 · Happiness Index • Use Social media to create a happiness index • Pilot project: Collected 15,000 text records for the thirty most populous

WHERE TO LOOK FOR DATA

Page 10: BIG DATA ANALYTICS · 2019-03-22 · Happiness Index • Use Social media to create a happiness index • Pilot project: Collected 15,000 text records for the thirty most populous

Data

Warehouse

Extract, Transform & Load (ETL)

Acct Sales Supply CRM HR etc.

Business Intelligence Tools (BI)

Descriptive Analytics Predictive Analytics

Data Lake

File Copy

Developer Environment

What happened; How many; how often; where

Project what will happen; possible outcome indication

Page 11: BIG DATA ANALYTICS · 2019-03-22 · Happiness Index • Use Social media to create a happiness index • Pilot project: Collected 15,000 text records for the thirty most populous
Page 12: BIG DATA ANALYTICS · 2019-03-22 · Happiness Index • Use Social media to create a happiness index • Pilot project: Collected 15,000 text records for the thirty most populous

DATA STUDIO

Page 13: BIG DATA ANALYTICS · 2019-03-22 · Happiness Index • Use Social media to create a happiness index • Pilot project: Collected 15,000 text records for the thirty most populous

BIG DATA ANALYTICS

Page 14: BIG DATA ANALYTICS · 2019-03-22 · Happiness Index • Use Social media to create a happiness index • Pilot project: Collected 15,000 text records for the thirty most populous

MACHINE LEARNING and AI Learn from data and make predictions about data by

using statistics to develop self learning algorithm

Page 15: BIG DATA ANALYTICS · 2019-03-22 · Happiness Index • Use Social media to create a happiness index • Pilot project: Collected 15,000 text records for the thirty most populous
Page 16: BIG DATA ANALYTICS · 2019-03-22 · Happiness Index • Use Social media to create a happiness index • Pilot project: Collected 15,000 text records for the thirty most populous

PUTTING AIs TO WORK

Credit: nicolamattina.it

Page 17: BIG DATA ANALYTICS · 2019-03-22 · Happiness Index • Use Social media to create a happiness index • Pilot project: Collected 15,000 text records for the thirty most populous

HOW IS MACHINE PERCEPTION DONE?

Image Vision features Detection

Images/video

Audio Audio features Speaker ID

Audio

Text

Text Text features

Text classification,

Machine translation,

Information retrieval, ....

Slide courtesy of Andrew Ng, Stanford University

Page 18: BIG DATA ANALYTICS · 2019-03-22 · Happiness Index • Use Social media to create a happiness index • Pilot project: Collected 15,000 text records for the thirty most populous

Real-time Face Recognition

Page 19: BIG DATA ANALYTICS · 2019-03-22 · Happiness Index • Use Social media to create a happiness index • Pilot project: Collected 15,000 text records for the thirty most populous

City Surveillance

Page 20: BIG DATA ANALYTICS · 2019-03-22 · Happiness Index • Use Social media to create a happiness index • Pilot project: Collected 15,000 text records for the thirty most populous

Chinese Surveillance System

Page 21: BIG DATA ANALYTICS · 2019-03-22 · Happiness Index • Use Social media to create a happiness index • Pilot project: Collected 15,000 text records for the thirty most populous

Credit: vaaas.kaisquare.com

Video Analytics for Profiling

Page 22: BIG DATA ANALYTICS · 2019-03-22 · Happiness Index • Use Social media to create a happiness index • Pilot project: Collected 15,000 text records for the thirty most populous

Three Levels of Intelligence

Artificial

Narrow

Intelligence

Artificial

General

Intelligence

Artificial

Super

Intelligence

Specialized in

one specific

area.

Specialized in

all areas.

Smarter than

human in every

way.

Page 23: BIG DATA ANALYTICS · 2019-03-22 · Happiness Index • Use Social media to create a happiness index • Pilot project: Collected 15,000 text records for the thirty most populous

Three Levels of Intelligence

Artificial

Narrow

Intelligence

Artificial

General

Intelligence

Artificial

Super

Intelligence

Specialized in

one specific

area.

Specialized in

all areas.

Smarter than

human in

every way.

Page 24: BIG DATA ANALYTICS · 2019-03-22 · Happiness Index • Use Social media to create a happiness index • Pilot project: Collected 15,000 text records for the thirty most populous

ALPHA GO

Page 25: BIG DATA ANALYTICS · 2019-03-22 · Happiness Index • Use Social media to create a happiness index • Pilot project: Collected 15,000 text records for the thirty most populous

ALPHA GO

Page 26: BIG DATA ANALYTICS · 2019-03-22 · Happiness Index • Use Social media to create a happiness index • Pilot project: Collected 15,000 text records for the thirty most populous

ALPHA GO ZERO

Page 27: BIG DATA ANALYTICS · 2019-03-22 · Happiness Index • Use Social media to create a happiness index • Pilot project: Collected 15,000 text records for the thirty most populous

DEEP LEARNING

Deep Learning is the Most Exciting Breakthrough in

Modern Machine Learning

• Learn complex features from big images

• Learn complex action plans for reinforcement

learning

• Learn to create and generate

• Learn complex unstructured data such as

language, videos, etc.

Page 28: BIG DATA ANALYTICS · 2019-03-22 · Happiness Index • Use Social media to create a happiness index • Pilot project: Collected 15,000 text records for the thirty most populous

State of the Art Deep Learning Generate text from images

Page 29: BIG DATA ANALYTICS · 2019-03-22 · Happiness Index • Use Social media to create a happiness index • Pilot project: Collected 15,000 text records for the thirty most populous

Generate high-resolution images based on text

State of the Art Deep Learning

Page 30: BIG DATA ANALYTICS · 2019-03-22 · Happiness Index • Use Social media to create a happiness index • Pilot project: Collected 15,000 text records for the thirty most populous

Deep Neural Networks

Page 31: BIG DATA ANALYTICS · 2019-03-22 · Happiness Index • Use Social media to create a happiness index • Pilot project: Collected 15,000 text records for the thirty most populous

HOW DOES AI LEARN

Page 32: BIG DATA ANALYTICS · 2019-03-22 · Happiness Index • Use Social media to create a happiness index • Pilot project: Collected 15,000 text records for the thirty most populous

TEXT MINING AND NLP Deriving high-quality information from text by devising of patterns

and trends through means such as statistical pattern learning.

Page 33: BIG DATA ANALYTICS · 2019-03-22 · Happiness Index • Use Social media to create a happiness index • Pilot project: Collected 15,000 text records for the thirty most populous

SENTIMENT ANALYSIS

• Use text analytics to key citizen concerns and

sentiments

• Assess current situations to create a set of keywords

to gather social media post related to crimes.

• Determine the sentiment with respect to topics or the

overall contextual polarity of a post/comment related

to crimes.

Page 34: BIG DATA ANALYTICS · 2019-03-22 · Happiness Index • Use Social media to create a happiness index • Pilot project: Collected 15,000 text records for the thirty most populous

Happiness Index

• Use Social media to create a happiness index

• Pilot project: Collected 15,000 text records for the thirty most populous cities in the US

• Parsed text was utilized to calculate happiness scores with happiness index dictionary

• Examine the relationships between the index and real world phenomena including population, crime rate, and climate

http://onlinelibrary.wiley.com/doi/10.1002/meet.14505001167/pdf

Page 35: BIG DATA ANALYTICS · 2019-03-22 · Happiness Index • Use Social media to create a happiness index • Pilot project: Collected 15,000 text records for the thirty most populous

INFLUENCER ANALYSIS

Social Network Analysis (SNA)

• SNA investigates social structures through the use of network and graph theory.

• Friends network can be analyzed

• An influencer is an individual who has above-average impact on a specific niche process.

• On the social network, an influencer can referred to the most shaping a discussion about a topic.

Page 36: BIG DATA ANALYTICS · 2019-03-22 · Happiness Index • Use Social media to create a happiness index • Pilot project: Collected 15,000 text records for the thirty most populous

TOPIC DISCOVERY

• Characterizes document according to topics Discover topics mentioned about “crime” on the social network

Page 37: BIG DATA ANALYTICS · 2019-03-22 · Happiness Index • Use Social media to create a happiness index • Pilot project: Collected 15,000 text records for the thirty most populous

“Brexit Impact”

https://quid.com/feed/brexit-immediate-impacts

Page 38: BIG DATA ANALYTICS · 2019-03-22 · Happiness Index • Use Social media to create a happiness index • Pilot project: Collected 15,000 text records for the thirty most populous

VOICE OF CUSTOMERS

Knows feedback sentiment, Keep track of behavioral trend, Real-time

anomaly detection, Knows where your target audiences are, Find out who are the influencers,

Call Center, Social Network, Front Offices

Page 39: BIG DATA ANALYTICS · 2019-03-22 · Happiness Index • Use Social media to create a happiness index • Pilot project: Collected 15,000 text records for the thirty most populous

CHATBOT Customer Services

Page 40: BIG DATA ANALYTICS · 2019-03-22 · Happiness Index • Use Social media to create a happiness index • Pilot project: Collected 15,000 text records for the thirty most populous

Chatbots have revolutionized the customer service space

Chatbot = Conversational interface powered by AI

ML NLP Chat

Platform + + =

Page 41: BIG DATA ANALYTICS · 2019-03-22 · Happiness Index • Use Social media to create a happiness index • Pilot project: Collected 15,000 text records for the thirty most populous

HUMAN HANDOVER

https://disruptive.asia/smbc-ntt-com-ai-chatbot/

Page 42: BIG DATA ANALYTICS · 2019-03-22 · Happiness Index • Use Social media to create a happiness index • Pilot project: Collected 15,000 text records for the thirty most populous

BIG DATA PROCESS

Page 43: BIG DATA ANALYTICS · 2019-03-22 · Happiness Index • Use Social media to create a happiness index • Pilot project: Collected 15,000 text records for the thirty most populous

Problem Analysis

Exploratory Data

Analysis

Predictive Analytic

Implementation &

Deployment

Project Inception

Visualization

Dashboard

Data Scientists และคณะท างาน Big Data ของแต่ละกลุ่มงาน ร่วมก าหนดโจทย์ท่ีเหมาะสมและตั้งโครงการพัฒนาและทดสอบโมเดลคณิตศาสตร์น าร่องท่ีเหมาะสม

ทีม Data Scientists ส ารวจข้อมูลท่ีมีอยู่ในปัจจุบันตามโจทย์น าร่องท่ีก าหนดเพื่อประเมินความพร้อมและแปลงจากความต้องการในเชิงปัญหาให้เป็นข้อก าหนดในเชิงข้อมูลและระบบ

ส ารวจการกระจายตัวของข้อมูลเพื่อท าความเข้าใจข้อมูล และหาความสัมพันธ์ระหว่างตัวแปรในข้อมูล ในขั้นตอนนี้ทีมงานจะต้องการตัวอย่างข้อมูลจริง ระบุข้อมูลที่ต้องการเพิ่มเติมและเร่ิมเตรียมข้อมูลจริง

ทีมงานน าข้อมลูท่ีจดัเตรียมในเบือ้งต้นมาใช้ในการสร้างแบบจ าลองหรือโมเดลทางคณิตศาสตร์เพ่ือการท านาย โดยใช้เทคนิคและอลักอริธ่ึมต่างๆ และทดสอบความแม่นย าของโมเดลคณิตศาสตร์

ออกแบบวิธีการแสดงผลโดยเลือกมิติของข้อมลูท่ีเหมาะสมบน Interactive Dashboard เพื้อให้คณะท างานทดลองใช้และสื่อสารกับทีมผู้บริหาร และ ผู้ปฏิบัติ ให้สามารถน าเอาความเข้าใจดังกล่าวไปแปลงเป็นแผนการพัฒนาต่อยอด

หลงัจากผลลพัธ์เป็นท่ีพอใจแล้ว นกัพฒันาระบบเร่ิมพฒันาโปรแกรมตามรูปแบบของโมเดลคณิตศาสตร์ท่ีวางไว้ และตัง้ค่าให้โปรแกรมให้ประมวลผลโมเดลแบบอตัโนมตัิตามความถ่ีท่ีวางแผนไว้ จากนัน้ติดตัง้ระบบซอฟท์แวร์เพ่ือการใช้งานจริง

Page 44: BIG DATA ANALYTICS · 2019-03-22 · Happiness Index • Use Social media to create a happiness index • Pilot project: Collected 15,000 text records for the thirty most populous

สร้างความเข้าใจพืน้ฐานดา้นData

gathering, interpreting, and

visualize อธบิายกระบวนการในการพฒันาโครงการ Big

Data ยกตวัอยา่ง

กรณศึีกษาของการใช้บิก๊ดาต้าเพือ่พฒันางานเฉพาะ

ด้าน

Educate

Brain Storm คุยถงึปญัหาและความต้องการ

ยกตวัอยา่งข้อมูล งานเฉพาะด้าน

(ทอ่งเทีย่ว สาธารณสชุ, ศึกษา)

ก าหนดโจทย์ปญัหาทีช่ดัเจนและเป็นไปได้ในกรอบเวลา/งบประมาณ

Data Landscape & Workflow

ส ารวจข้อมูล เครือ่งมอืทีใ่ช้และกระบวนการ

ท างานในปจัจุบนั (การบ้าน)

Discuss and Finalize project

Present and Verify Collected

Information

น าเสนอข้อมูลความพร้อม ซกัถาม และตรวจทาน

workflow เดมิ

น าเสนอ Workflow และเครือ่งมอืด้านบิก๊ดาต้าใหมท่ีน่่าจะน ามา

ประยุกตใ์ช้เพือ่ช่วยในการท างานให้ที่ประชุมพจิรณาและปรบัแก้

PROJECT INCEPTION: PROBLEM SELECTION

WORKSHOP & BASIC BI

DAY 1

DAY 2

Page 45: BIG DATA ANALYTICS · 2019-03-22 · Happiness Index • Use Social media to create a happiness index • Pilot project: Collected 15,000 text records for the thirty most populous

Exploratory Data Analysis

Evaluate Data

Quality & Quantity

Select, Clean, and

Filter Data

Acquire Sample

Data

Excel, csv, images, etc.

Prepare Tools and

Platform for

Prototyping

yes no

Explore Data

Distribution

Derive Insight

Check Model

Feasibility

Good

Page 46: BIG DATA ANALYTICS · 2019-03-22 · Happiness Index • Use Social media to create a happiness index • Pilot project: Collected 15,000 text records for the thirty most populous

Data Mining The Computational process of discovering patterns in large data sets involving

methods at the intersection of statistics, machine learning, and database systems.

Text Analytics The process of deriving high-quality information from text. High-quality information is

typically derived through the devising of patterns and trends through means such as

statistical pattern learning.

Machine Learning / Deep Learning The science of getting computers to learn from data without having to be explicitly

programmed by humans. Machine model can teach themselves to grow and change

when exposed to new data.

Big Data Technology Technology designed to manage and process extremely large data sets that may be

analyzed computationally to reveal patterns, trends, and associations, especially

relating to human behavior and interactions.

PREDICTIVE ANALYTICS

Page 47: BIG DATA ANALYTICS · 2019-03-22 · Happiness Index • Use Social media to create a happiness index • Pilot project: Collected 15,000 text records for the thirty most populous

• Regression produce a model that, given an individual, estimates the value of the particular

variable specific to that individual.

How much will a jail facility needs for operation next year ?

• Clustering group individuals in a population by their similarity (not driven by any specific

purpose).

Do offenders form natural groups or segments?

• Co-occurrence grouping find associations between entities based on transactions involving them.

What crimes are commonly committed by the same offenders?

• Profiling characterize the typical behavior of an individual, group, or population. Information can be used to establish behavior norms for anomaly detection

What is the typical behavior of serial killers ?

EXAMPLE TASKS

Page 48: BIG DATA ANALYTICS · 2019-03-22 · Happiness Index • Use Social media to create a happiness index • Pilot project: Collected 15,000 text records for the thirty most populous

• First Union Bank deployed a value predicting system that assigns

green / yellow / red flag to each customer, based on their predicted

lifetime value.

• Service representatives were instructed to waive fee for green

customers, and not waive for red customers. For yellow customers,

they can make their own judgement.

• This strategy generated over $100 million in incremental revenue.

CREDIT SCORING

Page 49: BIG DATA ANALYTICS · 2019-03-22 · Happiness Index • Use Social media to create a happiness index • Pilot project: Collected 15,000 text records for the thirty most populous

ANSWERING

QUESTIONS

• What are the number of repeated offenders in 2018?

– A straightforward database query, if records are kept properly.

• Is there really a profile difference between the repeated offenders and one time offender?

– Statistical Hypothesis testing

• But who really are these repeated offenders? Can I characterize them?

– Automated pattern finding

• Will some new convicted felons become repeated offenders ? How many can we expect?

– Predictive model of profitability

Page 50: BIG DATA ANALYTICS · 2019-03-22 · Happiness Index • Use Social media to create a happiness index • Pilot project: Collected 15,000 text records for the thirty most populous

Problem Analysis

Exploratory Data

Analysis

Predictive Analytic

Implementation &

Deployment

Project Inception

Visualization

Dashboard

Data Scientists และคณะท างาน Big Data ของแต่ละกลุ่มงาน ร่วมก าหนดโจทย์ท่ีเหมาะสมและตั้งโครงการพัฒนาและทดสอบโมเดลคณิตศาสตร์น าร่องท่ีเหมาะสม

ทีม Data Scientists ส ารวจข้อมูลท่ีมีอยู่ในปัจจุบันตามโจทย์น าร่องท่ีก าหนดเพื่อประเมินความพร้อมและแปลงจากความต้องการในเชิงปัญหาให้เป็นข้อก าหนดในเชิงข้อมูลและระบบ

ส ารวจการกระจายตัวของข้อมูลเพื่อท าความเข้าใจข้อมูล และหาความสัมพันธ์ระหว่างตัวแปรในข้อมูล ในขั้นตอนนี้ทีมงานจะต้องการตัวอย่างข้อมูลจริง ระบุข้อมูลที่ต้องการเพิ่มเติมและเร่ิมเตรียมข้อมูลจริง

ทีมงานน าข้อมลูท่ีจดัเตรียมในเบือ้งต้นมาใช้ในการสร้างแบบจ าลองหรือโมเดลทางคณิตศาสตร์เพ่ือการท านาย โดยใช้เทคนิคและอลักอริธ่ึมต่างๆ และทดสอบความแม่นย าของโมเดลคณิตศาสตร์

ออกแบบวิธีการแสดงผลโดยเลือกมิติของข้อมลูท่ีเหมาะสมบน Interactive Dashboard เพื้อให้คณะท างานทดลองใช้และสื่อสารกับทีมผู้บริหาร และ ผู้ปฏิบัติ ให้สามารถน าเอาความเข้าใจดังกล่าวไปแปลงเป็นแผนการพัฒนาต่อยอด

หลงัจากผลลพัธ์เป็นท่ีพอใจแล้ว นกัพฒันาระบบเร่ิมพฒันาโปรแกรมตามรูปแบบของโมเดลคณิตศาสตร์ท่ีวางไว้ และตัง้ค่าให้โปรแกรมให้ประมวลผลโมเดลแบบอตัโนมตัิตามความถ่ีท่ีวางแผนไว้ จากนัน้ติดตัง้ระบบซอฟท์แวร์เพ่ือการใช้งานจริง

Page 51: BIG DATA ANALYTICS · 2019-03-22 · Happiness Index • Use Social media to create a happiness index • Pilot project: Collected 15,000 text records for the thirty most populous

Visualization Dashboard

Page 52: BIG DATA ANALYTICS · 2019-03-22 · Happiness Index • Use Social media to create a happiness index • Pilot project: Collected 15,000 text records for the thirty most populous

PAGE 52

Messaging, and Web Services

EDW, OLAP

Social Media, Weblogs

Machine Devices, Sensors

Visualization

Predictor Software

IT Infrastructure

Data Input

Implementation and Deployment

Page 53: BIG DATA ANALYTICS · 2019-03-22 · Happiness Index • Use Social media to create a happiness index • Pilot project: Collected 15,000 text records for the thirty most populous

THE ANALYTIC CAPABILITY

Data science provides fact-based, math-based decision support (Data-Driven Decision)

Descriptive Analytics

tell you what happened;

how many; how often;

where critical events occur.

Predictive Analytics

project what will happen

next; provide indications

of possible outcomes if

trends continue.

Prescriptive Analytics

Synthesizes big data to

make predictions and then

suggests decision options to

take advantage of the

predictions

1 2 3

Big decisions need better analytics

Page 54: BIG DATA ANALYTICS · 2019-03-22 · Happiness Index • Use Social media to create a happiness index • Pilot project: Collected 15,000 text records for the thirty most populous

The 4 Personas for Data Analytics

https://namitkabra.wordpress.com/2016/12/05/the-4-personas-for-data-analytics/

Page 55: BIG DATA ANALYTICS · 2019-03-22 · Happiness Index • Use Social media to create a happiness index • Pilot project: Collected 15,000 text records for the thirty most populous

The 4 Personas for Data Analytics

https://namitkabra.wordpress.com/2016/12/05/the-4-personas-for-data-analytics/

Page 56: BIG DATA ANALYTICS · 2019-03-22 · Happiness Index • Use Social media to create a happiness index • Pilot project: Collected 15,000 text records for the thirty most populous

Statistics

Machine learning

Information retrieval

Signal processing

Data visualization

Databases

Big data platform and tools

Data modeling and ETL tools

Data warehousing solutions

Data APIs

Clouds

High performance computing

Page 57: BIG DATA ANALYTICS · 2019-03-22 · Happiness Index • Use Social media to create a happiness index • Pilot project: Collected 15,000 text records for the thirty most populous

BIG DATA EXAMPLES

Some examples of data-driven projects

Page 58: BIG DATA ANALYTICS · 2019-03-22 · Happiness Index • Use Social media to create a happiness index • Pilot project: Collected 15,000 text records for the thirty most populous

Pre-crime Programs

• CCTV is on every street corner, shopping mall, and

liquor store.

• Previous arrest records combining with real-time IoT

data (such as cameras designed to detect gunshots),

Police can pinpoint problem locations and understand

the crime conditions.

• Predict risk in specific locations across the city.

• At-risk areas are highlighted with recommendations for

evasive actions.

• LA : burglaries by 33 percent, violent crimes by 21

percent and property crimes by 12 percent respectively.

Page 59: BIG DATA ANALYTICS · 2019-03-22 · Happiness Index • Use Social media to create a happiness index • Pilot project: Collected 15,000 text records for the thirty most populous

MEMEX by DARPA

• An internet tool for internet crawling, searching, data aggregation, data analysis, data visualization, data extraction and image analysis.

• Intelligence Algorithm can be developed to – Anticipate specific incident – Query an information about a suspicious person on

social networks, shopping sites, and entertainment sites

– Websites promoting unlawful activities – Trending offensive videos/content specific to a person,

organizations, geography

Page 60: BIG DATA ANALYTICS · 2019-03-22 · Happiness Index • Use Social media to create a happiness index • Pilot project: Collected 15,000 text records for the thirty most populous

CRIME RISK ASSESSMENT

WITH CREDIT SCORING

• Predictive analytics to identify the offenders most likely to commit new crimes

• An analytics engine that classifies offenders as low-, medium-, and high-risk and makes targeted sentencing recommendations based on a host of case-specific factors. – Static factors:: offense type, current age, criminal history, age at first arrest.

– Dynamic factors (criminogenic factors) : attitude, associates, substance use, and antisocial personality patterns

– Real-time data : offender’s behavior and location.

Page 61: BIG DATA ANALYTICS · 2019-03-22 · Happiness Index • Use Social media to create a happiness index • Pilot project: Collected 15,000 text records for the thirty most populous

ROOT CAUSE ANALYSIS

• A method used for identifying the factors that are root

causes of crime in each area.

• A factor is considered a root cause if removal thereof from

the problem-fault-sequence prevents the final undesirable

outcome from recurring.

• Factors: Crime log, arrests, lighting condition, weather data,

• The identified factors can be utilized in

– Designing intervention plan

– Monitoring of factors

Page 62: BIG DATA ANALYTICS · 2019-03-22 · Happiness Index • Use Social media to create a happiness index • Pilot project: Collected 15,000 text records for the thirty most populous

PERSONALIZED REHABILITATION PROGRAM RECOMMENDATION

• Places offenders in specific rehabilitation programs based on predictors

• Past offense history

• Home life environment

• Gang affiliation

• Peer associations

• Creates the Management and Performance Hub

• Personalized solutions for individuals based on risk assessments.

Select an effective combination of interventions for offenders targeting individual needs

Page 63: BIG DATA ANALYTICS · 2019-03-22 · Happiness Index • Use Social media to create a happiness index • Pilot project: Collected 15,000 text records for the thirty most populous

BEYOND THE BARS

• Check-in sessions

• Training and education

• Mental health support

• Drug relapse prevention

• Estimate blood alcohol content

• Predict the onset of depression

• Contact with peer support

groups

• Push notifications from case

managers

Use mobile technology and electronic monitoring

device to replace physical incarceration.

Page 64: BIG DATA ANALYTICS · 2019-03-22 · Happiness Index • Use Social media to create a happiness index • Pilot project: Collected 15,000 text records for the thirty most populous

GEO-SPATIAL ANALYTICS

• Risk based algorithm : manager can monitor behavior and

movement patterns of their cases on an interactive

dashboard

• An automated monitoring system capable of

– Tracking offenders’ movements

– Notifying offenders when they have impending appointments

– Notifying officers when offenders enter high-crime zones

– Notifying officers when movements indicate that offenders are

becoming more likely to commit a crime.

• Parole officers access a dashboard tracking the movement

and activities of offenders under their supervision

• Track offenders’ location on a map, and assess their

activities.

Page 65: BIG DATA ANALYTICS · 2019-03-22 · Happiness Index • Use Social media to create a happiness index • Pilot project: Collected 15,000 text records for the thirty most populous

Vehicle Allocation Optimization

• Goal: Maximize availability and minimize transition • Predict quantity requirements by locations • Factors:

– Previous month vehicle locations – Previous months requirements of vehicle – Location constraints – Age of vehicle – Scheduled maintenance

• Is it better to keep vehicles in central storage or spread among operating locations

• 30 % increase in tasks being met and 40% decrease in transitions

Page 66: BIG DATA ANALYTICS · 2019-03-22 · Happiness Index • Use Social media to create a happiness index • Pilot project: Collected 15,000 text records for the thirty most populous

PREDICTIVE MAINTENANCE (PdM)

• Determine the condition of in-service equipment in order

to predict when the maintenance should be performed

• Cost savings over routine or time-based preventive

maintenance

• Fault Tree Analysis,

• Time Series Analysis

Page 67: BIG DATA ANALYTICS · 2019-03-22 · Happiness Index • Use Social media to create a happiness index • Pilot project: Collected 15,000 text records for the thirty most populous

PEOPLE ANALYTIC

• Predictive analytics is used for talent

acquisition, retention, placement,

promotion, compensation, or workforce

and succession planning.

• Analyzing the skills and attributes of high

performers in the present, then build a

template with quality hiring factors for

future hires.

• Non-traditional data gathering sources

– Social media channels where prospective

candidates usually leave their digital ‘thought

prints’.

• Statistical analysis of productivity and

turnover

– The data showed that old indicators (such as

GPA and education) were far less critical to

performance and retention. Factors like

experience is much more important. Ref: Forbe

Page 68: BIG DATA ANALYTICS · 2019-03-22 · Happiness Index • Use Social media to create a happiness index • Pilot project: Collected 15,000 text records for the thirty most populous

To Provide Data Services …

National Data Center, Cloud, and Big Data Platforms

Training Programs • Data Scientists • Data Engineers • Business Analysts

Services • ID verification • Access control • Data distribution • Transaction logging

Data Committees

Data Cataloging and directory services

1

3

2

5

4

Dat

a ex

chan

ge

Dat

a C

atal

og

Peo

ple

war

e

Data Committees &

Operating team

Infrastructure

Use cases in government

3

6

Showcases • Healthcare • Tourism • Traffic • Etc.

กรรมการก ากบัดแูลข้อมลู กฎหมายข้อมลู

ข้อมูล สารสนเทศพืน้ฐาน

บุคลากร

Page 69: BIG DATA ANALYTICS · 2019-03-22 · Happiness Index • Use Social media to create a happiness index • Pilot project: Collected 15,000 text records for the thirty most populous

GET READY FOR THE WORLD OF DATA

Page 70: BIG DATA ANALYTICS · 2019-03-22 · Happiness Index • Use Social media to create a happiness index • Pilot project: Collected 15,000 text records for the thirty most populous

THE OBSTACLES

• The absence of data and data gathering tools

• Existing data quality (consistency, accuracy,

completeness, conformity)

• Lack of concept understanding in business

problem formulation

• Lack of data scientists / analysts

• Data sharing within and across organizations

• Maintainability after initiatives

Page 71: BIG DATA ANALYTICS · 2019-03-22 · Happiness Index • Use Social media to create a happiness index • Pilot project: Collected 15,000 text records for the thirty most populous

DATA, HOW TO OBTAIN MORE

• Investment in mobile, AI, and conversational platforms

• Investment in the use of IoTs

• Create a seamless customer cross-channel experience

• Explore social media sentiment and other unstructured

data

• Integrate data silos (Administration, CRM, billing,

compliances, etc.)

Page 72: BIG DATA ANALYTICS · 2019-03-22 · Happiness Index • Use Social media to create a happiness index • Pilot project: Collected 15,000 text records for the thirty most populous

BIG DATA SANDBOX

• Sandbox allows an organization to realize its

actual investment value in big data.

• It is a developmental platform used to explore an

organization's information sets.

– Encourage collaboration and interaction

– Encourage learning by doing

– Build a data pool

• Build or rent one ?

Page 73: BIG DATA ANALYTICS · 2019-03-22 · Happiness Index • Use Social media to create a happiness index • Pilot project: Collected 15,000 text records for the thirty most populous

HR PREPERATION

• HR should identify current and future human

resources needs for an organization to become a

data-driven one.

• HR should come up with a plan to create

– A new salary model for data scientist hiring

– An incentive program and new opportunities for

existing staffs

– A multi-disciplinary task force

Page 74: BIG DATA ANALYTICS · 2019-03-22 · Happiness Index • Use Social media to create a happiness index • Pilot project: Collected 15,000 text records for the thirty most populous

THE ROADMAP

o Solution Architecture

Design (Vendor)

o Initial Investment on

Infrastructure (rent)

o Understand big data

o Ensure management buy-in

o Create a multi-disciplinary

team

o Define a list of use cases

o Explore Data Landscape

o Select proof of

concepts

o Collect data

o Create use case

prototypes

o Share the results

o Define the ROI

o Invest in infrastructure

o Expand the team and

start implementation

o Organization wide

training

o Improve overall

organization

VALUE

Page 75: BIG DATA ANALYTICS · 2019-03-22 · Happiness Index • Use Social media to create a happiness index • Pilot project: Collected 15,000 text records for the thirty most populous

Analytics is the

process of capturing,

interpreting and

communicating useful

information for better

decision making

Connect insights back to operations

People, Process, Environment