IOD 2013 - Crunch Big Data in the Cloud with IBM BigInsights and Hadoop

Crunch Big Data in the Cloud with IBM BigInsights and Hadoop IBD-3475

Leons Petrazickis, IBM Canada

@leonsp

Please note

IBM’s statements regarding its plans, directions, and intent are subject to change or withdrawal without notice at IBM’s sole discretion.

Information regarding potential future products is intended to outline our general product direction and it should not be relied on in making a purchasing decision.

The information mentioned regarding potential future products is not a commitment, promise, or legal obligation to deliver any material, code or functionality. Information about potential future products may not be incorporated into any contract. The development, release, and timing of any future features or functionality described for our products remains at our sole discretion.

Performance is based on measurements and projections using standard IBM benchmarks in a controlled environment. The actual throughput or performance that any user will experience will vary depending upon many factors, including considerations such as the amount of multiprogramming in the user’s job stream, the I/O configuration, the storage configuration, and the workload processed. Therefore, no assurance can be given that an individual user will achieve results similar to those stated here.

First step

Request a lab environment

http://bit.ly/requestLab

BigDataUniversity.com

Hadoop Architecture

Agenda

• Terminology review

• Hadoop architecture

– HDFS

– Blocks

– MapReduce

– Type of nodes

– Topology awareness

– Writing a file to HDFS

Hadoop cluster

Rack 1

Node 2

Node n

Terminology review

Node 1

Node 2

Node n

Rack 2

Node 1

Node 2

Node n

Rack n

Node 1

Hadoop architecture

• Two main components:

– Hadoop Distributed File System (HDFS)

– MapReduce Engine

Hadoop distributed file system (HDFS)

• Hadoop file system that runs on top of existing file system

• Designed to handle very large files with streaming data access patterns

• Uses blocks to store a file or parts of a file

HDFS - Blocks

• File Blocks

– 64MB (default), 128MB (recommended) – compare to 4KB in UNIX

– Behind the scenes, 1 HDFS block is supported by multiple operating system (OS) blocks

• Advantages of blocks:

– Fixed size – easy to calculate how many fit on a disk

– A file can be larger than any single disk in the network

– If a file or a chunk of the file is smaller than the block size, only needed space is used. Eg: 420MB file is split as:

• Fits well with replication to provide fault tolerance and availability

128MB 128MB 36MB 128MB

128 MB

OS Blocks

HDFS Block

HDFS - Replication

• Blocks with data are replicated to multiple nodes

• Allows for node failure without data loss

Node 1

Node 2

Node 3

MapReduce engine

• Technology from Google

• A MapReduce program consists of map and reduce

functions

• A MapReduce job is broken into tasks that run in

parallel

Types of nodes - Overview

• HDFS nodes

– NameNode

– DataNode

• MapReduce nodes

– JobTracker

– TaskTracker

• There are other nodes not discussed in this course

Types of nodes - Overview

Types of nodes - NameNode

• NameNode

– Only one per Hadoop cluster

– Manages the filesystem namespace and metadata

– Single point of failure, but mitigated by writing state to

multiple filesystems

– Single point of failure: Don’t use inexpensive

commodity hardware for this node, large memory

requirements

Types of nodes - DataNode

• DataNode

– Many per Hadoop cluster

– Manages blocks with data and

serves them to clients

– Periodically reports to name

node the list of blocks it stores

– Use inexpensive commodity

hardware for this node

Types of nodes - JobTracker

• JobTracker node

– One per Hadoop cluster

– Receives job requests submitted by client

– Schedules and monitors MapReduce jobs on task

trackers

Types of nodes - TaskTracker

• TaskTracker node

– Many per Hadoop cluster

– Executes MapReduce operations

– Reads blocks from DataNodes

…lesson continued in the next video>

Topology awareness

Bandwidth becomes progressively smaller in the following scenarios:

Topology awareness

1. Process on the same node.

1. Process on the same node

2. Different nodes on the same rack

Topology awareness

3. Nodes on different racks in the same data center

Topology awareness

3. Nodes on different racks in the same data center

4. Nodes in different data centers

Topology awareness

Writing a file to HDFS

Thank You

What is Hadoop?

Agenda

• What is Hadoop?

• What is Big Data?

• Hadoop-related open source projects

• Examples of Hadoop in action

• Big Data solutions and the Cloud

What is Hadoop?

Relational Database

What is Hadoop?

Relational Database

What is Hadoop?

Relational Database

What is Hadoop?

Relational Database

What is Hadoop?

Relational Database

What is Hadoop?

Relational Database

10TB 100TB

What is Hadoop?

Relational Database

10TB 100TB

What is Hadoop?

Relational Database

10TB 100TB

Sensors

Facebook

Twitter

What is Hadoop?

• Written in Java

• Using inexpensive commodity hardware

• A variety of data (structured, unstructured, semi-structured)

• Massive amounts of data through parallelism

• Optimized to handle

• Not for OLTP, not for OLAP/DSS, good for Big Data

• Open source project

• Reliability provided through replication

• Current version: 0.20.2

• Great performance

What is Big Data?

RFID Readers

What is Big Data?

2 Billion internet users

What is Big Data?

4.6 Billion mobile phones

What is Big Data?

7TB of data processed by Twitter every day

What is Big Data?

10TB of data processed by Facebook every day

What is Big Data?

About 80% of this data is unstructured

Hadoop-related open source projects

jaql PIG

ZooKeeper

Examples of Hadoop in action – IBM Watson

Examples of Hadoop in action

• In the telecommunication industry

• In the media

• In the technology industry

Hadoop is not for all types of work

• Not to process transactions (random access)

• Not good when work cannot be parallelized

• Not good for low latency data access

• Not good for processing lots of small files

• Not good for intensive calculations with little data

Big Data solutions and the Cloud

• Big Data solutions are more than just Hadoop

– Add business intelligence/analytics functionality

– Derive information of data in motion

• Big Data solutions and the Cloud are a perfect fit.

– The Cloud allows you to set up a cluster of systems in minutes and it’s relatively inexpensive.

Thank You

HDFS – Command Line

Agenda

• HDFS Command Line Interface

• Examples

HDFS Command line interface

• File System Shell (fs)

• Invoked as follows:

hadoop fs <args>

• Example:

Listing the current directory in hdfs

hadoop fs –ls .

• FS shell commands take paths URIs as argument

• URI format:

scheme://authority/path

• Scheme:

• For the local filesystem, the scheme is file

• For HDFS, the scheme is hdfs

hadoop fs –copyFromLocal file://myfile.txt hdfs://localhost/user/keith/myfile.txt

• Scheme and authority are optional

• Defaults are taken from configuration file core-site.xml

• Many POSIX-like commands

• cat, chgrp, chmod, chown, cp, du, ls, mkdir, mv, rm, stat, tail

• Some HDFS-specific commands

• copyFromLocal, copyToLocal, get, getmerge, put, setrep

HDFS – Specific commands

• copyFromLocal / put

• Copy files from the local file system into fs

hadoop fs -copyFromLocal <localsrc> .. <dst>

hadoop fs -put <localsrc> .. <dst>

• copyToLocal / get

• Copy files from fs into the local file system

hadoop fs -copyToLocal [-ignorecrc] [-crc] <src> <localdst>

hadoop fs -get [-ignorecrc] [-crc] <src> <localdst>

• getMerge

• Get all the files in the directories that match the source file pattern

• Merge and sort them to only one file on local fs

• <src> is kept

hadoop fs -getmerge <src> <localdst>

• setRep

• Set the replication level of a file.

• The -R flag requests a recursive change of replication level for an entire tree.

• If -w is specified, waits until new replication level is achieved.

hadoop fs -setrep [-R] [-w] <rep> <path/file>

Thank You

Hadoop MapReduce

Agenda

• Map operations

• Reduce operations

• Submitting a MapReduce job

• Distributed Mergesort Engine

• Two fundamental data types

• Fault tolerance

• Scheduling

• Task execution

What is a Map operation?

• Doing something to every element in an array is a common operation:

var a = [1,2,3];

for (i = 0; i < a.length; i++)

a[i] = a[i] * 2;

var a = [1,2,3];

• New value for variable a would be:

var a = [2,4,6];

a[i] = a[i] * 2;

var a = [1,2,3];

var a = [2,4,6];

This can

be written as

a function

a[i] = a[i] * 2;

var a = [1,2,3];

var a = [2,4,6];

a[i] = a[i] * 2; a[i] = fn(a[i]);

Like this,

where fn

a function

defined

function

{return

var a = [1,2,3];

a[i] = fn(a[i]);

Now, all of this can also be

converted into a “map” function

• …like this, where fn is a function passed as an argument:

function map(fn, a) {

a[i] = fn(a[i]);

• You can invoke this map function like this:

map(function(x){return x*2;}, a);

a[i] = fn(a[i]);

• You can invoke this map function like this:

This is function fn whose definition is included in the call

a[i] = a[i] * 2;

• In summary, now you can rewrite:

as a map operation:

What is a Reduce operation?

• Another common operation on arrays is to combine all their values:

function sum(a) {

var s = 0;

s += a[i];

return s;

function sum(a) {

var s = 0;

s += a[i];

return s;

This can

be written

function

function sum(a) {

var s = 0;

s = fn(s,a[i]);

return s;

Like this, where function fn is defined so it adds its arguments: function fn(a,b){ return a+b; }

function sum(a) {

var s = 0;

s = fn(s, a[i]);

return s;

The whole function sum can also be rewritten so that fn is passed as an

argument

function reduce(fn, a, init) {

var s = init;

s = fn(s, a[i]);

return s;

Like this… The function name was changed to reduce, and now it takes

three arguments, a function, an array, and an initial value

function sum(a) {

var s = 0;

s += a[i];

return s;

as a reduce operation:

reduce(function(a,b){return a+b;},a,0);

Submitting a MapReduce job

MapReduce – Distributed Mergesort Engine

Two Fundamental data types

Input Output

reduce

• Key/value pairs

• Lists

Input Output

map <k1, v1>

reduce

• Key/value pairs

• Lists

Input Output

map <k1, v1> list(<k2, v2>)

reduce

• Key/value pairs

• Lists

Input Output

reduce <k2, list(v2)>

• Key/value pairs

• Lists

Input Output

reduce <k2, list(v2)> list(<k3, v3>)

• Key/value pairs

• Lists

Simple data flow example

Fault tolerance

• Task Failure

Fault tolerance

• Task Failure

• If a child task fails, the child JVM reports to the TaskTracker before it exits. Attempt is marked failed, freeing up slot for another task.

Fault tolerance

• Task Failure

• If the child task hangs, it is killed. JobTracker reschedules the task on another machine.

Fault tolerance

• Task Failure

• If the child task hangs, it is killed. JobTracker reschedules the task on another machine.

• If task continues to fail, job is failed.

Fault tolerance

• TaskTracker Failure

Fault tolerance

• JobTracker receives no heartbeat

Fault tolerance

• Removes TaskTracker from pool of TaskTrackers to schedule tasks on.

Fault tolerance

• JobTracker Failure

Fault tolerance

• JobTracker Failure

• Singe point of failure. Job fails

Scheduling

• FIFO scheduler (with priorities)

Scheduling

• Each job uses the whole cluster, so jobs wait their turn.

Scheduling

• Fair scheduler

Scheduling

• Fair scheduler

• Jobs placed in pools. If a user submits more jobs than another user, he will not get any more cluster resources than the other user, on average. Can define custom pools with guaranteed minimum capacity.

Scheduling

• Fair scheduler

• Capacity scheduler

Scheduling

• Fair scheduler

• Capacity scheduler

• Allows Hadoop to simulate, for each user, a separate MapReduce cluster with FIFO scheduling.

Task execution

• Speculative Execution

Task execution

• Job execution is time sensitive to slow-running tasks. Hadoop detects slow-running tasks and launches another, equivalent task as a backup. The output from the first of these tasks to finish is used.

Task execution

• Task JVM Reuse

Task execution

• Task JVM Reuse

• Tasks run in their own JVMs for isolation. Jobs that have a large number of short-lived tasks or tasks with lengthy initialization can benefit from sequential JVM reuse through configuration.

Thank You

Pig, Hive, and JAQL

Agenda

• Overview

• Pig

• Hive

• Jaql

Agenda

• Overview

• Pig

• Hive

• Jaql

Similarities of Pig, Hive and Jaql

All translate their respective high-level languages to MapReduce jobs

All offer significant reductions in program size over Java

All provide points of extension to cover gaps in functionality

All provide interoperability with other languages

None support random reads/writes or low-latency queries

Comparing Pig, Hive, and Jaql

Pig Hive Jaql

Developed by Yahoo! Facebook IBM

Language name Pig Latin HiveQL Jaql

Type of language Data flow

Declarative

(SQL dialect) Data flow

Data structures it

operates on Complex

Geared

towards

structured data

Loosely structured

data, JSON

Schema optional? Yes

No, but data

can have many

schemas Yes

Turing complete?

Yes when

extended with

Java UDFs

Yes when

extended with

Java UDFs Yes

Agenda

• Overview

• Pig

• Hive

• Jaql

Pig components

• Two Components

Language (called Pig Latin)

Compiler

• Two execution environments

Local (Single JVM)

pig -x local

Distributed (Hadoop cluster)

pig -x mapreduce, or simply pig

Pig Latin

Compiler

Distributed

Execution Environment

Running Pig

Script

pig scriptfile.pig

Grunt (command line)

pig (to launch command line tool)

Embedded

Call in to Pig from Java

153 153

Pig Latin sample code

grunt> records = LOAD ‘econ_assist.csv’

using PigStorage (‘,’)

AS (country:chararray, sum:long);

grunt> grouped = GROUP records BY country;

grunt> thesum = FOREACH grouped

GENERATE group,

SUM(records, sum);

grunt> DUMP thesum;

Pig Latin – Statements, operations & commands

Pig Latin program

… LOAD ‘input.txt’;

… ls *.txt

… DUMP…

An operation

as a statement A

command

statement

Logical Plan

Compile Physical

Execute

Pig Latin statements

UDF Statements

REGISTER, DEFINE

Commands

Hadoop Filesystem (cat, ls, etc.)

Hadoop MapReduce (kill)

Utility (exec, help, quit, run, set)

Operators

Diagnostic: DESCRIBE, EXPLAIN, ILLUSTRATE

Relational: LOAD, STORE, DUMP, FILTER, etc.

156 156

Pig Latin – Relational operators

Loading and storing

Eg: LOAD (into a program), STORE (to disk), DUMP (to the screen)

Filtering Eg: FILTER, DISTINCT, FOREACH...GENERATE, STREAM, SAMPLE

Grouping and joining Eg: JOIN, COGROUP, GROUP, CROSS

Sorting Eg: ORDER, LIMIT

Combining and splitting Eg: UNION, SPLIT

157 157

Pig Latin – Relations and schema

Result of a relational operator is a relation

A relation is a set of tuples

Relations can be named using an alias (Eg: “x”)

x = LOAD ‘sample.txt’ AS (id: int, year:int);

DUMP x

Output is a tuple. Eg: (1,1987)

Pig Latin – Relations and schema

Structure of a relation is a schema

Use the DESCRIBE operator to see the schema. Eg:

The output is the schema:

DESCRIBE x

x: {id: int, year: int}

Pig Latin expressions

Statements that contain relational operators may also contain expressions.

Kinds of expressions:

Constant Field Projection

Map lookup Cast Arithmetic

Conditional Boolean Comparison

Functional Flatten

160 160

Pig Latin – Data types

• Simple types:

int float bytearray

long double chararray

Complex types:

Tuple – Sequence of fields of any type

Bag – Unordered collection of tuples

Map – Set of key-value pairs. Keys must be chararray.

161 161

Pig Latin – Function types

Input: One or more expressions

Output: An expression

Example: MAX

Filter

Input: Bag or map

Output: boolean

Example: IsEmpty

162 162

Input: Data from external storage

Output: A relation

Example: PigStorage

Input: A relation

Output: Data to external storage

Example: PigStorage

Pig Latin – Function types

Pig Latin – User-Defined Functions

• Written in Java

Packaged in a JAR file

Register JAR file using the REGISTER statement

Optionally, alias it with DEFINE statement

164 164

Agenda

• Overview

• Pig

• Hive

• Jaql

Hive architecture

Metastore

(Relational

database

for metadata)

Hadoop

JDBC/ODBC

Interface

Parser,

Planner

Optimizer

DDL Queries

Running Hive

Hive Shell

Interactive

Script

hive -f myscript

Inline

hive -e 'SELECT * FROM mytable'

167 167

Hive services

hive --service servicename

where servicename can be:

hiveserver

server for Thrift, JDBC, ODBC clients

web interface

hadoop jar with Hive jars in classpath

metastore

out of process metastore

168 168

Hive - Metastore

Stores Hive metadata

Configurations

Embedded

in-process metastore, in-process database

in-process metastore, out-of-process database

Remote

out-of-process metastore, out-of-process database

169 169

Hive – Schema-On-Read

Faster loads into the database (simply copy or move)

Slower queries

Flexibility – multiple schemas for the same data

170 170

Hive - Configuration

• Three ways to configure hive:

• hive-site.xml

- fs.default.name

- mapred.job.tracker

- Metastore configuration settings

hive –hiveconf

“Set” command in the Hive Shell

171 171

Hive Query Language (HiveQL)

SQL dialect

Does not support full SQL92 specification

No support for:

HAVING clause in SELECT

Correlated subqueries

Subqueries outside FROM clauses

Updateable or materialized views

Stored procedures

172 172

Sample code

hive> CREATE TABLE foreign_aid

(country STRING, sum BIGINT)

ROW FORMAT DELIMITED

FIELDS TERMINATED BY ‘,’

STORED AS TEXTFILE;

hive> SHOW TABLES;

hive> DESCRIBE foreign_aid;

hive> LOAD DATA INPATH ‘econ_assist.csv’

OVERWRITE INTO TABLE foreign_aid;

hive> SELECT * FROM foreign_aid LIMIT 10;

hive> SELECT country, SUM(sum) FROM foreign_aid

GROUP BY country;

Extensions

MySQL-like extensions

MapReduce extensions

Multi-table insert, MAP, REDUCE, TRANSFORM clauses

Data Types

Simple

TINYINT, SMALLINT, INT, BIGINT, FLOAT, DOUBLE, BOOLEAN, STRING

Complex

ARRAY, MAP, STRUCT

174 174

Built-in Functions SHOW FUNCTIONS

DESCRIBE FUNCTION

175 175

Hive – User-Defined Functions

Written in Java

Three UDF types:

Input: single row, output: single row

Input: multiple rows, output: single row

Input: single row, output: multiple rows

Register UDF using ADD JAR

Create alias using CREATE TEMPORARY FUNCTION

176 176

Agenda

• Overview

• Pig

• Hive

• Jaql

Jaql architecture

Interactive shell / Applications

Script

Compiler / Parser / Rewriter

File Systems

(HDFS, GPFS, Local)

Databases

(DBMS, HBase)

Streams

(Web, Pipes)

Storage layer

I/O layer

Jaql data model: JSON

JSON = JavaScript object Notation

Flexible (Schema is optional)

Powerful modeling for semi-structured data

Popular exchange format

179 179

JSON example

{ACCT_NUM:18,AUTH_DATE:”2011-01-29”,

AUTH_AMT:”111.11”,ZIP:98765,MERCH_NAME:”Acme”},

AUTH_AMT:”222.22”,ZIP:98765,MERCH_NAME:”Exxme”,

NICKNAME:”Xyz”},

AUTH_AMT:”3.33”,ZIP:12345,MERCH_NAME:”Acme”,

ROUTE:[”68.86.85.188”,”64.215.26.111”]},

Running Jaql

Jaql Shell

Interactive. Eg: jaqlshell

Batch Eg: jaqlshell -b myscript.jaql

Inline Eg: jaqlshell -e jaqlstatement

Cluster Eg: jaqlshell -c

Minicluster Eg: jaqlshell

181 181

Jaql query language

• Sources and sinks

Eg: Copy data from a local file to a new file on HDFS

source sink

read(file(“input.json”)) -> write(hdfs(“output”))

Core Operators

Filter Group Tee

Transform Join Sort

Expand Union Top

source sink operator operator …

Jaql query language

• Variables

Equal operator (=) binds source output to a variable

e.g. $tweets = read(hdfs(“twitterfeed”))

Pipes, streams, and consumers

Pipe operator (->) streams data to a consumer

Pipe expects array as input

e.g. $tweets → filter $.from_src == 'tweetdeck';

$ – implicit variable referencing current array value

183 183

Jaql query language

• Categories of Built-in Functions

system schema agg

core xml number

hadoop regex string

io binary function

array date random

index nil record

184 184

Jaql – Data Storage

Data store examples Amazon S3 DB2 HBase HDFS

HTTP JDBC Local FS

Data format examples JSON AVRO CSV XML

185 185

Jaql sample code

#jaqlshell -c

jaql> $foreignaid =

read(del(“econ_assist.csv”,

{schema: schema

{country: string, sum: long}

jaql> $foreignaid

-> group by $country = ($.country)

into {$country.country, sum($[*].sum)};

Hadoop core lab – Part 3

BigDataUniversity.com

Acknowledgements and Disclaimers

Availability. References in this presentation to IBM products, programs, or services do not imply that they will be available in all countries in

which IBM operates.

The workshops, sessions and materials have been prepared by IBM or the session speakers and reflect their own views. They are provided for

informational purposes only, and are neither intended to, nor shall have the effect of being, legal or other guidance or advice to any participant.

While efforts were made to verify the completeness and accuracy of the information contained in this presentation, it is provided AS-IS without

warranty of any kind, express or implied. IBM shall not be responsible for any damages arising out of the use of, or otherwise related to, this

presentation or any other materials. Nothing contained in this presentation is intended to, nor shall have the effect of, creating any warranties or

representations from IBM or its suppliers or licensors, or altering the terms and conditions of the applicable license agreement governing the use

of IBM software.

All customer examples described are presented as illustrations of how those customers have used IBM products and the results they may have

achieved. Actual environmental costs and performance characteristics may vary by customer. Nothing contained in these materials is intended

to, nor shall have the effect of, stating or implying that any activities undertaken by you will result in any specific sales, revenue growth or other

results.

•U.S. Government Users Restricted Rights - Use, duplication or disclosure restricted by GSA ADP Schedule Contract with IBM

IBM, the IBM logo, ibm.com, InfoSphere and BigInsights, Streams, and DB2 are trademarks or registered trademarks of International

Business Machines Corporation in the United States, other countries, or both. If these and other IBM trademarked terms are marked on

their first occurrence in this information with a trademark symbol (® or ™), these symbols indicate U.S. registered or common law

trademarks owned by IBM at the time this information was published. Such trademarks may also be registered or common law

trademarks in other countries. A current list of IBM trademarks is available on the Web at “Copyright and trademark information” at

www.ibm.com/legal/copytrade.shtml

Other company, product, or service names may be trademarks or service marks of others.

Communities

• On-line communities, User Groups, Technical Forums, Blogs, Social networks, and more

o Find the community that interests you …

• Information Management bit.ly/InfoMgmtCommunity

• Business Analytics bit.ly/AnalyticsCommunity

• Enterprise Content Management bit.ly/ECMCommunity

• IBM Champions

o Recognizing individuals who have made the most outstanding contributions to Information Management, Business Analytics, and Enterprise Content Management communities

• ibm.com/champion

Thank You Your feedback is important!

• Access the Conference Agenda Builder to complete your session surveys

oAny web or mobile browser at http://iod13surveys.com/surveys.html

oAny Agenda Builder kiosk onsite

IOD 2013 - Crunch Big Data in the Cloud with IBM BigInsights and Hadoop

Technology

Transcript of IOD 2013 - Crunch Big Data in the Cloud with IBM BigInsights and Hadoop

IOD -ALBANIA 2019

Dev@PULSE BigInsights Lab BlueMix

Concept to production Nationwide Insurance BigInsights Journey with Telematics

Driving IBM BigInsights Performance Over GPFS Using ...€¦ · Driving IBM BigInsights Performance Over GPFS Using ... “Big Data” can be described as data that ... That means

Big Data: Technical Introduction to BigSheets for InfoSphere BigInsights

Cyber Security - IoD

IOD 2012_ADP_092912

STIG Configuration for IOP and BigInsights - IBM

EMC Starter Kit - IBM BigInsights - EMC Isilon

Coef Distribution Iod

Universidad Veracruzana · UROLOG A PARASITOLOGIA TEORIA PARASITOLOGIA TEORIA LA 3 3 IOD IOD IOD IOD IOD Ipp Ipp Ipp Ipp IOD IOD IOD IOD IOD Universidad Veracruza Secretaría Académica

BigInsights 4.2 and BigSQL Overview

IoD Case Study

Big Data: Big SQL Application Development with BigInsights

Ba Mapreduce Biginsights Analysis PDF

BigInsights For Telecom

IBM IOD 2013

Iod 2010 1971_lohman_final

2014.07.11 biginsights data2014

IoD Big Picture