Presentation-Big data optomisation

16
DATA OPTIMISATION •Improving Information consistency for better big data management AJAY RATHI

Transcript of Presentation-Big data optomisation

DATA OPTIMISATION• Improving Information consistency for better big data management

AJAY RATHI

New Natural resource

Every day, we create 2.5 quintillion bytes of data — so much that 90% of the data in the world today has been created in the last two years alone. This data comes from everywhere:

What is BIG DATA

sensors used to gather information.

Posts to social media sites

Digital pictures and videos

Purchase transaction records

Cell phone GPS signals to name a few.

GARTNER HYPE CYCLE OF BIG DATA

GARTNER HYPE CYCLE OF IOT

The 4 V’s of Big data

Variability

Visualization

Data is not IMPORTANT, it is

what you do with it is IMPORTANTVALUE

Traditional Data Vs Big Data.

DATA LAKE

OUTCOME OF DATA LAKE

BIG DATA GOVERNANCE TRUTH Vs TRUST.

In God We trust all other bring data

TRADITIONAL Vs BIG DATA GOVERNANCE

WHY IS BIG DATA TRUST IMPORTANT

What Should be done --- Look for answers

Where did the data come

from?

Is the data from a transaction that can be audited and proven?

Is the data truth or opinion?

is it behavioral data from a data aggregator?

Is the data an intentional fabrication?

Is the profile information supplied by a registered user reliable?

What Should be done –Three principals to start with•Has the source (system, person or data stream) proved to be reliable on previous occasions? Chances are he/she/it will be again.

•Do other sources point in the same direction? Although it is unlikely that you'll get exactly the same results when you triangulate, you may be able to confirm a trend.

•Have other users or uses found the data to be useful? Try counting the effective popularity of data being used, or its "likes," if you will.