The Path to Truly Understanding Your MongoDB Data

48
#MDBlocal DATA ANALYTICS / UNDERSTANDING YOUR DATA SEATTLE

Transcript of The Path to Truly Understanding Your MongoDB Data

Page 1: The Path to Truly Understanding Your MongoDB Data

#MDBlocal

DATA ANALYTICS / UNDERSTANDING YOUR DATA

SEATTLE

Page 2: The Path to Truly Understanding Your MongoDB Data

# M D B l o c a l

Page 3: The Path to Truly Understanding Your MongoDB Data

#MDBlocal

WIFI: MONGODB PW: mdb

Page 4: The Path to Truly Understanding Your MongoDB Data

#MDBlocal

SAM WEAVER

PRODUCT MANAGER, MONGODB

@samuel_weaver

Page 5: The Path to Truly Understanding Your MongoDB Data

#MDBlocal

AGENDA

1. Background2. The importance of data visualization3. Methods for data visualization in MongoDB 4. Lunch

Page 6: The Path to Truly Understanding Your MongoDB Data

#MDBlocal

WHERE ARE WE NOW?

Page 7: The Path to Truly Understanding Your MongoDB Data

#MDBlocal

TERMINOLOGY

“Business Intelligence” “Business

Analytics”

ANALYTICS

Page 8: The Path to Truly Understanding Your MongoDB Data

#MDBlocal

DATA GROWTH IS EXPLOSIVE

• More data created in the last 2 years than entire previous history of the human race

• By 2020:• 1.7MB per person every second

Page 9: The Path to Truly Understanding Your MongoDB Data

#MDBlocal

THE STATE OF ANALYTICS

• Analytics is big $!

• $130B in 2016

• $200B+ in 2020

• Less than 0.5% of data is analyzed and used – imagine the potential!

Page 10: The Path to Truly Understanding Your MongoDB Data

#MDBlocal

EVOLUTION OF ANALYTICS

• Self service• Mobile access• Spark • Real time analytics• On-prem and cloud• On demand reporting

2018

Today2015 20162012

• Dedicated reporting team• Desktop access• Hadoop• Batch analytics• On prem only• Monthly reports

Page 11: The Path to Truly Understanding Your MongoDB Data

#MDBlocal

IMPORTANCE OF DATA VIZ

Page 12: The Path to Truly Understanding Your MongoDB Data

#MDBlocal

Page 13: The Path to Truly Understanding Your MongoDB Data

#MDBlocal

EARLY DATA VISUALIZATIONS

Charles Minard (1869)-- Napolean’s march and retreat on Moscow in 1812.

Page 14: The Path to Truly Understanding Your MongoDB Data

#MDBlocal

I

X Y

10 8.04

8 6.95

13 7.58

9 8.81

11 8.33

14 9.96

6 7.24

4 4.26

12 10.84

7 4.82

5 5.68

9.00 7.50

10.00 3.75

0.816

Page 15: The Path to Truly Understanding Your MongoDB Data

#MDBlocal

I

X Y

10 8.04

8 6.95

13 7.58

9 8.81

11 8.33

14 9.96

6 7.24

4 4.26

12 10.84

7 4.82

5 5.68

9.00 7.50

10.00 3.75

0.816

Page 16: The Path to Truly Understanding Your MongoDB Data

#MDBlocal

I

X Y

10 8.04

8 6.95

13 7.58

9 8.81

11 8.33

14 9.96

6 7.24

4 4.26

12 10.84

7 4.82

5 5.68

9.00 7.50

10.00 3.75

0.816

II III IV

X Y X Y X Y

10 9.14 10 7.46 8 6.58

8 8.14 8 6.77 8 5.76

13 8.74 13 12.74 8 7.71

9 8.77 9 7.11 8 8.84

11 9.26 11 7.81 8 8.47

14 8.1 14 8.84 8 7.04

6 6.13 6 6.08 8 5.25

4 3.1 4 5.39 19 12.5

12 9.13 12 8.15 8 5.56

7 7.26 7 6.42 8 7.91

5 4.74 5 5.73 8 6.89

9.00 7.50 9.00 7.50 9.00 7.50 Mean

10.00 3.75 10.00 3.75 10.00 3.75 Variance (Population)

0.816 0.816 0.817 Correlation (Pearson)

Page 17: The Path to Truly Understanding Your MongoDB Data

#MDBlocal

Page 18: The Path to Truly Understanding Your MongoDB Data

#MDBlocal

Page 19: The Path to Truly Understanding Your MongoDB Data
Page 20: The Path to Truly Understanding Your MongoDB Data

#MDBlocal

SO YOU WANT TO VISUALIZE?

Page 21: The Path to Truly Understanding Your MongoDB Data

#MDBlocal

Page 22: The Path to Truly Understanding Your MongoDB Data

# M D B l o c a l

Page 23: The Path to Truly Understanding Your MongoDB Data

#MDBlocal

• Use the correct architecture

• Determine what your needs are• Multiple data sources?

• Huge amounts of complex data?

• Quick self service?

• Choose the right solution for you

THINGS TO THINK ABOUT

Page 24: The Path to Truly Understanding Your MongoDB Data

#MDBlocal

ARCHITECTURE FOR ANALYTICS

Page 25: The Path to Truly Understanding Your MongoDB Data

#MDBlocal

HIDDEN REPLICAS

• Hidden secondary's maintain a copy of the primaries data set

• Hidden secondary's are used for workloads with different access patterns

• Cannot become primary

Client

Primary

Secondary

Secondary

Secondary

SecondaryP=0 Hidden=True

Analytics

Page 26: The Path to Truly Understanding Your MongoDB Data

#MDBlocal

TOOLING

Page 27: The Path to Truly Understanding Your MongoDB Data

#MDBlocal

BUILD YOUR OWN

• Pro’s• Custom tailored solution: fits exactly as required!

• Con’s• High investment

• Maintenance

• Deep understanding of the underlying tech and its language(s)

Page 28: The Path to Truly Understanding Your MongoDB Data

#MDBlocal

USE THE TOOLS WE GIVE YOU

Page 29: The Path to Truly Understanding Your MongoDB Data

#MDBlocal

MONGODB COMPASS

• Developer tool

• Data management and manipulation

• Interesting schema analysis

• Used daily: a good first place to start

Page 30: The Path to Truly Understanding Your MongoDB Data

#MDBlocal

WHEN TO USE

• Day-to-day development/operations

• Adding indexes

• Viewing server stats

• Data manipulation

• 10,000->1ft view of data

Page 31: The Path to Truly Understanding Your MongoDB Data

#MDBlocal

BI CONNECTOR

• Visualize and explore MongoDB data in SQL-based BI tools:• Automatically discovers the schema

• Translates complex SQL statements issued by the BI tool into MongoDB aggregation queries

• Converts the results into a tabular format for rendering inside the BI tool

Page 32: The Path to Truly Understanding Your MongoDB Data

#MDBlocal

BI CONNECTOR

Page 33: The Path to Truly Understanding Your MongoDB Data

#MDBlocal

WHEN TO USE

• Multi datasources (not just mongodb)

• Business analysts

• Extremely powerful but high ramp

Page 34: The Path to Truly Understanding Your MongoDB Data

#MDBlocal

MONGODB CHARTS

• Lightweight

• Intuitive

• Build visualizations on MongoDB data (nested, polymorphic)

• Share content in a dashboard

Page 35: The Path to Truly Understanding Your MongoDB Data

#MDBlocal

WHEN TO USE

• When you want quick answers

• No need to flatten / ETL your mongodb data

• Self service for the technical audience

Page 36: The Path to Truly Understanding Your MongoDB Data

#MDBlocal

DEMO

Page 37: The Path to Truly Understanding Your MongoDB Data

#MDBlocal

LIFE CYCLE

1. Acquire 2. Prep- Calcs- Groups- Data types

3. Visualize- Bar- Pie- Line

4. Explore- Dashboards

5. Share- Export- Collaborate- Embed

Page 38: The Path to Truly Understanding Your MongoDB Data

#MDBlocal

THANK YOU

Page 39: The Path to Truly Understanding Your MongoDB Data

# M D B l o c a l

Page 40: The Path to Truly Understanding Your MongoDB Data

# M D B l o c a l

Page 41: The Path to Truly Understanding Your MongoDB Data

# M D B l o c a l

Page 42: The Path to Truly Understanding Your MongoDB Data

# M D B l o c a l

Page 43: The Path to Truly Understanding Your MongoDB Data

# M D B l o c a l

Page 44: The Path to Truly Understanding Your MongoDB Data

# M D B l o c a l

Page 45: The Path to Truly Understanding Your MongoDB Data

# M D B l o c a l

Page 46: The Path to Truly Understanding Your MongoDB Data

# M D B l o c a l

Page 47: The Path to Truly Understanding Your MongoDB Data

# M D B l o c a l

Page 48: The Path to Truly Understanding Your MongoDB Data

# M D B l o c a l