MT30 Best practices for data lake adoption
-
Upload
dell-emc-world -
Category
Documents
-
view
218 -
download
3
Transcript of MT30 Best practices for data lake adoption
![Page 1: MT30 Best practices for data lake adoption](https://reader034.fdocuments.in/reader034/viewer/2022042906/58a6dc921a28abef698b5b15/html5/thumbnails/1.jpg)
MT30
Best practices: Data lake adoption
Matt Maccaux, Global Big Data Practice Lead
![Page 2: MT30 Best practices for data lake adoption](https://reader034.fdocuments.in/reader034/viewer/2022042906/58a6dc921a28abef698b5b15/html5/thumbnails/2.jpg)
2Dell - Internal Use - Confidential
Agenda
• Two models for big data
• Big data anti-patterns
• Big data best practice
• How to get started?
• Your questions
![Page 3: MT30 Best practices for data lake adoption](https://reader034.fdocuments.in/reader034/viewer/2022042906/58a6dc921a28abef698b5b15/html5/thumbnails/3.jpg)
3Dell - Internal Use - Confidential
Two models for big data
Exploratory analytics
• Full data set – batch
• Explore, test, refine,
iterate
• The output is an algorithm
that will be integrated into
new or existing
applications.
Operationalization
• Limited data set –
Streaming
• The algorithm is integrated
into applications that drive
business decisions.
![Page 4: MT30 Best practices for data lake adoption](https://reader034.fdocuments.in/reader034/viewer/2022042906/58a6dc921a28abef698b5b15/html5/thumbnails/4.jpg)
Big data anti-patterns
![Page 5: MT30 Best practices for data lake adoption](https://reader034.fdocuments.in/reader034/viewer/2022042906/58a6dc921a28abef698b5b15/html5/thumbnails/5.jpg)
Dell - Internal Use - Confidential
© Copyright 2016 EMC Corporation. All rights reserved.
![Page 6: MT30 Best practices for data lake adoption](https://reader034.fdocuments.in/reader034/viewer/2022042906/58a6dc921a28abef698b5b15/html5/thumbnails/6.jpg)
Dell - Internal Use - Confidential
© Copyright 2016 EMC Corporation. All rights reserved.
![Page 7: MT30 Best practices for data lake adoption](https://reader034.fdocuments.in/reader034/viewer/2022042906/58a6dc921a28abef698b5b15/html5/thumbnails/7.jpg)
Dell - Internal Use - Confidential
© Copyright 2016 EMC Corporation. All rights reserved.
![Page 8: MT30 Best practices for data lake adoption](https://reader034.fdocuments.in/reader034/viewer/2022042906/58a6dc921a28abef698b5b15/html5/thumbnails/8.jpg)
Dell - Internal Use - Confidential
© Copyright 2016 EMC Corporation. All rights reserved.
![Page 9: MT30 Best practices for data lake adoption](https://reader034.fdocuments.in/reader034/viewer/2022042906/58a6dc921a28abef698b5b15/html5/thumbnails/9.jpg)
Dell - Internal Use - Confidential
© Copyright 2016 EMC Corporation. All rights reserved.
![Page 10: MT30 Best practices for data lake adoption](https://reader034.fdocuments.in/reader034/viewer/2022042906/58a6dc921a28abef698b5b15/html5/thumbnails/10.jpg)
Big data best practices
![Page 11: MT30 Best practices for data lake adoption](https://reader034.fdocuments.in/reader034/viewer/2022042906/58a6dc921a28abef698b5b15/html5/thumbnails/11.jpg)
Dell - Internal Use - Confidential
© Copyright 2016 EMC Corporation. All rights reserved.
me:~>_
CONTINUUM
![Page 12: MT30 Best practices for data lake adoption](https://reader034.fdocuments.in/reader034/viewer/2022042906/58a6dc921a28abef698b5b15/html5/thumbnails/12.jpg)
Dell - Internal Use - Confidential
© Copyright 2016 EMC Corporation. All rights reserved.
Hadoop
Spark
Tableau
Python
TOOL CATALOG
Customer
Alert
Bills
Social
DATACATALOG
Duration
Performance
Normal
Analytics Request Portal
NONSampleData
SampleData
![Page 13: MT30 Best practices for data lake adoption](https://reader034.fdocuments.in/reader034/viewer/2022042906/58a6dc921a28abef698b5b15/html5/thumbnails/13.jpg)
Dell - Internal Use - Confidential
© Copyright 2016 EMC Corporation. All rights reserved.
Data Lake
Discover/Map
Transform
Organize/Tag
CATALOG AND PROVISION
ENTERPRISE LOG ANALYSIS
Virtualisation
![Page 14: MT30 Best practices for data lake adoption](https://reader034.fdocuments.in/reader034/viewer/2022042906/58a6dc921a28abef698b5b15/html5/thumbnails/14.jpg)
Dell - Internal Use - Confidential
© Copyright 2016 EMC Corporation. All rights reserved.
Virtualised Compute Pool
![Page 15: MT30 Best practices for data lake adoption](https://reader034.fdocuments.in/reader034/viewer/2022042906/58a6dc921a28abef698b5b15/html5/thumbnails/15.jpg)
Dell - Internal Use - Confidential
© Copyright 2016 EMC Corporation. All rights reserved.
Data Pool
Meta
-dat
a T
aggi
ng
G
o
v
e
r
n
a
n
c
e
A
n
o
n
y
m
i
s
e
E
n
c
r
y
p
t
i
o
n
Pooln
Pooln
Pooln
Copy
![Page 16: MT30 Best practices for data lake adoption](https://reader034.fdocuments.in/reader034/viewer/2022042906/58a6dc921a28abef698b5b15/html5/thumbnails/16.jpg)
Dell - Internal Use - Confidential
© Copyright 2016 EMC Corporation. All rights reserved.
Virtualised Compute Pool
![Page 17: MT30 Best practices for data lake adoption](https://reader034.fdocuments.in/reader034/viewer/2022042906/58a6dc921a28abef698b5b15/html5/thumbnails/17.jpg)
Dell - Internal Use - Confidential
© Copyright 2016 EMC Corporation. All rights reserved.
CD \>_
CONTINUUM
Data Pool
G
o
v
e
r
n
a
n
c
e
A
n
o
n
y
m
i
s
e
E
n
c
r
y
p
t
i
o
n
Pooln
Pooln
Pooln
Copy
Virtualised Compute Pool
![Page 18: MT30 Best practices for data lake adoption](https://reader034.fdocuments.in/reader034/viewer/2022042906/58a6dc921a28abef698b5b15/html5/thumbnails/18.jpg)
18Dell - Internal Use - Confidential
How to get started?
Big Data Technology Advisory
• Interview stakeholders including business users and technical/functional
experts
• Document requirements and gaps
• Define a future-state reference architecture
• Provide a plan/roadmap for implementation
![Page 19: MT30 Best practices for data lake adoption](https://reader034.fdocuments.in/reader034/viewer/2022042906/58a6dc921a28abef698b5b15/html5/thumbnails/19.jpg)
Q&A
![Page 20: MT30 Best practices for data lake adoption](https://reader034.fdocuments.in/reader034/viewer/2022042906/58a6dc921a28abef698b5b15/html5/thumbnails/20.jpg)