Hadoop and Data Virtualization - A Case Study by VHA

36
Page 1 © Hortonworks Inc. 2011 – 2015. All Rights Reserved Hadoop and Data Virtualization: A Case Study by VHA with Denodo, Hortonworks and VHA

Transcript of Hadoop and Data Virtualization - A Case Study by VHA

Page 1 © Hortonworks Inc. 2011 – 2015. All Rights Reserved

Hadoop and Data Virtualization: A Case Study by VHA

with Denodo, Hortonworks and VHA

Page 2 © Hortonworks Inc. 2011 – 2015. All Rights Reserved

Speakers

Richard Proctor GM Healthcare Hortonworks

Ravi Shankar CMO Denodo

Ben Blakeney Architecture & Engineering Services VHA

Page 3 © Hortonworks Inc. 2011 – 2015. All Rights Reserved

Shifting the Data Paradigm

•  Reactive reporting •  3 – 6 month delay •  Regulatory-centric •  Manual data review/collection •  Repressive data silo’s •  Not sure what data to store •  Expensive storage w/limited options

�  All data types (structured, semi structured, & Unstructured)

�  Near real-time and predictive �  Organization & Patient centric �  Store everything �  Inexpensive storage w/lots options

Data  as  an  independent  business  process(Silo’s  of  data)  

Reac7ve  repor7ng  

Data  as  a  byproduct  of  pa7ent  care   Prospec7ve  analysis  

Primary  audience  –  healthcare  organiza7on  

Secondary  audience    

Secondary  audience    

Primary  audience  –  regulatory  agencies  

Current  data  process  with  latent  architecture  

Modern  data  Architecture  with  Hadoop  

Page 4 © Hortonworks Inc. 2011 – 2015. All Rights Reserved

The problem: Current data architecture under pressure AP

PLICAT

IONS  

DATA

   SYSTEM  

REPOSITORIES  

SOURC

ES  

Exis4ng  Sources    ADT,  Pa4ent  Accoun4ng,  GL,  Payroll,    

Physician  entry,  Core  measures,  Pa4ent  Sat,  AHRQ,  External  benchmarks,,  Clinical  Systems  (ED,  Radiology,  PACS),  Other  Sources  (Clinics,  

home  health,  Ambulatory,  LTAC’s)  

RDBMS   EDW   MPP  

Business    Analy4cs  

Custom  Applica4ons  

Packaged  Applica4ons  

OLTP,  ERP,  CRM  Systems  

Unstructured  documents,  emails  

Clickstream  

Server  logs  

Sen7ment,  Web  Data  

Sensor.  Machine  Data  

Geoloca7on  

Value in New data sources

•  Limited Application interaction •  Costly to Scale storage •  Silos of Data

•  Inability to manage new data sources •  Schema on Write vs. Read

Page 5 © Hortonworks Inc. 2011 – 2015. All Rights Reserved

The “Empowered” Patient

Relationship Focused (Values stable doctor’s relationships)

Detests/suspicious of Gadgets

Serial Processor (Email, basic model phone)

Loyalty to Brand

Health as a service

Access to Care

Patient (willing to wait)

Believes experts (not comfortable seeking second opinions)

Goes for Best of Breed (Network of Networks)

Super Connected

Parallel Processor-Integrated (State-of-the-Art)

Loyalty to Value

Take care of Health

Health as a right

Impatient

Asks for Data (researches multiple self enabled searches, demands second opinions)

My Parents

My Children

Page 6 © Hortonworks Inc. 2011 – 2015. All Rights Reserved

Thank you for the diagram, Robert Wood Johnson Foundation, 2014

6

Comprehensive Health Management

80% of healthcare determinants lie outside the US healthcare delivery system Can healthcare systems expand into these other areas, and become true public health systems?

Page 7 © Hortonworks Inc. 2011 – 2015. All Rights Reserved

Page 8 © Hortonworks Inc. 2011 – 2015. All Rights Reserved

The image cannot be displayed. Your computer may not have enough memory to open the image, or the image may have been corrupted. Restart your computer, and then open the file again. If the red x still appears, you may have to delete the image and then insert it again.

Original 24 architects, developers, operators of Hadoop from Yahoo!

ON

LY

100 open source

Apache Hadoop data platform

% Founded in 2011

HADOOP 1 ST provider to go public

IPO Fall 2014 (NASDAQ: HDP)

subscription customers 556

employees across 740+

countries technology partners 1350+ 17

TM

Hortonworks Company Profile

Page 9 © Hortonworks Inc. 2011 – 2015. All Rights Reserved

Two great Use cases to realize a dramatic cost savings…

EDW Optimization

OPERATIONS 50%

ANALYTICS 20%

ETL PROCESS 30%

OPERATIONS 50% ANALYTICS

50%

Current Reality EDW at capacity: some usage from low value workloads

Older data archived, unavailable for ongoing exploration

Source data often discarded

Augment w/ Hadoop

Free up EDW resources from low value tasks

Keep 100% of source data and historical data for ongoing exploration

Mine data for value after loading it because of schema-on-read

MPP

SAN

Engineered System

NAS

HADOOP

Cloud Storage

$0 $20,000 $40,000 $60,000 $80,000 $180,000

Fully-loaded Cost Per Raw TB of Data (Min–Max Cost)

Commodity Compute & Storage Hadoop Enables Scalable Compute & Storage at a Compelling Cost Structure

Hadoop Parse, Cleanse

Apply Structure, Transform

Storage Costs and licensing reduction of latent systems $500,000

5 times the amount of usable storage, plus processing power, for about 30% of the cost of traditional enterprise technologies,”

Page 10 © Hortonworks Inc. 2011 – 2015. All Rights Reserved

Monitor Patient Vitals in Real-Time with Sensor data

Problem Managing The Volumes of System Sensor Data •  In a typical hospital setting, nurses do rounds and manually monitor patient vital signs.

They may visit each bed every few hours to measure and record vital signs but the patient’s condition may decline between the time of scheduled visits.

•  This means that caregivers often respond to problems reactively, in situations where arriving earlier may have made a huge difference in the patient’s wellbeing.

Solution Hadoop Empowers Healthcare by Converting High Volumes of Sensor Data into a Manageable Set of Data •  New wireless sensors can capture and transmit patient vitals at much higher frequencies,

and these measurements can stream into a Hadoop cluster. •  Caregivers can use these signals for real-time alerts to respond more promptly to

unexpected changes. •  Over time, this data can go into algorithms that proactively predict the likelihood of an

emergency even before that could be detected with a bedside visit.

Benefits Ø Proactively Predict Events

rather than reactively

Ø Real-time Alerts

Ø Capture & Transmit Patient Vitals at Much Higher Frequencies

Ø Improve Patient Satisfaction

Ø Improve operational efficiency

Ø Improved response times

Ø Reduce adverse drug response times

Healthcare

VHA Inc. Confidential information.

Hadoop and Data Virtualization

11 |

VHA Inc. Confidential information.

Since 1977 – when 30 hospital CEOs established VHA as the nation’s first membership organization for acute care providers – the company has applied knowledge in analytics, contracting, consulting and network development to help members achieve their strategic objectives.

VHA is based in Irving, Texas, and has 11 regional offices. Our unique family of companies brings industry-leading innovation and expertise to help organizations thrive in a dynamic health care environment:

A legacy of innovation

"   As the first hospital membership organization, we were born of innovation.

"   We introduced the concept of supply networks, drawing on the power of collaboration to achieve greater cost savings for health organizations.

"   We pioneered comparative data analysis and exchange as well as the industry’s first committed contracting program and private label program, all of which continue to deliver exceptional value today.

Who is VHA?

In 2013, VHA delivered

$2.2 billion in savings and additional value

to members.

12 |

VHA Inc. Confidential information.

At UHC, collaboration drives success.

For nearly 30 years, UHC has been a catalyzing force in:

"   Supporting academic medical centers in their efforts

"   Fostering new ideas

"   Building solid relationships that withstand the test of time

Our members have been agents of progress—driving the advancement of patient care, medical knowledge, and fiscal acuity by coming together to candidly discuss their ideas and vision.

UHC continually expands and strengthens services to offer insights and solutions to members.

As the leader in providing relevant comparative data and as a single-source provider of information and insights that promote change, UHC has created UHC Intelligence™, a versatile suite of business tools that power performance improvement.

Who is UHC?

13 |

UHC offers Transparency.

VHA Inc. Confidential information.

UHC and VHA have a track record of partnering successfully

History of Collaboration

14 |

1998 – Current Novation was formed in 1998 and is a joint venture between UHC and VHA.

1998 – 2011 A supply chain improvement company focused on non-acute care market, Provista was formed in 1998 as a joint venture between UHC and VHA. VHA acquired UHC’s minority interest in 2011.

2013 – Current A subsidiary of Novation, aptitude is the health care industry's first online direct contracting marketplace.

UHC and VHA have formed the largest contracting services company Since forming Novation in 1998, UHC and VHA have worked in partnership to grow Novation to the nation’s largest contracting services company, representing more than $50 billion in purchasing volume and delivering more than $1 billion in contract price savings for UHC and VHA members and other affiliate organizations over the past five years. We have continued to expand on this successful partnership. Our advanced shared analytic capabilities and innovative cost management tools have helped purchasing through Novation to save an additional $1.4 billion, over the last four years.

VHA Inc. Confidential information.

UHC and VHA Have Complementary Strengths Which When Combined Create Enhanced Member Value

A Powerful Network

The Nation’s Leading Academic Medical Centers and Community

Health Care Providers

Foundation

Supply Chain

Management

Advisory Services

Comprehensive Data and Analytics

Core Capabilities

Targeted Solutions

Customized Insights

NewCo

15 |

VHA Inc. Confidential information.

Our new organization offers superior access to leading practices, networking and knowledge sharing for our members, which includes the majority of this country’s preeminent academic medical centers and community-based health systems.

The newly combined organization:

"   Serves more than 5,200 health system members and affiliates.

"   Provides services to nearly 30 percent of the nation’s hospitals, including virtually all the academic medical centers and health systems.

"   Serves more than 118,000 non-acute health care customers.

"   Includes more than $50 billion in purchasing volume, the largest in the industry.

"   Provides services to all of the top 10 hospitals on the US News and World Reports annual list of America’s Top Hospitals.

"   Delivers the industry’s most in-depth clinical data combined with the nation’s most robust supply chain data to address cost and quality.

VHA and UHC Are Now the Largest Member-owned Health Care Company

16 |

VHA Inc. Confidential information.

Context

17 |

VHA Inc. Confidential information.

Move from silos to streamlined data processing….

18 |

Data Internal External

Data

Logic

Apps

Internal External Internal External Internal External

Clinical Supply Academic

Process

Logic

Apps

Logic

Apps

Process Process

Data Data

Process

Logic

Clinical

Supply

Academic

Apps

Apps

Apps

Logic

Logic

VHA Inc. Confidential information.

Management Acquisition Delivery

Where we want to go…

19 |

Data Lake

Bus

ines

s A

cces

s La

yer

Data Warehouse

Discovery Zone

Systems of Record

Oracle

SQL

Other

Exadata

Data Access Layer

Applications

Reports

Dashboards DM

Queries

EDI

Future Capabilities

Data Gateway La

ndin

g Zo

ne

HCO

XYZ

Data Aggregators

Raw Data Useful Information

Data Owners, Technical Support Data Stewards, Data SME/Scientist, Analyst, Advisors

Advisors, Analysts, Members, Collaboratives

DM

DM

Data Owners, Technical Support Data Stewards, Data SME’s, Data Scientists, Analyst, Advisors, Data QA

Advisors, Analysts, Members, Collaboratives

Discovery Zone

Posts

VHA Inc. Confidential information.

Let’s get rolling…

20 |

VHA Inc. Confidential information.

Management Acquisition Delivery

Where we want to go…

21 |

Data Lake

Bus

ines

s A

cces

s La

yer

Data Warehouse

Discovery Zone

Systems of Record

Oracle

SQL

Other

Exadata

Data Access Layer

Applications

Reports

Dashboards DM

Queries

EDI

Future Capabilities

Data Gateway La

ndin

g Zo

ne

HCO

XYZ

Data Aggregators

Raw Data Useful Information

Data Owners, Technical Support Data Stewards, Data SME/Scientist, Analyst, Advisors

Advisors, Analysts, Members, Collaboratives

DM

DM

Data Owners, Technical Support Data Stewards, Data SME’s, Data Scientists, Analyst, Advisors, Data QA

Advisors, Analysts, Members, Collaboratives

Discovery Zone

Posts

VHA Inc. Confidential information.

Built on Hadoop

"   Hortonworks is the distribution

Business Need

"   Move to a modern data architecture –  Disparate data sources into a single data lake –  Flexibility of schema on read (not write) –  Ease of doing analysis on subsets of large data sets –  Capture all types of data (even data that might only have a future purpose) –  Lower cost to store large amounts of data

Data Lake

22 |

Data Lake

VHA Inc. Confidential information.

Area to discover value from data

Access roles:

"   Data scientists and SMEs

"   Product Managers

"   Analysts

"   Data Stewards

Challenge

"   Business users have been trained to use SQL or CSV exports

"   Introduction of Hadoop will require training on PIG and HIVE for access

"   Possibility of slowing down adoption and deriving value from new solution

Data Discovery

23 |

Discovery Zone

VHA Inc. Confidential information.

Utilize data virtualization

"   “Data virtualization is an umbrella term used to describe any approach to data management that allows an application to retrieve and manipulate data without requiring technical details about the data, such as how it is formatted or where it is physically located” – Margaret Rouse, TechTarget.com

Our solution…

24 |

Data Lake

Discovery Zone

VHA Inc. Confidential information.

Proven platform

"   Denodo is our DV platform

Successes in our company

" Salesforce reporting environment (cloud based plug-in)

"   Physician dashboard (disparate data sources)

Data Virtualization

25 |

VHA Inc. Confidential information.

Discovery Zone

"   Utilize Denodo HDFS, HBase and Map Reduce custom wrappers

"   Abstract data from lake –  Protects data source asset –  Enhanced security

"   Simplified access for data discovery users –  Can use SQL to query

"   Easy to augment discovery process –  Can pull in other sources of data to DV view (Excel, PDF, Websites)

Data Virtualization for Discovery Zone

26 |

VHA Inc. Confidential information.

Discovery zone presentation layer

Virtual data views

Data Lake

Systems of record

Architecture

27 |

VHA Inc. Confidential information.

Data Lake approach on Hadoop

"   Simplifies data management

"   reduces data costs

"   Scalable

"   Flexibility

Data Virtualization

"   Simplified data access

"   Less training for business users

"   Faster data discovery

"   Augmented discovery process (adding new sources)

Recommendation and Benefits

28 |

Data Virtualization

Ravi Shankar Chief Marketing Officer

© 2015 Denodo Technologies

What is Data Virtualization?

Data Virtualization combines disparate data sources into a single “virtual” data layer (aka information fabric abstraction) that provides unified access and integrated data services to consuming applications in real-time (right-time).

© 2015 Denodo Technologies

© 2015 Denodo Technologies

Data Virtualization Capabilities

Data Virtualization

Logical abstraction & decoupling

Data federation Real-time, hybrid, cache

Semantic integration & data quality- structured & unstructured

Agile data services provisioning

Unified data governance & security

© 2015 Denodo Technologies

Benefits of Data Virtualization

Better Quality Information § Focus on Business Information Needs §  Include Web / Cloud, Big Data, Unstructured, Streaming § Bigger volumes, richer/easier access to data

Lower Cost & Agility

§ Lower Integration Costs by 80% § Flexibility to Change § Real-time (on-demand) Data Services

Fast Time to Solution

§ Projects in 4-6 Weeks § ROI in <6 months § Adds New IT and Business Capabilities

© 2015 Denodo Technologies

Data Virtualization – Use Cases

Agile Business Intelligence

Big Data, Cloud Integration

Agile Single View Applications

Data Services

Data Virtualization

Access new data sources 60% faster with change requests met in just a few days with IT using 40% less analyst time to support.

Reduced back-office workload by more than 50%. Increased First Call Resolution

rate to over 90% and customer satisfaction to over 94%.

Improved asset performance and proactive maintenance. Increased revenue from sale of services and parts. Reduced warranty costs of parts failure.

Reduced time to create and provision data service from 180

hours to 8 hours.

© 2015 Denodo Technologies

About Denodo Description Denodo is the leader in data virtualization offering the broadest access to structured and

unstructured data exceeding the performance needs of data-intensive organizations for both analytical and operational use cases delivered in a much shorter timeframe than traditional data integration tools.

Headquarters

Palo Alto, CA. Offices in New York, London, Madrid, A Coruña, Chicago, Boise, Houston and Munich. Worldwide sales network through partners.

Leadership

Longest continuous focus on data virtualization and data services. Product leadership. Solutions expertise.

Customers

250+ customers worldwide, including many F500 and G2000 companies across key verticals, such as healthcare, life sciences, technology, media, telecommunications, insurance, financial services, consumer/retail, energy and public sector.

Page 36 © Hortonworks Inc. 2011 – 2015. All Rights Reserved

Next Steps…

Download the Hortonworks Sandbox Learn Hadoop

Build Your Analytic App

Try Hadoop

Learn more about our partnerships and VHA

http://www.denodo.com

http://vha.com

Download Denodo Express for Free The fastest way to Data Virtualization