Hadoop and Data Virtualization - A Case Study by VHA
-
Upload
hortonworks -
Category
Software
-
view
2.384 -
download
0
Transcript of Hadoop and Data Virtualization - A Case Study by VHA
Page 1 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Hadoop and Data Virtualization: A Case Study by VHA
with Denodo, Hortonworks and VHA
Page 2 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Speakers
Richard Proctor GM Healthcare Hortonworks
Ravi Shankar CMO Denodo
Ben Blakeney Architecture & Engineering Services VHA
Page 3 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Shifting the Data Paradigm
• Reactive reporting • 3 – 6 month delay • Regulatory-centric • Manual data review/collection • Repressive data silo’s • Not sure what data to store • Expensive storage w/limited options
� All data types (structured, semi structured, & Unstructured)
� Near real-time and predictive � Organization & Patient centric � Store everything � Inexpensive storage w/lots options
Data as an independent business process(Silo’s of data)
Reac7ve repor7ng
Data as a byproduct of pa7ent care Prospec7ve analysis
Primary audience – healthcare organiza7on
Secondary audience
Secondary audience
Primary audience – regulatory agencies
Current data process with latent architecture
Modern data Architecture with Hadoop
Page 4 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
The problem: Current data architecture under pressure AP
PLICAT
IONS
DATA
SYSTEM
REPOSITORIES
SOURC
ES
Exis4ng Sources ADT, Pa4ent Accoun4ng, GL, Payroll,
Physician entry, Core measures, Pa4ent Sat, AHRQ, External benchmarks,, Clinical Systems (ED, Radiology, PACS), Other Sources (Clinics,
home health, Ambulatory, LTAC’s)
RDBMS EDW MPP
Business Analy4cs
Custom Applica4ons
Packaged Applica4ons
OLTP, ERP, CRM Systems
Unstructured documents, emails
Clickstream
Server logs
Sen7ment, Web Data
Sensor. Machine Data
Geoloca7on
Value in New data sources
• Limited Application interaction • Costly to Scale storage • Silos of Data
• Inability to manage new data sources • Schema on Write vs. Read
Page 5 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
The “Empowered” Patient
Relationship Focused (Values stable doctor’s relationships)
Detests/suspicious of Gadgets
Serial Processor (Email, basic model phone)
Loyalty to Brand
Health as a service
Access to Care
Patient (willing to wait)
Believes experts (not comfortable seeking second opinions)
Goes for Best of Breed (Network of Networks)
Super Connected
Parallel Processor-Integrated (State-of-the-Art)
Loyalty to Value
Take care of Health
Health as a right
Impatient
Asks for Data (researches multiple self enabled searches, demands second opinions)
My Parents
My Children
Page 6 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Thank you for the diagram, Robert Wood Johnson Foundation, 2014
6
Comprehensive Health Management
80% of healthcare determinants lie outside the US healthcare delivery system Can healthcare systems expand into these other areas, and become true public health systems?
Page 8 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
The image cannot be displayed. Your computer may not have enough memory to open the image, or the image may have been corrupted. Restart your computer, and then open the file again. If the red x still appears, you may have to delete the image and then insert it again.
Original 24 architects, developers, operators of Hadoop from Yahoo!
ON
LY
100 open source
Apache Hadoop data platform
% Founded in 2011
HADOOP 1 ST provider to go public
IPO Fall 2014 (NASDAQ: HDP)
subscription customers 556
employees across 740+
countries technology partners 1350+ 17
TM
Hortonworks Company Profile
Page 9 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Two great Use cases to realize a dramatic cost savings…
✚
EDW Optimization
OPERATIONS 50%
ANALYTICS 20%
ETL PROCESS 30%
OPERATIONS 50% ANALYTICS
50%
Current Reality EDW at capacity: some usage from low value workloads
Older data archived, unavailable for ongoing exploration
Source data often discarded
Augment w/ Hadoop
Free up EDW resources from low value tasks
Keep 100% of source data and historical data for ongoing exploration
Mine data for value after loading it because of schema-on-read
MPP
SAN
Engineered System
NAS
HADOOP
Cloud Storage
$0 $20,000 $40,000 $60,000 $80,000 $180,000
Fully-loaded Cost Per Raw TB of Data (Min–Max Cost)
Commodity Compute & Storage Hadoop Enables Scalable Compute & Storage at a Compelling Cost Structure
Hadoop Parse, Cleanse
Apply Structure, Transform
Storage Costs and licensing reduction of latent systems $500,000
5 times the amount of usable storage, plus processing power, for about 30% of the cost of traditional enterprise technologies,”
Page 10 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Monitor Patient Vitals in Real-Time with Sensor data
Problem Managing The Volumes of System Sensor Data • In a typical hospital setting, nurses do rounds and manually monitor patient vital signs.
They may visit each bed every few hours to measure and record vital signs but the patient’s condition may decline between the time of scheduled visits.
• This means that caregivers often respond to problems reactively, in situations where arriving earlier may have made a huge difference in the patient’s wellbeing.
Solution Hadoop Empowers Healthcare by Converting High Volumes of Sensor Data into a Manageable Set of Data • New wireless sensors can capture and transmit patient vitals at much higher frequencies,
and these measurements can stream into a Hadoop cluster. • Caregivers can use these signals for real-time alerts to respond more promptly to
unexpected changes. • Over time, this data can go into algorithms that proactively predict the likelihood of an
emergency even before that could be detected with a bedside visit.
Benefits Ø Proactively Predict Events
rather than reactively
Ø Real-time Alerts
Ø Capture & Transmit Patient Vitals at Much Higher Frequencies
Ø Improve Patient Satisfaction
Ø Improve operational efficiency
Ø Improved response times
Ø Reduce adverse drug response times
Healthcare
VHA Inc. Confidential information.
Since 1977 – when 30 hospital CEOs established VHA as the nation’s first membership organization for acute care providers – the company has applied knowledge in analytics, contracting, consulting and network development to help members achieve their strategic objectives.
VHA is based in Irving, Texas, and has 11 regional offices. Our unique family of companies brings industry-leading innovation and expertise to help organizations thrive in a dynamic health care environment:
A legacy of innovation
" As the first hospital membership organization, we were born of innovation.
" We introduced the concept of supply networks, drawing on the power of collaboration to achieve greater cost savings for health organizations.
" We pioneered comparative data analysis and exchange as well as the industry’s first committed contracting program and private label program, all of which continue to deliver exceptional value today.
Who is VHA?
In 2013, VHA delivered
$2.2 billion in savings and additional value
to members.
12 |
VHA Inc. Confidential information.
At UHC, collaboration drives success.
For nearly 30 years, UHC has been a catalyzing force in:
" Supporting academic medical centers in their efforts
" Fostering new ideas
" Building solid relationships that withstand the test of time
Our members have been agents of progress—driving the advancement of patient care, medical knowledge, and fiscal acuity by coming together to candidly discuss their ideas and vision.
UHC continually expands and strengthens services to offer insights and solutions to members.
As the leader in providing relevant comparative data and as a single-source provider of information and insights that promote change, UHC has created UHC Intelligence™, a versatile suite of business tools that power performance improvement.
Who is UHC?
13 |
UHC offers Transparency.
VHA Inc. Confidential information.
UHC and VHA have a track record of partnering successfully
History of Collaboration
14 |
1998 – Current Novation was formed in 1998 and is a joint venture between UHC and VHA.
1998 – 2011 A supply chain improvement company focused on non-acute care market, Provista was formed in 1998 as a joint venture between UHC and VHA. VHA acquired UHC’s minority interest in 2011.
2013 – Current A subsidiary of Novation, aptitude is the health care industry's first online direct contracting marketplace.
UHC and VHA have formed the largest contracting services company Since forming Novation in 1998, UHC and VHA have worked in partnership to grow Novation to the nation’s largest contracting services company, representing more than $50 billion in purchasing volume and delivering more than $1 billion in contract price savings for UHC and VHA members and other affiliate organizations over the past five years. We have continued to expand on this successful partnership. Our advanced shared analytic capabilities and innovative cost management tools have helped purchasing through Novation to save an additional $1.4 billion, over the last four years.
VHA Inc. Confidential information.
UHC and VHA Have Complementary Strengths Which When Combined Create Enhanced Member Value
A Powerful Network
The Nation’s Leading Academic Medical Centers and Community
Health Care Providers
Foundation
Supply Chain
Management
Advisory Services
Comprehensive Data and Analytics
Core Capabilities
Targeted Solutions
Customized Insights
NewCo
15 |
VHA Inc. Confidential information.
Our new organization offers superior access to leading practices, networking and knowledge sharing for our members, which includes the majority of this country’s preeminent academic medical centers and community-based health systems.
The newly combined organization:
" Serves more than 5,200 health system members and affiliates.
" Provides services to nearly 30 percent of the nation’s hospitals, including virtually all the academic medical centers and health systems.
" Serves more than 118,000 non-acute health care customers.
" Includes more than $50 billion in purchasing volume, the largest in the industry.
" Provides services to all of the top 10 hospitals on the US News and World Reports annual list of America’s Top Hospitals.
" Delivers the industry’s most in-depth clinical data combined with the nation’s most robust supply chain data to address cost and quality.
VHA and UHC Are Now the Largest Member-owned Health Care Company
16 |
VHA Inc. Confidential information.
Move from silos to streamlined data processing….
18 |
Data Internal External
Data
Logic
Apps
Internal External Internal External Internal External
Clinical Supply Academic
Process
Logic
Apps
Logic
Apps
Process Process
Data Data
Process
Logic
Clinical
Supply
Academic
Apps
Apps
Apps
Logic
Logic
VHA Inc. Confidential information.
Management Acquisition Delivery
Where we want to go…
19 |
Data Lake
Bus
ines
s A
cces
s La
yer
Data Warehouse
Discovery Zone
Systems of Record
Oracle
SQL
Other
Exadata
Data Access Layer
Applications
Reports
Dashboards DM
Queries
EDI
Future Capabilities
Data Gateway La
ndin
g Zo
ne
HCO
XYZ
Data Aggregators
Raw Data Useful Information
Data Owners, Technical Support Data Stewards, Data SME/Scientist, Analyst, Advisors
Advisors, Analysts, Members, Collaboratives
DM
DM
Data Owners, Technical Support Data Stewards, Data SME’s, Data Scientists, Analyst, Advisors, Data QA
Advisors, Analysts, Members, Collaboratives
Discovery Zone
Posts
VHA Inc. Confidential information.
Management Acquisition Delivery
Where we want to go…
21 |
Data Lake
Bus
ines
s A
cces
s La
yer
Data Warehouse
Discovery Zone
Systems of Record
Oracle
SQL
Other
Exadata
Data Access Layer
Applications
Reports
Dashboards DM
Queries
EDI
Future Capabilities
Data Gateway La
ndin
g Zo
ne
HCO
XYZ
Data Aggregators
Raw Data Useful Information
Data Owners, Technical Support Data Stewards, Data SME/Scientist, Analyst, Advisors
Advisors, Analysts, Members, Collaboratives
DM
DM
Data Owners, Technical Support Data Stewards, Data SME’s, Data Scientists, Analyst, Advisors, Data QA
Advisors, Analysts, Members, Collaboratives
Discovery Zone
Posts
VHA Inc. Confidential information.
Built on Hadoop
" Hortonworks is the distribution
Business Need
" Move to a modern data architecture – Disparate data sources into a single data lake – Flexibility of schema on read (not write) – Ease of doing analysis on subsets of large data sets – Capture all types of data (even data that might only have a future purpose) – Lower cost to store large amounts of data
Data Lake
22 |
Data Lake
VHA Inc. Confidential information.
Area to discover value from data
Access roles:
" Data scientists and SMEs
" Product Managers
" Analysts
" Data Stewards
Challenge
" Business users have been trained to use SQL or CSV exports
" Introduction of Hadoop will require training on PIG and HIVE for access
" Possibility of slowing down adoption and deriving value from new solution
Data Discovery
23 |
Discovery Zone
VHA Inc. Confidential information.
Utilize data virtualization
" “Data virtualization is an umbrella term used to describe any approach to data management that allows an application to retrieve and manipulate data without requiring technical details about the data, such as how it is formatted or where it is physically located” – Margaret Rouse, TechTarget.com
Our solution…
24 |
Data Lake
Discovery Zone
VHA Inc. Confidential information.
Proven platform
" Denodo is our DV platform
Successes in our company
" Salesforce reporting environment (cloud based plug-in)
" Physician dashboard (disparate data sources)
Data Virtualization
25 |
VHA Inc. Confidential information.
Discovery Zone
" Utilize Denodo HDFS, HBase and Map Reduce custom wrappers
" Abstract data from lake – Protects data source asset – Enhanced security
" Simplified access for data discovery users – Can use SQL to query
" Easy to augment discovery process – Can pull in other sources of data to DV view (Excel, PDF, Websites)
Data Virtualization for Discovery Zone
26 |
VHA Inc. Confidential information.
Discovery zone presentation layer
Virtual data views
Data Lake
Systems of record
Architecture
27 |
VHA Inc. Confidential information.
Data Lake approach on Hadoop
" Simplifies data management
" reduces data costs
" Scalable
" Flexibility
Data Virtualization
" Simplified data access
" Less training for business users
" Faster data discovery
" Augmented discovery process (adding new sources)
Recommendation and Benefits
28 |
© 2015 Denodo Technologies
What is Data Virtualization?
Data Virtualization combines disparate data sources into a single “virtual” data layer (aka information fabric abstraction) that provides unified access and integrated data services to consuming applications in real-time (right-time).
© 2015 Denodo Technologies
Data Virtualization Capabilities
Data Virtualization
Logical abstraction & decoupling
Data federation Real-time, hybrid, cache
Semantic integration & data quality- structured & unstructured
Agile data services provisioning
Unified data governance & security
© 2015 Denodo Technologies
Benefits of Data Virtualization
Better Quality Information § Focus on Business Information Needs § Include Web / Cloud, Big Data, Unstructured, Streaming § Bigger volumes, richer/easier access to data
Lower Cost & Agility
§ Lower Integration Costs by 80% § Flexibility to Change § Real-time (on-demand) Data Services
Fast Time to Solution
§ Projects in 4-6 Weeks § ROI in <6 months § Adds New IT and Business Capabilities
© 2015 Denodo Technologies
Data Virtualization – Use Cases
Agile Business Intelligence
Big Data, Cloud Integration
Agile Single View Applications
Data Services
Data Virtualization
Access new data sources 60% faster with change requests met in just a few days with IT using 40% less analyst time to support.
Reduced back-office workload by more than 50%. Increased First Call Resolution
rate to over 90% and customer satisfaction to over 94%.
Improved asset performance and proactive maintenance. Increased revenue from sale of services and parts. Reduced warranty costs of parts failure.
Reduced time to create and provision data service from 180
hours to 8 hours.
© 2015 Denodo Technologies
About Denodo Description Denodo is the leader in data virtualization offering the broadest access to structured and
unstructured data exceeding the performance needs of data-intensive organizations for both analytical and operational use cases delivered in a much shorter timeframe than traditional data integration tools.
Headquarters
Palo Alto, CA. Offices in New York, London, Madrid, A Coruña, Chicago, Boise, Houston and Munich. Worldwide sales network through partners.
Leadership
Longest continuous focus on data virtualization and data services. Product leadership. Solutions expertise.
Customers
250+ customers worldwide, including many F500 and G2000 companies across key verticals, such as healthcare, life sciences, technology, media, telecommunications, insurance, financial services, consumer/retail, energy and public sector.
Page 36 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Next Steps…
Download the Hortonworks Sandbox Learn Hadoop
Build Your Analytic App
Try Hadoop
Learn more about our partnerships and VHA
http://www.denodo.com
http://vha.com
Download Denodo Express for Free The fastest way to Data Virtualization