SAP Data Hub Defeat the Data Discord Let Your Data Live in ...€¦ · Defeat the Data Discord Let...

18
Defeat the Data Discord Let Your Data Live in Harmony SAP Data Hub Automation Scalability Quick Facts The Agile Business Processing Your Data Flows Data Refinement © 2018 SAP SE or an SAP affiliate company. All rights reserved.

Transcript of SAP Data Hub Defeat the Data Discord Let Your Data Live in ...€¦ · Defeat the Data Discord Let...

Defeat the Data Discord Let Your Data Live in Harmony

SAP Data Hub

AutomationScalability Quick FactsThe Agile Business

Processing Your Data Flows

Data Refinement

© 2

018

SAP

SE o

r an

SAP

affilia

te c

ompa

ny. A

ll rig

hts

rese

rved

.

Providing visibility and access to a broad range of data systems and assets, the SAP Data Hub solution allows for quick and intuitive creation of powerful, organization-spanning data pipelines. SAP Data Hub optimizes data-pipeline execution speed with a distributed processing approach that allows the data to be processed at the source where possible (see Figure 1). SAP Data Hub meets the governance and security needs of your enterprise, helping to ensure that appropriate policy measures are in place to meet regulatory and corporate requirements.

SAP® Data Hub establishes a new category of software solutions, allowing agile data operations management in a diverse landscape across the organization. This enterprise-ready solution enables governance and orchestration for data refinement and enrichment, using pipelining of many complex data processing operations such as machine learning.

2 / 17

Data Refinement, Enrichment, and Orchestration

Data Refinement

Processing Your Data Flows

Scalability Automation The Agile Business

Quick Facts

SAP Data Hub allows for quick and intuitive creation of powerful, organization-spanning data pipelines.

© 2

018

SAP

SE o

r an

SAP

affilia

te c

ompa

ny. A

ll rig

hts

rese

rved

.

3 / 17

Figure 1: Enablement of Data-Driven Applications Across Your Landscape

Enterprise apps and BI tools

SAP HANA® (on premise, cloud, and multicloud)

Sources and systems (SAP, non-SAP,

on premise, and cloud)

SAP® Data Hub

Self-services Metadata management Pipelining

Refinement Orchestration ML and PA

SAP tools for EIM

Big Data services

from SAP

Third-party data lakes

Cloud object stores

ML – Machine learning; PA – Predictive analytics; BI – Business intelligence; EIM – Enterprise information management

Data Refinement

Processing Your Data Flows

Scalability Automation The Agile Business

Quick Facts

© 2

018

SAP

SE o

r an

SAP

affilia

te c

ompa

ny. A

ll rig

hts

rese

rved

.

By integrating data and establishing data-driven pro-cesses across an increasingly diverse data landscape, SAP Data Hub provides a comprehensive answer to an emerging challenge for enterprise customers. The solution enables users to accomplish the following:

• Respond quickly to opportunities and threats through a data operations management console that gives data architects, engineers, and scientists the breadth of visibility and control they need

• Minimize the effort for managing data across the entire landscape and interact from one end to the next while combining Big Data and enterprise data for actionable insights

• Identify opportunities to connect data systems, information to spot new sources, and patterns of value – and resolve data problems

4 / 17

Processing Your Landscape-Wide Data Flows

Data Refinement

Processing Your Data Flows

Scalability Automation The Agile Business

Quick Facts

© 2

018

SAP

SE o

r an

SAP

affilia

te c

ompa

ny. A

ll rig

hts

rese

rved

.

• Accelerate or repair the flow of data through the landscape

• Improve the effectiveness of data results by quickly resolving data quality issues or pipeline friction points

• Significantly reduce the cost and effort of data governance, and manage your metadata to increase automation

• Leverage preexisting and configurable adapters to connect to a variety of systems

• Save time and reduce rework efforts by understanding the impact of data changes before they happen

• Benefit from the scalability and straightforward development that comes with container technologies, as all components will be executed on Kubernetes

5 / 17

With SAP Data Hub, you gain a holistic view of your data landscape, no matter where the data lives, without physically centralizing data but by centralizing the orchestration instead.

Data Refinement

Processing Your Data Flows

Scalability Automation The Agile Business

Quick Facts

© 2

018

SAP

SE o

r an

SAP

affilia

te c

ompa

ny. A

ll rig

hts

rese

rved

.

technical access points to a system through an agent of SAP Data Hub. You get smooth data connectivity to various data lakes and storage, such as Apache Hadoop, the SAP Business Warehouse (SAP BW) application, and SAP HANA software, as well as Google Cloud Platform (Google Cloud Storage), Amazon S3 Web service (S3), Azure Data Lake (ADL), and Windows Azure Storage Blob (WASB).

By increasing the depth and breadth of visibility and control across your diverse data landscape, you can act quickly to seize opportunities and respond to threats to data security. And, having a full landscape view of your data quality enables you to make better decisions. LANDSCAPE MANAGEMENTImagine a single tool in which you can centrally manage connectivity of distributed data using a visual and intuitive user interface. With SAP Data Hub, you can manage the software systems and connections in your landscape. This highly detailed data management is also essential when defining different security concepts for addressing various data sensitivities in a diverse landscape. Systems are stand-alone data sources in the distributed data landscape, and connections are

6 / 17

Scalability Across Big Data Stores and Enterprise Systems

Data Refinement

Processing Your Data Flows

Scalability Automation The Agile Business

Quick Facts

With SAP Data Hub, you can manage the software systems and connections in your landscape.

© 2

018

SAP

SE o

r an

SAP

affilia

te c

ompa

ny. A

ll rig

hts

rese

rved

.

LAUNCHPADWith the launchpad for SAP Data Hub, you can quickly access tools that can be applied to comprehensive scenarios. You can also embed related tools or create custom links to frequently used tools and pages.

7 / 17

Get smooth data connectivity of various Big Data storage systems, enterprise systems, and business applications.

Data Refinement

Processing Your Data Flows

Scalability Automation The Agile Business

Quick Facts

© 2

018

SAP

SE o

r an

SAP

affilia

te c

ompa

ny. A

ll rig

hts

rese

rved

.

a user to see only the connections in the specified URI path. With this feature, it is also possible to make sure that policies do not contain any unused resources or users and to run simulations to test policy decisions.

POLICY AND SECURITY MANAGEMENTIn SAP Data Hub, policy and security management allows for creation, communication, and maintenance of policies and procedures within your enterprise and, therefore, mit-igation of risks. Use this feature for establishing security settings and policies for processes, for modeling objects in SAP Data Hub, for identity control (users, groups, and roles), and for security logging. Administrators have the ability to evaluate a policy directly from the cockpit or via the “policy management” tab, edit an existing resource for a policy, escape the keyword in a filter, and browse existing entities while creating a resource type. Administrators can optionally add a Uniform Resource Identifier (URI) to their connection type resources to limit user ability, allowing

8 / 17

Mitigate risks through creation, communication, and maintenance of policies and procedures within your enterprise.

Data Refinement

Processing Your Data Flows

Scalability Automation The Agile Business

Quick Facts

© 2

018

SAP

SE o

r an

SAP

affilia

te c

ompa

ny. A

ll rig

hts

rese

rved

.

9 / 17

pipelines to refine, augment, or enrich data at the source (see Figure 2). Create a comprehensive data landscape by working across diverse data sources and applications with governance.

SIMPLER DATA OPERATIONS THROUGH SEMANTIC REFINEMENT AND ENRICHMENTAfter a straightforward data preparation and profiling process, you can readily create complex, multistep data

Figure 2: Enriching Data at the Source and Automating the Data Pipeline

Data sources Data consumption

Ingest and share

Transform and refine

Enrich and compute

Integrate and orchestrate

Data Refinement

Processing Your Data Flows

Scalability Automation The Agile Business

Quick Facts

Governance | Monitoring | Automation

© 2

018

SAP

SE o

r an

SAP

affilia

te c

ompa

ny. A

ll rig

hts

rese

rved

.

10 / 17

When you want to output sensitive information while maintaining privacy for an individual or an organization, the data-mask anonymization option is invaluable. With SAP Data Hub, data discovery and preparation are intuitive and scalable across your enterprise.

DATA DISCOVERY AND PREPARATIONSAP Data Hub provides an innovative approach to self- service data preparation and also takes advantage of integration through the SAP Agile Data Preparation appli-cation. Learn more about your data by accessing, profiling, transforming, enriching, and viewing it as you navigate a connected system. With its intuitive interface, SAP Data Hub combines interactive exploration and the ability to graphically profile your data. With the “data discover” feature, you can gain insights with every click. Every trans-formation step defined by the user is highly structured. Typical transformations are “projection (map and filter),” “data mask,” and “pivot,” as well as flow-control transfor-mations such as “aggregation,” “join,” “union,” and “case.”

Learn more about your data by accessing, profiling, transforming, enriching, and viewing it as you navigate a connected system.

Data Refinement

Processing Your Data Flows

Scalability Automation The Agile Business

Quick Facts

© 2

018

SAP

SE o

r an

SAP

affilia

te c

ompa

ny. A

ll rig

hts

rese

rved

.

11 / 17

METADATA MANAGEMENT AND CATALOGINGThe metadata catalog within SAP Data Hub enables you to define, govern, and manage your metadata assets across enterprise systems with disparate sources (see Figure 3). It provides full insight into the systems that the data went through. And it enables anyone in your organization to discover, understand, and consume information about the data with the ability to synchronize and share it, even those who aren’t tech savvy. Gain business value in seeing where and how various data fits together by interpreting information about data quality and structures. With the metadata catalog, different data values, attributes, and objects, such as in SAP HANA and SAP BW, can get semantically translated into proper schemas.

Figure 3: Manage Metadata Assets Across Disparate Systems

Data Refinement

Processing Your Data Flows

Scalability Automation The Agile Business

Quick Facts

© 2

018

SAP

SE o

r an

SAP

affilia

te c

ompa

ny. A

ll rig

hts

rese

rved

.

12 / 17

Automate by Applying Complex Data-Processing Operationsmachine-learning or image-processing functions. The canvas tool for creating unified data workflows lets you orchestrate and execute the data pipelines in a given order graphically via drag-and-drop functionality (see Figure 4). You can do this for several data workflows that are created and executed through SAP Data Hub.

By building information pipelines throughout and beyond the enterprise, you can work across silos with enhanced data agility. Rapidly create data pipelines and take action through scheduling their execution as part of powerful data workflows. Experience outstanding speed by taking advantage of distributed processing across the data landscape – with SAP Data Hub.

MODELER (DATA PIPELINES AND WORKFLOWS)Process data or reuse existing code and libraries through data pipelines consisting of several predefined and cus-tomizable operations. You can access connectors to messaging systems, to databases, and to systems that store and read data. Use process operators to execute any code, or operators for type conversion, and add

Data Refinement

Processing Your Data Flows

Scalability Automation The Agile Business

Quick Facts

Process data or reuse existing code and libraries through data pipelines consisting of several predefined and customizable operations.

© 2

018

SAP

SE o

r an

SAP

affilia

te c

ompa

ny. A

ll rig

hts

rese

rved

.

13 / 17

The modeler provides hundreds of predefined operators in the following categories:

• Connectors to messaging systems (Apache Kafka, MQTT, NATS, and WAMP)

• Connectors to store and read data (File, HDFS, S3, and NFS)

• Operators for RESTful clients and services (such as APIs and app services)

• Connectors to databases (SAP HANA and the SAP Vora™ engine)

• Operators for data processing (JavaScript, Python, and Go)

• Operators for process execution (stateful and stateless) • Operators for Apache Spark and R • Operators for machine learning (Tensorflow and machine learning framework)

Data Refinement

Processing Your Data Flows

Scalability Automation The Agile Business

Quick Facts

• Operators for image processing (OpenCV) • Operators for digital signal processing • Operators for type conversion

Figure 4: Graphically Display Your Data Pipelines

© 2

018

SAP

SE o

r an

SAP

affilia

te c

ompa

ny. A

ll rig

hts

rese

rved

.

14 / 17

the unified modeling view of SAP Data Hub. You can also suspend or resume them if necessary. Finally, by sched-uling data pipelines in large cluster environments, you can handle batch-driven jobs and streaming jobs in a single environment.

Monitoring and SchedulingSchedule the execution of data pipelines in the dedicated scheduling area. Through the monitoring dashboard, you can keep track of the status of the data workflows and pipelines that you have scheduled for execution within

Keep track of the status of the data workflows and pipelines that you have scheduled for execution.

Data Refinement

Processing Your Data Flows

Scalability Automation The Agile Business

Quick Facts

© 2

018

SAP

SE o

r an

SAP

affilia

te c

ompa

ny. A

ll rig

hts

rese

rved

.

15 / 17

Becoming an Agile, Data-Driven Businesscomprehensive data orchestration, and the performance of complex data scenarios with distributed processing. Make better decisions by leveraging a full landscape view and sophisticated data quality. Be able to react quickly to stay ahead of the competition. And, all the while, maintain compliance with security policy dynamically.

By accelerating and scaling your data projects, you can magnify the ways that data benefits your business – gaining new insights, synchronizing across systems, and empowering more users. Establish data streams and create powerful data pipelines to build agile, data-driven applications and processes. Centralized enterprise visi-bility and governance across the data landscape enables you to achieve a systemwide view of data sources, end points, and quality, as well as desired service-level agreements. Provide information that is more valuable and more useful by enriching data flows beyond simple data movement. Improve the success of all your data projects through proper metadata management and governance,

Achieve a system-wide view of data sources, end points, and quality, as well as desired service-level agreements.

Data Refinement

Processing Your Data Flows

Scalability Automation The Agile Business

Quick Facts

© 2

018

SAP

SE o

r an

SAP

affilia

te c

ompa

ny. A

ll rig

hts

rese

rved

.

16 / 17

Quick FactsOBJECTIVES

• Manage increasingly complex information and tech-nology landscapes

• Work with data from a variety of sources, including Big Data

• Connect information and data processes and systems

SOLUTION • Unified data-operations management, orchestration, pipelining, and governance

• Monitoring and management functionality to control and distribute data

• Integration of data and processes for on-premise, cloud, and hybrid environments

• Configurable adapters for a variety of systems • Holistic metadata-management capabilities

SUMMARY The SAP® Data Hub solution enables sophisticated and agile data operations management in a diverse land-scape across the organization. It gives you the capability and flexibility to connect all your data, including enter-prise and Big Data, and gain a deeper understanding of data and information processes across various sources. The unified and enterprise-ready solution provides visi-bility and control of data governance and orchestration for data refinement and enrichment, using pipelining of many complex data-processing operations such as machine learning. The distributed processing power enables greater speed and efficiency. SAP Data Hub is also taking advantage of the agility and flexibility that comes with container technologies, using Kubernetes as its primary deployment infrastructure.

Data Refinement

Processing Your Data Flows

Scalability Automation The Agile Business

Quick Facts

© 2

018

SAP

SE o

r an

SAP

affilia

te c

ompa

ny. A

ll rig

hts

rese

rved

.

17 / 17

BENEFITS • Increased control and visibility of all your data within one solution

• Better understanding of data from a diverse landscape and valuable business insights

• Improved data security, compliance, and governance • Greater responsiveness to business opportunities and potential data issues

• Lower cost and effort in managing data and metadata

LEARN MORE To find out more, call your SAP representative today or visit us online.

Data Refinement

Processing Your Data Flows

Scalability Automation The Agile Business

Quick Facts

© 2

018

SAP

SE o

r an

SAP

affilia

te c

ompa

ny. A

ll rig

hts

rese

rved

.

© 2018 SAP SE or an SAP affi liate company. All rights reserved.

No part of this publication may be reproduced or transmitted in any form or for any purpose without the express permission of SAP SE or an SAP affi liate company.

The information contained herein may be changed without prior notice. Some software products marketed by SAP SE and its distributors contain proprietary software components of other software vendors. National product specifi cations may vary.

These materials are provided by SAP SE or an SAP affi liate company for informational purposes only, without representation or warranty of any kind, and SAP or its affi liated companies shall not be liable for errors or omissions with respect to the materials. The only warranties for SAP or SAP affi liate company products and services are those that are set forth in the express warranty statements accompanying such products and services, if any. Nothing herein should be construed as constituting an additional warranty.

In particular, SAP SE or its affi liated companies have no obligation to pursue any course of business outlined in this document or any related presentation, or to develop or release any functionality mentioned therein. This document, or any related presentation, and SAP SE’s or its affi liated companies’ strategy and possible future developments, products, and/or platforms, directions, and functionality are all subject to change and may be changed by SAP SE or its affi liated companies at any time for any reason without notice. The information in this document is not a commitment, promise, or legal obligation to deliver any material, code, or functionality. All forward-looking statements are subject to various risks and uncertainties that could cause actual results to diff er materially from expectations. Readers are cautioned not to place undue reliance on these forward-looking statements, and they should not be relied upon in making purchasing decisions.

SAP and other SAP products and services mentioned herein as well as their respective logos are trademarks or registered trademarks of SAP SE (or an SAP affi liate company) in Germany and other countries. All other product and service names mentioned are the trademarks of their respective companies.

See https://www.sap.com/copyright for additional trademark information and notices.

Studio SAP | 52802enUS (18/05)

www.sap.com/contactsap

Follow us