SAP HANA Platform - SUSE Linux · Store text and binary files in SAP HANA for native text analysis...

25
SAP HANA Platform The platform for all applications SAP HANA Platform / November 2016

Transcript of SAP HANA Platform - SUSE Linux · Store text and binary files in SAP HANA for native text analysis...

SAP HANA Platform The platform for all applications

SAP HANA Platform / November 2016

© 2016 SAP SE or an SAP affiliate company. All rights reserved. 2

Disclaimer

This presentation outlines our general product direction and should not be relied on in making a

purchase decision. This presentation is not subject to your license agreement or any other agreement

with SAP. SAP has no obligation to pursue any course of business outlined in this presentation or to

develop or release any functionality mentioned in this presentation. This presentation and SAP's

strategy and possible future developments are subject to change and may be changed by SAP at any

time for any reason without notice. This document is provided without a warranty of any kind, either

express or implied, including but not limited to, the implied warranties of merchantability, fitness for a

particular purpose, or non-infringement. SAP assumes no responsibility for errors or omissions in this

document, except if such damages were caused by SAP intentionally or grossly negligent.

© 2016 SAP SE or an SAP affiliate company. All rights reserved. 3

SAP HANA Platform is easy to adoptStandard-based and open

Application Services

Choice of application servers

and webservers

Eclipse-based and web

development tool

Include web application server

with Java Script, Java, Node.JS,

C++ runtime support

Support git, github, maven tools

Include HTML5 UI libraries

S A P H A N A P L A T F O R M

APPLICATION SERVICES INTEGRATION & QUALITY SERVICESPROCESSING SERVICES

DATABASE SERVICES

Processing Services

Execute advanced data

processing using SQL

Spatial processing follows

OGC standards, ISO SQL/MM,

GeoJSON

Built-in predictive libraries and

supports R

Database Services

Standard RDBMS

ACID, SQL 92 Compliant

Accessible thru JDBC,

ODBC, JSON, OData

Standard security model

Choice of third-party

administration tools

Integration & Quality

Services

Data movement and

federation with existing DBs

Framework to build custom

adaptors

Integration with Spark and

Hadoop

© 2016 SAP SE or an SAP affiliate company. All rights reserved. 4

Choice of application architectureLeverage existing assets and skills

SAP S/4HANA

SAP NetWeaver

Application Server - ABAP

Core Data Services (CDS)

S A P H A N A P L A T F O R M

Database Services

Integration Services

Custom Applications

Application Server(J2EE, .NET)

Native Custom Applications

ABAP developers use CDS and Open SQL to leverage SAP HANA without coding SAP HANA objects

Custom application developers choose any application server and any database interface

SAP HANA native application developers use SAP HANA application services inside the platform

Processing Services

Database Services

Integration Services

Processing Services

Database Services

Integration Services

Processing Services

Application Services

S A P H A N A P L A T F O R M S A P H A N A P L A T F O R M

© 2016 SAP SE or an SAP affiliate company. All rights reserved. 5

SAP HANA Platform: The platform for all applicationsSimplify, accelerate, innovate

DATABASE SERVICES

Web Server JavaScript

Graphic Modeler

Data Virtualization ELT & Replication

Columnar OLTP+OLAP

Multi-Core & Parallelization

Advanced Compression

Multi-tenancy Multi-Tier Storage

Graph* Predictive Search

DataQuality

SeriesData

Business Functions

Hadoop & Spark Integration

Streaming Analytics

Application Lifecycle Management

High Availability &Disaster Recovery

OpennessDataModeling

Admin &Security

Remote Data Sync

Spatial

Text Analytics

Fiori UX

ALM

</>

APPLICATION SERVICES INTEGRATION & QUALITY SERVICESPROCESSING SERVICES

SAP, ISV and Custom Applications

All Devices

OLTP + OLAP ONE Open Platform ONE Copy of the Data* Graph is in controlled availability

S A P H A N A P L A T F O R M

© 2016 SAP SE or an SAP affiliate company. All rights reserved. 6

New! SAP HANA Desktop Edition For Developers*Smaller Footprint – run SAP HANA on a laptop.

Downloadable Virtual Machine Image for SAP HANA.

• Pre-configured SAP HANA – free to download and use for development purposes.

• No need for a certified appliance – can run on a laptop.

• Limitations – 32GB RAM only.

• Community Support via SCN.

• Early Adopter version to be launched at Sapphire.

Product Capabilities

DATABASE SERVICES

Web Server JavaScript

Graphic Modeler

Data Virtualization ELT & Replication

Columnar OLTP+OLAP

Multi-Core & Parallelization

Advanced Compression

Multi-tenancy Multi-Tier Storage

Graph* Predictive Search

DataQuality

SeriesData

Business Functions

Hadoop & Spark Integration

Streaming Analytics

Application Lifecycle Management

High Availability &Disaster Recovery

OpennessDataModeling

Admin &Security

Remote Data Sync

Spatial

Text Analytics

Fiori UX

ALM

</>

APPLICATION SERVICES INTEGRATION & QUALITY SERVICESPROCESSING SERVICES

OLTP + OLAP ONE Open Platform ONE Copy of the Data

© 2016 SAP SE or an SAP affiliate company. All rights reserved. 7

Choice of SAP HANA High Availability (HA) and Disaster Recovery (DR) optionsEnsuring the most demanding service-levels

Host Auto-Failover (HA) Within one scale-out system

N active nodes, M standby node(s)

Automatically switch to standby node

System Replication (HA & DR) Across multiple systems/locations

Continuous data transfer from memory

Fast switch-over on system failure

Storage

SAP HANA(Primary)

Node

Storage

SAP HANA(Secondary)

Node

Storage Replication (DR) Across multiple systems/locations

Transfer data using storage mirroring

Low cost option

Supports campus, metro, and geo clusters with multiple standbys

SAP HANA

Node 1 Node 2 Standby

Storage

SAP HANA(Primary)

Node

Storage

SAP HANA(Secondary)

Node

Secondary system can be used for Dev/QA

Geo Clusters

Metro Cluster

Sync

AsyncCampus

Cluster

Async

Storage

HANA in detail

© 2016 SAP SE or an SAP affiliate company. All rights reserved. 9

In-memory columnar storeFaster OLTP + OLAP processing on single copy of data

ACID compliant

High speed transactions support

Aggregations on fly

No indexes for fast access

Process compressed data

Optimized for multi-core parallel processing

Single Instruction, Multiple Data (SIMD)

processing support

NUMA optimization to enable future support for very

large (12TB+) nodes (CPU/Memory)

© 2016 SAP SE or an SAP affiliate company. All rights reserved. 10

Parallel processing and SIMD Faster data processing

Each processor can scan a column

or portion of a column in parallel

Core 1 Core 2

Core 3

Core 4

Without SIMD With SIMD

Operation

Source

XopY

Source

X

Y

Operation

Source

X4opY4

Source

X4

Y4

X3opY3

X3

Y3

X2opY2

X2

Y2

X1opY1

X1

Y1

With Single Instruction, Multiple Data (SIMD)

processing, the same processor operation works

on multiple data blocks simultaneously

© 2016 SAP SE or an SAP affiliate company. All rights reserved. 11

Columnar store to process transactions and queriesOLTP+OLAP on single copy of data

Delta storage is optimized for transactions

Delta storage is merged periodically with

main storage

No data duplication – data kept either in

delta storage or in main storage

Column Table

Delta Storage

Main Storage

Read Operation

Write Operation

© 2016 SAP SE or an SAP affiliate company. All rights reserved. 12

Multitenant database containersLower capital and operating expenditure – cloud-ready

Manage multiple databases as a unit

Strong separation of data, resources and users

among tenant databases

Lower capital expenditure with better utilization of

system resources

Lower operating expenditure with simplified

management

System Tenant

SAP HANA System: SID

SAP HANA

Tenant A

SAP HANA Node

SAP HANA

Tenant B

SAP HANA

Tenant C

Scale Up

Node 3

SAP HANA System : SID

Tenant A.1

System

Tenant

Node 2 Node 1

Tenant A.3

Tenant B.1 Tenant B.2

Tenant C

Tenant A.2

Scale Out

Standby Node

System Tenant

(Standby)

System Tenant

(Standby)

System Tenant

(Standby)

© 2016 SAP SE or an SAP affiliate company. All rights reserved. 13

Dynamic tieringRight price/performance balance between memory and disk

Utilize disk-based, column-store technology to store

less frequently used data

Support petabyte scale deployment – not confined by

the size of memory

Is integral part of the single SAP HANA instance –

no data duplication

Transparently manage large data volumes by

automatically moving data among memory, disk and

Hadoop/SAP IQ using Data Lifecycle Manager (DLM)

Hadoop

SAP IQ

S A P H A N A P L A T F O R M

Data Lifecycle Manager (DLM)

Dynamic Tiering

Hot Warm

© 2016 SAP SE or an SAP affiliate company. All rights reserved. 14

Spatial processingNew insights from enriching business data with spatial data

Store and process spatial data with other data types

No need to create spatial indexes, tessellation, etc.

Support open standards compliance (Open Geospatial Consortium - OGC)

Spatial data types

– Points, lines, polygons

– Multi-dimensional support including 3D and measurement dimension

Spatial functions

– Area, distance, within, touches, intersects, adjacent

Built in geo-content

– Maps, political boundaries, roads, Point of Interests (POI)

Spatial join operators for SQL and Calculation Views

– Contains, crosses, intersects, overlaps, touches, within

– Automatically add latitude and longitude to address

S A P H A N A P L A T F O R M

External Geocoding Services

Spatial FunctionsSpatial Data Types

Maps & POIGeocoding

Spatial Engine

© 2016 SAP SE or an SAP affiliate company. All rights reserved. 15

Predictive analyticsTransforming the future with today’s insights

70+ prepackaged predictive algorithms

– Supports association, clustering, classification, regression,

time series etc.

– Supports variety of data – structured, spatial, text, streaming

and series data

AFM graphical modeling tool support to develop predictive

applications using PAL and R-Script

SAP Predictive Analytics leverage Automated Predictive

Libraries (APL) libraries and PAL

SAS processing run natively in SAP HANA eliminating

date redundancy

Leverage R advanced functions transparently

Predictive analytics across multiple data types

SAP HANA

Studio/Web IDE

Application Function

Modeler (AFM)

Tools &

Applications

SAP

Predictive

Analytics

S A P H A N A P L A T F O R M

Predictive Analysis

Libraries (PAL)

R Integration

Predictive

Analysis

Libraries (PAL)

Automated Predictive

Libraries (APL)

© 2016 SAP SE or an SAP affiliate company. All rights reserved. 16

SAP HANA In-Memory Predictive Analytics Predictive Analysis Library (PAL) - algorithms supported

Association Analysis• Apriori

• Apriori Lite

• FP-Growth

• KORD – Top K Rule Discovery

Classification Analysis• CART

• C4.5 Decision Tree Analysis

• CHAID Decision Tree Analysis

• K Nearest Neighbour

• Logistic Regression

• Back-Propagation (Neural Network)

• Naïve Bayes

• Support Vector Machine

• Confusion Matrix

• Parameter Selection & Model Evaluation

Regression• Multiple Linear Regression

• Polynomial Regression

• Exponential Regression

• Bi-Variate Geometric Regression

• Bi-Variate Logarithmic Regression

Probability Distribution• Distribution Fit

• Cumulative Distribution Function

• Quantile Function

Outlier Detection• Inter-Quartile Range Test (Tukey’s Test)

• Variance Test

• Anomaly Detection

• Grubbs Outlier Test

Link Prediction• Common Neighbors

• Jaccard’s Coefficient

• Adamic/Adar

• Katzβ

Data Preparation• Sampling

- Random Distribution Sampling

• Binning

• Scaling

• Partitioning

• Principal Component Analysis (PCA)

Statistic Functions

(Univariate)• Mean, Median, Variance,

Standard Deviation

• Kurtosis

• Skewness

Statistic Functions (Multivariate)• Covariance Matrix

• Pearson Correlations Matrix

• Chi-squared Tests:- Test of Quality of Fit

- Test of Independence

• F-test (variance equal test)

Other• Weighted Scores Table

• Substitute Missing Values

• DenStream clustering

• Adaptive Hoeffding Tree

• Random forest

• Area Under Curve calculation

• Survival analysis (Kaplan-Meier estimate)

Cluster Analysis• ABC Classification

• DBSCAN

• K-Means

• K-Medoid Clustering

• K-Medians

• Kohonen Self Organized Maps

• Agglomerate Hierarchical

• Affinity Propagation

• Gaussian Mixture Model

• Latent Dirichlet Allocation (LDA)

Time Series Analysis• Single Exponential Smoothing

• Double Exponential Smoothing

• Triple Exponential Smoothing

• Forecast Smoothing

• ARIMA/Seasonal Arima

• Brown Exponential Smoothing

• Croston Method

• Forecast Accuracy Measure

• Linear Regression with Damped Trend and Seasonal Adjust

• Test for White Noise, Trend, Seasonality

© 2016 SAP SE or an SAP affiliate company. All rights reserved. 17

R integrationUse leading open source data mining software transparently

Embed R script within SQL script

Execute R script inside R server

Use R vector-oriented format rather

than JDBC/ODBC

Execute multiple R processes in parallel

Leverage 3,500+ R statistical and

graphical packages

R Server

R Process

R-code

Input data in R Format

Results in R Format

Query Processor

R Client

S A P H A N A P L A T F O R M

© 2016 SAP SE or an SAP affiliate company. All rights reserved. 18

Search, text analytics and miningInsights from unstructured data

Store text and binary files in SAP HANA for native text analysis and search

Support various file formats(txt, html, xml, pdf, doc, ppt, xls, rtf, msg)

Automatically detects 31 languages

Search

– Fuzzy, linguistic, synonymous search, using SQL

Text Analytics

– Extract relevant information from text

(Linguistic Markup, Entity, Sentiment Extraction)

Text Mining

– Rank and Categorize documents by comparing with a set

of pre-classified documents

Search Engine

Text Analytics

Linguistic Processing

Entity, Fact Extraction

Text Mining

S A P H A N A P L A T F O R M

© 2016 SAP SE or an SAP affiliate company. All rights reserved. 19

Web serverReduced data movement – app and database services in one platform

Scale applications independently from the database

services with new web application server

Supports choice of programming languages – Server side

JavaScript on Node.js, Java on TomEE and C++ Runtime

container

Core Data Services allow developers to create database

objects and relationships without SQL

Accelerate application development with open source code

management tools – Git, GitHub and Maven

Simplify authentication and authorization with single sign-on

support between application and database services

Scheduled execution of JavaScript and SQLScript programs

Application Services

JavaScript

Node.js

Java

TomEE

C++

Runtime

Container

Database Services

Authentication and Authorization

Application Router

Web Application Server

S A P H A N A P L A T F O R M

© 2016 SAP SE or an SAP affiliate company. All rights reserved. 20

Choice of application architectureLeverage existing assets and skills

SAP S/4HANA

SAP NetWeaver

Application Server - ABAP

Core Data Services (CDS)

S A P H A N A P L A T F O R M

Database Services

Integration Services

Custom Applications

Application Server(J2EE, .NET)

Native Custom Applications

ABAP developers use CDS and Open SQL to leverage SAP HANA without coding SAP HANA objects

Custom application developers choose any application server and any database interface

SAP HANA native application developers use SAP HANA application services inside the platform

Processing Services

Database Services

Integration Services

Processing Services

Database Services

Integration Services

Processing Services

Application Services

S A P H A N A P L A T F O R M S A P H A N A P L A T F O R M

© 2016 SAP SE or an SAP affiliate company. All rights reserved. 21

Smart data accessAccess any data from any source

Manage and query remote tables as local

virtual tables

– Support virtual tables in calculation view and SQL

– Virtual tables can be combined with PAL, BFL, and Spatial

Push query processing to remote databases

Complement functionalities in remote database with

SAP HANA capabilities

Support remote query results caching with HIVE

Provide SDK for adapters based on ODBCIBM DB2, Netezza,

Oracle, MS SQL

Server, Teradata,

SAP HANA, SAP

ASE, SAP IQ

Modeling & SQL Script

S A P H A N A P L A T F O R M

Smart Data Access

Virtual Tables

© 2016 SAP SE or an SAP affiliate company. All rights reserved. 22

Hadoop integrationAd-hoc query capabilities and processing of unstructured data

Indirect access using Spark and Hive with Smart

Data Access

Direct access using Virtual User Defined Function

(vUDF)

– Access HDFS without need for the package, mapper,

and reducer specification

– Invoke custom Map Reduce jobs

– Embed vUDF in SQL

Load data from Hadoop with Smart Data Integration

Unified admin and monitoring tool for SAP HANA

and Hadoop cluster

Speed-up Hadoop data analysis with new SAP

HANA Vora connector

Map Reduce

HDFS

Smart Data Access vUDF

Federate Federate

Smart Data Integration

Federate ELT

VoraConnector

Vora

S A P H A N A P L A T F O R M

© 2016 SAP SE or an SAP affiliate company. All rights reserved. 23

SAP HANA CockpitSimplify administration and monitoring

SAP HANA Cockpit Fiori UX-based web administration tool

manages SAP HANA from any device

SAP HANA Cockpit

Catalog of Fiori tiles to manage hardware resource

utilization and SAP HANA processes

Analyze diagnostic files while the database is down for

faster fault detection and correction

Security dashboard in SAP HANA Cockpit to achieve

visibility into security KPIs

Integrated delta backup capabilities in SAP HANA Cockpit

Hana Express - DEMO

© 2016 SAP SE or an SAP affiliate company. All rights reserved. 25

Demo