SAP Big Data Overview.ppt

download SAP Big Data Overview.ppt

of 22

Transcript of SAP Big Data Overview.ppt

  • AgendaBig Data Definition

    Big Data Evolution

    Big Data Architecture

    Use cases/solutions

    *

  • *Big Data Changing the way we do business.

  • *Big Data DefinitionBig data is an all-encompassing term for any collection of data sets so large or complex that it becomes difficult to process them using traditional data processing applications.

    Turn raw data into insights that drive massive business value. Enable Customers to Achieve Real-Time Business Results on BIG DATA

    Massive data results into problems of :capturingstoring managinganalyzing

    Source of challenges 3 VsVolume - Ranging up to petabytes needs special storageVariety - Multiple data sources producing different data types i.e. social media, unstructured, machine sensors etc Velocity High speed and volume of incoming data for real-time decisions

  • *Volume Variety Velocity

  • *Big Data Evolution Source: SAP

  • *Code HalosData, devices and interactions surrounding all of us

    Code HaloCode Halo /kohd hey-loh/.nounThe information that surrounds people, organizations, processes and products

  • SAP Big Data Patterns aligned to Cognizant Code HalosProviding insight from machines, assets, and devices for better real-time decisions, predictions, and operational performanceProviding insight from high volume and high variety data for real-time analytics & actionable intelligence*

  • Legend*Cognizant SAP Big Data Reference Architecture

    Enterprise HANA 1.0 SPS8AnalyticsDataStoresData Ingest

    Extended Storage(Sybase IQ)

    DATA LAKE BW 7.4

    HANA DataMartData ServicesAnalytics & Big Data VisualizationSAP Big Data Application Types, Data Science and Statistical ModelingM2M and Customer Behavior insightSAP KXEN/Infinite InsightMultiple Regression ModelsLinear ModelsUnivariate/Multivariate models ReportingAnalysisDashboardsExplorationVisualizationPredictiveSAPNon-SAPHadoop Large Scale Data Capture, Generate Analytical Datasets, Train/Validate Predictive Models SLTSQL AnywhereSybase ESPDocuments & EmailsWeb Logs, Click StreamsSocial NetworksMachine GeneratedSensor DataGeo-location DataNon SAPSAP ERPSAP SRMSAP CRMSAP HCMOthersSmart Data AccessSAP HANA Data PlatformHadoop

  • *Cognizant and SAP Big Data Solutions ArchitectureCustomer Behavior Apps Machine to Machine AppsBusiness and Suite AppsCustom HANA SolutionsAdvanced Genome AnalysisVaccine Yield AnalysisRetail Omnichannel AnalyticsMarketing Insights for Downstream Oil & GasAppsAppsHadoop Large Scale Data Capture, Generate Analytical Datasets, Train/Validate Predictive Models SAP HANA PLATFORM.Unified AdministrationApplication DevelopmentSmart Data AccessTransfer DatasetsSmart Data Access

    Extended Storage(Sybase IQ)

    Plant Equipment AnalysisBFSInsuranceCommsCGLSHealthcareTechEducationMediaRetailT&HLogisticsAnalyticsExplorationDashboards, ReportsCharting, VisualizationSAP Predictive AnalyticsSAP KXEN/Infinite InsightSAP LumiraSAP Data ScientistsE&UManuf.SAP BusinessObjects BI*

  • What is Hadoop?

    *Apache Hadoop, an open - source software library, is a framework thatallows for the distributed processing of large data sets across clustersof commodity hardware using simple programming models. It is designed to scale up from single servers to thousands of machines, each offering local computation and storage.

  • SAP Certified Hadoop distribution partners

    *

  • H2 Power of HANA & Hadoop

    + Modern in-memory platformTransact/analyze in real- timeNative predictive, text, and spatial algorithmsDistributed data storage and processing on commodity hardware Store infinite amounts of unstructured data No-SQL access Combine strengths of different data processing domainsPetascale, columnar database Tight integration with HANA via Smart Data Access Extend HANA tables with hot data in HANA, and warm data in IQ*

  • SAP HANA Smart Data AccessData virtualization for on-premise and hybrid cloud environmentsTransactions + AnalyticsSAP HANAVirtual TablesHANA Tables*

  • SAP HANA - Hadoop Integration

    Integration at ETL layer (HIVE, HDFS, Map Reduce, Pig, Apache HBASE, Floom, Ambari, Oozie, Avro etc. )Federation at BI layer (BOBJ multi-source Universe accessing Hadoop HIVE )Smart Data Access - direct HANA-Hadoop connectivity *

  • Two speed analytics

    Long running batch analytic jobs in Hadoop Push results to SAP HANA Combine with other data, e.g. from SAP Business Suite User accesses result through BI Tools on SAP HANA *

  • Maintain appropriate inventory and strategize buying decisionsImprove Retail distribution , Store operations & Product qualityCombine enterprise and digital information to create Code Halos for consumers and products Analyze sentiment from Twitter, Facebook, LinkedIn or industry-specific social media streams. Omni channel Insight for Retail Powered by Code Halos *Advanced Genome AnalysisReduce delays and minimize the costs associated with new drug discovery by optimizing the process for genome analysisSpeed the decision making for hospitals which conduct cancer detection based on DNA sequence matchingVaccine Yield AnalysisImprove vaccine yield and lower cost of productionReduce the variability in the production process to reduce costsAnalyze large volume of data (spread across disparate platforms) of manufacturing information including: time series (pressures, temperatures, etc.), event, transaction, change control, quality, raw material and environmental dataMarketing Insights for Downstream Oil & GasHow can transactions at Gas pumps & PoS transactions at the Retail Gas outlets be related?How can we engage with the gas consumer in terms of cross sell & up sell marketing offers?How can we retain consumer loyalty & brand?How can we make the consumer return to the same gas station consistently?How can we analyze the buying behaviors of gas vs. retail SKUs bought inside the store?How can we understand the gas consumer wallet & spend?Big Data Solutions in Innovation Lab

  • *

  • *

    2014 Cognizant

    Appendix*

  • SAP HANA - Hadoop Integration

    SP9 includes the following new features:

    Direct access to HDFS file systemDevelop and invoke custom Map Reduce jobsSAP HANA studio as the single IDE to invoke the M/R jobsLeverage HANA Repository design time for Map Reduce job, remote Hadoop source and virtual functionsSupport of remote result caching of virtual function executionData Provisioning support for remote source connectivity for IM in HANA via SAPANA Service/Adapter Framework

    Smart Data Access features:SAP MAX DB SupportStatistics EnhancementsChanged Default System Parameter BehaviorImproved Join RelocationFunction Translate ImprovementRead-only Remote Sources and Smart Data Access Connections*

  • SAP Lumira 1.21 More access to Big Data

    Big Data, big InsightsAmazon EMR Hive 0.11Apache Hive 0.12 & 0.13Cloudera Impala 1.0

    Performance OptimizationSmoother scrolling through large number of columns in prepare room

    *

    **********************