Service Oriented BI - Departament d'Enginyeria de … · where you can use BI tools for analysis....

33
16/11/2011 1 Service Service Oriented Oriented BI BI Service Service Oriented Oriented BI BI 05/07/2011 Alberto Abelló & Oscar Romero 2

Transcript of Service Oriented BI - Departament d'Enginyeria de … · where you can use BI tools for analysis....

16/11/2011

1

ServiceService OrientedOriented BIBIServiceService OrientedOriented BIBI

05/07/2011Alberto Abelló & Oscar Romero 2

16/11/2011

2

“Services are economic activities offered by one t t th t l l i party to another, most commonly employing

time-based performances to bring about desired results in recipients themselves or in objects or other assets for which purchasers have responsibility. In exchange for their money, time and effort, service customers expect to obtain value from access to goods, labor professional skills facilities networks and labor, professional skills, facilities, networks, and systems; but they do not normally take ownership of any of the physical elements involved.”

Lovelock &Wright

05/07/2011Alberto Abelló & Oscar Romero 3

“All managerial themes unique to services All managerial themes unique to services are founded in customers providing significant inputs into the production process.”

Sampson

05/07/2011Alberto Abelló & Oscar Romero 4

16/11/2011

3

Antoni Olivé: Què és un Servei? Ruth Raventós: Service Management Enric Mayol, Tomàs Aluja: Service Is Front Stage Ernest Teniente: Reaching the Goalg Joan Antoni Pastor: CMMI for Services Pere Botella: Plataformes i iniciatives en serveis Albert Abelló: Understanding Service Businesses. Applying Principles of the Unified Services Theory (UST) Antoni Olivé: Introducció a la tecnologia RFID i a la seva aplicació en hospital Enric Martínez: SOA: Service Oriented Architecture Carles Farré: Web Usability Antonio Villegas: Everything you always wanted to know about Web services but were afraid to ask Ferran Martí: Breu Introducció a ITIL Jordi Pradel: SOA des de les trinxeres Antonio Valle: Modelo de Costes. Caso práctico de aplicación p p Jordi Torres: The changing role of the computer scientist in the cloud Carme Quer: SERVQUAL. Un instrument de mesura de la qualitat dels serveis Frederic Marimon: Instruments de mesura de la qualitat dels serveis online Pablo Casado: Browser + Device App Stores. Noves perspectives per al desenvolupament de la web Jesús Bisbal: Towards negotiable SLA-based QoS Support for Data Services Michael D. Myers: Conducting Critical Research in IS, and its relationship with IS Design Research Kazuyosi Hidaka: Service Science Research and Education in Japan Miquel Barceló: Per on va la recerca europea en TIC

05/07/2011Alberto Abelló & Oscar Romero 5

Business Process as a Service

Software as a Service

Platform as a Service

Infrastructure as a Service

05/07/2011Alberto Abelló & Oscar Romero 6

16/11/2011

4

IaaSCl d ti› Cloud computing

PaaS› BigTable› MapReduce

SaaS› CRM › SCM

BaaS BaaS› SOA› QoS

BI on Services› BPM› KPI

05/07/2011Alberto Abelló & Oscar Romero 7

• IBM WebSpherei• Oracle SOA suite

• webMethods• Apache ServiceMix• Microsoft Connected

Services Framework

05/07/2011Alberto Abelló & Oscar Romero 8

Services Framework• Open ESB• etc.

16/11/2011

5

• Salesforce.comCl d9• Cloud9

• Oco• RightNow• Microstrategy• Quantivo

05/07/2011Alberto Abelló & Oscar Romero 9

• Quantivo• Oracle on Demand• etc.

• Google BigTableA Si l DB• Amazon SimpleDB

• Microsoft SDS• FanthomDB• Aster DB• Vertica

05/07/2011Alberto Abelló & Oscar Romero 10

• Vertica• K2 Analytics• etc.

16/11/2011

6

• Amazon EC2• IBM SmartCloud• Google app engine

05/07/2011Alberto Abelló & Oscar Romero 11

• Etc.

05/07/2011Alberto Abelló & Oscar Romero 12

16/11/2011

7

05/07/2011Alberto Abelló & Oscar Romero 13

05/07/2011Alberto Abelló & Oscar Romero 14

16/11/2011

8

“Cloud computing is a model for enabling convenient on-demand network access to a convenient, on demand network access to a shared pool of configurable computing resources (e.g., networks, servers, storage, applications, and services) that can be rapidly provisioned and released with minimal management effort or service provider interaction ”interaction.

NIST (National Institute of Standards and Technology)

05/07/2011Alberto Abelló & Oscar Romero 15

On-demand self-service On demand self service Broad network access Resource pooling Rapid elasticity Measured service

05/07/2011Alberto Abelló & Oscar Romero 16

16/11/2011

9

Illusion of infinite resources Illusion of infinite resources Elimination of up-front commitment Pay-per-use

05/07/2011Alberto Abelló & Oscar Romero 17

Buy a cow• High upfront investment• High maintenance cost• High maintenance cost• Produces a fixed amount • Stepwise scaling

Buy bottled milk

05/07/2011Alberto Abelló & Oscar Romero 18

Daniel Abadi analogy

Buy bottled milk• Pay-per-use• Lower maintenance cost• Linear scaling• Fault-tolerant

16/11/2011

10

Deployment› Localization› Routing› Authentication

Tuning› PlacementPlacement› Resource partitioning› Service level objectives› Dynamically varying workloads

05/07/2011Alberto Abelló & Oscar Romero 19

Availability of service Data lock-in Data confidentiality Data transfer bottlenecks Performance unpredictability Scalable storage Debugging Debugging Scaling quickly Reputation fate sharing Software licensing

05/07/2011Alberto Abelló & Oscar Romero 20

16/11/2011

11

05/07/2011Alberto Abelló & Oscar Romero 21

Storage as a Service Storage as a Service Database as a Service Processes (Workflow) as a Service Service as a Service Security as a Service Management/Governance as a Service

05/07/2011Alberto Abelló & Oscar Romero 22

16/11/2011

12

05/07/2011Alberto Abelló & Oscar Romero 23

Software in the cloud Software in the cloud› DBMS› Workflow management› Versioning

Cloud software

05/07/2011Alberto Abelló & Oscar Romero 24

16/11/2011

13

Hadoop› Based on Google MapReduce (2004)

Hbase (also Dynamo and Cassandra) › Based on Google BigTable (2006)

05/07/2011Alberto Abelló & Oscar Romero 25

Processes pairs [key value]

Map Merge-Sort Reduce

Processes pairs [key, value] Hides parallelization, fault-tolerance, data

distribution and load balancing

05/07/2011Alberto Abelló & Oscar Romero 26

16/11/2011

14

public void map(LongWritable key, Text value) { String line = value.toString(); StringTokenizer tokenizer = new StringTokenizer(line); g g ( );while (tokenizer.hasMoreTokens()) {

write(new Text(tokenizer.nextToken()), new IntWritable(1)); }

}

public void reduce(Text key, Iterable<IntWritable> values) { int sum = 0; for (IntWritable val : values) {

sum += val.get(); } write(key, new IntWritable(sum));

}

05/07/2011Alberto Abelló & Oscar Romero 27

[“The”,[

Map

[1,1,1,1,…]]

Reduce

05/07/2011Alberto Abelló & Oscar Romero 28

The 1Project 1Gutemberg 1Ebook 1of 1The 1Outline 1Of 1Science, 1Vol. 11 1(of 14), 1by 1

The 57631

16/11/2011

15

Programming model simple yet expressive Programming model simple yet expressive

Able to process structured or unstructured Elastically scalable Fine grained fault tolerance

05/07/2011Alberto Abelló & Oscar Romero 29

Does not benefit from compression Does not benefit from compression Writes intermediate results to disk

› Reduce tasks pull intermediate data Defines the execution plan on the fly

› Schedules one block at a time

05/07/2011Alberto Abelló & Oscar Romero 30

16/11/2011

16

key value

family1 family2 familyn…

column1 column2 columnm…

version1 version2 versionp…

(row:string, column:string[, time:int64])string

05/07/2011Alberto Abelló & Oscar Romero 31

Key

Supports single row transactions Compression per block can be enabled The schema determines the locality of data

05/07/2011Alberto Abelló & Oscar Romero 32

16/11/2011

17

Child Parent

Child Parent

05/07/2011Alberto Abelló & Oscar Romero 33

No ACID No standard Low-level query

Michael Stonebraker

05/07/2011Alberto Abelló & Oscar Romero 34

16/11/2011

18

Do It Yourself• Expensive• Ad hoc development

05/07/2011Alberto Abelló & Oscar Romero 35

Florian Waas analogy

Off the Shelf• Economies of scale• Concrete functionalities

“… But to really unlock the power of Hadoop, y p p,you must be able to efficiently extract data stored across multiple (often tens or hundreds) of nodes with a user-friendly ETL (extract, transform and load) tool that will then allow you to move your Hadoop data i iinto a relational data mart or warehouse where you can use BI tools for analysis. “

Ian FyfePentaho

05/07/2011Alberto Abelló & Oscar Romero 36

16/11/2011

19

DM DMDM

ETL (Extraction, Transformation and Load)

DW

05/07/2011Alberto Abelló & Oscar Romero 37

WWW

Trademark• Expensive• Many functionalities• Mature

05/07/2011Alberto Abelló & Oscar Romero 38

Open source• Free• Simple functionalities• Young

16/11/2011

20

MapReduce Java Oracle

Oracle Grid Engine

Hadoop Distributed File System (HDFS)

05/07/2011Alberto Abelló & Oscar Romero 39

05/07/2011Alberto Abelló & Oscar Romero 40

16/11/2011

21

5%10%15%20%25%30%35%

Service %

05/07/2011Alberto Abelló & Oscar Romero 41

0%5%

On-premises Service-basedCustomization + +/-Implementation time +/- +/-Application shut-off + -Hidden fees - -Security of data + -Process integrity + -

05/07/2011Alberto Abelló & Oscar Romero 42

Guarantee of quality + -

16/11/2011

22

05/07/2011Alberto Abelló & Oscar Romero 43

Business Process as a Services are focused on providing existing business processes through a cloud If there is existing business processes through a cloud. If there is an existing process with steps that are known it can be provided as a service within the catalog. This allows the Cloud Service Provider to automate any steps within the process while leaving the changes transparent to the Cloud Service Consumer.

Software Services allow a Cloud Service Consumer to select a specific software instance that they want created without the need to be aware of where and how it will be hosted. … This allows the Cloud Service Consumer to focus on the characteristics of the application and gives the Cloud Service Provider the freedom to fulfill the request with any resources that will meet the need.

NIST (National Institute of Standards and Technology)

05/07/2011Alberto Abelló & Oscar Romero 44

16/11/2011

23

Reusability Reusability Loose coupling Contract Abstraction Composability Autonomy Statelessness Discoverability

05/07/2011Alberto Abelló & Oscar Romero 45

Distributed SOAcomponents

Design Functionality ProcessDesigned to … Last ChangeDevelopment cycle Long Interactive and iterativeCentered on … Cost BusinessCoordination Blocks OrchestrationCoupling Tight Loose (agile and adaptive)

05/07/2011Alberto Abelló & Oscar Romero 46

Coupling Tight Loose (agile and adaptive)Technologies Homogeneous HeterogeneousProgramming Objects MessagesEncapsulation Partial Full (contracts)

16/11/2011

24

Primitive activity Primitive activity Complex activity

› Atomic transaction› Business activity Orchestration Choreography Choreography

05/07/2011Alberto Abelló & Oscar Romero 47

Business Process Management Business Process Management Service Composition Service Infrastructure and Management

+ + + +BusinessProcesses

Business

05/07/2011Alberto Abelló & Oscar Romero 48

BusinessServices

UtilityServices

DB MapReduce MDMCRMSCM ERP

16/11/2011

25

Service broker(UDDI)

findServiceC

lient

(UDDI)

Service provider

description(WSDL)

t

register

05/07/2011Alberto Abelló & Oscar Romero 49

Busin

ess

obje

ct

request(SOAP)

response(SOAP)

Service Engineering and Design Service Engineering and Design Service Adaptation and Monitoring Service Quality

05/07/2011Alberto Abelló & Oscar Romero 50

16/11/2011

26

DM DMDM

ETL (Extraction, Transformation and Load)

DW

05/07/2011Alberto Abelló & Oscar Romero 51

WWW

05/07/2011Alberto Abelló & Oscar Romero 52

16/11/2011

27

Turns manufacturers into services Turns manufacturers into services Becomes a barrier for competitors By coding customers allows to:

› Instruct staff› Manage queues in call centers› Target offers› Share data with other firms

05/07/2011Alberto Abelló & Oscar Romero 53

› Sell data to other firms

05/07/2011Alberto Abelló & Oscar Romero 54

16/11/2011

28

“You cannot control what you cannot measure”

05/07/2011Alberto Abelló & Oscar Romero 55

05/07/2011Alberto Abelló & Oscar Romero 56

16/11/2011

29

05/07/2011Alberto Abelló & Oscar Romero 57

05/07/2011Alberto Abelló & Oscar Romero 58

16/11/2011

30

05/07/2011Alberto Abelló & Oscar Romero 59

05/07/2011Alberto Abelló & Oscar Romero 60

16/11/2011

31

05/07/2011Alberto Abelló & Oscar Romero 61

05/07/2011Alberto Abelló & Oscar Romero 62

16/11/2011

32

05/07/2011Alberto Abelló & Oscar Romero 63

BI can benefit from services at four levels BI can benefit from services at four levels› IaaS› PaaS› SaaS› BaaS

Services benefit from BI Services benefit from BI› KPI and Balance Scorecards› Process mining

05/07/2011Alberto Abelló & Oscar Romero 64

16/11/2011

33

Mell, P., Grance, T.: The NIST Denition of Cloud Computing. Special Publication 800 145 National Institute of Standards Special Publication 800-145, National Institute of Standards and Technology (January 2011), draft

Abadi, D.J.: Data management in the cloud: Limitations and opportunities. IEEE Data Engineering Bulletin 32(1), 3-12 (2009)

Stonebraker, M., et al.: MapReduce and parallel DBMSs: friends or foes? Communication of ACM 53(1), 64-71 (2010)

Hostmann, B.: Business Intelligence as a Service: Findings and Recommendations Research G00164653 Gartner and Recommendations. Research G00164653, Gartner (January 2009)

Erl, T.: Service Oriented Architecture. Prentice Hall (2006) Castellanos, M., et al.: Automating the loading of business

process data warehouses. In EDBT’2009. pp. 612-623. ACM Josep Carmona, Jordi Cortadella: Process Mining Meets

Abstract Interpretation. ECML/PKDD (1) 2010: 184-19905/07/2011Alberto Abelló & Oscar Romero 65