Information Systems - unipi.it · Information Systems Introduction to concepts, ... Humanities and...

36
Information Systems Introduction to concepts, requirements, approaches, and best-practices for designing Information systems in hybrid data infrastructure Pasquale Pagano

Transcript of Information Systems - unipi.it · Information Systems Introduction to concepts, ... Humanities and...

Page 1: Information Systems - unipi.it · Information Systems Introduction to concepts, ... Humanities and Cultural Heritage s ... "Service oriented architectures: approaches, technologies

Information SystemsIntroduction to concepts, requirements, approaches, and best-practices for designing Information systems in hybrid data infrastructure

Pasquale Pagano

Page 2: Information Systems - unipi.it · Information Systems Introduction to concepts, ... Humanities and Cultural Heritage s ... "Service oriented architectures: approaches, technologies

Pasquale Pagano

12

/12

/16

Info

rma

tion S

yste

ms

2

• Education• Master Degree in Computer Science

• Ph.D in Information Engineering on Distributed Systems

• Organization• CNR – ISTI, InfraScience Group

• Experience• D4Science Hybrid Data Infrastructure, Technical Director• gCube Open-Source Framework, Technical Director• BlueBRIDGE EU Project, Technical Director• SoBigData EU Project, Infrastructure Manager• Parthenos EU Project, Infrastructure Operation Manager

• Bio and contact• it.linkedin.com/in/pasqualepagano/• [email protected]

Page 3: Information Systems - unipi.it · Information Systems Introduction to concepts, ... Humanities and Cultural Heritage s ... "Service oriented architectures: approaches, technologies

Outline

Information System

• What it is and how to define it

Context

• Hybrid cloud-based infrastructure

Resource Registry

• Hybrid cloud-based infrastructure information system

Conclusions

12

/12

/16

Info

rma

tion S

yste

ms

3

Page 4: Information Systems - unipi.it · Information Systems Introduction to concepts, ... Humanities and Cultural Heritage s ... "Service oriented architectures: approaches, technologies

Information Systems

An information system (IS) is

• any organized system for the collection, organization, storage and communication of information

• an integrated set of components for collecting, storing, and processing data and for providing information, knowledge, and digital products [Encyclopaedia Britannica]

Information consists of data that is

1. accurate and timely,

2. specific and organized for a purpose,

3. presented within a context that gives it meaning and relevance,

4. can increase understanding and decrease uncertainty

12

/12

/16

Info

rma

tion S

yste

ms

4Introduction

Page 5: Information Systems - unipi.it · Information Systems Introduction to concepts, ... Humanities and Cultural Heritage s ... "Service oriented architectures: approaches, technologies

Information Systems

An information system (IS) is

• a combination of hardware, software, infrastructure and trained personnel organized to facilitate planning, control, coordination, and decision making in an organization [businessdictionary]

Trained personnel consists of human resources and :

1. procedures for using, operating, and maintaining the information system

2. set of basic principles and associated guidelines, a.k.a policies, formulated and enforced to direct and limit actions in pursuit of long-term goals

12

/12

/16

Info

rma

tion S

yste

ms

5Introduction

Page 6: Information Systems - unipi.it · Information Systems Introduction to concepts, ... Humanities and Cultural Heritage s ... "Service oriented architectures: approaches, technologies

Information Systems

An information system (IS) is

• a software system to capture, transmit, store, retrieve, and manipulate data produced by software systems to provide access to information, thereby supporting people, organizations, or other software systems [MIT Press]

Software systems become producer and consumer of the Information System making it at the core of their business activities

12

/12

/16

Info

rma

tion S

yste

ms

6Introduction

Page 7: Information Systems - unipi.it · Information Systems Introduction to concepts, ... Humanities and Cultural Heritage s ... "Service oriented architectures: approaches, technologies

Information Systems Definition

A software system

• to capture, transmit, store, retrieve, and manipulate data produced by software systems

• to provide access to information, organized for a purpose and within a contextual domain• used, accessed, and maintained according to well-known procedures

operated under the limit of the (evolving) organization policies

• to support people within an organization and other software systems

12

/12

/16

Info

rma

tion S

yste

ms

7Introduction

Page 8: Information Systems - unipi.it · Information Systems Introduction to concepts, ... Humanities and Cultural Heritage s ... "Service oriented architectures: approaches, technologies

CONTEXT

Hybrid cloud-based infrastructure

12

/12

/16

Info

rma

tion S

yste

ms

8

Page 9: Information Systems - unipi.it · Information Systems Introduction to concepts, ... Humanities and Cultural Heritage s ... "Service oriented architectures: approaches, technologies

e-Infrastructures

e-Infrastructures enable researchers in different locations across the world to collaborate in the context of their home institutions or in national or multinational scientific initiatives.

They can work together by having shared access to unique or distributed scientific facilities (including data, instruments, computing and communications)

12

/12

/16

Info

rma

tion S

yste

ms

9Context

Page 10: Information Systems - unipi.it · Information Systems Introduction to concepts, ... Humanities and Cultural Heritage s ... "Service oriented architectures: approaches, technologies

e-Infrastructures 12

/12

/16

Info

rma

tion S

yste

ms

10Context

Page 11: Information Systems - unipi.it · Information Systems Introduction to concepts, ... Humanities and Cultural Heritage s ... "Service oriented architectures: approaches, technologies

e-Infrastructures

Data e-Infrastructure: an e-Infrastructure promoting data

sharing and consumption. Addresses the needs of the

research activity performed by a certain community.

Info

rma

tion S

yste

ms

12

/12

/16

11Context

Page 12: Information Systems - unipi.it · Information Systems Introduction to concepts, ... Humanities and Cultural Heritage s ... "Service oriented architectures: approaches, technologies

e-Infrastructures

Computational e-Infrastructure: an e-Infrastructures

offering computational resources distributed in a network

environment. Uses Cloud computing to execute calculations

with a large number of connected computers. Offers

collaboration facilities for scientists to share experimental

results

Info

rma

tion S

yste

ms

12

/12

/16

12Context

Page 13: Information Systems - unipi.it · Information Systems Introduction to concepts, ... Humanities and Cultural Heritage s ... "Service oriented architectures: approaches, technologies

Requirements for e-Infrastructures

• Support collaborative research and experimentation

• Implement Reproducibility-Repeatability-Reusability

• Allow sharing of data, methods, workflows, and findings

• Grant open access to produced scientific knowledge and data

• Tackle simplified access to existing computing and storage resources

• Ensure low operational and maintenance costs

• Manage heterogeneous data and service access policies

13

12

/12

/16

Info

rma

tion S

yste

ms

Context

Page 14: Information Systems - unipi.it · Information Systems Introduction to concepts, ... Humanities and Cultural Heritage s ... "Service oriented architectures: approaches, technologies

Virtual Research Environment 12

/12

/16

Info

rma

tion S

yste

ms

14

An operational environment

• Where set of resources (data,

services, computational, and

storage resources)

• are assigned to group of users

via interfaces

• for a limited timeframe

L. Candela, D. Castelli, P. Pagano (2013) Virtual Research Environments: An Overview and a Research Agenda. Data Science Journal, Vol. 12

Created on demand

Regulated by tailored policies

No cost for the resource providers

Open to host and operate custom software

Context

Page 15: Information Systems - unipi.it · Information Systems Introduction to concepts, ... Humanities and Cultural Heritage s ... "Service oriented architectures: approaches, technologies

D4ScienceEuropean e-Infrastructure

D4Science is both a Data and a Computational e-

Infrastructure that federates other e-Infrastructures across

administration domains - Hybrid Data Infrastructure

Moreover, it

• Implements the notion of e-

Infrastructure/platform/software as-a-Service

• it offers on demand access to data management services and

computational facilities;

• is policies-driven through the true implementation of Virtual

Research Environments

12

/12

/16

Info

rma

tion S

yste

ms

15Context

Page 16: Information Systems - unipi.it · Information Systems Introduction to concepts, ... Humanities and Cultural Heritage s ... "Service oriented architectures: approaches, technologies

Infrastructure as a Service

Infrastructure as a service (IaaS) is a standardized, highly automated offering, where compute resources, complemented by storage and networking capabilities are owned and hosted by a service provider and offered to customers on-demand.

• IaaS also hosts users' applications and handles tasks including system maintenance, backup, and recovery planning.

• Customers are able to self-provision this infrastructure, using a Web-based graphical user interface that serves as an IT operations management console for the overall environment.

• API access to the infrastructure may also be offered as an option.

12

/12

/16

Info

rma

tion S

yste

ms

16Context

Page 17: Information Systems - unipi.it · Information Systems Introduction to concepts, ... Humanities and Cultural Heritage s ... "Service oriented architectures: approaches, technologies

Cloud Computing

• IaaS is one of three main categories of cloud computing services, complemented by

• Software as a Service (SaaS)

• software distribution model in which applications are made available to customers over the Internet.

• removes the need to install and run applications on owned data center.

• eliminates the expense of hardware acquisition, provisioning and maintenance, as well as software licensing, installation and support.

• Platform as a Service (PaaS)

• cloud computing model that delivers application development frameworks to its users as a service.

12

/12

/16

Info

rma

tion S

yste

ms

17Context

Page 18: Information Systems - unipi.it · Information Systems Introduction to concepts, ... Humanities and Cultural Heritage s ... "Service oriented architectures: approaches, technologies

Cloud Computing Characteristics

• On-demand• Provision of computing resources, such as server, service, and

storage, as needed without requiring human interaction

• Broad network access• Resources are available over a network

• Resource pooling• Resources pooled to serve multiple users using a multi-tenant model,

with physical and virtual resources dynamically assigned and reassigned according to consumer demand

• Rapid elasticity• Resources elastically provisioned and released, automatically, to

horizontally scale rapidly outward and inward as needed

• Measured service• Resources usage is monitored, controlled, and reported

12

/12

/16

Info

rma

tion S

yste

ms

18Context

Page 19: Information Systems - unipi.it · Information Systems Introduction to concepts, ... Humanities and Cultural Heritage s ... "Service oriented architectures: approaches, technologies

D4Science is an hybrid cloud-based infrastructuretechnologies integrated to provide

elastic access and usage of data and data-management capabilities

12

/12

/16

Info

rma

tion S

yste

ms

Humanities and Cultural Heritage

Social Mining

Environmental Studies

Biological and Ecological Studies

Context 19

Page 20: Information Systems - unipi.it · Information Systems Introduction to concepts, ... Humanities and Cultural Heritage s ... "Service oriented architectures: approaches, technologies

D4Science Service Provision

12

/12

/16

Info

rma

tion S

yste

ms

20

Empowered Hardware

Package

Repository HW

gHN

Failure Recovery

HWgHN

HWgHN

Service provision continuity

HWgHN

HW gHN

Balancing utilization with head room

Dynamic Load Balancing

WS

State

WS

State

CPU Usage

30%

CPU Usage

90%

Rapid deployment

Production

HW

gHNPackage

Repository

WS

Dynamic deployment

…WS

State

WS

State WS

State

WS

State

…WSWS

Context

Page 21: Information Systems - unipi.it · Information Systems Introduction to concepts, ... Humanities and Cultural Heritage s ... "Service oriented architectures: approaches, technologies

12

/12

/16

Info

rma

tion S

yste

ms

D4Science is an hybrid cloud-based infrastructure

• 63 VREs hosted

• +3100 users

• in 44 countries

• from +80 Institutions

• + 430 millions service calls a year

• + 1600 distinct caller hosts

• +25,000 derivative data/month

• +50 data providers

• over a billion quality records

• +20,000 temporal datasets

• +50,000 spatial datasets

• 99.8% service availability

Context 21

Page 22: Information Systems - unipi.it · Information Systems Introduction to concepts, ... Humanities and Cultural Heritage s ... "Service oriented architectures: approaches, technologies

Hybrid cloud-based infrastructurechallenges

Hundred software systems opportunistically deployed on demand

• The software systems to manage are not known at design time

• The location of any service is known only at runtime

• Any software system has to discover the location of the targeted service before to use it

• All software systems have to be monitored, controlled, and reported

• Status, load, exploitation usage, and accounting data have to be constantly updated to enable elasticity and pooling of resources

All these data are managed by the infrastructure Resource Registry

12

/12

/16

Info

rma

tion S

yste

ms

22Resource Registry

Page 23: Information Systems - unipi.it · Information Systems Introduction to concepts, ... Humanities and Cultural Heritage s ... "Service oriented architectures: approaches, technologies

RESOURCE REGISTRY

Hybrid cloud-based infrastructure information system

12

/12

/16

Info

rma

tion S

yste

ms

23

Page 24: Information Systems - unipi.it · Information Systems Introduction to concepts, ... Humanities and Cultural Heritage s ... "Service oriented architectures: approaches, technologies

Resource Registry

The infrastructure Resource Registry is an Information System designed to support the operation of an hybrid cloud-based infrastructure

• To capture, transmit, store, retrieve and manipulate data from any software system enabled on the infrastructure• Location and properties• Status, load, exploitation usage, and accounting data

• To provide access to information, organized to enable• Monitoring, validation, and reporting• Elasticity and pooling of resources

• To support any software system to• Discover services and infrastructure

resources

12

/12

/16

Info

rma

tion S

yste

ms

24Resource Registry

Page 25: Information Systems - unipi.it · Information Systems Introduction to concepts, ... Humanities and Cultural Heritage s ... "Service oriented architectures: approaches, technologies

Resource Registry abstract system view

The Resource Registry - core of a SOA within the complexities of an hybrid cloud-based infrastructure – must enable

• a set of resource management functions

• enabling functions• publication, discovery

• monitoring, deployment

• contextualization, security, execution

• data management functions• access, store

• index, search

• transfer, transform

• plus a set of applications

• built against those functions

12

/12

/16

Info

rma

tion S

yste

ms

25Resource Registry

Page 26: Information Systems - unipi.it · Information Systems Introduction to concepts, ... Humanities and Cultural Heritage s ... "Service oriented architectures: approaches, technologies

Resource Registryabstract system view

• Resource types: abstract view over functions

• defined by specifications

• multiple implementations, over time / concurrently

• different implementations, different information

• system cannot globally define them

• implementations produce/consume different facets, independently

• resource semantics dynamic

• no longer predefined in class hierarchies

• implicitly captured by current facets

• changes over time / across “similar” resources

12

/12

/16

Info

rma

tion S

yste

ms

26Resource Registry

Page 27: Information Systems - unipi.it · Information Systems Introduction to concepts, ... Humanities and Cultural Heritage s ... "Service oriented architectures: approaches, technologies

Resource Registry 12

/12

/16

Info

rma

tion S

yste

ms

27

resource

registry

Resource Registry

Page 28: Information Systems - unipi.it · Information Systems Introduction to concepts, ... Humanities and Cultural Heritage s ... "Service oriented architectures: approaches, technologies

Resource Registryresource model

• defines a framework for collecting facets

• some common properties

• a loose binding to XML/Json

• all resources have:

• A unique identifier

• optional name and description

• one or more policies

• zero or more facets

• uniquely identified

• arbitrary otherwise

12

/12

/16

Info

rma

tion S

yste

ms

28Resource Registry

Page 29: Information Systems - unipi.it · Information Systems Introduction to concepts, ... Humanities and Cultural Heritage s ... "Service oriented architectures: approaches, technologies

Resource Registryresource model

12

/12

/16

Info

rma

tion S

yste

ms

29Resource Registry

Page 30: Information Systems - unipi.it · Information Systems Introduction to concepts, ... Humanities and Cultural Heritage s ... "Service oriented architectures: approaches, technologies

Resource ModelEntities and Relations

12

/12

/16

Info

rma

tion S

yste

ms

30

ResourceInstance

Resource Schema

FacetInstance

FacetSchema

consistsOf

1..n

consistsOf

1..n

conformsTo

1..n

Application Domain

defines

1..ndefines

1..n

Context

contains

1..n

belongsTo

1..n

isRelatedTo

0..n

conformsTo

1..n

Resource

isDescribedBy

1..1

isParentOf

1..n

Resource Registry

Page 31: Information Systems - unipi.it · Information Systems Introduction to concepts, ... Humanities and Cultural Heritage s ... "Service oriented architectures: approaches, technologies

Resource ModelEntities and Relations

12

/12

/16

Info

rma

tion S

yste

ms

31Resource Registry

Page 32: Information Systems - unipi.it · Information Systems Introduction to concepts, ... Humanities and Cultural Heritage s ... "Service oriented architectures: approaches, technologies

Resource Modelmilestones

• Open-ended model for describing resources

• Open-ended set of manageable resources

• Ability to evolve with the evolving needs of the infrastructure at no cost for its clients

• by supporting new types of resources at run-time

• by supporting evolution in the way a resource is described

• by supporting the same resource type described by using different models

12

/12

/16

Info

rma

tion S

yste

ms

32Resource Registry

Page 33: Information Systems - unipi.it · Information Systems Introduction to concepts, ... Humanities and Cultural Heritage s ... "Service oriented architectures: approaches, technologies

Resource Registryarchitecture O

cto

be

r 1

8th

20

16

To

wa

rds th

e n

ew

In

form

ation S

yste

m

33

Resource RegistryGraph DBGraph DB

Graph DB

IS-Model

Any Service

PEP

PDP

High Availability Proxy

Resource Registry Client Resource Registry Publisher

Resource Registry

Page 34: Information Systems - unipi.it · Information Systems Introduction to concepts, ... Humanities and Cultural Heritage s ... "Service oriented architectures: approaches, technologies

Conclusions

• Any information system has to be designed for a purpose and within a contextual domain

• A Resource Registry is an Information System designed to support the operation of an infrastructure

• Open-ended model since infrastructure resources may not known in advance

• Open-ended set of manageable resources since an infrastructure lifetime may span several decades

• Non-functional requirements - e.g. availability, reliability – are key requirements to consider in the design phase

12

/12

/16

Info

rma

tion S

yste

ms

34

Page 35: Information Systems - unipi.it · Information Systems Introduction to concepts, ... Humanities and Cultural Heritage s ... "Service oriented architectures: approaches, technologies

Further Reading• Candela, Leonardo, Donatella Castelli, and Pasquale Pagano. "Virtual research environments: an overview and a research agenda." Data

Science Journal 12.0 (2013): 65-91

• Papazoglou, Mike P., and Willem-Jan Van Den Heuvel. "Service oriented architectures: approaches, technologies and research issues." The VLDB journal 16.3 (2007): 389-415.

• Papazoglou, Mike P. "Service-oriented computing: Concepts, characteristics and directions." Web Information Systems Engineering, 2003. WISE 2003. Proceedings of the Fourth International Conference on. IEEE, 2003.

• Sivashanmugam, Kaarthik, Kunal Verma, and Amit Sheth. "Discovery of web services in a federated registry environment." Web Services, 2004. Proceedings. IEEE International Conference on. IEEE, 2004

• Khouja, Mehdi, and Carlos Juiz. "Enhanced service discovery via shared context in a distributed architecture." Web Services (ICWS), 2015 IEEE International Conference on. IEEE, 2015.

• Zhu, Fen, Matt W. Mutka, and Lionel M. Ni. "Service discovery in pervasive computing environments." IEEE Pervasive computing 4.4 (2005): 81-90.

• Chakraborty, Dipanjan, et al. "Toward distributed service discovery in pervasive computing environments." IEEE Transactions on Mobile computing5.2 (2006): 97-112.

• Zhang, Liang-Jie, and Qun Zhou. "CCOA: Cloud computing open architecture." Web Services, 2009. ICWS 2009. IEEE International Conference on. Ieee, 2009.

• Zhang, Qi, Lu Cheng, and Raouf Boutaba. "Cloud computing: state-of-the-art and research challenges." Journal of internet services and applications 1.1 (2010): 7-18.

• Wei, Yi, and M. Brian Blake. "Service-oriented computing and cloud computing: challenges and opportunities." IEEE Internet Computing14.6 (2010): 72.

• Garofalakis, John, et al. "Web service discovery mechanisms: Looking for a needle in a haystack." International Workshop on Web Engineering. Vol. 38. 2004.

• Sotomayor, Borja, et al. "Virtual infrastructure management in private and hybrid clouds." IEEE Internet computing 13.5 (2009): 14-22.

• Rodero-Merino, Luis, et al. "From infrastructure delivery to service management in clouds." Future Generation Computer Systems 26.8 (2010): 1226-1240.

• Zhang, Xuechai, Jeffrey L. Freschl, and Jennifer M. Schopf. "A performance study of monitoring and information services for distributed systems." High Performance Distributed Computing, 2003. Proceedings. 12th IEEE International Symposium on. IEEE, 2003.

12

/12

/16

Info

rma

tion S

yste

ms

35

Page 36: Information Systems - unipi.it · Information Systems Introduction to concepts, ... Humanities and Cultural Heritage s ... "Service oriented architectures: approaches, technologies

THANK YOU

Acknowledgement:Fabio Simeoni, Luca Frosini, Manuele SimiCNR – ISTI InfraScience

12

/12

/16

Info

rma

tion S

yste

ms

36

The content of this presentation is released under the

Creative-Commons CC-BY-SA license