The Brave New World of Data · Uniform Resource Identifiers (URI’s) instead of words Linked Open...

Post on 22-Sep-2020

1 views 0 download

Transcript of The Brave New World of Data · Uniform Resource Identifiers (URI’s) instead of words Linked Open...

PUBLIC

The Brave New World of Data

Prof Dr Pieter BallonDirector imec-SMIT, Vrije Universiteit Brussel, Belgium

INTRO

is the world-leading R&D and innovation hub

in nanoelectronics and digital technology

imec domains

186 Members

32 Large companies

58 SMEs

81 Research institutions

15 Others

83 Full members

103 Associate members

Present in 28 countries

Industry-driven and fully self-financed

international non–for-profit organisation under

Belgian law

Academia / Research

44%

Industry large17%

Industry SME31%

Others8%

Digitising European Industry

Building a European DataEconomy

Developing a European Data Infrastructure

Digital Skills

Contributing to the Digital Transformation in Europe

TF3: Community

TF1: Programme

TF2: Impact

TF4: Communication

BDVA is taking care…of many different aspects of the big data

HPC – Big Data

TF5: Policy

& Societ

al

Policy & Societal

TF6:

Technical

TF6-SG1: Data Management

TF6-SG2: Data Processing

Architectures

TF6-SG3: Data Analytics

TF6-SG4: Data Protection and

PseudonymisationMechanisms

TF6-SG5: Advanced Visualisation and User Experience

TF6-SG6: Standardisation

TF7:

Application

TF7-SG1: Emerging Application Areas

TF7-SG2: Telecom

TF7-SG3: Healthcare

TF7-SG4: Media

TF7-SG5: Earth observation &

geospatialTF7-SG6: Smart

Manufacturing Industry

TF7-SG7: Mobility and Logistics

TF7-SG8: Smart Cities

TF8:

Business

TF8-SG1: Data

entrepreneurs (SMEs and

startups)

TF8-SG2: Transforming

traditional business(Large

Enterprise)

TF8-SG3: Observatory

on Data Business Models

TF9:

Skills and Education

TF9.SG1: Skill

requirements from

European industriesTF9SG2:

Analysis of current

curricula related to data

science

TF9.SG3: Liaison with

existing educational

projects

Trust in Data Driven Decision Making

• Trusted co-evolution between humans and AI-based systems

• Legal issues with data decisions

• Trust in algorithms and data

Scaling Industrial Cooperation Models in the Data Economy

Data Skills and Know-HowConvergence of Digital

Infrastructure

Future Challenges of the European Data Economy and Society

THE BRAVE NEW WORLD OF DATA

Data is (more than) the new oil

Data drives digital business models

e.g. Data supplier – Data quality guarantor – Data enabler

Data for operational efficiency

16.3 million packages and 39.5 million tracking requests per day

telematics sensors on 46,000 vehicles led to 31 million liter fuel saving per year

Data for new/improved value propositions

Personal medical data is about the most sensitive and valuable data. Patientslikeme offers such value that users give up willingly. Anonymised data is sold to a.o. pharmaceutical companies.

Maas.fi

Data sharing models

Data disrupts economy

Data disrupts public governance

AN OPEN DATA ECOSYSTEM

And in many other domains…

Environment (e.g. air and water quality, noise measurements…)

Public domain (e.g. detailed information on infrastructure, green spaces, vacant buildings…)

Health and wellbeing (e.g. location and availability of places in daycare, schools…)

Public services (e.g. garbage collection, maintenance of the public domain…)

Safety (e.g. crowd management, citizens complaints, police interventions…)

Tourism (e.g. visitor flows, profiling…)

Need for structured, open data that can be linked to create solutions to these challenges

SMART FLANDERS RESULTS

Inter-city harmonisation

–Open Data Charter

SMART FLANDERS RESULTS

Inter-city harmonisation

–Open Data Charter

International harmonisation

–Open & Agile Smart

Cities

SMART FLANDERS RESULTS

Inter-city harmonisation

–Open Data Charter

International harmonisation

–Open & Agile Smart

Cities Data Pilot

Data Pilot

Data Pilot

Data Pilot

Open Data Charter general overview

Charter is available to the cities in draft version and still confidential

Contains the strategic and technical principles cities and their partners will adhere to when publishing data

1. General principles (open by default, only once, …)2. Data collection (types of data, ownership of data, …)3. Data processing (data governance, formats and standards, data quality, …)4. Data publication (decentralised publication, licences, reuser engagement, …)

Will link to practical information and tools signatories can use in applying the charter, such as documentation, paragraphs to be used in tenders, technical background and so on

Will be ready to sign by the end of this year

bu

sin

ess

soci

etal

org

ansa

tio

nal

Increasing interoperability and creating impact

syntactic

semantic

technical

legal

OrganisationalEnsuring organisations communicate, rethink and adapt processes

BusinessEnsuring data is reused and generates sustainable economic impact

SocietalEnsuring data is useful and has societal impact

Your system

?

?

?

?

?

?

Challenge: maximising reuse and increasing interoperability

Sharing data on the web

Data dumps

Smart servers

Entire query languages over HTTP

Dataset split in fragments

Smart agents

algorithmsas a service

High client-side effortPossible to ask any questionPossibility to federate queries

High server-side effortNot possible to ask any question

Not built to ask questions beyond the silo

Data publishing vs. Data services

Data dumps

Smart servers

Entire query languages over HTTP

Dataset split in fragments

Smart agents

algorithmsas a service

High client-side effortPossible to ask any questionPossibility to federate queries

High server-side effortNot possible to ask any question

Not built to ask questions beyond the silo

Data publishing vs. Data services

“Sint Pietersplein” → https://stad.gent/id/parking/P10“is a” → http://www.w3.org/1999/02/22-rdf-syntax-ns#type“Parking” → http://vocab.datex.org/terms#UrbanParkingSite

Uniform Resource Identifiers (URI’s)instead of words

Linked Open Data Fragments

1. Local URI strategy: ensure there is a definition available when going to a URI2. Use these URIs instead of other identifiers3. In procuring new solutions, include the Resource Description Framework (RDF) as a

way to describe data

Real time parking availability as Linked Open Data

Smart Flanders proof of conceptlinked.open.gent

DATA PRIVACY & ALGORITHMIC ACCOUNTABILITY

Privacy is a multi-stakeholder issue

33

The need for multi-stakeholder Privacy Impact Assessment

Kortrijk gaat bezoekers volgen via gsm13/06/2017 om 10:24 door Jonas Mayeur en Hannes Cattebeke

Tot op tien meter  nauwkeur ig weten waar  bezoekers van evenementen, winkels, musea en hotels zich bevinden. Dat is het plan van het

stadsbestuur  van Kortr ijk. Die informatie moet komen van gsm ­operatoren, van camera’s en van het gratis wifi­netwerk in de stad.

‘M eten is weten.’

Foto: BELGA

(http://www.standaard.be/)

(/Zoeken)

Privacy is a multi-stakeholder issue

34

multi-stakeholder Privacy Impact Assessment

Privacy is about transparency

Privacy is about transparency

36

Profile transparency tool for empowerment of social media users

37

Privacy is about transparencyAlgorithmic accountability and transparency – Certification?

Literacy and readability

New forms of literacy required

DATA COMPETENCE PROFILES

People-centred algorithms

39

Example: Decision algorithm exploration to support triage of patients

AGENDA

Agenda

• 15:30 Keynote

• Pieter Ballon, Director, imec-SMIT, VUB

Topic: The Brave New World of Data

• 16:00 – 17:30 Presentations and Panel discussion

• Irene López de Vallejo, Director of Collaborative research and international Development at Digital Catapult, London

Topic: SMEs innovating with (personal) data: Challenges, barriers and needed interventions

• Maurizio Cecchi, Manager at Telecom Italia

Topic: An European data market to boost a data driven economy: the telecoms perspective

• Nozha Boujemaa, Director of Research, Advisor to the CEO of INRIA in Big Data, in INRIA, Head of TransAIgo, member of the BDVA Board of Directors

Topic: Technical Insights in the implementation of trust and transparency in data and algorithms

• Amardeo Sarma, General Manager at NEC Laboratories Europe and Chairman of the Board of Directors of Trust in Digital Life (TDL)

Topic: GDPR and security issues related to the European data market