The Brave New World of Data · Uniform Resource Identifiers (URI’s) instead of words Linked Open...
Transcript of The Brave New World of Data · Uniform Resource Identifiers (URI’s) instead of words Linked Open...
PUBLIC
The Brave New World of Data
Prof Dr Pieter BallonDirector imec-SMIT, Vrije Universiteit Brussel, Belgium
INTRO
is the world-leading R&D and innovation hub
in nanoelectronics and digital technology
imec domains
186 Members
32 Large companies
58 SMEs
81 Research institutions
15 Others
83 Full members
103 Associate members
Present in 28 countries
Industry-driven and fully self-financed
international non–for-profit organisation under
Belgian law
Academia / Research
44%
Industry large17%
Industry SME31%
Others8%
Digitising European Industry
Building a European DataEconomy
Developing a European Data Infrastructure
Digital Skills
Contributing to the Digital Transformation in Europe
TF3: Community
TF1: Programme
TF2: Impact
TF4: Communication
BDVA is taking care…of many different aspects of the big data
HPC – Big Data
TF5: Policy
& Societ
al
Policy & Societal
TF6:
Technical
TF6-SG1: Data Management
TF6-SG2: Data Processing
Architectures
TF6-SG3: Data Analytics
TF6-SG4: Data Protection and
PseudonymisationMechanisms
TF6-SG5: Advanced Visualisation and User Experience
TF6-SG6: Standardisation
TF7:
Application
TF7-SG1: Emerging Application Areas
TF7-SG2: Telecom
TF7-SG3: Healthcare
TF7-SG4: Media
TF7-SG5: Earth observation &
geospatialTF7-SG6: Smart
Manufacturing Industry
TF7-SG7: Mobility and Logistics
TF7-SG8: Smart Cities
TF8:
Business
TF8-SG1: Data
entrepreneurs (SMEs and
startups)
TF8-SG2: Transforming
traditional business(Large
Enterprise)
TF8-SG3: Observatory
on Data Business Models
TF9:
Skills and Education
TF9.SG1: Skill
requirements from
European industriesTF9SG2:
Analysis of current
curricula related to data
science
TF9.SG3: Liaison with
existing educational
projects
Trust in Data Driven Decision Making
• Trusted co-evolution between humans and AI-based systems
• Legal issues with data decisions
• Trust in algorithms and data
Scaling Industrial Cooperation Models in the Data Economy
Data Skills and Know-HowConvergence of Digital
Infrastructure
Future Challenges of the European Data Economy and Society
THE BRAVE NEW WORLD OF DATA
Data is (more than) the new oil
Data drives digital business models
e.g. Data supplier – Data quality guarantor – Data enabler
Data for operational efficiency
16.3 million packages and 39.5 million tracking requests per day
telematics sensors on 46,000 vehicles led to 31 million liter fuel saving per year
Data for new/improved value propositions
Personal medical data is about the most sensitive and valuable data. Patientslikeme offers such value that users give up willingly. Anonymised data is sold to a.o. pharmaceutical companies.
Maas.fi
Data sharing models
Data disrupts economy
Data disrupts public governance
AN OPEN DATA ECOSYSTEM
And in many other domains…
Environment (e.g. air and water quality, noise measurements…)
Public domain (e.g. detailed information on infrastructure, green spaces, vacant buildings…)
Health and wellbeing (e.g. location and availability of places in daycare, schools…)
Public services (e.g. garbage collection, maintenance of the public domain…)
Safety (e.g. crowd management, citizens complaints, police interventions…)
Tourism (e.g. visitor flows, profiling…)
Need for structured, open data that can be linked to create solutions to these challenges
SMART FLANDERS RESULTS
Inter-city harmonisation
–Open Data Charter
SMART FLANDERS RESULTS
Inter-city harmonisation
–Open Data Charter
International harmonisation
–Open & Agile Smart
Cities
SMART FLANDERS RESULTS
Inter-city harmonisation
–Open Data Charter
International harmonisation
–Open & Agile Smart
Cities Data Pilot
Data Pilot
Data Pilot
Data Pilot
Open Data Charter general overview
Charter is available to the cities in draft version and still confidential
Contains the strategic and technical principles cities and their partners will adhere to when publishing data
1. General principles (open by default, only once, …)2. Data collection (types of data, ownership of data, …)3. Data processing (data governance, formats and standards, data quality, …)4. Data publication (decentralised publication, licences, reuser engagement, …)
Will link to practical information and tools signatories can use in applying the charter, such as documentation, paragraphs to be used in tenders, technical background and so on
Will be ready to sign by the end of this year
bu
sin
ess
soci
etal
org
ansa
tio
nal
Increasing interoperability and creating impact
syntactic
semantic
technical
legal
OrganisationalEnsuring organisations communicate, rethink and adapt processes
BusinessEnsuring data is reused and generates sustainable economic impact
SocietalEnsuring data is useful and has societal impact
Your system
?
?
?
?
?
?
Challenge: maximising reuse and increasing interoperability
Sharing data on the web
Data dumps
Smart servers
Entire query languages over HTTP
Dataset split in fragments
Smart agents
algorithmsas a service
High client-side effortPossible to ask any questionPossibility to federate queries
High server-side effortNot possible to ask any question
Not built to ask questions beyond the silo
Data publishing vs. Data services
Data dumps
Smart servers
Entire query languages over HTTP
Dataset split in fragments
Smart agents
algorithmsas a service
High client-side effortPossible to ask any questionPossibility to federate queries
High server-side effortNot possible to ask any question
Not built to ask questions beyond the silo
Data publishing vs. Data services
“Sint Pietersplein” → https://stad.gent/id/parking/P10“is a” → http://www.w3.org/1999/02/22-rdf-syntax-ns#type“Parking” → http://vocab.datex.org/terms#UrbanParkingSite
Uniform Resource Identifiers (URI’s)instead of words
Linked Open Data Fragments
1. Local URI strategy: ensure there is a definition available when going to a URI2. Use these URIs instead of other identifiers3. In procuring new solutions, include the Resource Description Framework (RDF) as a
way to describe data
Real time parking availability as Linked Open Data
Smart Flanders proof of conceptlinked.open.gent
DATA PRIVACY & ALGORITHMIC ACCOUNTABILITY
Privacy is a multi-stakeholder issue
33
The need for multi-stakeholder Privacy Impact Assessment
Kortrijk gaat bezoekers volgen via gsm13/06/2017 om 10:24 door Jonas Mayeur en Hannes Cattebeke
Tot op tien meter nauwkeur ig weten waar bezoekers van evenementen, winkels, musea en hotels zich bevinden. Dat is het plan van het
stadsbestuur van Kortr ijk. Die informatie moet komen van gsm operatoren, van camera’s en van het gratis wifinetwerk in de stad.
‘M eten is weten.’
Foto: BELGA
(http://www.standaard.be/)
(/Zoeken)
Privacy is a multi-stakeholder issue
34
multi-stakeholder Privacy Impact Assessment
Privacy is about transparency
Privacy is about transparency
36
Profile transparency tool for empowerment of social media users
37
Privacy is about transparencyAlgorithmic accountability and transparency – Certification?
Literacy and readability
New forms of literacy required
DATA COMPETENCE PROFILES
People-centred algorithms
39
Example: Decision algorithm exploration to support triage of patients
AGENDA
Agenda
• 15:30 Keynote
• Pieter Ballon, Director, imec-SMIT, VUB
Topic: The Brave New World of Data
• 16:00 – 17:30 Presentations and Panel discussion
• Irene López de Vallejo, Director of Collaborative research and international Development at Digital Catapult, London
Topic: SMEs innovating with (personal) data: Challenges, barriers and needed interventions
• Maurizio Cecchi, Manager at Telecom Italia
Topic: An European data market to boost a data driven economy: the telecoms perspective
• Nozha Boujemaa, Director of Research, Advisor to the CEO of INRIA in Big Data, in INRIA, Head of TransAIgo, member of the BDVA Board of Directors
Topic: Technical Insights in the implementation of trust and transparency in data and algorithms
• Amardeo Sarma, General Manager at NEC Laboratories Europe and Chairman of the Board of Directors of Trust in Digital Life (TDL)
Topic: GDPR and security issues related to the European data market