SC6 Workshop 1: From your data to data stories - BigDataEurope, SC6 Workshop
BigDataEurope: Project Introduction @ Year #1 Workshops
-
Upload
bigdataeurope -
Category
Technology
-
view
58 -
download
0
Transcript of BigDataEurope: Project Introduction @ Year #1 Workshops
Empowering Communities with Data Technologies
Year #1 Series of WorkshopsGeneral Overview
BIG DATA EUROPE
The Motivation – Big Data
Every day, we create 2.5 quintillion bytes of data — so much that 90% of the data in the world today has been created in the last two years alone.
This data comes from everywhere: sensors used to gather climate information, posts to social media sites, digital pictures and videos, purchase transaction records, and cell phone GPS signals to name a few.
This data is big data. Source: IBM
Big Data
15 avr. 2023www.big-data-europe.eu
BIG DATA
Open DataLinked
DataLinked Open Data
Data Repositories
DatabasesData
LibrariesCatalogues
Social
Media
Big Data: Dimensions
www.big-data-europe.eu
Volume
Velocity
Variety
10001010101010101010101001010010101010101001010010101001010010101001010010100101010010101010010101000101010101001010101010101010100101010101010100101010111000101010101010101010100101001010101010100101001010100101001010100101001010010101001010101001010100010101010100101010101
10001010101010101010101001010010101010101001010010
…….………….……………..……..……………
1 0
1000101010101010101010100101001010101010100101oo11
Veracity
!
Big Data Dimensions
HealthClimateEnergyTransportFoodSocietiesSecurity
Big Data in Europe: Challenges, Opportunities
www.big-data-europe.eu
Loremipsumdolors
KSDJOPSCKKSDKA
B
LKASJLLAWWDS
wpweppepwpisi
owe
10101001101010010101
0
Regional Data Repositories
#1: Compile, Harmonise, Publish10101001101010010101
0
10101001101010010101
0
#2: Interlink, Centralise Access, Explore
101010100101010101001011010001010101010010101010100001011010001010101010010101010100100101010100101010101001011010
001010
Data Eleme
nt
related relate
d
#3: Analyse, Discover, Visualise#4: Mashup, Cross-domain Exploitation
Journalists
Citizens Industry
Authorities
Big Data in Europe: Obstacles
15 avr. 2023www.big-data-europe.eu
#1 Big Data “Variety“ problem Multiple Data Sources Required: Integration, Harmonisation
#2 Opening-up Data concerns Loss of control, lack of tracking Reservations about large corporations
#3 Limited Skills, Training, Technology
Lack of Data Scientists Lack of Generic Architectures, components
Data Value Chain Evolution
15 avr. 2023www.big-data-europe.eu
Extraction, Curation Quality, Linking, Integration
Publication, Visualization, Analysis
Extraction, Curation, Quality, Linking, Integration, Publication,
Visualization, Analysis
Health
TransportSecurity
Extraction Curation Quality Linking Integration Publication Visualization Analysis
Data Repositories Linked Open Data Cloud
Stage 1
Stage 2
Stage 3
Food SocietiesClimate Energy
BigDataEurope – The Project
15 avr. 2023www.big-data-europe.eu
Rationale Show societal value of Big Data Lower barrrier for using big data
technologieso Required effort and resourceso Limited data science skills
Help establishing cross-lingual/organizational/domain Data Value Chains 15 avr. 2023
BigDataEurope: Objectives
15 avr. 2023www.big-data-europe.eu
COORDINATIONStakeholder Engagement
(Requirements Elicitation)
SUPPORTDesign, Realise, Evaluate
Big Data Aggregator Platform
Create and Manage Societal Big Data Interest
Groups
Cloud-deployment ready Big Data Aggregator
Platform
CSA Measures
Results
BigDataEurope: Consortium
Big Data Ecosystems: Orthogonal Dimensions
15 avr. 2023www.big-data-europe.eu
Generic Big Data Enabling Technologies
Data Value Chain
Data Generation & Acquisition
Data Analysis & Processing
Data Storage & Curation
Data Visualization &
Usage
Data-driven Services
Soci
etal
Cha
lleng
es
Dom
ain
Spec
ific D
ata
Asse
ts &
Tec
hnol
ogy Healthcare
Food Security
Energy
Intelligent Transport
Climate & Environment
Inclusive & Reflective Societies
Secure Societies
Work Packages & Phases
Community Building
M1-M12 M13-M24 M25-M36
Enabling Technologies
Component Integration
Uptake
Integrator Deployment
Community Assessment
WP3 – Big Data Generic Enabling Technologies & Architecture
WP5 – Big Data Integrator Instances
WP7 – Dissemination & Communication
WP2 – Community Building & Requirements
WP4 – Big Data Integrator Platform
WP6 – Real-life Deployment & User Evaluation
Societal Domains, Focus Areas, Data assetts
Societal Domain Preliminary Big Data Focus area Selected Key Data assets
Life Sciences & Health
Heterogeneous data Linking & integration
Biomedical Semantic Indexing & QA
ACD Labs / ChemSpider, ChEBI, ChEMBL, Con-ceptWiki, DrugBank, EN-ZYME, Gene Ontology, GO Annotation, Swis-
sProt, UniProt, Wik-iPathways, PubMed, MeSH, Disease Ontology (DO), Joint Chemical Dic-tionary (Jochem), Bio-ASQ
datasets
Food & Agriculture
Large-scale distributed data integration
INFOODS, AQUASTAT Green Learning Network (GLN), Agricultural Bibliography Network (ABN), AGRIS, AquaMaps,
Fishbase
EnergyReal-time monitoring, stream
processing, data analytics, and decision support
European Energy Exchange Data, smart meter measurement data, gas/fuels/energy market/price data, consumption
statistics, equipment condition monitoring data)
TransportStreaming sensor network & geo-
spatial data integration
GTFS data, OSM/ LinkedGeoData, MobilityMaps, Transport sensor data, ROSATTE Road safety attributes, European Road
Data Infrastructure - EuroRoadS
ClimateReal-time monitoring, stream
processing, and data analytics.
European Grid Infrastructure (EGI), Databases hosting atmospheric data. Several software frameworks for
simulation, calibration and reconstruction.
Social SciencesStatistical and research data linking &
integration
Federated social sciences data catalogs, statistical data from public data portals and statistical offices (e.g. EuroStats,
UNESCO, WorldBank)
SecurityReal-time monitoring, stream
processing, and data analytics.Image data analysis
Earth Observation data (e.g. Very High Resolution Satellite Imagery acquired from commercial providers and
governmental systems) and collateral data for supporting CFSP/CSDP missions and operations, Databases hosting
atmospheric Data. Experimental and simulation data concerning dispersion of hazardous substances
Stakeholder Engagement Cycle
Data Aggregator Platform: Blueprint
Batch Layer
Speed Layer
Data Storage
Real-time data &
Transactions …
Batch View
Real-time View
mess
age p
ass
ing
message passing
Applications & Showcases
Real-time dashboards
Domain-specific BDE apps
Big Data AnalyticsIn-stream Mining
BD
E P
latfo
rm &
In
tellig
ence
Input dataStreamSpatialSocialStatistical TemporalTransactionalImagery
+ Semantic Layer (Retaining Semantics using LD approach )
Lambda Architecture
Current Activities – Year#1 2015 BDE Societal Workshops (7)
Plannedo Schedule on Website
7 W3C Interest Groups set up: Please Join!o SC1: HEALTH https://www.w3.org/community/bde-health/joino SC2: FOOD & AGRICULTURE https://www.w3.org/community/bde-food/o SC3: ENERGY https://www.w3.org/community/bde-energy/ o SC4: TRANSPORT https://www.w3.org/community/bde-transport/o SC5: CLIMATE & ENVIRONMENT https://www.w3.org/community/bde-climate/o SC6: SOCIETIES https://www.w3.org/community/bde-societies/ o SC7: SECURITY https://www.w3.org/community/bde-secure-societies/
www.big-data-europe.eu