Putting the L in front: from Open Data to Linked Open Data
-
Upload
martin-kaltenboeck -
Category
Technology
-
view
2.473 -
download
5
description
Transcript of Putting the L in front: from Open Data to Linked Open Data
Putting the L in frontFrom Open Data to Linked Open Data (LOD)Government Linked Data Workshop, OGD Camp 2011, Warsaw, Poland
Martin Kaltenböck
These slides are published under : http://creativecommons.org/licenses/by/3.0
@lod2project@semwebcompany
2
Open Government Data is a worldwide movement to open data (& information) of the government / public administration* - that is NOT personal (individual related) – in human- and maschine readable open formats (non proprietary) for use & re use!
OPEN stands for lowering the barriers to ensure as broad as possible re-use (for everybody)!
There is a new paradigm in publishing Open Government Data
= look, take and play!
* ….. data and information produced or commissioned by government or government controlled entities
What is Open (Government) Data?
3
What is important when thinking about open data in use?•Interoperability to ensure broad & easy use & re-use•Human AND machine readable data and meta data•In open formats•For smooth and cost efficient data integration•To generate effects on several levels: local – regional – national – EU wide & worldwide
For several target groups with several interests!•Public administration (also for internal use)•Politicians & decision makers•Citizens (Citizen Analysts)•Economy & Industry (data integration, -enrichment, APPs)•(Data) Journalists, media & publishers•Academia & Science
What is Important? For Whom?
From Open Data to Linked Open Data
5 Stars for Open Data by Tim Berners Lee
5
The Evolution of the Internet
6
Docu
men
ts
on
th
e W
eb
Web
of
Docu
men
tsA
KA
Hyp
ert
ext
Data
on
th
e W
eb
A
KA
Op
en
Data
Web
of
Data
AK
A H
yp
erd
ata
• Give things an URI!• Use RDF for Publishing!• Link your Data to other Data (as well as the data models)!• Provide a Standard-API on top
•Provide an API!
•Organise Data!•License Data!•Raw Data now!
7
Publish Data!
Use Web-Technologies
Use Linked Data!• The web is an Ecosystem• Networked Data creates Network Effects• Lowers Costs of Data Integration
From Open Data to Linked Open Data
8
Data Integration on User Level and Application Level
The Idea of Hyperdata
9
Integration on Data Level
Application and/or API
The Idea of Hyperdata
11
CC-BY-SA von campuspartybrasil (flickr)
Linked Data realizes the vision of evolving the Web into a global data commons, allowing applications to operate on top of an unbounded set of data sources, via standardised access mechanisms.
I expect that Linked Data will enable a significant evolutionary step in leading the Web to its full potential.
The Vision of the new Internet
Linked Open Data benefits & LOD in use
Misconceptions about Linked Open Data
• All of us have to use ONE schema
• Everything needs to be switched to RDF
• We all have to learn SPARQL, there are no standard (web) APIs
• LOD is a pure academic approach
• LOD can only be used by Semantic Web experts
• We have to change our data integration & -management approaches 13
14
• Enables web-scale data publishing - distributed publication with web-based discovery mechanisms
• Everything is a resource – follow your nose to discover more about properties, classes, or codes within a code list
• Everything can be annotated - make comments about observations, data series, points on a map
• Easy to extend - create new properties as required, no need to plan everything up-front
• Easy to merge - slot together RDF graphs, no need to worry about name clashes
• Easy use and re-use on top of common schemas AND schema mapping
• Allows complex querying of several distributed data sources & systems
The Power of Linked Open Data
• Less replication (offering same datasets in different places)
• Encouragement to re-use existing datasets
• Clear which datasets are providing similar / same information
• More innovation because datasets can be put in a new context and lead to interesting applications
• Put information in context and thereby create knowledge
The Benefits of Linked Open Data
Linked Open Data as a ‚national digital infrastructure‘
16
LONDON
VIENNA
UK
BERLIN
AUSTRIA
SCHOOLS PISA-RESULTS
PUBLIC BUILDINGS
SUPERINTENDENT‘S REPORT
TYPES OF SCHOOLS
PRIV. RANKING
TYPES OF SCHOOLS
SCHOOLS
SCHOOLS
SCHOOL FEES
EDU STATISTICS
START A CATALOGUE
RELEASE DATASETS
GIVE THEM A URI
LINK THEM
MAP IT
DEMOGRAPHIC
BUDGET
Cost of Data Integration – 2 Approaches
19
Source: Price Waterhouse Coopers – Technology Forecast, Spring 2009
Can we afford to mash the data
with ours?
Can we afford to mash the data
with ours?
20
LOD – Huge Economic Potentials
http://www.mckinsey.com/mgi/publications/big_data/index.asp
Data have swept into every industry and business function and are now an important factor of production, alongside labor and capital.
Harnessing big data in the public sector has enormous potential, too. If US health care were to use big data creatively and effectively to drive efficiency and quality, the sector could create more than $300 billion in value every year. Two-thirds of that would be in the form of reducing US health care expenditure by about 8 percent.
In the developed economies of Europe, government administrators could save more than €100 billion ($149 billion) in operational efficiency improvements alone by using big data, not including using big data to reduce fraud and errors and boost the collection of tax revenues.
And users of services enabled by personal location data could capture $600 billion in consumer surplus.
21
Benefit for Developers: APIs
Students 2010/2011 in cities &
communities by school types
Places sorted by city & community numbers
and postal codes
getNumberStudents(PostalCode)
Postal Code: GO
22
Benefit for Developers: LOD
Students 2010/2011 in cities
& communities and by school
types
Places sorted by city & community numbers and by
postal codes
getNumberStudents(PostalCode)
Geoname
s
accident frequency
2010/2011 by cities &
communities
OSM
© Semantic Web Company – http://www.semantic-web.at/ 23
Where is the Semantic Web used? Under the hood of lots of products!
Creating Knowledge out of Interlinked Data
Creating Knowledge out of Interlinked Data
Research focus
•Very large RDF data management•Enrichment & Interlinking•Fusion & Information Quality•Adaptive User Interfaces
3 Use Cases
•Media & Publishing•Linked Enterprise Data•Open Government Data
15 Partners of 12 countries
•University of Leipzig, Germany•DERI Galway, Ireland•FU Berlin, Germany•Semantic Web Company, Austria•OpenLink Software, UK•TenForce, Belgium•Exalead, France•Wolters Kluwer, Germany•Open Knowledge Foundation, UK•CWI, Netherlands•Zemanta, Slovenia•University of Economics, Czech Republic•Instytut Informatyki Gospodarczej, Poland•MP: Institute Mihajlo Pupin, Serbia•KAIST, Korea
LOD2 in a Nutshell
ObjectiveApplying Linked Data technologies in an enterprise stack to support Human Resources (HR) related issues.ENTERPRISE
APPLICATIONS
Exalead, France
MEDIA&
PUBLISHING
Wolters KluwerGermany
LINKED OPEN GOVERNMENT
DATA
Open Knowledge Foundation, UK
ObjectiveImproving accessibility, findability and reusability of Open Government Datapublicdata.eu
ObjectiveSupporting content-related production workflows in the media & publishing industry.
3 Use Cases of LOD2
publicdata.eu – Where LOD can help
© Semantic Web Company – http://www.semantic-web.at/ 27
Problem Statement
• Data catalogue federation: lots of data catalogues in Europe
• No harmonised Metadata
• Many different languages
• Many different formats
• Strong need of broad use & re-use
• Strong need of Interoperability
publicdata.eu – Where LOD can help
© Semantic Web Company – http://www.semantic-web.at/ 28
LOD(2) Approach
• Harmonise meta data via LOD (From ‘soup 2 sense’)
• Provide semantic search mechanisms over all languages & distributed sources
• Create and provide easy to use LOD converters to publish LOD
• Tools for: Analysing & Visualising
• Provision of standard (web) APIs
LOD2: for Public Procurement
© Semantic Web Company – http://www.semantic-web.at/ 29
LOD2 for a Distributed Markeplace for Public Sector Contracts
•Government meets the commerce•Establishing a functional distributed marketplace for public procurement by integrating data in a linked data infrastructure•Crossing boundaries of dedicated applications for tenders, electronic markets, or e-shops.
Goals•Increase business participation in public procurement•Enhance transparency of the public procurement processes•Achieve more efficient resource allocation•Provide for better informed decisions•Defragmentation of public sector demand
Free to Use: The LOD2 Stack
30
1st release of the LOD2 stack available at: http://stack.lod2.euIntegrated distribution of aligned tools which support the life-cycle of Linked Data from The LOD2 Stack also makes use of dataset metadata repositories such as thedatahub.org and http://publicdata.eu.
Online demo athttp://demo.lod2.eu/lod2demo.
Virtual machine imagehttp://stack.lod2.eu/VirtualMachines/
Details & instructionsHOWTO Start document
LOD2: Outreach & Information
31
Open Government Stakeholder Survey 2012 – early 2012http://survey.lod2.eu
Publink LOD Consulting Service – 2nd Call opens November 2011http://lod2.eu/Article/Publink.html
LOD2 webinar series on Linked Open Data tools starting soon!
Where you can find all informationWebsite: http://lod2.euWeblog: http://blog.lod2.eu Twitter: http://twitter.com/lod2projectMailing List: [email protected] SlideShare: http://www.slideshare.net/lod2project
Remark: please use #lod2 on twitter for your posts & connect with: @lod2project Many thanks in advance!! AND: #lod2stack coming soon!
LOD: Interested to learn more?!
32
Join our free webinar!Linked (Open) Data
Management09.11. 2011 – 04.00 – 05.00pm
CEThttp://bit.ly/npomAJ
•evaluate possibilities which linked data can offer•understand how to transform your data to linked data in a controlled and strategic manner•get an idea why linked data technologies are the "agile way of enterprise data integration"•see how to profit from oodles of free available data in the cloud•understand how and why publishing interesting and relevant parts of your data could be a strategic option for you
Infos & Kontakt
33
Martin Kaltenböck, CMCSemantic Web Company GmbH (SWC)
Lerchenfelder Gürtel 43A-1160 Wien
Semantic Web Company (SWC)
Webhttp://www.semantic-web.athttp://blog.semantic-web.at
Phone: +43-1-402 12 35–25
Open Government Data OGD Austria
Austrian Chapter of the OKFN
Webhttp://gov.opendata.at
LOD2Creating Knowledge out
of Interlinked Data
Webhttp://www.lod2.euBloghttp://blog.lod2.eu
Putting the L in frontFrom Open Data to Linked Open Data
Annex
35
Open Government Data Stakeholder Survey 2010http://survey.lod2.eu
I current consume/publish data in the following formats
Interoperabilty & Standards
36
Open Government Data Stakeholder Survey 2010http://survey.lod2.eu
I would ideally like to consume/publish data in the following formats
Interoperabilty & Standards
37
Interoperabilty & Standards
Henry Maudslay (1771 – 1831)
He also developed the first industrially practical screw-cutting lathe in 1800, allowing standardisation of screw thread sizes for the first time. This allowed the concept of interchangeability (a idea that was already taking hold) to be practically applied to nuts and bolts. Before this, all nuts and bolts had to be made as matching pairs only. This meant that when machines were disassembled, careful account had to be kept of the matching nuts and bolts ready for when reassembly took place.http://en.wikipedia.org/wiki/Henry_Maudslay
John Sheridan, OGD2011 Conference, Vienna, Austria: http://www.slideshare.net/semwebcompany/linking-uk-government-data-john-sheridan/
Also take the Metadata into account!
Towards Open Government Metadata (Sept 2010)
In the EU eGovernment Action Plan semantic interoperability is mentioned “…as an essential precondition for open, flexible delivery of eGovernment services”.
Metadata IgnorancePublic administrations should become aware of the importance of Metadata in eGovernment and the need for coherent relevant management policies.
Scattered or Closed MetadataPublic administrations should organize the scattered Metadata in structured repositories,catalogues or libraries and provide open access to the collected resources.
Open Metadata for Humans & Open Reuseable MetadataPublic administrations should provide services to query, browse and export their Metadata in amachine-readable and preferably non-proprietary format (e.g. CSV, XML).
Linked Open MetadataPublic administrations should consider applying linked metadata policies, including use of RDFto document their Metadata, persistent design, use and maintenance of URIs, linking toexternal vocabularies/schemata, harmonize their resources to third parties' resources etc.
http://www.semic.eu/semic/view/documents/towards_open_government_metadata.pdf
As a reegle mash-up, the country energy profiles provide a comprehensive set of energy-related information on over 200 countries including:
oContact details of key stakeholders
oOverview on policy and regulations
oLinks to project outcome documents programmes and tools
oStatistical data visualised charts
oMaps of renewable energy sources
www.reegle.info
Linked Open Data in Use: Best Practise