Trentino government linked open geodata: first resultsinspire.ec.europa.eu › events ›...
Transcript of Trentino government linked open geodata: first resultsinspire.ec.europa.eu › events ›...
Trentino government linked open
geodata: first results
Pavel Shvaiko, Informatica Trentina
Feroz Farazi, University of Trento
Daniela Ferrari, Segreteria SIAT, PAT
Giuliana Ucelli, Segreteria SIAT, PAT
Lorenzino Vaccari, Joint Research Center
Vincenzo Maltese, University of Trento
Veronica Rizzi, University of Trento
A. Ivanyukovich, Trient Consulting Group
Outline
� Introduction
� Linked Data
� RDF
� Trentino Geo-data
� Publishing Pipeline
� Conversion
� Linking
� Sharing
�Mash-up application
� Conclusion
1 INSPIRE Conference – June 27th 2012 Feroz Farazi – University of Trento
2 INSPIRE Conference – June 27th 2012 Feroz Farazi – University of Trento
Introduction
� Open Government Data
� Data owned by the government authorities
� Making data publicly available
� Principles to comply with
� Complete, Primary
� Timely, Accessible
� Machine processable, Non discriminatory
� Non proprietary format, Open license
� Advantages
� To help the government function better; transparency, citizens’ involvement
� To leverage the data to produce new business models and opportunities
INTRODUCTION :: LINKED DATA :: RDF :: TRENTINO GEO-DATA :: PUBLISHING PIPELINE :: MASH-UP :: CONCLUSION
3 INSPIRE Conference – June 27th 2012 Feroz Farazi – University of Trento
Linked Data
� Linked data is about both
� Publishing data on the web
� Making links to external data sources
� The four rules
(i) URIs as the identifiers of things
(ii) HTTP URIs
(iii) Information should be served against a URI
(iv) Make links to other data sources
� 5 star rating system data(★) On the web, any format, open license (2★) Machine-readable structured data
(3★) Non-proprietary format (e.g., CSV or TSV) (4★) In RDF and SPARQL to retrieve
(5★) Linked to other external data sources
INTRODUCTION :: LINKED DATA :: RDF :: TRENTINO GEO-DATA :: PUBLISHING PIPELINE :: MASH-UP :: CONCLUSION
4 INSPIRE Conference – June 27th 2012 Feroz Farazi – University of Trento
RDF
� RDF (Resource Description Framework)
� A language for representing data in the Semantic Web
� a simple data model for making statements
� the capability to perform inference on the statements
� Data model in RDF
� The data model in RDF is a graph data model
� An edge with two connecting nodes form a triple
� Triple elements are subject, object and predicate
� RDF representation
� URIs to identify subjects, objects and predicates
� Objects can be Literals
INTRODUCTION :: LINKED DATA :: RDF :: TRENTINO GEO-DATA :: PUBLISHING PIPELINE :: MASH-UP :: CONCLUSION
5 INSPIRE Conference – June 27th 2012 Feroz Farazi – University of Trento
Trentino GeoData
� Context
� Original datasets used in the semantic geo-catalogue application
� The datasets consist of both data (shape files) and metadata (xml)
� Production of Linked Open Data
� 161 datasets were converted to RDF
� By following Open Government Data paradigm
� By following Linked Open Data paradigm
� Goal of producing Linked Open Data
� To utilize its power and potential in developing applications quickly
� To obtain insights on how the services can be improved
INTRODUCTION :: LINKED DATA :: RDF :: TRENTINO GEO-DATA :: PUBLISHING PIPELINE :: MASH-UP :: CONCLUSION
DataShape files
GeoToolsData
XML files
MetadataXML files
SAX Parser
Jena RDF
� Linking
� Links were established to external data sources (e.g., DBPedia, Freebase)
� High quality of the links were guaranteed through validating them manually
� We linked classes (e.g., lake), and entities (e.g., Lake Garda) using OWL:sameAs
� Sharing
� The produced RDF data is made available on a web server for sharing
� For each class a different RDF file is produced
� The data and metadata were published with CC-Zero license
INTRODUCTION :: LINKED DATA :: RDF :: TRENTINO GEO-DATA :: PUBLISHING PIPELINE :: MASH-UP :: CONCLUSION
� Conversion
7 INSPIRE Conference – June 27th 2012 Feroz Farazi – University of Trento
Publishing Pipeline
� Metadata in RDF
<rdf:Description rdf:about =
"http://www.territorio.provincia.tn.it/geodati/p_tn:piste_ciclabili">
<dc:language>it</dc:language>
<dcmibox:westlimit>10.41</dcmibox:westlimit>
<dc:identifier>http://www.naturambiente.provincia.tn.it/</dc:identifier>
<dc:format>shp</dc:format>
<dc:rights>Dato pubblico</dc:rights>
<dc:title>Piste ciclabili</dc:title>
<dc:creator>Dipartimento Risorse Forestali e Montane</dc:creator>
</rdf:Description>
INTRODUCTION :: LINKED DATA :: RDF :: TRENTINO GEO-DATA :: PUBLISHING PIPELINE :: MASH-UP :: CONCLUSION
8 INSPIRE Conference – June 27th 2012 Feroz Farazi – University of Trento
Publishing Pipeline
� Data in RDF
INTRODUCTION :: LINKED DATA :: RDF :: TRENTINO GEO-DATA :: PUBLISHING PIPELINE :: MASH-UP :: CONCLUSION
<rdf:Description rdf:about="http://www.territorio.provincia.tn.it/geodati/resource/piste_ciclabili">
<rdf:type rdf:resource="http://www.w3.org/2002/07/owl#Class"/>
<owl:sameAs rdf:resource =
"http://rdf.freebase.com/ns/guid.9202a8c04000641f8000000000428308"/>
</rdf:Description>
<rdf:Description rdf:about =
"http://www.territorio.provincia.tn.it/geodati/resource/piste_ciclabili/529">
<geontology:length rdf:datatype =
"http://www.w3.org/2001/XMLSchema#double">1445.8484810675</geontology:length>
<rdfs:label xml:lang="it">Mori - torbole</rdfs:label>
<rdf:type rdf:resource="http://www.territorio.provincia.tn.it/geodati/resource/piste_ciclabili"/>
<rdfs:label xml:lang="it">529</rdfs:label>
<geo:geometry rdf:resource =
"http://www.territorio.provincia.tn.it/geodati/resource/piste_ciclabili_529"/>
</rdf:Description>
<rdf:Description rdf:about =
"http://www.territorio.provincia.tn.it/geodati/resource/piste_ciclabili_529">
<geontology:polyline>646339.346896746,5082179.74045936
…
645575.851739799,5081173.68539361
</geontology:polyline>
</rdf:Description>
Class
Instance
Geometric Shape Polyline
Structured Data Text
RDFizers for CSV, XML, Excel, …
Entity
Calais)
Entity Extractor (e.g. Calais)
Relational Database
Data Source with API
RDF StoreRDF files
CMS with RDFa Output (e.g. Drupal)
Custom Linked Data Wrapper
Linked Data Interface (e.g. Pubby)
RDB-to-RDF Wrapper (e.g. D2R)
Web Server (e.g. Apache)
Linked Data on the Web
3. Data
Sharing
2. Data Representation
1. Data Conversion
Raw Data
INTRODUCTION :: LINKED DATA :: RDF :: TRENTINO GEO-DATA :: PUBLISHING PIPELINE :: MASH-UP :: CONCLUSION
T. Heath and C. Bizer. Linked Data book (2011)
10 INSPIRE Conference – June 27th 2012 Feroz Farazi – University of Trento
Mash-up application
� The developed mash-up on top of the produced Linked Data
INTRODUCTION :: LINKED DATA :: RDF :: TRENTINO GEO-DATA :: PUBLISHING PIPELINE :: MASH-UP :: CONCLUSION
11 INSPIRE Conference – June 27th 2012 Feroz Farazi – University of Trento
Conclusion
�We have presented our experimental work on:
� Producing Trentino Government data as Linked Open Data
� Releasing in total 161 Trentino Government datasets
� Representation language and vocabulary
� RDF for representing both the data and metadata
� WGS84 vocabulary for data and Dublin Core for metadata
� OWL for linking to external sources such as DBPedia and Freebase
� Future work� Includes the definition of geometrical terms through the NewGeo Geometry
� publication of the datasets on the LOD cloud
INTRODUCTION :: LINKED DATA :: RDF :: TRENTINO GEO-DATA :: PUBLISHING PIPELINE :: MASH-UP :: CONCLUSION
INSPIRE Conference – June 27th 2012 Feroz Farazi – University of Trento
Thank you!