Semantic Web For Hack Days

28
Semantic Mashups for Hack Days Laurent Lefort, CSIRO ICT Centre, Information Engineering Laboratory Canberra Semantic Meetup, 2 August 2010 Photo by JJ Halans

description

Canberra Semantic Web Meetup, 2 August 2010 The talk objective is to encourage the Meetup members to participate and prepare the Sydney Amped Hack Day (October 16 in Sydney: http://ampedweb.org/ ).

Transcript of Semantic Web For Hack Days

Page 1: Semantic Web For Hack Days

Semantic Mashups for Hack DaysLaurent Lefort, CSIRO ICT Centre, Information Engineering Laboratory

Canberra Semantic Meetup, 2 August 2010

Photo by JJ Halans

Page 2: Semantic Web For Hack Days

CSIRO. Semantic Mashups for Hack days

Bio

• Over 25 years ago: Computer science engineer

• Almost 10 years ago: CSIRO ICT Centre, Canberra

• Over 5 years ago: Semantic Web researcher

• Ontology design patterns

• 1 year ago: W3C Australia Office manager & guest programmer for the W3C Track at Web Directions South

• Armin Haller (Meetup co-organiser) is now in charge of the Australian Office

• About 1 year ago: GovHack host

• Now: W3C Semantic Sensor Network XGco-chair

• To be continued …

Page 3: Semantic Web For Hack Days

CSIRO. Semantic Mashups for Hack days

Taste le …

I don’t use (yet)

I use

I want to use more

Linked Data Api

(Gloze)

2222

1111

PURLz

Callimachus

Page 4: Semantic Web For Hack Days

CSIRO. Semantic Mashups for Hack days

My semantic web journey

0.01

0.1

1

10

100

1000

o11

o12

o65

o56

o23

o22

o30

o49

o24

o57

o27

Fact++RACERPellet(Direct)CEL

WHEN

WHEREIN

WHATObservationSampling Feature

Observed

property

Metadata

Procedure

Result

Time* Geometrical types

Units

Quantities

Taxa

Chemistry

Temporal types

Coord. Sys

Vertical Coord. Sys

Medium

Fraction

Processing & interpolation

Validation &

quality flag

Sensor

(Instrument)

Station

PlatformSite

Water

Feature

Result type

Sampled Feature

Institution and project

System and author

Security classif.

Transaction type

Gauge/weir

layout/profile

Missing data

Feature property

?

?

?

?

Survey type

Process

Action

Event

Multi-dependent concepts

Feature-dep. parameters

Feature-indep. parameters

Abstract concepts

Semi-abstract concepts

Semi-primitive concepts

Primitive concepts

O&M amd GFM stereotypes

Simple classes

Classes w/ ident. instances

Onto category to be defined

Time* : two O&M stereotypes (sampling time and result time)

Features types

?

?

Surfing www to learn about o. from Gio W.?

1995 2004 2006 2007 2008

OWL-ifyingthe “motherof all colorcharts”

EL+-ifyingaircraft data to test reasonersover large o.

OWL-ifyingO&M-based stds: uml, xsd, vocabs

Semantic Sensor ontology

OWL-ifyingwater vocabs(4-in-one o.)

2009 2009 2009 2009 2010

SKOS-ifyingUoM and quantities

Semantic Markup for XML-based stds

TriplifyingUoM and quantities

Linking Open Standards !

=xlink:role C

=xlink:href myC

Page 5: Semantic Web For Hack Days

CSIRO. Semantic Mashups for Hack days

Semantic Web and Web of Data

SKOS without dataOntologist

Will use OWLLink

Linked Open Data user

Also uses SPARQL

SKOS with data

I want both

I need loose coupling (follow my nose) I need tight coupling (reasoner-boosted links)

Page 6: Semantic Web For Hack Days

CSIRO. Semantic Mashups for Hack days

The Semantic Intensity scale

• RIF: to exchange rules among rule systems (which use a logic which is different to what OWL uses)

• ---- Ontologies stop here ----• OWL2: description logic stretched as far

as possible without breaking it: richer axioms, meta-modelling, additional capabilities to handle data

• OWL 1: triples arranged according to description logic principles

• SPARQL: triples arranged according to “web-ready schema”

• SKOS: triples structured as thesaurus, simple taxonomies

• RDF: triples

2222

1111

Low Intensity apps

High Intensity

Page 7: Semantic Web For Hack Days

CSIRO. Semantic Mashups for Hack days

Producing good quality ontologies canhelp to get good linked data pathways

• An ontology is always an approximation of the intended model

• To avoid mismatches between the intent and the result, it is better to have:

• domain experts with a broader knowledge of the domain

• experts with a narrower field of expertise

• But for experts to work efficiently together, you’ll need to solve the linguistic issues first

Figure by Krzysztof Janowicz , “invited expert”, SSN XG

Page 8: Semantic Web For Hack Days

CSIRO. Semantic Mashups for Hack days

Solving linguistic issues (W3C Semantic Sensor Network XG example)

• VIM_terms• Measurement (VIM 2.1): process of experimentally obtaining one or more

quantity values that can reasonably be attributed to a quantity• Measurement (MINET): Process of associating numbers, in an empirical

and objective way, to characteristics of objects and events of the real world in a way so as to describe them

• SWE_terms• Measurement (noun) (SensorML): an observation whose result is a

measure• Measurement (verb) (SensorML): An instance of a procedure to estimate

the value of a natural phenomenon, typically involving an instrument or sensor. This is implemented as a dynamic feature type, which has a property containing the result of the measurement. The measurement feature also has a location, time, and reference to the method used to determine the value. A measurement feature effectively binds a value to a location and to a method or instrument.

• Wikipedia_terms• Measurement (Wikipedia) In science, measurement is the process of

obtaining the magnitude of a quantity, such as length or mass, relative to a unit of measurement, such as a meter or a kilogram. The term can also be used to refer to the result obtained after performing the process.

• Tip: check if Wikipedia has a disambiguation list

Page 9: Semantic Web For Hack Days

CSIRO. Semantic Mashups for Hack days

OWL in Protege

Protégé is coupled to a description logic reasonerwhich can infer subsumption relationships

Page 10: Semantic Web For Hack Days

CSIRO. Semantic Mashups for Hack days

SKOS example in SESAME (with data)

RDF data can be queried by others using a REST-based service (what is know as a SPARQL endpoint)

Page 11: Semantic Web For Hack Days

CSIRO. Semantic Mashups for Hack days

OWL with data in Protégé

OWL skeleton populated with data (here SysML-QUDV)

Must have: a def. and a citation

Page 12: Semantic Web For Hack Days

CSIRO. Semantic Mashups for Hack days

Semantic Mashup options across the 3 main families of W3C standards: XML, Semantic Web and HTML

XML + HTML(e.g. KML)

“Lifted” XMLXML annotated with OWL, SKOS, RDF

HTML + RDFa

HTML + RDFa + content sourced from annotated XML, SPARQL endpoints or even other RDFa pages

HTML + content from SPARQLendpoints: OWL, SKOS, RDF

Semantic Web resources

Page 13: Semantic Web For Hack Days

CSIRO. Semantic Mashups for Hack days

Linking Open Standards

• Watch for the transition from XML-based standards to either RDF-only or hybrid XML-RDF standards

• Watch for the transition from HTML-based standards to either hybrid XML-RDF-HTML standards

• Standard Development Organisations should work together to agree on common approaches to:

• Develop and apply a core set of principles and methods to mix specifications snippets from XML, RDF and HTML

• Including extensible APIs like the SPARQL and RDFa

• Allow tight and loose couplings • between ontologies• between ontologies and linked open data resources• between linked open data resources

• Split ontologies into manageable modules: the issue here is that the definition of a module can vary a lot …

Page 14: Semantic Web For Hack Days

CSIRO. Semantic Mashups for Hack days

For the semantic hack fest, we have the Open Data, but it’s not linked (not avail. as SPARQL)!

Page 15: Semantic Web For Hack Days

CSIRO. Semantic Mashups for Hack days

LOD constellation: 13.1 billions triples and 142 millions RDF links (late 2009)

Page 16: Semantic Web For Hack Days

CSIRO. Semantic Mashups for Hack days

Adding Australian content to the LOD constellation Geo/Gov, People/communities, Science

LinkedMDB

Gov-Track

US Census Data

riese

Geo-names

lingvoj

World Fact-book

Euro-stat

flickr wrappr

Open Calais

RevyuSIOCSites

Doap-space Flickr

exporter

FOAF Profiles

Crunch base

Open-Guides

Pub Guide

W3CWordNet

Open Cyc

UMBEL

Yago

DBpedia

Freebase

Virtuoso Sponger

DBLPHannover

RDF Book Mashup

Project Guten-berg

DBLPBerlin

IEEE

CiteSeer

DBLP RKBExplorer

eprints

RKBECS South-

ampton

Linked GeoData

TO DO: Australian clouds

TO DO: Australian links

?

?

?

Also to consider …all the cultural clouds (e.g. Music)All the health & life sciences clouds

Page 17: Semantic Web For Hack Days

CSIRO. Semantic Mashups for Hack days

Class-to-Class link pathways in the UMBEL LOD Constellation (Oct 2008)

damltime -- Time Zone o. doap -- Description of a Project o. foaf -- Friend-of-a-Friend o. geo -- Geo wgs84 o. geonames -- GeoNames o. opencyc -- OpenCycknowledge base sioc -- Socially Interlinked Online Communities o. sioc_types -- SIOC extension umbel -- Upper Mapping and Binding Exchange Layer o. wordnet -- WordNet lexical o.

Page 18: Semantic Web For Hack Days

CSIRO. Semantic Mashups for Hack days

Why a semantic hack fest?To ease the transition from Web 2.0 to Web 3.0

• Mashups developed for govhack, apps4nsw, app-my-state VIChave used semantic web technos very rarely

• Report of the Gov 2.0 Taskforce• […] Information should be: free, easily discoverable, based on

open standards and therefore machine-readable, properly documented and therefore understandable, licensed to permit freereuse and transformation by others

• […] governments have a role to play in leading and encouraging the uptake of Web 3.0 technologies in support of greater innovation based on the reuse of public sector information and enhanced citizen/government interaction.

• e.g. RDFa usage in data.gov.au

• Having a semantic hack fest can help: • To demonstrate benefits on concrete examples: hack days are a

great way to get awesome results • To accelerate adoption at a larger scale by industry

Page 19: Semantic Web For Hack Days

CSIRO. Semantic Mashups for Hack days

The Amped Hack day

• Two free events organised by Web Directions South at the end of the Web Directions Conference

• Atlanta September 25th• Sydney October 16th

• Sydney event• Hosted by the PowerHouse Museum• Not yet fully booked: http://ampedweb.org/

• John Allsopp “likes the idea”• Semantic Geeks! I love it. I think this is a great addition.

Page 20: Semantic Web For Hack Days

CSIRO. Semantic Mashups for Hack days

Semantic Mashup for Hack days: What’s required

• Ontologies which could add value to mashups: e.g. my colour ontology for images,, …

• Australian content to play with: triplified (linkable) datasets for different domains: .gov (and .com, .org), geo + …, science, libraries/museums

• Lots of tools with hackable examples • More publicity:

• Semantic hackfest talk scheduled in the W3C Australia track at the Web Directions South Conference (just before Amped).

• Public announcement elsewhere: SW Meetup, THATCamp, others?

• A workspace or two: one to prepare and one for the hack fest participants

• Helpers (before the hack day) and mentors (during the event) for all of the above

Page 21: Semantic Web For Hack Days

CSIRO. Semantic Mashups for Hack days

Ideas for the hack day

• 1st idea. Mix:• Paul Hagon Searching by Colour Metadata Australia 2010

• My colour ontology (need some rework)• If we put the two approaches together, we should learn if:

• We can build a better colour ontology

• We can build better links between colours and tags or topics

• We can demonstrate that ontologies are really adding value

• 2nd idea. Mix:• Govhack’s mashups based on data about organisations

(companies, government agencies)

• The organisation ontology published by Epimorphics

Page 22: Semantic Web For Hack Days

CSIRO. Semantic Mashups for Hack days

Paul’s idea: correlate the flickr tags and the colours of images selected with image magick

Paul Hagon Searching by ColourMetadata Australia 2010

Page 23: Semantic Web For Hack Days

CSIRO. Semantic Mashups for Hack days

Bonus we can have a colour ontology which provides the true tags

Color QualeSpatial Region

NBS ISCCcentroids

Individual colors

MunsellHue

• HTML tables• 267 centroids

• 1000’s of individual colours http://tx4.us/

Page 24: Semantic Web For Hack Days

CSIRO. Semantic Mashups for Hack days

It is also citeable: the original references are provided

House and Garden magazine, in Supplementary Index of Color names (Dictionary of Color)

U.S. Department of Agriculture Soil Charts

Postage-Stamp Color Names, William H. Beck

National Research Council, Rock-Color Chart

Commercial Standard CS156-49, Colors for Polystyrene Plastics

Commercial Standard CS147-47, Colors for Molded Urea Plastics

Horticultural Colour Charts, R. F. Wilson

Federal Specification TT-C-595, Colors; (for) Ready-Mixed Paints

Colour Terms in Biology, H.A. Dade

American Association of Textile Chemists and Colorists and Society of Dyers and Colourists

Textile Color Card Association Standard Color Card of America and U.S. Army Color Card

Taylor, Knoche & Granville, Descriptive Color Names Dictionary

Ridgway, Color Standards and Color Nomenclature

Plochere Color System

Maerz and Paul, Dictionary of Color

Kelly and Judd (1976)Color Universal Language and Dictionary of Names. National Bureau of Standards special publication 440

Page 25: Semantic Web For Hack Days

CSIRO. Semantic Mashups for Hack days

Random example Amber Yellow

http://tx4.us/nbs/nbs-a.htm

Page 26: Semantic Web For Hack Days

CSIRO. Semantic Mashups for Hack days

Another example: organizations (.gov, .com and .org.)

• Some effort required to fix the missing links: ontology then data• Company codes: ANZSIC vs. UNSPCC

• Government codes: AGIFT thesaurus vs. UK’s Government Category List

• Epimorphics (UK) and W3C eGov IG working on this topic• Organization ontology developed to be LOD-ready

Page 27: Semantic Web For Hack Days

CSIRO. Semantic Mashups for Hack days

Linked Open Data (LOD) opportunities

• W3C• eGov IG or incubators

• local initiative (W3C Australia to help – status to be defined)• W3C track speakers (Web Directions South)

• Linked Open Data Around the Clock project (EU)• Contact: Michael Hausenblas (DERI) • Semantic Web Atlanta already involved (advisory committee)

• Amped hack day • Opportunity to engage with front end developers• Opportunity to liaise with other SW Meetups (Sydney in Australia,

Atlanta and maybe Austin in the US)

Page 28: Semantic Web For Hack Days

CSIRO. Semantic Mashups for Hack days

CSIRO ICT CentreLaurent Lefort @laurentlefortOntologist, Information Engineering Lab W3C Australia Office @W3CAustralia

Photo: Web DirectionsCC BY-NC 2.0