Introduction to SDshare

An introduction to SDshare

2011-03-15Lars Marius Garshol, <larsga@bouvet.no>http://twitter.com/larsga

Overview of SDshare

SDshare

• A protocol for tracking changes in a semantic datastore– essentially allows clients to keep track of all

changes, for replication purposes

• Supports both Topic Maps and RDF• Based on Atom• Highly RESTful• A CEN specification

Basic workings

Server Client

Fragment

Server publishes fragments representing changes in datastore

Client pulls these in, updateslocal copy of dataset

Fragment

There is, however, more to it than just this

What more is needed?

• Support for more than one dataset per server– this means: more than one fragment stream

• How do clients get started?– a change feed is nice once you've got a copy

of the dataset, but how do you get a copy?

• What if you miss out on some changes and need to restart?– must be a way to reset local copy

• The protocol supports all this

Two new concepts

• Collection– essentially a dataset inside the server– exact meaning is not defined in spec– will generally be a topic map (TMs) or a graph

• Snapshot– a complete copy of a collection at some point

in time

Feeds in the server

Overview feed

Collection feeds

Fragment feed

Snapshot feed

Fragment

Snapshot

An overview feed<feed xmlns="http://www.w3.org/2005/Atom" xmlns:sdshare="http://www.egovpt.org/sdshare"> <title>SDshare feeds from localhost</title> <updated>2011-03-15T18:55:38Z</updated> <author> <name>Ontopia SDshare server</name> </author> <id>http://localhost:8080/sdshare/</id> <link href="http://localhost:8080/sdshare/"></link> <entry> <title>beer.xtm</title> <updated>2011-03-15T18:55:38Z</updated> <id>http://localhost:8080/sdshare/beer.xtm</id> <link href="collection.jsp?topicmap=beer.xtm" type="application/atom+xml" rel="http://www.egovpt.org/sdshare/collectionfeed"></link> </entry> <entry> <title>metadata.xtm</title> <updated>2011-03-15T18:55:38Z</updated> <id>http://localhost:8080/sdshare/metadata.xtm</id> <link href="collection.jsp?topicmap=metadata.xtm" type="application/atom+xml" rel="http://www.egovpt.org/sdshare/collectionfeed"></link> </entry></feed>

The snapshot feed

• A list of links to snapshots of the entire dataset (collection)

• The spec doesn't say anything about how and when snapshots are produced

• It's up to implementations to decide how they want to do this

• It makes sense, though, to always have a snapshot for the current state of the dataset

Example snapshot feed

<title>Snapshots feed for beer.xtm</title> <updated>2011-03-15T19:12:34Z</updated> <author> <name>Ontopia SDshare server</name> </author> <id>file:/Users/larsga/data/topicmaps/beer.xtm/snapshots</id> <sdshare:ServerSrcLocatorPrefix>file:/Users/larsga/data/topicmaps/

beer.xtm</sdshare:ServerSrcLocatorPrefix> <entry> <title>Snapshot of beer.xtm</title> <updated>2011-03-15T19:12:34Z</updated> <id>file:/Users/larsga/data/topicmaps/beer.xtm/snapshot/0</id> <link href="snapshot.jsp?topicmap=beer.xtm" type="application/x-

tm+xml; version=1.0" rel="alternate"></link> </entry></feed>

The fragment feed

• For every change in the topic map, there is one fragment– the granularity of changes is not defined by

the spec– it could be per transaction, or per topic

changed

• The fragment is basically a link to a URL that produces a part of the dataset

An example fragment feed

<title>Fragments feed for beer.xtm</title> <updated>2011-03-15T19:21:20Z</updated> <author> <name>Ontopia SDshare server</name> </author> <id>file:/Users/larsga/data/topicmaps/beer.xtm/fragments</id> <sdshare:ServerSrcLocatorPrefix>file:/Users/larsga/data/topicmaps/beer.xtm</

sdshare:ServerSrcLocatorPrefix> <entry> <title>Topic with object ID 4521</title> <updated>2011-03-15T19:20:03Z</updated> <id>file:/Users/larsga/data/topicmaps/beer.xtm/4521/1300216803730</id> <link href="fragment.jsp?topicmap=beer.xtm&topic=4521&syntax=rdf"

type="application/rdf+xml" rel="alternate"/> <link href="fragment.jsp?topicmap=beer.xtm&topic=4521&syntax=xtm"

type="application/x-tm+xml; version=1.0" rel="alternate"/> <sdshare:TopicSI>http://psi.example.org/12</sdshare:TopicSI> </entry></feed>

What is a fragment?

• Essentially, a piece of a topic map– that is, a complete XTM file that contains only

part of a bigger topic map– typically, most of the topic references will

point to topics not in the XTM file

• Downloading more fragments will yield a bigger subset of the topic map– the automatic merging in Topic Maps will

cause the fragments to match up

• Exactly the same applies in RDF

An example fragment<topicMap xmlns="http://www.topicmaps.org/xtm/1.0/" xmlns:xlink="http://www.w3.org/1999/xlink"> <topic id="id4521"> <instanceOf> <subjectIndicatorRef xlink:href="http://psi.garshol.priv.no/beer/pub"></subjectIndicatorRef> </instanceOf> <subjectIdentity> <subjectIndicatorRef xlink:href="http://psi.example.org/12"></subjectIndicatorRef> <topicRef xlink:href="file:/Users/larsga/data/topicmaps/beer.xtm#id2662"></topicRef> </subjectIdentity> <baseName> <baseNameString>Amundsen Bryggeri og Spiseri</baseNameString> </baseName> <occurrence> <instanceOf> <subjectIndicatorRef

xlink:href="http://psi.ontopia.net/ontology/latitude"></subjectIndicatorRef> </instanceOf> <resourceData>59.913816</resourceData> </occurrence> ... </topic> ...</topicMap>

Applying a fragment

• The feed contains a URI prefix– this is used to create item identifiers tagging

statements with their origin

• For each TopicSI find that topic, then– for each statement, remove matching item

identifier– if statement now has no item identifiers,

delete it

• Merge in the received fragment– then tag all statements in it with matching

item identifier

Properties of the protocol

• HATEOAS– uses hypertext principles– only endpoint is that of the overview feed– all other URLs available via hypertext

• Applying a fragment is idempotent– ie: result is the same, no matter how many times

you do it

• Loose binding– very loose binding between server and client

• Supports federation of data– client can safely merge data from different sources

SDshare push

• In normal SDshare data receivers connect to the data source– basically, they poll the source with GET requests

• However, the receiver is not always allowed to make connections to the source– SDshare push is designed for this situation

• Solution is a slightly modified protocol– source POSTs Atom feeds with inline fragments to

receipient– this flips the server/client relationship

• Not part of the spec; unofficial Ontopia extension

Uses of SDshare

Example use case #1

Portal

Ontopia DB2TM

Frontend

Database

Example use case #1

Portal

OntopiaDB2TM

Frontend

Database

Service #1

Service #3

SDshare

OntopiaSDshare

NRK/Skole today

Editorial serverMediaDBDB2TM

DB server 1

DB server 2

Prod #1 Prod #2

JDBCnrk-grep.xtm

Database

Server

Production environment

Export

Import

Firewall

NRK/Skole with SDshare push

Editorial serverMediaDBDB2TM

DB server 1

DB server 2

Prod #1 Prod #2

Database

Server

Production environment

Firewall

SDsharePUSH

Hafslund

ERP GIS CRM ...

Introduction to SDshare

Technology

Transcript of Introduction to SDshare

Introduction to SolidWorks - Yolasga-site.yolasite.com/resources/Introduction to SolidWorks.pdf · Introduction to SolidWorks Introduction to SolidWorks SolidWorks is a powerful 3D

Introduction to Theoretical Computer Science Introduction to Theoretical …€¦ · Introduction to Theoretical Computer Science Introduction to Theoretical CS!! Fundamental questions:

Introduction to animals Introduction to Animals. Traits.

Introduction to Algorithm I519, Introduction to bioinformatics.

Introduction to Structure Hazard Introduction to Hazard ...

Introduction to GeneticExpression Introduction to Genetic Expression.

Introduction to Bioinformatics Introduction to Databases

CS344: Introduction to ArtificialCS344: Introduction to ...

Introduction to Introduction to Database Systems

INTRODUCTION TO MARKETING Introduction to Business & Marketing.

Introduction to Introduction to Global Warming I

Introduction to HTML & XHTML 15 th February. Introduction to HTML & XHTML Introduction to HTML Introduction to XHTML.

Introduction to Python Programming Introduction to … · Introduction to Python Programming Introduction to Object-Oriented Programming Annemarie Friedrich (anne@cis.uni-muenchen.de)

Introduction to Energy Introduction to Transportation.

Introduction to FEM Modeling - support.midasnfx.comsupport.midasnfx.com/webinar/PDF/Introduction to FEM Modeling... · Introduction to FEM Modeling 1 Introduction to FEM Modeling

Introduction to Bioinformatics Introduction to Bioinformatics -

Introduction to Systems Introduction to Systems Biology

Introduction to Simulations Introduction to MHD Simulation

Introduction to JavaScript CS101 Introduction to Computing.

An Introduction to NSAn Introduction to NS2An Introduction ...teerawat/publications/NS2/02-NS2.pdf · An Introduction to NSAn Introduction to NS2An Introduction to NS2 Textbook: T.