2013-05-29 Taverna Provenance

8
TAVERNA PROVENANCE Stian Soiland - Reyes, University of Manchester https://github.com/wf4ever/taverna - prov This work is licensed under a Creative Commons Attribution 3.0 Unported License 2013-05-29

description

Slide deck presenting the Provenance support of Taverna workflow system, detailing architecture, ontologies and how results are exported as Research Object bundles, including the PROV-O provenance of the workflow run. This upload is the PDF version, for PPTX source, see https://www.slideshare.net/soilandreyes/20130529-taverna-provenance-pptx-source/

Transcript of 2013-05-29 Taverna Provenance

Page 1: 2013-05-29 Taverna Provenance

TAVERNAPROVENANCE

Stian Soiland-Reyes, University of Manchester

https://github.com/wf4ever/taverna-prov

This work is licensed under a

Creative Commons Attribution 3.0 Unported

License

2013-05-29

Page 2: 2013-05-29 Taverna Provenance

ARCHITECTURE

Provenance

Workflow

Workflow run

Process run (iteration)

Parameter bindings

Data

Lists

Values

References

Errors

Process1

portA B C

D E

Process2

portA B C

D E

Invoke

Retry

Failover

Loop

Error bounce

Provenance

Parallelise

Processor

dispatch stack

layer injected by plugin

P Missier, S Soiland-Reyes, S Owen, W Tan, A Nenadic, I Dunlop, C

Goble: (2010, January). Taverna, reloaded. In Scientific and

Statistical Database Management (pp. 471-481). Springer Berlin

Heidelberg. DOI 10.1007/978-3-642-13818-8_33

captures provenance trace

Workflow execution

Page 3: 2013-05-29 Taverna Provenance

ONTOLOGY STACK

tavernaprov

• Lists, errors, byte content, checksums

wfprov + wfdesc

• Workflow execution, parameters, processes

PROV-O

• Activity start/stop, generation of values

http://purl.org/wf4ever/wfprov#

http://www.w3.org/ns/prov-o#

http://ns.taverna.org.uk/2012/tavernaprov/

Page 4: 2013-05-29 Taverna Provenance

INTERMEDIATE RESULTS

• Within the Taverna Workbench, the provenance database is

used for showing intermediate results and previous runs

Clicking a processor

Inputs and outputs of individual invocations

Page 5: 2013-05-29 Taverna Provenance

WORKFLOW RESULTS (FOLDER)

workflowrun.prov.ttl

(RDF)

outputA.txt

outputC.jpg

outputB/

Folder structure

intermediates/

1.txt2.txt

3.txt

de/def2e58b-50e2-4949-9980-fd310166621a.txt

Workflow outputs, one file per value

Provenance trace

Values from intermediate steps in workflow

Page 6: 2013-05-29 Taverna Provenance

WORKFLOW RESULTS (BUNDLE)

workflowrun.prov.ttl

(RDF)

outputA.txt

outputC.jpg

outputB/

https://w3id.org/bundle

intermediates/

1.txt2.txt

3.txt

de/def2e58b-50e2-4949-9980-fd310166621a.txt .ro/manifest.json

inputA.txtworkflow

URI

references

attribution

execution

environment

Aggregating in Research Object

ZIP folder structure (RO Bundle)

mimetype

application/vnd.wf4ever.robundle+zip

Page 7: 2013-05-29 Taverna Provenance

ACKNOWLEDGEMENTS

• Paolo Missier – initial provenance engine for Taverna 2

• Ian Dunlop – provenance capture execution layer

• Khalid Belhajjame – ontologies

• Alexandra Nenadic – intermediates, folder structure

• W3C Provenance working group – PROV-O

• Funded by European Commission’s 7th FWP FP7-ICT-2007-6

270192 and ESPRC platform grant EP/G026238/1

Page 8: 2013-05-29 Taverna Provenance

QUESTIONS?

Twitter: @soilandreyes

Skype: soiland

Email: [email protected]

http://soiland-reyes.com/stian/work/

http://practicalprovenance.wordpress.com/