openUwedat - NEMCnemc.us/docs/2014/Presentations/Wed-Data Management-14.2-Bozic.pdfopenUwedat The...

20
openUwedat The Freedom to Process Any Environmental Observation in the Most Natural Way Bojan Božić, Johannes Schabauer, Gerhard Dünnebeil

Transcript of openUwedat - NEMCnemc.us/docs/2014/Presentations/Wed-Data Management-14.2-Bozic.pdfopenUwedat The...

Page 1: openUwedat - NEMCnemc.us/docs/2014/Presentations/Wed-Data Management-14.2-Bozic.pdfopenUwedat The Freedom to Process Any Environmental Observation in the Most Natural Way Bojan Božić,

openUwedat The Freedom to Process Any Environmental Observation in the Most Natural Way

Bojan Božić, Johannes Schabauer,

Gerhard Dünnebeil

Page 2: openUwedat - NEMCnemc.us/docs/2014/Presentations/Wed-Data Management-14.2-Bozic.pdfopenUwedat The Freedom to Process Any Environmental Observation in the Most Natural Way Bojan Božić,

Sponsors

The research leading to these results has received funding from the European Union Seventh Framework Program FP7/2007-2013 under grant agreement number 287867. Link: carbotraf.eu

Page 3: openUwedat - NEMCnemc.us/docs/2014/Presentations/Wed-Data Management-14.2-Bozic.pdfopenUwedat The Freedom to Process Any Environmental Observation in the Most Natural Way Bojan Božić,

Introduction

§  Definition of openUWEDAT §  Data Aquistion System for Environmental Data §  Long Time Storage and Availability §  Analysis and Alarming §  Application wich uses the TimeSeries Toolbox

•  TS-Toolbox is a framework •  openUwedat is a productline of ready to use applications

§  Fields of Application §  Air Monitoring §  Water Monitoring §  Traffic Monitoring

Page 4: openUwedat - NEMCnemc.us/docs/2014/Presentations/Wed-Data Management-14.2-Bozic.pdfopenUwedat The Freedom to Process Any Environmental Observation in the Most Natural Way Bojan Božić,

Definition of a Timeseries

A time series is a sequence of data points, measured typically at successive points in time spaced at uniform time intervals.

(Wikipedia 2014)

Page 5: openUwedat - NEMCnemc.us/docs/2014/Presentations/Wed-Data Management-14.2-Bozic.pdfopenUwedat The Freedom to Process Any Environmental Observation in the Most Natural Way Bojan Božić,

Definition of a Timeseries Concept of a Datapoint

Properties (Metadata) ts:geo_lat = 48.20135

ts:geo_lon = 15.620257

...

...

Zeit spectrum status

ValueKeys

00:00

00:10

00:20

00:30

...

...

[20.0,14.3,10.2,...]

[19.5,13.8,11.0,...]

[18.7,13.5,10.8,...]

[20.1,14.2,11.3,...]

...

...

OK

OK

FEHLER

OK

...

...

Slot

Values

Page 6: openUwedat - NEMCnemc.us/docs/2014/Presentations/Wed-Data Management-14.2-Bozic.pdfopenUwedat The Freedom to Process Any Environmental Observation in the Most Natural Way Bojan Božić,

Definition of a Timeseries Temporal Classification

Time

Type of Vehicle

Irregular Time Series

Time

Temperature Samples

Quasi Regular Time Series

Car

Lorry

Time

Temperature Mean Values

Regular Time Series

Page 7: openUwedat - NEMCnemc.us/docs/2014/Presentations/Wed-Data Management-14.2-Bozic.pdfopenUwedat The Freedom to Process Any Environmental Observation in the Most Natural Way Bojan Božić,

Pipe Oriented Architecture Simple Pipe

Source Destination ?

Page 8: openUwedat - NEMCnemc.us/docs/2014/Presentations/Wed-Data Management-14.2-Bozic.pdfopenUwedat The Freedom to Process Any Environmental Observation in the Most Natural Way Bojan Božić,

Pipe Oriented Architecture Complex Pipe

Source Destination

Prozessor

? ?

Page 9: openUwedat - NEMCnemc.us/docs/2014/Presentations/Wed-Data Management-14.2-Bozic.pdfopenUwedat The Freedom to Process Any Environmental Observation in the Most Natural Way Bojan Božić,

Pipe Oriented Architecture Simple Application

Thermometer

Temperature in °K

Meanvalue Generation

Temperature- Meanvalue in °K

Calculation

Temperature Meanvalue in °C

Database

Datasource Processor Destination

Page 10: openUwedat - NEMCnemc.us/docs/2014/Presentations/Wed-Data Management-14.2-Bozic.pdfopenUwedat The Freedom to Process Any Environmental Observation in the Most Natural Way Bojan Božić,

Pipe Oriented Architecture Source & Destination

Thermometer Annual Meanvalue Generation

Temperature (annual)

Database

Datasource Processor Destination

Database

Temperature (minute)

Temperature (minute)

Page 11: openUwedat - NEMCnemc.us/docs/2014/Presentations/Wed-Data Management-14.2-Bozic.pdfopenUwedat The Freedom to Process Any Environmental Observation in the Most Natural Way Bojan Božić,

Pipe Oriented Architecture Caching

Source

Caching Processor

Destination 1

Destination 2

Page 12: openUwedat - NEMCnemc.us/docs/2014/Presentations/Wed-Data Management-14.2-Bozic.pdfopenUwedat The Freedom to Process Any Environmental Observation in the Most Natural Way Bojan Božić,

Formula 3 Introduction

§  Works between source and destination §  Describes a timeseries processor §  Comparable to regular expressions: very

compact and very powerful §  Usually not longer than one line, but covers

functionality of whole pages with normal programming languages

§  Seamless integration to TS Toolbox

Page 13: openUwedat - NEMCnemc.us/docs/2014/Presentations/Wed-Data Management-14.2-Bozic.pdfopenUwedat The Freedom to Process Any Environmental Observation in the Most Natural Way Bojan Božić,

Formula 3 Primitive Operations §  Multiplication of all Values of a Timeseries with a

Constant §  @A << A[i] * 2>>

§  Calculating the Sum out of two Timeserieses §  @A @B << A[i]+B[i] >>

§  Calculating the Difference of two consecutive slots §  @A << A[i] – A[i-1] >> §  @A << A[i] – A[i-1] if A[i-1] != none otherwise A[0]>

Page 14: openUwedat - NEMCnemc.us/docs/2014/Presentations/Wed-Data Management-14.2-Bozic.pdfopenUwedat The Freedom to Process Any Environmental Observation in the Most Natural Way Bojan Božić,

Formula 3 Working with Aggregates

§  Calculating the floating mean over a logical index §  @A << mean( A[i-3..i] ) >>

§  Calculating the Floating Mean over a temporal index §  @A << mean(A ]t-10mins .. t]) >> every 10 mins

Page 15: openUwedat - NEMCnemc.us/docs/2014/Presentations/Wed-Data Management-14.2-Bozic.pdfopenUwedat The Freedom to Process Any Environmental Observation in the Most Natural Way Bojan Božić,

Formula 3 implementation

§  Design goal: usable as a „standalone” library on major platforms (Java, .NET, “native”)

§  Implemented in Python §  TS Toolbox contains Formula 3 running on Jython

15 05.05.10

Page 16: openUwedat - NEMCnemc.us/docs/2014/Presentations/Wed-Data Management-14.2-Bozic.pdfopenUwedat The Freedom to Process Any Environmental Observation in the Most Natural Way Bojan Božić,

The Time Series Toolbox

§  A set of API's and software components §  For building services and applications §  That work with time series data (record, store, process,

publish ...)

16 05.05.10

Page 17: openUwedat - NEMCnemc.us/docs/2014/Presentations/Wed-Data Management-14.2-Bozic.pdfopenUwedat The Freedom to Process Any Environmental Observation in the Most Natural Way Bojan Božić,

Layered Architecture

§  Backend components (Data Sources) §  Processing and control components §  Frontend components (Data Sinks)

17 05.05.10

Time  Series  API

Time  Series  APISOS

DataSourceCSV

DataSourceRRD

DatasourceAnySen

Datasource

Formula  3 Caching

SOS  Frontend Universal  Data  Pump

Page 18: openUwedat - NEMCnemc.us/docs/2014/Presentations/Wed-Data Management-14.2-Bozic.pdfopenUwedat The Freedom to Process Any Environmental Observation in the Most Natural Way Bojan Božić,

Time Series Toolbox Availability

§  Available under dual license: Open source (GPL) and commercial license

§  Development release at: http://ts-toolbox.ait.ac.at/

§  For more info contact: [email protected]

18 05.05.10

Page 19: openUwedat - NEMCnemc.us/docs/2014/Presentations/Wed-Data Management-14.2-Bozic.pdfopenUwedat The Freedom to Process Any Environmental Observation in the Most Natural Way Bojan Božić,

Conclusion

§  openUwedat supports §  Multidimensional values of a certain value-type §  Sematic multidimensionality with the concept of value-keys §  Regular and irregular timeserieses

§  Open interfaces to §  gather data from various sensors §  store it in various databases §  offer other popular interfaces like OGI SOS

§  Flexible dataprocessing with §  Formula 3 §  Custom dataprocessors in any language (e.g. Java, Python)

Page 20: openUwedat - NEMCnemc.us/docs/2014/Presentations/Wed-Data Management-14.2-Bozic.pdfopenUwedat The Freedom to Process Any Environmental Observation in the Most Natural Way Bojan Božić,

AIT Austrian Institute of Technology your ingenious partner Dr. Bojan Božić AIT 2444 Seibersdorf [email protected]