Wolfgang Lehner - Université libre de...

86
© Prof. Dr.-Ing. Wolfgang Lehner | Forecasting and Data Imputation Strategies in Database Systems Wolfgang Lehner 11.07.2013

Transcript of Wolfgang Lehner - Université libre de...

Page 1: Wolfgang Lehner - Université libre de Bruxellescs.ulb.ac.be/conferences/ebiss2013/files/lehner_ebiss2013.pdfExplain time series from it’s history ... Wolfgang Lehner | | 34 > Query

© Prof. Dr.-Ing. Wolfgang Lehner |

Forecasting and Data Imputation Strategies in Database Systems

Wolfgang Lehner

11.07.2013

Page 2: Wolfgang Lehner - Université libre de Bruxellescs.ulb.ac.be/conferences/ebiss2013/files/lehner_ebiss2013.pdfExplain time series from it’s history ... Wolfgang Lehner | | 34 > Query

© Prof. Dr.-Ing. Wolfgang Lehner | | 2

> Novel type of applications !!!

Forecasting and Data Imputation Strategies in Database Systems

data crunching meets number crunching

data volume

computational complexity

Reporting

OLAP

Data Mining (classification,

association rules, ...)

Forecasting

Data Imputation

Simulation

Page 3: Wolfgang Lehner - Université libre de Bruxellescs.ulb.ac.be/conferences/ebiss2013/files/lehner_ebiss2013.pdfExplain time series from it’s history ... Wolfgang Lehner | | 34 > Query

© Prof. Dr.-Ing. Wolfgang Lehner | | 3

> The NetFlix Competition

Forecasting and Data Imputation Strategies in Database Systems

Page 4: Wolfgang Lehner - Université libre de Bruxellescs.ulb.ac.be/conferences/ebiss2013/files/lehner_ebiss2013.pdfExplain time series from it’s history ... Wolfgang Lehner | | 34 > Query

© Prof. Dr.-Ing. Wolfgang Lehner | | 4

> The NetFlix Competition (2)

BellKor's Pragmatic Chaos

Forecasting and Data Imputation Strategies in Database Systems

Page 5: Wolfgang Lehner - Université libre de Bruxellescs.ulb.ac.be/conferences/ebiss2013/files/lehner_ebiss2013.pdfExplain time series from it’s history ... Wolfgang Lehner | | 34 > Query

© Prof. Dr.-Ing. Wolfgang Lehner | | 5

> The NetFlix Competition (3)

Forecasting and Data Imputation Strategies in Database Systems

Page 6: Wolfgang Lehner - Université libre de Bruxellescs.ulb.ac.be/conferences/ebiss2013/files/lehner_ebiss2013.pdfExplain time series from it’s history ... Wolfgang Lehner | | 34 > Query

© Prof. Dr.-Ing. Wolfgang Lehner | | 6

> The NetFlix Competition (4)

About Schmidt Lost In Translation Sideways

Alice 4 2

Bob 3 2

Michael 5 3

(2.24) (1.92) (1.18)

(1.98)

(1.21)

(2.30)

3.8

2.3

4.4

2.7

2.3

2.7 5.2

4.4

2.7

Forecasting and Data Imputation Strategies in Database Systems

Page 7: Wolfgang Lehner - Université libre de Bruxellescs.ulb.ac.be/conferences/ebiss2013/files/lehner_ebiss2013.pdfExplain time series from it’s history ... Wolfgang Lehner | | 34 > Query

© Prof. Dr.-Ing. Wolfgang Lehner | | 7

> The NetFlix Competition (6)

>10

.00

0.0

00

>100.000

Movies U

sers

Forecasting and Data Imputation Strategies in Database Systems

Page 8: Wolfgang Lehner - Université libre de Bruxellescs.ulb.ac.be/conferences/ebiss2013/files/lehner_ebiss2013.pdfExplain time series from it’s history ... Wolfgang Lehner | | 34 > Query

© Prof. Dr.-Ing. Wolfgang Lehner | | 8

> A simple experiment ...

Forecasting and Data Imputation Strategies in Database Systems

Page 9: Wolfgang Lehner - Université libre de Bruxellescs.ulb.ac.be/conferences/ebiss2013/files/lehner_ebiss2013.pdfExplain time series from it’s history ... Wolfgang Lehner | | 34 > Query

© Prof. Dr.-Ing. Wolfgang Lehner | | 9

> ... our test object

different movies d

iffe

ren

t u

sers

color code := user rating

Forecasting and Data Imputation Strategies in Database Systems

Page 10: Wolfgang Lehner - Université libre de Bruxellescs.ulb.ac.be/conferences/ebiss2013/files/lehner_ebiss2013.pdfExplain time series from it’s history ... Wolfgang Lehner | | 34 > Query

© Prof. Dr.-Ing. Wolfgang Lehner | | 10

> The Experiment ...

Phase 1: drop 75% of all pixels

Forecasting and Data Imputation Strategies in Database Systems

Page 11: Wolfgang Lehner - Université libre de Bruxellescs.ulb.ac.be/conferences/ebiss2013/files/lehner_ebiss2013.pdfExplain time series from it’s history ... Wolfgang Lehner | | 34 > Query

© Prof. Dr.-Ing. Wolfgang Lehner | | 11

> The Experiment ...

Phase 2: Random permutation of rows and columns

Forecasting and Data Imputation Strategies in Database Systems

Page 12: Wolfgang Lehner - Université libre de Bruxellescs.ulb.ac.be/conferences/ebiss2013/files/lehner_ebiss2013.pdfExplain time series from it’s history ... Wolfgang Lehner | | 34 > Query

© Prof. Dr.-Ing. Wolfgang Lehner | | 12

> The Experiment ... Phase 3: Determine the latent factors

Forecasting and Data Imputation Strategies in Database Systems

Page 13: Wolfgang Lehner - Université libre de Bruxellescs.ulb.ac.be/conferences/ebiss2013/files/lehner_ebiss2013.pdfExplain time series from it’s history ... Wolfgang Lehner | | 34 > Query

© Prof. Dr.-Ing. Wolfgang Lehner | | 13

> The Experiment ... Phase 4: Reconstruction

Forecasting and Data Imputation Strategies in Database Systems

Page 14: Wolfgang Lehner - Université libre de Bruxellescs.ulb.ac.be/conferences/ebiss2013/files/lehner_ebiss2013.pdfExplain time series from it’s history ... Wolfgang Lehner | | 34 > Query

© Prof. Dr.-Ing. Wolfgang Lehner | | 14

> The Experiment ...

Phase 5: Final Result Generation

Forecasting and Data Imputation Strategies in Database Systems

Page 15: Wolfgang Lehner - Université libre de Bruxellescs.ulb.ac.be/conferences/ebiss2013/files/lehner_ebiss2013.pdfExplain time series from it’s history ... Wolfgang Lehner | | 34 > Query

© Prof. Dr.-Ing. Wolfgang Lehner | | 15

> The Experiment ...

25%

LFM

Forecasting and Data Imputation Strategies in Database Systems

Page 16: Wolfgang Lehner - Université libre de Bruxellescs.ulb.ac.be/conferences/ebiss2013/files/lehner_ebiss2013.pdfExplain time series from it’s history ... Wolfgang Lehner | | 34 > Query

© Prof. Dr.-Ing. Wolfgang Lehner | | 16

> Time Series Forecasting

Multi-Dimensional Time Series Data

Forecasting and Data Imputation Strategies in Database Systems

Page 17: Wolfgang Lehner - Université libre de Bruxellescs.ulb.ac.be/conferences/ebiss2013/files/lehner_ebiss2013.pdfExplain time series from it’s history ... Wolfgang Lehner | | 34 > Query

© Prof. Dr.-Ing. Wolfgang Lehner | | 17

> Forecast Models

Given

Time series with numerical values as training data

Goal

Predict future values for arbitrary future point in times (forecast horizon) Include trend and seasonality

Applications

Planning of sales and budget Price development Inventory, manufacturing Climate, weather, environment Economic indicators Stocks

“It’s tough to make predictions, especially about the future.”

-- Mark Twain

Forecasting and Data Imputation Strategies in Database Systems

Page 18: Wolfgang Lehner - Université libre de Bruxellescs.ulb.ac.be/conferences/ebiss2013/files/lehner_ebiss2013.pdfExplain time series from it’s history ... Wolfgang Lehner | | 34 > Query

© Prof. Dr.-Ing. Wolfgang Lehner | | 18

> Forecast Models

Diverse Time Series Data

Find forecast model that minimizes the forecast error

Example: Exponential Smoothing Framework

30 variants (additional additive or multiplicative errors)

Trend/Season No Season Add. Season Mult. Season

No Trend

Add. Trend

Add. Damp. Trend

Mult. Trend

Mult. Damp. Trend

Forecasting and Data Imputation Strategies in Database Systems

Page 19: Wolfgang Lehner - Université libre de Bruxellescs.ulb.ac.be/conferences/ebiss2013/files/lehner_ebiss2013.pdfExplain time series from it’s history ... Wolfgang Lehner | | 34 > Query

© Prof. Dr.-Ing. Wolfgang Lehner | | 19

> Agenda

Mathematical Foundations

In-DBMS Time Series Forecasting

Flash-Forward Query Project

Query Processing and Optimization

Model Configuration Advisor

Forecasting the Data Cube Selection of Model Configurations

Notification-based Forecast Queries

Beyond Forecast Models

Model Refinement

Forecasting and Data Imputation Strategies in Database Systems

Page 20: Wolfgang Lehner - Université libre de Bruxellescs.ulb.ac.be/conferences/ebiss2013/files/lehner_ebiss2013.pdfExplain time series from it’s history ... Wolfgang Lehner | | 34 > Query

© Prof. Dr.-Ing. Wolfgang Lehner | | 20

>

Mathematical Foundations

Forecasting and Data Imputation Strategies in Database Systems

Page 21: Wolfgang Lehner - Université libre de Bruxellescs.ulb.ac.be/conferences/ebiss2013/files/lehner_ebiss2013.pdfExplain time series from it’s history ... Wolfgang Lehner | | 34 > Query

© Prof. Dr.-Ing. Wolfgang Lehner | | 21

> Model-Based Forecasting

Forecast Model

Statistical time series description (model)

(Recursive) Forecasting Process

Model Identification Model Estimation Forecasting and Model Update Model Evaluation Model Adaptation

Model Identification

Model Estimation

Forecasting

Model Evaluation

Model Adaptation

Model Maintenance

Time Series Data

Model Type AR(2)

Model Parameters φ1=0.55, φ2=0.45

New Time Series

Values Ui

Forecasting Values Fi+h

Model Creation Model Usage

Forecasting and Data Imputation Strategies in Database Systems

Page 22: Wolfgang Lehner - Université libre de Bruxellescs.ulb.ac.be/conferences/ebiss2013/files/lehner_ebiss2013.pdfExplain time series from it’s history ... Wolfgang Lehner | | 34 > Query

© Prof. Dr.-Ing. Wolfgang Lehner | | 22

> Model Identification

Forecast Model Types / Classes

Base Forecast Models

Exponential Smoothing

Machine Learning

(Auto)Regression

Domain-Specific Extensions

HWT (Single-Equation)

EGRV (Multi-Equation)

BN (Bayesian Networks)

SVM (Support Vector Machines)

SVR (Support Vector Regression)

ANN (Artificial Neural Networks)

Black-Box Gray-Box

White-Box

AR MA

ARMA ARIMA

SARIMA ARMAX

MLR (Multiple Linear Regression)

SESM (Single Exponential Smoothing)

DESM (Double Exponential Smoothing)

TESM / HoltWinters

(Triple Exponential Smoothing)

Forecasting and Data Imputation Strategies in Database Systems

Page 23: Wolfgang Lehner - Université libre de Bruxellescs.ulb.ac.be/conferences/ebiss2013/files/lehner_ebiss2013.pdfExplain time series from it’s history ... Wolfgang Lehner | | 34 > Query

© Prof. Dr.-Ing. Wolfgang Lehner | | 23

> Model Estimation

Problem

Instantiate a forecast model w.r.t. meta model and training data set

Example

Forecast Model Type AR(2): Error Metric: MSE

Parameter Estimator

L-BFGS-B

2211ˆ

ttt yyy

eMSE=827,354.4

0.23

0.56

eMSE= 211,204.7

Forecasting and Data Imputation Strategies in Database Systems

Page 24: Wolfgang Lehner - Université libre de Bruxellescs.ulb.ac.be/conferences/ebiss2013/files/lehner_ebiss2013.pdfExplain time series from it’s history ... Wolfgang Lehner | | 34 > Query

© Prof. Dr.-Ing. Wolfgang Lehner | | 24

> Model Usage

Forecasting

Use the estimated forecast model Create h forecast values (forecast horizon) Update model state for new measurements (e.g., exponential smoothing)

Example Forecast (EGRV)

SMAPEvshort

=0.0021 SMAPElong

=0.0755

21 23.056.0ˆ ttt yyy

30min 1year

Forecasting and Data Imputation Strategies in Database Systems

Page 25: Wolfgang Lehner - Université libre de Bruxellescs.ulb.ac.be/conferences/ebiss2013/files/lehner_ebiss2013.pdfExplain time series from it’s history ... Wolfgang Lehner | | 34 > Query

© Prof. Dr.-Ing. Wolfgang Lehner | | 25

> Model Maintenance

Model Evaluation

Goal: Trigger model adaptation only if necessary

Fixed Interval Techniques (# updates, time interval) Continuous Evaluation

Techniques (threshold, on-demand)

Model Adaptation

Goal: Adapt the forecast model to the changed time series (if necessary)

Model Re-Identification Model Re-Estimation (old model as start point)

h=2

Forecasting and Data Imputation Strategies in Database Systems

Page 26: Wolfgang Lehner - Université libre de Bruxellescs.ulb.ac.be/conferences/ebiss2013/files/lehner_ebiss2013.pdfExplain time series from it’s history ... Wolfgang Lehner | | 34 > Query

© Prof. Dr.-Ing. Wolfgang Lehner | | 26

>

In-DBMS Time Series Forecasting

Forecasting and Data Imputation Strategies in Database Systems

Page 27: Wolfgang Lehner - Université libre de Bruxellescs.ulb.ac.be/conferences/ebiss2013/files/lehner_ebiss2013.pdfExplain time series from it’s history ... Wolfgang Lehner | | 34 > Query

© Prof. Dr.-Ing. Wolfgang Lehner | | 27

> MAD Skills

Magnetic

„Attract data and practitioners“ Use all available data

sources independent of their quality

Agile

„Rapid iteration: ingest, analyze, productionalize“ Continuous and rapid evolution of physical and

logical structures ELT (Extraction, Loading, Transformation)

Deep

„Sophisticated analytics in Big Data“ Extended algorithmic runtime environment Ad-hoc advanced analytics and statistics

Forecasting and Data Imputation Strategies in Database Systems

Page 28: Wolfgang Lehner - Université libre de Bruxellescs.ulb.ac.be/conferences/ebiss2013/files/lehner_ebiss2013.pdfExplain time series from it’s history ... Wolfgang Lehner | | 34 > Query

© Prof. Dr.-Ing. Wolfgang Lehner | | 28

> Integrated Data Analytics

Integration of advanced analytics into scalable database management systems

Traditionally forecasting is performed manually in external statistical systems Support of transparent and automatic in-DBMS forecasting

Analysis (e.g., R, SPSS)

DBMS

Traditional In-DBMS Forecasting

Predictive DBMS

Forecasting and Data Imputation Strategies in Database Systems

Page 29: Wolfgang Lehner - Université libre de Bruxellescs.ulb.ac.be/conferences/ebiss2013/files/lehner_ebiss2013.pdfExplain time series from it’s history ... Wolfgang Lehner | | 34 > Query

© Prof. Dr.-Ing. Wolfgang Lehner | | 29

> Integrated Forecasting

Model-based Prediction

We employ (classical) time series models (e.g. exponential smoothing, ARIMA) Used to Explain time series from it’s history And—possibly—from exogenous inputs

Related Work

Customized functions with proprietary languages SQL Server 2012: ARIMA, autoregressive trees Oracle: exponential smoothing, non-linear regression

Bi-directional communication Reuse existing statistical tools (e.g. R) SAP HANA, Oracle, IBM Netezza

Model-based views

Key Elements

Declarative querying Automatic model creation Automatic model maintenance Forecast model advisor

Forecasting and Data Imputation Strategies in Database Systems

Page 30: Wolfgang Lehner - Université libre de Bruxellescs.ulb.ac.be/conferences/ebiss2013/files/lehner_ebiss2013.pdfExplain time series from it’s history ... Wolfgang Lehner | | 34 > Query

© Prof. Dr.-Ing. Wolfgang Lehner | | 30

> Scenario

Evaluate Error

Adjust the Model

Model Forecast Model Forecast

AS OF AS OF

Forecasting and Data Imputation Strategies in Database Systems

Page 31: Wolfgang Lehner - Université libre de Bruxellescs.ulb.ac.be/conferences/ebiss2013/files/lehner_ebiss2013.pdfExplain time series from it’s history ... Wolfgang Lehner | | 34 > Query

© Prof. Dr.-Ing. Wolfgang Lehner | | 31

> System Architecture

Model Index

Base Tables

Model Pool

Time Series Time Series Time Series

Model Model

Model

Model

Query Interface

Forecast Queries Inserts

Model Creation • Manually by experts • Automatically • Ensemble-based

modeling

Transparent Model Usage • Model matching • Reuse of models • Optimization of adhoc-

queries

Model Maintenance • Model matching • Model evaluation • Model maintenance

Model Advisor • Physical design

recommendation

Forecasting and Data Imputation Strategies in Database Systems

Page 32: Wolfgang Lehner - Université libre de Bruxellescs.ulb.ac.be/conferences/ebiss2013/files/lehner_ebiss2013.pdfExplain time series from it’s history ... Wolfgang Lehner | | 34 > Query

© Prof. Dr.-Ing. Wolfgang Lehner | | 32

> F2DB

Forecasting and Data Imputation Strategies in Database Systems

Page 33: Wolfgang Lehner - Université libre de Bruxellescs.ulb.ac.be/conferences/ebiss2013/files/lehner_ebiss2013.pdfExplain time series from it’s history ... Wolfgang Lehner | | 34 > Query

© Prof. Dr.-Ing. Wolfgang Lehner | | 33

> F2DB

Forecasting and Data Imputation Strategies in Database Systems

Page 34: Wolfgang Lehner - Université libre de Bruxellescs.ulb.ac.be/conferences/ebiss2013/files/lehner_ebiss2013.pdfExplain time series from it’s history ... Wolfgang Lehner | | 34 > Query

© Prof. Dr.-Ing. Wolfgang Lehner | | 34

>

Query Processing and Optimization

Forecasting and Data Imputation Strategies in Database Systems

Page 35: Wolfgang Lehner - Université libre de Bruxellescs.ulb.ac.be/conferences/ebiss2013/files/lehner_ebiss2013.pdfExplain time series from it’s history ... Wolfgang Lehner | | 34 > Query

© Prof. Dr.-Ing. Wolfgang Lehner | | 35

> Relational Integration

SQL Forecast Query

Query Plan

Forecast operator Ψ

SELECT Date, Quantity

FROM Sales, Article

WHERE S_ID = A_ID AND

Name = ’Article A’

AS OF 2013-05-02

… S_ID Date Qty

… 1 2013-04-28 6

… 1 2013-04-29 5

… 2 2013-04-30 1

… 1 2013-04-30 7

A_ID Name …

1 Article A …

2 Acticle B …

Date Qty

2013-04-28 6

2013-04-29 5

2013-04-30 7

2013-05-01 6

2013-05-02 6.5

Article

⋈S_ID=A_ID

Sales

Ψk=2

σ Name= ‚Article A'

πDate, QTY

Forecasting and Data Imputation Strategies in Database Systems

Page 36: Wolfgang Lehner - Université libre de Bruxellescs.ulb.ac.be/conferences/ebiss2013/files/lehner_ebiss2013.pdfExplain time series from it’s history ... Wolfgang Lehner | | 34 > Query

© Prof. Dr.-Ing. Wolfgang Lehner | | 36

> Query Restructuring

Selection invariance

Condition filtering out whole clusters

Projection invariance

If neither time, measure or cluster attribute

Union invariance

On different relations

σ product= 'HTC'

sales

Ψk=2

πdate, quantity

σ product= 'HTC'

sales

Ψk=2

π product, date, quantity

sales2 sales1

σ product= 'HTC' σ product= 'NOKIA'

sales2 sales1

σ product= 'HTC' σ product= 'NOKIA'

Ψk=2

Ψk=2 Ψk=2

Influences only runtime of FQs date product quantity

Jun 2013 HTC 36.000

Jul 2013 HTC 38.000

Jun 2013 Nokia 5.000

Jul 2013 Nokia 3.000

Forecasting and Data Imputation Strategies in Database Systems

Page 37: Wolfgang Lehner - Université libre de Bruxellescs.ulb.ac.be/conferences/ebiss2013/files/lehner_ebiss2013.pdfExplain time series from it’s history ... Wolfgang Lehner | | 34 > Query

© Prof. Dr.-Ing. Wolfgang Lehner | | 37

> Cost-based Optimization

Restructuring might also influence accuracy

E.g. join, aggregation

sales2

⋈sales1.date=sales2.date

πsales1.date, sales1.quantity – sales2.quantity

Ψk=2

sales2

⋈sales1.date=sales2.date

πsales1.date, sales1.quantity – sales2.quantity

sales1

Ψk=2

sales1

Ψk=2

Expensive model creation

Cost model required: accuracy and runtime

Expensive join

Forecasting and Data Imputation Strategies in Database Systems

Page 38: Wolfgang Lehner - Université libre de Bruxellescs.ulb.ac.be/conferences/ebiss2013/files/lehner_ebiss2013.pdfExplain time series from it’s history ... Wolfgang Lehner | | 34 > Query

© Prof. Dr.-Ing. Wolfgang Lehner | | 38

>

E

Example: Aggregation Queries

Build Model

Build Model

Build Model

Build Model

+

Shop 1

Shop 2

Shop 3

Aggregate time series

+

How to compute aggregation forecast queries?

Runtime Accuracy

Aggregate Forecast

Forecast Aggregate

Sample Forecast Estimate

Base time series

Forecasting and Data Imputation Strategies in Database Systems

Page 39: Wolfgang Lehner - Université libre de Bruxellescs.ulb.ac.be/conferences/ebiss2013/files/lehner_ebiss2013.pdfExplain time series from it’s history ... Wolfgang Lehner | | 34 > Query

© Prof. Dr.-Ing. Wolfgang Lehner | | 39

> Example: Aggregation Queries

Uniform Estimation

Estimation with Historical Ratios

10

10

ŷ?

t+1

9

9

49

67

8

8

48

64

base time series

aggregate = 𝛼 𝐹𝑜𝑟𝑒𝑐𝑎𝑠𝑡

t t-1

𝛼 =# 𝑏𝑎𝑠𝑒 𝑠𝑒𝑟𝑖𝑒𝑠

# 𝑏𝑎𝑠𝑒 𝑠𝑒𝑟𝑖𝑒𝑠 𝑖𝑛 𝑠𝑎𝑚𝑝𝑙𝑒

3

2∙ 20 = 30

𝑟𝑎𝑡𝑖𝑜 =𝑏𝑎𝑠𝑒 𝑠𝑒𝑟𝑖𝑒𝑠

𝑎𝑔𝑔𝑟𝑒𝑔𝑎𝑡𝑒 𝑠𝑒𝑟𝑖𝑒𝑠 𝛼 =

1

𝑟𝑎𝑡𝑖𝑜𝑠

19

67+9

67

= 3.7 ∙ 20 = 74

Model

Model

Forecasting and Data Imputation Strategies in Database Systems

Page 40: Wolfgang Lehner - Université libre de Bruxellescs.ulb.ac.be/conferences/ebiss2013/files/lehner_ebiss2013.pdfExplain time series from it’s history ... Wolfgang Lehner | | 34 > Query

© Prof. Dr.-Ing. Wolfgang Lehner | | 40

> Example: Aggregation Queries

Calculation of Historical Ratios

Different approaches possible Simple averages Lagged proportions …

Seasonality of data is important

Combined strategy: mixture of past ratios and ratio one season ago

10 9 8 …

agg 67 64

base

t+1 t t-1

Jul 12 Feb 12 Jul 11 time

Forecasting and Data Imputation Strategies in Database Systems

Page 41: Wolfgang Lehner - Université libre de Bruxellescs.ulb.ac.be/conferences/ebiss2013/files/lehner_ebiss2013.pdfExplain time series from it’s history ... Wolfgang Lehner | | 34 > Query

© Prof. Dr.-Ing. Wolfgang Lehner | | 41

> Example: Aggregation Queries

agg

Build Model +

Build Model

Build Model

Build Model

base

+

Forecasting and Data Imputation Strategies in Database Systems

Page 42: Wolfgang Lehner - Université libre de Bruxellescs.ulb.ac.be/conferences/ebiss2013/files/lehner_ebiss2013.pdfExplain time series from it’s history ... Wolfgang Lehner | | 34 > Query

© Prof. Dr.-Ing. Wolfgang Lehner | | 42

> Example: Aggregation Queries

Aggregation - Sales

Forecasting and Data Imputation Strategies in Database Systems

Page 43: Wolfgang Lehner - Université libre de Bruxellescs.ulb.ac.be/conferences/ebiss2013/files/lehner_ebiss2013.pdfExplain time series from it’s history ... Wolfgang Lehner | | 34 > Query

© Prof. Dr.-Ing. Wolfgang Lehner | | 43

>

Model Configuration Advisor

Forecasting and Data Imputation Strategies in Database Systems

Page 44: Wolfgang Lehner - Université libre de Bruxellescs.ulb.ac.be/conferences/ebiss2013/files/lehner_ebiss2013.pdfExplain time series from it’s history ... Wolfgang Lehner | | 34 > Query

© Prof. Dr.-Ing. Wolfgang Lehner | | 44

> Forecasting the Data Cube

product

time

city region

P1 P2 P3 P4

C1

C2

C3

C4

R1

R2

SELECT time, measure

FROM facts

WHERE product = P4

AND city = C4

time

measure

AS OF now() + 1 day

Forecasting and Data Imputation Strategies in Database Systems

Page 45: Wolfgang Lehner - Université libre de Bruxellescs.ulb.ac.be/conferences/ebiss2013/files/lehner_ebiss2013.pdfExplain time series from it’s history ... Wolfgang Lehner | | 34 > Query

© Prof. Dr.-Ing. Wolfgang Lehner | | 45

> Forecasting the Data Cube

product

time

city region

P1 P2 P3 P4

C1

C2

C3

C4

R1

R2

SELECT time, SUM(measure)

FROM facts

WHERE product = P4

AND region = R2

GROUP BY time

AS OF now() + 1 day

C1 C2 C3 C4

R1 R2

*

M

M M

M

M

M M M

Forecasting and Data Imputation Strategies in Database Systems

Page 46: Wolfgang Lehner - Université libre de Bruxellescs.ulb.ac.be/conferences/ebiss2013/files/lehner_ebiss2013.pdfExplain time series from it’s history ... Wolfgang Lehner | | 34 > Query

© Prof. Dr.-Ing. Wolfgang Lehner | | 46

> Forecast Model Advisor

Traditional Database Systems Statistical Database Systems

Workload Workload

Materialized View and Index Advisor

Forecast Model Advisor

Physical Design Recommendation

Physical Design Recommendation

Optimize Runtime

M M M

Optimize Runtime and Accuracy

Forecasting and Data Imputation Strategies in Database Systems

Page 47: Wolfgang Lehner - Université libre de Bruxellescs.ulb.ac.be/conferences/ebiss2013/files/lehner_ebiss2013.pdfExplain time series from it’s history ... Wolfgang Lehner | | 34 > Query

© Prof. Dr.-Ing. Wolfgang Lehner | | 47

> Conceptual Data Model

Base Time Series

Aggregated Time Series

Conceptually, we organize the aggregation possibilities as a directed time series hyper graph

C1 C2 C3 C4

R1 R2

*

Forecasting and Data Imputation Strategies in Database Systems

Page 48: Wolfgang Lehner - Université libre de Bruxellescs.ulb.ac.be/conferences/ebiss2013/files/lehner_ebiss2013.pdfExplain time series from it’s history ... Wolfgang Lehner | | 34 > Query

© Prof. Dr.-Ing. Wolfgang Lehner | | 48

> Much larger in reality

Forecasting in Data Management Umgebungen

Page 49: Wolfgang Lehner - Université libre de Bruxellescs.ulb.ac.be/conferences/ebiss2013/files/lehner_ebiss2013.pdfExplain time series from it’s history ... Wolfgang Lehner | | 34 > Query

© Prof. Dr.-Ing. Wolfgang Lehner | | 49

> Forecast Models

Each node (or time series) may be associated with a forecast model

Time series methods

Exponential Smoothing ARIMA

M

A query describes one or several nodes in the hyper graph

Forecasting and Data Imputation Strategies in Database Systems

Page 50: Wolfgang Lehner - Université libre de Bruxellescs.ulb.ac.be/conferences/ebiss2013/files/lehner_ebiss2013.pdfExplain time series from it’s history ... Wolfgang Lehner | | 34 > Query

© Prof. Dr.-Ing. Wolfgang Lehner | | 50

> Calculation of Forecast Values

Forecast values of a node can be computed by any nodes in the graph

Derivation weight k

Based on history of source and target time series

M k

Aggregation

M M M

k=1

Direct

M k=1

Disaggregation

M k<1

Forecasting and Data Imputation Strategies in Database Systems

Page 51: Wolfgang Lehner - Université libre de Bruxellescs.ulb.ac.be/conferences/ebiss2013/files/lehner_ebiss2013.pdfExplain time series from it’s history ... Wolfgang Lehner | | 34 > Query

© Prof. Dr.-Ing. Wolfgang Lehner | | 51

>

M

M

Evaluation of Model Configurations

Model configuration

Configuration evaluation

Assignment of models

Assignment of derivation schemes +

Costs Forecast accuracy

M

M

Forecasting and Data Imputation Strategies in Database Systems

Page 52: Wolfgang Lehner - Université libre de Bruxellescs.ulb.ac.be/conferences/ebiss2013/files/lehner_ebiss2013.pdfExplain time series from it’s history ... Wolfgang Lehner | | 34 > Query

© Prof. Dr.-Ing. Wolfgang Lehner | | 52

> Overview Configuration Advisor

Candidate Selection

Evaluation

Control

Output

Time Series Graph

Forecast Error

Model Costs

Indicator

Acceptance Criteria

Stop Criteria

Prepare

Init Model Conf

Model Conf

Model Conf

Forecasting and Data Imputation Strategies in Database Systems

Page 53: Wolfgang Lehner - Université libre de Bruxellescs.ulb.ac.be/conferences/ebiss2013/files/lehner_ebiss2013.pdfExplain time series from it’s history ... Wolfgang Lehner | | 34 > Query

© Prof. Dr.-Ing. Wolfgang Lehner | | 53

> Indicators

Heuristically indicate the expected benefit of a model at a node (without building the model)

Focus on time series relationships Measure to specify the derivation error between two nodes Low indicator value low error (good derivation) High indicator value high error (poor derivation)

Indicator

Historical Error

Weight Variance

Forecasting and Data Imputation Strategies in Database Systems

10, 20, 32, 40

1, 2, 3, 4

10, 20, 32, 40

1, 2, 3, 4

0.1, 0.1, 0.09, 0.1 0, 0, 0.06, 0

Page 54: Wolfgang Lehner - Université libre de Bruxellescs.ulb.ac.be/conferences/ebiss2013/files/lehner_ebiss2013.pdfExplain time series from it’s history ... Wolfgang Lehner | | 34 > Query

© Prof. Dr.-Ing. Wolfgang Lehner | | 54

> Indicators

Local indicator arrays

Derivation errors of one node

Global indicator array

Minimum over all local arrays

0

MIN

Forecasting and Data Imputation Strategies in Database Systems

Page 55: Wolfgang Lehner - Université libre de Bruxellescs.ulb.ac.be/conferences/ebiss2013/files/lehner_ebiss2013.pdfExplain time series from it’s history ... Wolfgang Lehner | | 34 > Query

© Prof. Dr.-Ing. Wolfgang Lehner | | 55

> Selection of Model Configurations

Select start configuration

0 5 2 3

M

Forecasting and Data Imputation Strategies in Database Systems

Page 56: Wolfgang Lehner - Université libre de Bruxellescs.ulb.ac.be/conferences/ebiss2013/files/lehner_ebiss2013.pdfExplain time series from it’s history ... Wolfgang Lehner | | 34 > Query

© Prof. Dr.-Ing. Wolfgang Lehner | | 56

> Selection of Model Configurations

0 5 2 3

Add Delete

Select start configuration

Candidate selection

Preselection

M

Forecasting and Data Imputation Strategies in Database Systems

Page 57: Wolfgang Lehner - Université libre de Bruxellescs.ulb.ac.be/conferences/ebiss2013/files/lehner_ebiss2013.pdfExplain time series from it’s history ... Wolfgang Lehner | | 34 > Query

© Prof. Dr.-Ing. Wolfgang Lehner | | 57

> Selection of Model Configurations

0 5 2 3

Add Delete

Select start configuration

Candidate selection

Preselection

M

Forecasting and Data Imputation Strategies in Database Systems

Page 58: Wolfgang Lehner - Université libre de Bruxellescs.ulb.ac.be/conferences/ebiss2013/files/lehner_ebiss2013.pdfExplain time series from it’s history ... Wolfgang Lehner | | 34 > Query

© Prof. Dr.-Ing. Wolfgang Lehner | | 58

> Selection of Model Configurations

0 5 2 3

Add

0 0 1 3

1 2

Select start configuration

Candidate selection

Preselection Ranking

Delete

M

Forecasting and Data Imputation Strategies in Database Systems

Page 59: Wolfgang Lehner - Université libre de Bruxellescs.ulb.ac.be/conferences/ebiss2013/files/lehner_ebiss2013.pdfExplain time series from it’s history ... Wolfgang Lehner | | 34 > Query

© Prof. Dr.-Ing. Wolfgang Lehner | | 59

> Selection of Model Configurations

0 5 2 3

Add

1

Delete

M

M

Select start configuration

Candidate selection

Preselection Ranking

Evaluation

Model creation Acceptance

M

Forecasting and Data Imputation Strategies in Database Systems

Page 60: Wolfgang Lehner - Université libre de Bruxellescs.ulb.ac.be/conferences/ebiss2013/files/lehner_ebiss2013.pdfExplain time series from it’s history ... Wolfgang Lehner | | 34 > Query

© Prof. Dr.-Ing. Wolfgang Lehner | | 60

> Selection of Model Configurations

0 5 2 3

Add Delete

M

Select start configuration

Candidate selection

Preselection Ranking

Evaluation

Model creation Acceptance

M

Forecasting and Data Imputation Strategies in Database Systems

Page 61: Wolfgang Lehner - Université libre de Bruxellescs.ulb.ac.be/conferences/ebiss2013/files/lehner_ebiss2013.pdfExplain time series from it’s history ... Wolfgang Lehner | | 34 > Query

© Prof. Dr.-Ing. Wolfgang Lehner | | 61

> Selection of Model Configurations

0 5 2 3

Add Delete

M

Select start configuration

Candidate selection

Preselection Ranking

Evaluation

Model creation Acceptance

0 0 1 3

M

Forecasting and Data Imputation Strategies in Database Systems

Page 62: Wolfgang Lehner - Université libre de Bruxellescs.ulb.ac.be/conferences/ebiss2013/files/lehner_ebiss2013.pdfExplain time series from it’s history ... Wolfgang Lehner | | 34 > Query

© Prof. Dr.-Ing. Wolfgang Lehner | | 62

> Selection of Model Configurations

0 0 1 3

Add Delete

M

Select start configuration

Candidate selection

Preselection Ranking

Evaluation

Model creation Acceptance

M

Model costs: 2 Forecast error: 10 %

Forecasting and Data Imputation Strategies in Database Systems

Page 63: Wolfgang Lehner - Université libre de Bruxellescs.ulb.ac.be/conferences/ebiss2013/files/lehner_ebiss2013.pdfExplain time series from it’s history ... Wolfgang Lehner | | 34 > Query

© Prof. Dr.-Ing. Wolfgang Lehner | | 63

> Indicator Accuracy

Correlation between indicators and real forecast error

Forecasting and Data Imputation Strategies in Database Systems

Page 64: Wolfgang Lehner - Université libre de Bruxellescs.ulb.ac.be/conferences/ebiss2013/files/lehner_ebiss2013.pdfExplain time series from it’s history ... Wolfgang Lehner | | 34 > Query

© Prof. Dr.-Ing. Wolfgang Lehner | | 64

> Accuracy Analysis

Static approaches – data independent

Tourism Sales

Forecasting and Data Imputation Strategies in Database Systems

Page 65: Wolfgang Lehner - Université libre de Bruxellescs.ulb.ac.be/conferences/ebiss2013/files/lehner_ebiss2013.pdfExplain time series from it’s history ... Wolfgang Lehner | | 34 > Query

© Prof. Dr.-Ing. Wolfgang Lehner | | 65

> Accuracy Analysis

Tourism Sales

Dynamic approaches – empirical selection

Forecasting and Data Imputation Strategies in Database Systems

Page 66: Wolfgang Lehner - Université libre de Bruxellescs.ulb.ac.be/conferences/ebiss2013/files/lehner_ebiss2013.pdfExplain time series from it’s history ... Wolfgang Lehner | | 34 > Query

© Prof. Dr.-Ing. Wolfgang Lehner | | 66

> Runtime Analysis

Scalability

Forecasting and Data Imputation Strategies in Database Systems

Page 67: Wolfgang Lehner - Université libre de Bruxellescs.ulb.ac.be/conferences/ebiss2013/files/lehner_ebiss2013.pdfExplain time series from it’s history ... Wolfgang Lehner | | 34 > Query

© Prof. Dr.-Ing. Wolfgang Lehner | | 67

>

Subscription-Based Forecast Queries

Forecasting and Data Imputation Strategies in Database Systems

Page 68: Wolfgang Lehner - Université libre de Bruxellescs.ulb.ac.be/conferences/ebiss2013/files/lehner_ebiss2013.pdfExplain time series from it’s history ... Wolfgang Lehner | | 34 > Query

© Prof. Dr.-Ing. Wolfgang Lehner | | 68

> Motivation

Energy data management systems

Provide stable energy supply while including larger amounts of renewable energy Continously require forecasts of energy demand and supply

Subscription-based forecast queries

Subscriber

Process Forecasts

Forecasting

Model Model

Subscribe

Notify with forecasts

Forecasting and Data Imputation Strategies in Database Systems

Page 69: Wolfgang Lehner - Université libre de Bruxellescs.ulb.ac.be/conferences/ebiss2013/files/lehner_ebiss2013.pdfExplain time series from it’s history ... Wolfgang Lehner | | 34 > Query

© Prof. Dr.-Ing. Wolfgang Lehner | | 69

> Subscription-Based Forecast Queries

When to notify subscriber?

365, 412, 546

After each new real value …

As less as possible …

365, 412, 546, 459, 460, 549, 340, 400, 128, 943,

Optimal …

365, 412, 546

365, 412, 546, 756, 349, 321

Many notifications High subscriber costs

Long messages Low accuracy Resend messages

High subscriber costs

Reduce number of notifications and notification length

Forecasting and Data Imputation Strategies in Database Systems

Page 70: Wolfgang Lehner - Université libre de Bruxellescs.ulb.ac.be/conferences/ebiss2013/files/lehner_ebiss2013.pdfExplain time series from it’s history ... Wolfgang Lehner | | 34 > Query

© Prof. Dr.-Ing. Wolfgang Lehner | | 70

> Definition

SELECT datetime, production FROM ts_powerproduction WHERE type = „wind“ FORECAST 3 hours THRESHOLD 0.1

Parameters

Time series description Minimum continous forecast horizon Accuracy threshold

initial forecasts

threshold real

data

new forecasts

Forecasting and Data Imputation Strategies in Database Systems

Page 71: Wolfgang Lehner - Université libre de Bruxellescs.ulb.ac.be/conferences/ebiss2013/files/lehner_ebiss2013.pdfExplain time series from it’s history ... Wolfgang Lehner | | 34 > Query

© Prof. Dr.-Ing. Wolfgang Lehner | | 71

> Processing Model

Horizon Violation

When? Subscriber has less than minimum horizon What? Send missing values + horizon extension

Threshold Violation

When? Threshold is violated What? Resend all values

SELECT datetime, production FROM ts_powerproduction WHERE type = „wind“ FORECAST 3 hours THRESHOLD 0.1

horizon horizon

extension

first notification

horizon violation

threshold violation

time

Forecasting and Data Imputation Strategies in Database Systems

Page 72: Wolfgang Lehner - Université libre de Bruxellescs.ulb.ac.be/conferences/ebiss2013/files/lehner_ebiss2013.pdfExplain time series from it’s history ... Wolfgang Lehner | | 34 > Query

© Prof. Dr.-Ing. Wolfgang Lehner | | 72

> Subscriber Cost Model

Processing costs of the subscriber

Analytically known or learned function Depend on the forecast horizon Complete costs 𝐹𝐶 Complete restart of processing

Incremental costs 𝐹𝐼 Processing of additional values

first notification

horizon violation

threshold violation

𝐹𝐶

𝐹𝐶

𝐹𝐼

Subscriber

Process Forecasts

Forecasting and Data Imputation Strategies in Database Systems

Page 73: Wolfgang Lehner - Université libre de Bruxellescs.ulb.ac.be/conferences/ebiss2013/files/lehner_ebiss2013.pdfExplain time series from it’s history ... Wolfgang Lehner | | 34 > Query

© Prof. Dr.-Ing. Wolfgang Lehner | | 73

> Optimization Goal

Assume we know …

The sequence of threshold + horizon violations The subscriber cost functions 𝐹𝐶 and 𝐹𝐼

Subscriber costs over subscription lifetime

Sum over all 𝐹𝐶 and 𝐹𝐼

Optimization Goal

Find forecast horizon that minimizes total costs Depends on subscriber cost function Depends on forecast accuracy

How to get threshold violations? Analyze past to predict future

Forecasting and Data Imputation Strategies in Database Systems

Page 74: Wolfgang Lehner - Université libre de Bruxellescs.ulb.ac.be/conferences/ebiss2013/files/lehner_ebiss2013.pdfExplain time series from it’s history ... Wolfgang Lehner | | 34 > Query

© Prof. Dr.-Ing. Wolfgang Lehner | | 74

> Computation Approaches

Core Idea: Calculate best forecast horizon using our cost model on the time series history

Offline – Static

One forecast horizon over whole lifetime

Offline – Dynamic

Adapts to periodic changes of time series accuracy Sequence of forecast horizons for time slices

Online

Adapts to arbitrary changes in data or cost functions Continously adapts forecast horizon

k

t

k1 k2 k1 k2

t

k1 k2 k3 k4

t

Forecasting and Data Imputation Strategies in Database Systems

Page 75: Wolfgang Lehner - Université libre de Bruxellescs.ulb.ac.be/conferences/ebiss2013/files/lehner_ebiss2013.pdfExplain time series from it’s history ... Wolfgang Lehner | | 34 > Query

© Prof. Dr.-Ing. Wolfgang Lehner | | 75

> Experimental Setting

Real-world energy demand and supply data sets

National Energy Demand Household Energy Demand National Wind Supply

Subscriber cost functions

Synthetic linear function Real world cost function (obtained from MIRABEL)

Forecast Methods

Tailor-made for short-term energy forecasting Extension of exponential smoothing

Forecasting and Data Imputation Strategies in Database Systems

Page 76: Wolfgang Lehner - Université libre de Bruxellescs.ulb.ac.be/conferences/ebiss2013/files/lehner_ebiss2013.pdfExplain time series from it’s history ... Wolfgang Lehner | | 34 > Query

© Prof. Dr.-Ing. Wolfgang Lehner | | 76

> Experimental Evaluation

Comparison of Approaches

Fixed subscription parameters and linear cost function

Evaluation of Time Slice Approach

Forecasting and Data Imputation Strategies in Database Systems

Page 77: Wolfgang Lehner - Université libre de Bruxellescs.ulb.ac.be/conferences/ebiss2013/files/lehner_ebiss2013.pdfExplain time series from it’s history ... Wolfgang Lehner | | 34 > Query

© Prof. Dr.-Ing. Wolfgang Lehner | | 77

> Experimental Evaluation

Influence of subscriber threshold

Relationship between number of notifications, subscriber costs and runtime Real-world cost function

number of notifications subscriber costs runtime

Forecasting and Data Imputation Strategies in Database Systems

Page 78: Wolfgang Lehner - Université libre de Bruxellescs.ulb.ac.be/conferences/ebiss2013/files/lehner_ebiss2013.pdfExplain time series from it’s history ... Wolfgang Lehner | | 34 > Query

© Prof. Dr.-Ing. Wolfgang Lehner | | 78

> Experimental Evaluation

Cost Model Validation

Estimated costs vs. real costs Real-world cost function Queries with increasing complexity (Q1, Q2, Q3)

Forecasting and Data Imputation Strategies in Database Systems

Page 79: Wolfgang Lehner - Université libre de Bruxellescs.ulb.ac.be/conferences/ebiss2013/files/lehner_ebiss2013.pdfExplain time series from it’s history ... Wolfgang Lehner | | 34 > Query

© Prof. Dr.-Ing. Wolfgang Lehner | | 79

>

Beyond Forecast Models

Towards a Model-based Database System

Forecasting and Data Imputation Strategies in Database Systems

Page 80: Wolfgang Lehner - Université libre de Bruxellescs.ulb.ac.be/conferences/ebiss2013/files/lehner_ebiss2013.pdfExplain time series from it’s history ... Wolfgang Lehner | | 34 > Query

© Prof. Dr.-Ing. Wolfgang Lehner | | 80

>

Forecast Impute Refine

Adjust

Aggregate

Updates

Estimate

History

FIRA² - Approach

Forecast

Base approach Predict missing data from history Requires no known data

Impute

Exploit local patterns Infer missing data from similar units Requires adequate set of known data

Refine

Detect local and global shifts Infer error yields new forecasts Requires few known data

Aggregate

Calculate aggregate (e.g. report)

Adjust

Maintain models and synopses Optimize accuracy of estimates

Forecasting and Data Imputation Strategies in Database Systems

Page 81: Wolfgang Lehner - Université libre de Bruxellescs.ulb.ac.be/conferences/ebiss2013/files/lehner_ebiss2013.pdfExplain time series from it’s history ... Wolfgang Lehner | | 34 > Query

© Prof. Dr.-Ing. Wolfgang Lehner | | 81

> Refine

How to include new real data?

Data delivery may be late

There might be missing data for the last period Reports still have to be generated Estimate missing data

83,5

84,5

85,5

86,5

87,5

88,5

89,5

-1 0 1 2 3 4 5 6 7

qu

anti

ty (

in m

illio

ns)

number of available base real values

forecasted aggregate

real aggregate

Shop 1 Shop 2

Shop 3 Shop 4

Shop 5 Shop 6

Shop 7

Forecasting and Data Imputation Strategies in Database Systems

Page 82: Wolfgang Lehner - Université libre de Bruxellescs.ulb.ac.be/conferences/ebiss2013/files/lehner_ebiss2013.pdfExplain time series from it’s history ... Wolfgang Lehner | | 34 > Query

© Prof. Dr.-Ing. Wolfgang Lehner | | 82

> Refine

Refine I

Refine II

Refine III

10

10

ŷ?

t+1

9

9

49

67

8

8

48

64

t t-1

74

t+1

forecast real

8

50

?

base time series

aggregate

error correction

𝛼 𝑅𝑒𝑎𝑙

𝛼 (𝑅𝑒𝑎𝑙 + 𝐹𝑜𝑟𝑒𝑐𝑎𝑠𝑡)

𝛼 𝑅𝑒𝑎𝑙 + 𝐹𝑜𝑟𝑒𝑐𝑎𝑠𝑡 − 𝛽 𝐸𝑟𝑟𝑜𝑟

Forecasting and Data Imputation Strategies in Database Systems

Page 83: Wolfgang Lehner - Université libre de Bruxellescs.ulb.ac.be/conferences/ebiss2013/files/lehner_ebiss2013.pdfExplain time series from it’s history ... Wolfgang Lehner | | 34 > Query

© Prof. Dr.-Ing. Wolfgang Lehner | | 83

> Refine

Refine III: Estimation of forecast errors

10

10

t+1 t+1

forecast real

8

50

Case 1

Forecast and real value Calculate true model

error

Case 2/3

No real value No error calculation

Case 4

No forecast value Estimate model error

error

8 10 -

?

74 ?

base time series

aggregate

50 50 50 -

Forecasting and Data Imputation Strategies in Database Systems

Page 84: Wolfgang Lehner - Université libre de Bruxellescs.ulb.ac.be/conferences/ebiss2013/files/lehner_ebiss2013.pdfExplain time series from it’s history ... Wolfgang Lehner | | 34 > Query

© Prof. Dr.-Ing. Wolfgang Lehner | | 84

> Experimental Evaluation

Refine – Sales Refine – Wind production

Forecasting and Data Imputation Strategies in Database Systems

Page 85: Wolfgang Lehner - Université libre de Bruxellescs.ulb.ac.be/conferences/ebiss2013/files/lehner_ebiss2013.pdfExplain time series from it’s history ... Wolfgang Lehner | | 34 > Query

© Prof. Dr.-Ing. Wolfgang Lehner | | 85

> Conclusions

FFQ project

Provide forecasting as 1st class citizen within a database system Preserve logical and physical data independence (e.g. transparent model usage,

transparent model maintenance, and model creation) Extend traditional processing and optimization techniques Apply concept of traditional index advisors to foreast models

Towards a model-based database system

Data is increasingly inconsistent, incomplete and imprecise Extend concept of models to other use cases (missing data, uncertain data, data

compression …)

Forecasting and Data Imputation Strategies in Database Systems

Page 86: Wolfgang Lehner - Université libre de Bruxellescs.ulb.ac.be/conferences/ebiss2013/files/lehner_ebiss2013.pdfExplain time series from it’s history ... Wolfgang Lehner | | 34 > Query

© Prof. Dr.-Ing. Wolfgang Lehner |

Forecasting and Data Imputation Strategies in Database Systems

Wolfgang Lehner

11.07.2013