The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package...

101
Introduction The Dataset package Perspectives The Dataset Project Emmanuel Rousseaux Institut d’études démographiques et du parcours de vie Université de Genève 1211 Genève 4, Suisse [email protected] Rousseaux E. The Dataset Project May 24, 2012 1/30

Transcript of The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package...

Page 1: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be

IntroductionThe Dataset package

Perspectives

The Dataset Project

Emmanuel Rousseaux

Institut d’études démographiques et du parcours de vieUniversité de Genève1211 Genève 4, Suisse

[email protected]

Rousseaux E. – The Dataset Project May 24, 2012 1/30

Page 2: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be

IntroductionThe Dataset package

Perspectives

Outline

Introduction

The Dataset package

Perspectives

Rousseaux E. – The Dataset Project May 24, 2012 2/30

Page 3: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be

IntroductionThe Dataset package

Perspectives

About the speakerNCCR LIVESGoals of the thesis projectWhat will be presented this day

Plan

Introduction

The Dataset package

Perspectives

Rousseaux E. – The Dataset Project May 24, 2012 3/30

Page 4: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be

IntroductionThe Dataset package

Perspectives

About the speakerNCCR LIVESGoals of the thesis projectWhat will be presented this day

Teaching and Research Assistant at the Departement ofEconomics, SES, Unige

PhD directed byI Gilbert Ritschard, iDemo, UnigeI Michel Léonard, ISS, Unige

Rousseaux E. – The Dataset Project May 24, 2012 4/30

Page 5: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be

IntroductionThe Dataset package

Perspectives

About the speakerNCCR LIVESGoals of the thesis projectWhat will be presented this day

Teaching and Research Assistant at the Departement ofEconomics, SES, Unige

PhD directed byI Gilbert Ritschard, iDemo, UnigeI Michel Léonard, ISS, Unige

Rousseaux E. – The Dataset Project May 24, 2012 4/30

Page 6: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be

IntroductionThe Dataset package

Perspectives

About the speakerNCCR LIVESGoals of the thesis projectWhat will be presented this day

Teaching and Research Assistant at the Departement ofEconomics, SES, Unige

PhD directed byI Gilbert Ritschard, iDemo, UnigeI Michel Léonard, ISS, Unige

Rousseaux E. – The Dataset Project May 24, 2012 4/30

Page 7: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be

IntroductionThe Dataset package

Perspectives

About the speakerNCCR LIVESGoals of the thesis projectWhat will be presented this day

Teaching and Research Assistant at the Departement ofEconomics, SES, Unige

PhD directed byI Gilbert Ritschard, iDemo, UnigeI Michel Léonard, ISS, Unige

Rousseaux E. – The Dataset Project May 24, 2012 4/30

Page 8: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be

IntroductionThe Dataset package

Perspectives

About the speakerNCCR LIVESGoals of the thesis projectWhat will be presented this day

(some) Research interestsI Decision TreesI Association rules miningI Nature-based optimization algorithmsI Health sociologyI Cognitive psychology

Rousseaux E. – The Dataset Project May 24, 2012 5/30

Page 9: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be

IntroductionThe Dataset package

Perspectives

About the speakerNCCR LIVESGoals of the thesis projectWhat will be presented this day

(some) Research interestsI Decision TreesI Association rules miningI Nature-based optimization algorithmsI Health sociologyI Cognitive psychology

Rousseaux E. – The Dataset Project May 24, 2012 5/30

Page 10: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be

IntroductionThe Dataset package

Perspectives

About the speakerNCCR LIVESGoals of the thesis projectWhat will be presented this day

(some) Research interestsI Decision TreesI Association rules miningI Nature-based optimization algorithmsI Health sociologyI Cognitive psychology

Rousseaux E. – The Dataset Project May 24, 2012 5/30

Page 11: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be

IntroductionThe Dataset package

Perspectives

About the speakerNCCR LIVESGoals of the thesis projectWhat will be presented this day

(some) Research interestsI Decision TreesI Association rules miningI Nature-based optimization algorithmsI Health sociologyI Cognitive psychology

Rousseaux E. – The Dataset Project May 24, 2012 5/30

Page 12: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be

IntroductionThe Dataset package

Perspectives

About the speakerNCCR LIVESGoals of the thesis projectWhat will be presented this day

(some) Research interestsI Decision TreesI Association rules miningI Nature-based optimization algorithmsI Health sociologyI Cognitive psychology

Rousseaux E. – The Dataset Project May 24, 2012 5/30

Page 13: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be

IntroductionThe Dataset package

Perspectives

About the speakerNCCR LIVESGoals of the thesis projectWhat will be presented this day

(some) Research interestsI Decision TreesI Association rules miningI Nature-based optimization algorithmsI Health sociologyI Cognitive psychology

Rousseaux E. – The Dataset Project May 24, 2012 5/30

Page 14: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be

IntroductionThe Dataset package

Perspectives

About the speakerNCCR LIVESGoals of the thesis projectWhat will be presented this day

NCCR LIVES "Overcoming vulnerability: life courseperspectives"

I work within the IP14 "Measuring life sequences and thedisorder of lives" leaded by Gilbert Ritschard

Aims at providing ad hoc methods for life course analysis inorder to have more insight about dynamics of vulnerability

Rousseaux E. – The Dataset Project May 24, 2012 6/30

Page 15: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be

IntroductionThe Dataset package

Perspectives

About the speakerNCCR LIVESGoals of the thesis projectWhat will be presented this day

NCCR LIVES "Overcoming vulnerability: life courseperspectives"

I work within the IP14 "Measuring life sequences and thedisorder of lives" leaded by Gilbert Ritschard

Aims at providing ad hoc methods for life course analysis inorder to have more insight about dynamics of vulnerability

Rousseaux E. – The Dataset Project May 24, 2012 6/30

Page 16: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be

IntroductionThe Dataset package

Perspectives

About the speakerNCCR LIVESGoals of the thesis projectWhat will be presented this day

NCCR LIVES "Overcoming vulnerability: life courseperspectives"

I work within the IP14 "Measuring life sequences and thedisorder of lives" leaded by Gilbert Ritschard

Aims at providing ad hoc methods for life course analysis inorder to have more insight about dynamics of vulnerability

Rousseaux E. – The Dataset Project May 24, 2012 6/30

Page 17: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be

IntroductionThe Dataset package

Perspectives

About the speakerNCCR LIVESGoals of the thesis projectWhat will be presented this day

OverviewI Providing a software framework for handling survey data

in RI Providing a software framework for handling life courses

as a wholeI Providing new mining tools for rare events

I Decision trees for the discovery of vulnerable profilesI Multi-channel association rules mining

I Apply these tools for getting new insight on poor healthsituations

Rousseaux E. – The Dataset Project May 24, 2012 7/30

Page 18: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be

IntroductionThe Dataset package

Perspectives

About the speakerNCCR LIVESGoals of the thesis projectWhat will be presented this day

OverviewI Providing a software framework for handling survey data

in RI Providing a software framework for handling life courses

as a wholeI Providing new mining tools for rare events

I Decision trees for the discovery of vulnerable profilesI Multi-channel association rules mining

I Apply these tools for getting new insight on poor healthsituations

Rousseaux E. – The Dataset Project May 24, 2012 7/30

Page 19: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be

IntroductionThe Dataset package

Perspectives

About the speakerNCCR LIVESGoals of the thesis projectWhat will be presented this day

OverviewI Providing a software framework for handling survey data

in RI Providing a software framework for handling life courses

as a wholeI Providing new mining tools for rare events

I Decision trees for the discovery of vulnerable profilesI Multi-channel association rules mining

I Apply these tools for getting new insight on poor healthsituations

Rousseaux E. – The Dataset Project May 24, 2012 7/30

Page 20: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be

IntroductionThe Dataset package

Perspectives

About the speakerNCCR LIVESGoals of the thesis projectWhat will be presented this day

OverviewI Providing a software framework for handling survey data

in RI Providing a software framework for handling life courses

as a wholeI Providing new mining tools for rare events

I Decision trees for the discovery of vulnerable profilesI Multi-channel association rules mining

I Apply these tools for getting new insight on poor healthsituations

Rousseaux E. – The Dataset Project May 24, 2012 7/30

Page 21: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be

IntroductionThe Dataset package

Perspectives

About the speakerNCCR LIVESGoals of the thesis projectWhat will be presented this day

OverviewI Providing a software framework for handling survey data

in RI Providing a software framework for handling life courses

as a wholeI Providing new mining tools for rare events

I Decision trees for the discovery of vulnerable profilesI Multi-channel association rules mining

I Apply these tools for getting new insight on poor healthsituations

Rousseaux E. – The Dataset Project May 24, 2012 7/30

Page 22: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be

IntroductionThe Dataset package

Perspectives

About the speakerNCCR LIVESGoals of the thesis projectWhat will be presented this day

OverviewI Providing a software framework for handling survey data

in RI Providing a software framework for handling life courses

as a wholeI Providing new mining tools for rare events

I Decision trees for the discovery of vulnerable profilesI Multi-channel association rules mining

I Apply these tools for getting new insight on poor healthsituations

Rousseaux E. – The Dataset Project May 24, 2012 7/30

Page 23: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be

IntroductionThe Dataset package

Perspectives

About the speakerNCCR LIVESGoals of the thesis projectWhat will be presented this day

OverviewI Providing a software framework for handling survey data

in RI Providing a software framework for handling life courses

as a wholeI Providing new mining tools for rare events

I Decision trees for the discovery of vulnerable profilesI Multi-channel association rules mining

I Apply these tools for getting new insight on poor healthsituations

Rousseaux E. – The Dataset Project May 24, 2012 7/30

Page 24: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be

IntroductionThe Dataset package

Perspectives

About the speakerNCCR LIVESGoals of the thesis projectWhat will be presented this day

Motivation

I Currently no framework to handle survey data in RI Possible in SPSS, SAS, Stata

However

I In theses commercial softwares, no rigourous consistencytest

I No real standard for sharing datasetI A lot of methods in the state-of-the-art are provided on R

only

Rousseaux E. – The Dataset Project May 24, 2012 8/30

Page 25: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be

IntroductionThe Dataset package

Perspectives

About the speakerNCCR LIVESGoals of the thesis projectWhat will be presented this day

Motivation

I Currently no framework to handle survey data in RI Possible in SPSS, SAS, Stata

However

I In theses commercial softwares, no rigourous consistencytest

I No real standard for sharing datasetI A lot of methods in the state-of-the-art are provided on R

only

Rousseaux E. – The Dataset Project May 24, 2012 8/30

Page 26: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be

IntroductionThe Dataset package

Perspectives

About the speakerNCCR LIVESGoals of the thesis projectWhat will be presented this day

Motivation

I Currently no framework to handle survey data in RI Possible in SPSS, SAS, Stata

However

I In theses commercial softwares, no rigourous consistencytest

I No real standard for sharing datasetI A lot of methods in the state-of-the-art are provided on R

only

Rousseaux E. – The Dataset Project May 24, 2012 8/30

Page 27: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be

IntroductionThe Dataset package

Perspectives

About the speakerNCCR LIVESGoals of the thesis projectWhat will be presented this day

Motivation

I Currently no framework to handle survey data in RI Possible in SPSS, SAS, Stata

However

I In theses commercial softwares, no rigourous consistencytest

I No real standard for sharing datasetI A lot of methods in the state-of-the-art are provided on R

only

Rousseaux E. – The Dataset Project May 24, 2012 8/30

Page 28: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be

IntroductionThe Dataset package

Perspectives

About the speakerNCCR LIVESGoals of the thesis projectWhat will be presented this day

Motivation

I Currently no framework to handle survey data in RI Possible in SPSS, SAS, Stata

However

I In theses commercial softwares, no rigourous consistencytest

I No real standard for sharing datasetI A lot of methods in the state-of-the-art are provided on R

only

Rousseaux E. – The Dataset Project May 24, 2012 8/30

Page 29: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be

IntroductionThe Dataset package

Perspectives

About the speakerNCCR LIVESGoals of the thesis projectWhat will be presented this day

Motivation

I Currently no framework to handle survey data in RI Possible in SPSS, SAS, Stata

However

I In theses commercial softwares, no rigourous consistencytest

I No real standard for sharing datasetI A lot of methods in the state-of-the-art are provided on R

only

Rousseaux E. – The Dataset Project May 24, 2012 8/30

Page 30: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be

IntroductionThe Dataset package

Perspectives

About the speakerNCCR LIVESGoals of the thesis projectWhat will be presented this day

Motivation

I Currently no framework to handle survey data in RI Possible in SPSS, SAS, Stata

However

I In theses commercial softwares, no rigourous consistencytest

I No real standard for sharing datasetI A lot of methods in the state-of-the-art are provided on R

only

Rousseaux E. – The Dataset Project May 24, 2012 8/30

Page 31: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be

IntroductionThe Dataset package

Perspectives

About the speakerNCCR LIVESGoals of the thesis projectWhat will be presented this day

Motivation

I Currently no framework to handle survey data in RI Possible in SPSS, SAS, Stata

However

I In theses commercial softwares, no rigourous consistencytest

I No real standard for sharing datasetI A lot of methods in the state-of-the-art are provided on R

only

Rousseaux E. – The Dataset Project May 24, 2012 8/30

Page 32: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be

IntroductionThe Dataset package

Perspectives

About the speakerNCCR LIVESGoals of the thesis projectWhat will be presented this day

Goals

I store and manipulate life courses dataI sophisticated managment of missing valuesI automatic consistency testsI representativity of the initial population testsI user-oriented functionsI automatic summaries

Rousseaux E. – The Dataset Project May 24, 2012 9/30

Page 33: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be

IntroductionThe Dataset package

Perspectives

About the speakerNCCR LIVESGoals of the thesis projectWhat will be presented this day

Goals

I store and manipulate life courses dataI sophisticated managment of missing valuesI automatic consistency testsI representativity of the initial population testsI user-oriented functionsI automatic summaries

Rousseaux E. – The Dataset Project May 24, 2012 9/30

Page 34: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be

IntroductionThe Dataset package

Perspectives

About the speakerNCCR LIVESGoals of the thesis projectWhat will be presented this day

Goals

I store and manipulate life courses dataI sophisticated managment of missing valuesI automatic consistency testsI representativity of the initial population testsI user-oriented functionsI automatic summaries

Rousseaux E. – The Dataset Project May 24, 2012 9/30

Page 35: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be

IntroductionThe Dataset package

Perspectives

About the speakerNCCR LIVESGoals of the thesis projectWhat will be presented this day

Goals

I store and manipulate life courses dataI sophisticated managment of missing valuesI automatic consistency testsI representativity of the initial population testsI user-oriented functionsI automatic summaries

Rousseaux E. – The Dataset Project May 24, 2012 9/30

Page 36: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be

IntroductionThe Dataset package

Perspectives

About the speakerNCCR LIVESGoals of the thesis projectWhat will be presented this day

Goals

I store and manipulate life courses dataI sophisticated managment of missing valuesI automatic consistency testsI representativity of the initial population testsI user-oriented functionsI automatic summaries

Rousseaux E. – The Dataset Project May 24, 2012 9/30

Page 37: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be

IntroductionThe Dataset package

Perspectives

About the speakerNCCR LIVESGoals of the thesis projectWhat will be presented this day

Goals

I store and manipulate life courses dataI sophisticated managment of missing valuesI automatic consistency testsI representativity of the initial population testsI user-oriented functionsI automatic summaries

Rousseaux E. – The Dataset Project May 24, 2012 9/30

Page 38: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be

IntroductionThe Dataset package

Perspectives

About the speakerNCCR LIVESGoals of the thesis projectWhat will be presented this day

Goals

I store and manipulate life courses dataI sophisticated managment of missing valuesI automatic consistency testsI representativity of the initial population testsI user-oriented functionsI automatic summaries

Rousseaux E. – The Dataset Project May 24, 2012 9/30

Page 39: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be

IntroductionThe Dataset package

Perspectives

About the speakerNCCR LIVESGoals of the thesis projectWhat will be presented this day

Representativity is central. Efforts have to be made for helpingthe user

I real structure for handling weights in the databaseI representativity checks on each variableI compute new weights to correctly balance a subdataset

Rousseaux E. – The Dataset Project May 24, 2012 10/30

Page 40: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be

IntroductionThe Dataset package

Perspectives

About the speakerNCCR LIVESGoals of the thesis projectWhat will be presented this day

Representativity is central. Efforts have to be made for helpingthe user

I real structure for handling weights in the databaseI representativity checks on each variableI compute new weights to correctly balance a subdataset

Rousseaux E. – The Dataset Project May 24, 2012 10/30

Page 41: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be

IntroductionThe Dataset package

Perspectives

About the speakerNCCR LIVESGoals of the thesis projectWhat will be presented this day

Representativity is central. Efforts have to be made for helpingthe user

I real structure for handling weights in the databaseI representativity checks on each variableI compute new weights to correctly balance a subdataset

Rousseaux E. – The Dataset Project May 24, 2012 10/30

Page 42: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be

IntroductionThe Dataset package

Perspectives

About the speakerNCCR LIVESGoals of the thesis projectWhat will be presented this day

Proposal: the ’Dataset’ project

I 2 librairies in R:I ’Dataset’: for cross-sectional survey dataI ’stDataset’: for spatio-temporal survey data

I Full S4I https://r-forge.r-project.org/projects/dataset/

Rousseaux E. – The Dataset Project May 24, 2012 11/30

Page 43: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be

IntroductionThe Dataset package

Perspectives

About the speakerNCCR LIVESGoals of the thesis projectWhat will be presented this day

Proposal: the ’Dataset’ project

I 2 librairies in R:I ’Dataset’: for cross-sectional survey dataI ’stDataset’: for spatio-temporal survey data

I Full S4I https://r-forge.r-project.org/projects/dataset/

Rousseaux E. – The Dataset Project May 24, 2012 11/30

Page 44: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be

IntroductionThe Dataset package

Perspectives

About the speakerNCCR LIVESGoals of the thesis projectWhat will be presented this day

Proposal: the ’Dataset’ project

I 2 librairies in R:I ’Dataset’: for cross-sectional survey dataI ’stDataset’: for spatio-temporal survey data

I Full S4I https://r-forge.r-project.org/projects/dataset/

Rousseaux E. – The Dataset Project May 24, 2012 11/30

Page 45: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be

IntroductionThe Dataset package

Perspectives

About the speakerNCCR LIVESGoals of the thesis projectWhat will be presented this day

Proposal: the ’Dataset’ project

I 2 librairies in R:I ’Dataset’: for cross-sectional survey dataI ’stDataset’: for spatio-temporal survey data

I Full S4I https://r-forge.r-project.org/projects/dataset/

Rousseaux E. – The Dataset Project May 24, 2012 11/30

Page 46: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be

IntroductionThe Dataset package

Perspectives

About the speakerNCCR LIVESGoals of the thesis projectWhat will be presented this day

Proposal: the ’Dataset’ project

I 2 librairies in R:I ’Dataset’: for cross-sectional survey dataI ’stDataset’: for spatio-temporal survey data

I Full S4I https://r-forge.r-project.org/projects/dataset/

Rousseaux E. – The Dataset Project May 24, 2012 11/30

Page 47: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be

IntroductionThe Dataset package

Perspectives

About the speakerNCCR LIVESGoals of the thesis projectWhat will be presented this day

Proposal: the ’Dataset’ project

I 2 librairies in R:I ’Dataset’: for cross-sectional survey dataI ’stDataset’: for spatio-temporal survey data

I Full S4I https://r-forge.r-project.org/projects/dataset/

Rousseaux E. – The Dataset Project May 24, 2012 11/30

Page 48: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be

IntroductionThe Dataset package

Perspectives

IntroductionObject VariableObject DatasetPerforming a preprocessing stepNative analysis tools provided

Plan

Introduction

The Dataset package

Perspectives

Rousseaux E. – The Dataset Project May 24, 2012 12/30

Page 49: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be

IntroductionThe Dataset package

Perspectives

IntroductionObject VariableObject DatasetPerforming a preprocessing stepNative analysis tools provided

In the Dataset package we mainly definedI the Variable object: store one measure on individualsI the Dataset object: store the full output of the survey

Rousseaux E. – The Dataset Project May 24, 2012 13/30

Page 50: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be

IntroductionThe Dataset package

Perspectives

IntroductionObject VariableObject DatasetPerforming a preprocessing stepNative analysis tools provided

In the Dataset package we mainly definedI the Variable object: store one measure on individualsI the Dataset object: store the full output of the survey

Rousseaux E. – The Dataset Project May 24, 2012 13/30

Page 51: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be

IntroductionThe Dataset package

Perspectives

IntroductionObject VariableObject DatasetPerforming a preprocessing stepNative analysis tools provided

In the Dataset package we mainly definedI the Variable object: store one measure on individualsI the Dataset object: store the full output of the survey

Rousseaux E. – The Dataset Project May 24, 2012 13/30

Page 52: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be

IntroductionThe Dataset package

Perspectives

IntroductionObject VariableObject DatasetPerforming a preprocessing stepNative analysis tools provided

The Variable object is represented byI codes: vector of codes for each individualsI missings: vector specifying the coding of missings valuesI values: vector specifying the coding of valid casesI description: a label

Then the Variable is declined into different kind of measures

Rousseaux E. – The Dataset Project May 24, 2012 14/30

Page 53: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be

IntroductionThe Dataset package

Perspectives

IntroductionObject VariableObject DatasetPerforming a preprocessing stepNative analysis tools provided

The Variable object is represented byI codes: vector of codes for each individualsI missings: vector specifying the coding of missings valuesI values: vector specifying the coding of valid casesI description: a label

Then the Variable is declined into different kind of measures

Rousseaux E. – The Dataset Project May 24, 2012 14/30

Page 54: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be

IntroductionThe Dataset package

Perspectives

IntroductionObject VariableObject DatasetPerforming a preprocessing stepNative analysis tools provided

The Variable object is represented byI codes: vector of codes for each individualsI missings: vector specifying the coding of missings valuesI values: vector specifying the coding of valid casesI description: a label

Then the Variable is declined into different kind of measures

Rousseaux E. – The Dataset Project May 24, 2012 14/30

Page 55: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be

IntroductionThe Dataset package

Perspectives

IntroductionObject VariableObject DatasetPerforming a preprocessing stepNative analysis tools provided

The Variable object is represented byI codes: vector of codes for each individualsI missings: vector specifying the coding of missings valuesI values: vector specifying the coding of valid casesI description: a label

Then the Variable is declined into different kind of measures

Rousseaux E. – The Dataset Project May 24, 2012 14/30

Page 56: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be

IntroductionThe Dataset package

Perspectives

IntroductionObject VariableObject DatasetPerforming a preprocessing stepNative analysis tools provided

The Variable object is represented byI codes: vector of codes for each individualsI missings: vector specifying the coding of missings valuesI values: vector specifying the coding of valid casesI description: a label

Then the Variable is declined into different kind of measures

Rousseaux E. – The Dataset Project May 24, 2012 14/30

Page 57: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be

IntroductionThe Dataset package

Perspectives

IntroductionObject VariableObject DatasetPerforming a preprocessing stepNative analysis tools provided

The Variable object is represented byI codes: vector of codes for each individualsI missings: vector specifying the coding of missings valuesI values: vector specifying the coding of valid casesI description: a label

Then the Variable is declined into different kind of measures

Rousseaux E. – The Dataset Project May 24, 2012 14/30

Page 58: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be

IntroductionThe Dataset package

Perspectives

IntroductionObject VariableObject DatasetPerforming a preprocessing stepNative analysis tools provided

The Variable object is represented byI codes: vector of codes for each individualsI missings: vector specifying the coding of missings valuesI values: vector specifying the coding of valid casesI description: a label

Then the Variable is declined into different kind of measures

Rousseaux E. – The Dataset Project May 24, 2012 14/30

Page 59: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be

IntroductionThe Dataset package

Perspectives

IntroductionObject VariableObject DatasetPerforming a preprocessing stepNative analysis tools provided

Variable (V)

Quantitative (V)

Scale svar()

Weighting wvar()

Timestamp tvar()

Categorical (V) cvar()

Ordered (V) Nominal nvar()

Ordered Binary bvar() Ordinal ovar()

Figure: Class diagramme of objects inheriting of the Variable class.

Rousseaux E. – The Dataset Project May 24, 2012 15/30

Page 60: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be

IntroductionThe Dataset package

Perspectives

IntroductionObject VariableObject DatasetPerforming a preprocessing stepNative analysis tools provided

Standard builders for VariablesI svarI cvarI ovar

Rousseaux E. – The Dataset Project May 24, 2012 16/30

Page 61: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be

IntroductionThe Dataset package

Perspectives

IntroductionObject VariableObject DatasetPerforming a preprocessing stepNative analysis tools provided

Standard builders for VariablesI svarI cvarI ovar

Rousseaux E. – The Dataset Project May 24, 2012 16/30

Page 62: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be

IntroductionThe Dataset package

Perspectives

IntroductionObject VariableObject DatasetPerforming a preprocessing stepNative analysis tools provided

Standard builders for VariablesI svarI cvarI ovar

Rousseaux E. – The Dataset Project May 24, 2012 16/30

Page 63: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be

IntroductionThe Dataset package

Perspectives

IntroductionObject VariableObject DatasetPerforming a preprocessing stepNative analysis tools provided

Standard builders for VariablesI svarI cvarI ovar

Rousseaux E. – The Dataset Project May 24, 2012 16/30

Page 64: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be

IntroductionThe Dataset package

Perspectives

IntroductionObject VariableObject DatasetPerforming a preprocessing stepNative analysis tools provided

Demo 1

Working with Variable objects

Rousseaux E. – The Dataset Project May 24, 2012 17/30

Page 65: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be

IntroductionThe Dataset package

Perspectives

IntroductionObject VariableObject DatasetPerforming a preprocessing stepNative analysis tools provided

The Dataset object is represented byI variables: list of variablesI name: name of the datasetI description: a long labelI row.names: names for rowsI weights: a variable used for weightingI control: some control variablesI infos: a list for storing other information the user want to

share

Rousseaux E. – The Dataset Project May 24, 2012 18/30

Page 66: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be

IntroductionThe Dataset package

Perspectives

IntroductionObject VariableObject DatasetPerforming a preprocessing stepNative analysis tools provided

The Dataset object is represented byI variables: list of variablesI name: name of the datasetI description: a long labelI row.names: names for rowsI weights: a variable used for weightingI control: some control variablesI infos: a list for storing other information the user want to

share

Rousseaux E. – The Dataset Project May 24, 2012 18/30

Page 67: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be

IntroductionThe Dataset package

Perspectives

IntroductionObject VariableObject DatasetPerforming a preprocessing stepNative analysis tools provided

The Dataset object is represented byI variables: list of variablesI name: name of the datasetI description: a long labelI row.names: names for rowsI weights: a variable used for weightingI control: some control variablesI infos: a list for storing other information the user want to

share

Rousseaux E. – The Dataset Project May 24, 2012 18/30

Page 68: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be

IntroductionThe Dataset package

Perspectives

IntroductionObject VariableObject DatasetPerforming a preprocessing stepNative analysis tools provided

The Dataset object is represented byI variables: list of variablesI name: name of the datasetI description: a long labelI row.names: names for rowsI weights: a variable used for weightingI control: some control variablesI infos: a list for storing other information the user want to

share

Rousseaux E. – The Dataset Project May 24, 2012 18/30

Page 69: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be

IntroductionThe Dataset package

Perspectives

IntroductionObject VariableObject DatasetPerforming a preprocessing stepNative analysis tools provided

The Dataset object is represented byI variables: list of variablesI name: name of the datasetI description: a long labelI row.names: names for rowsI weights: a variable used for weightingI control: some control variablesI infos: a list for storing other information the user want to

share

Rousseaux E. – The Dataset Project May 24, 2012 18/30

Page 70: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be

IntroductionThe Dataset package

Perspectives

IntroductionObject VariableObject DatasetPerforming a preprocessing stepNative analysis tools provided

The Dataset object is represented byI variables: list of variablesI name: name of the datasetI description: a long labelI row.names: names for rowsI weights: a variable used for weightingI control: some control variablesI infos: a list for storing other information the user want to

share

Rousseaux E. – The Dataset Project May 24, 2012 18/30

Page 71: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be

IntroductionThe Dataset package

Perspectives

IntroductionObject VariableObject DatasetPerforming a preprocessing stepNative analysis tools provided

The Dataset object is represented byI variables: list of variablesI name: name of the datasetI description: a long labelI row.names: names for rowsI weights: a variable used for weightingI control: some control variablesI infos: a list for storing other information the user want to

share

Rousseaux E. – The Dataset Project May 24, 2012 18/30

Page 72: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be

IntroductionThe Dataset package

Perspectives

IntroductionObject VariableObject DatasetPerforming a preprocessing stepNative analysis tools provided

The Dataset object is represented byI variables: list of variablesI name: name of the datasetI description: a long labelI row.names: names for rowsI weights: a variable used for weightingI control: some control variablesI infos: a list for storing other information the user want to

share

Rousseaux E. – The Dataset Project May 24, 2012 18/30

Page 73: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be

IntroductionThe Dataset package

Perspectives

IntroductionObject VariableObject DatasetPerforming a preprocessing stepNative analysis tools provided

How to create a Dataset objectI from an existing data.frameI from a SPSS fileI by hand (as a list of Variable objects)

Rousseaux E. – The Dataset Project May 24, 2012 19/30

Page 74: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be

IntroductionThe Dataset package

Perspectives

IntroductionObject VariableObject DatasetPerforming a preprocessing stepNative analysis tools provided

How to create a Dataset objectI from an existing data.frameI from a SPSS fileI by hand (as a list of Variable objects)

Rousseaux E. – The Dataset Project May 24, 2012 19/30

Page 75: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be

IntroductionThe Dataset package

Perspectives

IntroductionObject VariableObject DatasetPerforming a preprocessing stepNative analysis tools provided

How to create a Dataset objectI from an existing data.frameI from a SPSS fileI by hand (as a list of Variable objects)

Rousseaux E. – The Dataset Project May 24, 2012 19/30

Page 76: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be

IntroductionThe Dataset package

Perspectives

IntroductionObject VariableObject DatasetPerforming a preprocessing stepNative analysis tools provided

How to create a Dataset objectI from an existing data.frameI from a SPSS fileI by hand (as a list of Variable objects)

Rousseaux E. – The Dataset Project May 24, 2012 19/30

Page 77: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be

IntroductionThe Dataset package

Perspectives

IntroductionObject VariableObject DatasetPerforming a preprocessing stepNative analysis tools provided

Demo 2

Preprocessing step: starting from a data set in a spss file, wewant get the data in R, et create our data set for our study

importing - recoding - exporting

Rousseaux E. – The Dataset Project May 24, 2012 20/30

Page 78: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be

IntroductionThe Dataset package

Perspectives

IntroductionObject VariableObject DatasetPerforming a preprocessing stepNative analysis tools provided

Native operations on Dataset/Variable objectsI recoding VariablesI bivariate analysisI logistic regression

Rousseaux E. – The Dataset Project May 24, 2012 21/30

Page 79: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be

IntroductionThe Dataset package

Perspectives

IntroductionObject VariableObject DatasetPerforming a preprocessing stepNative analysis tools provided

Native operations on Dataset/Variable objectsI recoding VariablesI bivariate analysisI logistic regression

Rousseaux E. – The Dataset Project May 24, 2012 21/30

Page 80: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be

IntroductionThe Dataset package

Perspectives

IntroductionObject VariableObject DatasetPerforming a preprocessing stepNative analysis tools provided

Native operations on Dataset/Variable objectsI recoding VariablesI bivariate analysisI logistic regression

Rousseaux E. – The Dataset Project May 24, 2012 21/30

Page 81: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be

IntroductionThe Dataset package

Perspectives

IntroductionObject VariableObject DatasetPerforming a preprocessing stepNative analysis tools provided

Native operations on Dataset/Variable objectsI recoding VariablesI bivariate analysisI logistic regression

Rousseaux E. – The Dataset Project May 24, 2012 21/30

Page 82: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be

IntroductionThe Dataset package

Perspectives

IntroductionObject VariableObject DatasetPerforming a preprocessing stepNative analysis tools provided

Demo 3

Launching analysis

bivariate analysis - logistic regression

Rousseaux E. – The Dataset Project May 24, 2012 22/30

Page 83: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be

IntroductionThe Dataset package

Perspectives

For the Dataset packageReleasing the stDataset packagePresentations forthcoming

Plan

Introduction

The Dataset package

Perspectives

Rousseaux E. – The Dataset Project May 24, 2012 23/30

Page 84: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be

IntroductionThe Dataset package

Perspectives

For the Dataset packageReleasing the stDataset packagePresentations forthcoming

Finish up implementingI Weights managment toolsI Basic spatial toolsI Easy data capture of a surveyI (Detection/correction of bugs)

Rousseaux E. – The Dataset Project May 24, 2012 24/30

Page 85: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be

IntroductionThe Dataset package

Perspectives

For the Dataset packageReleasing the stDataset packagePresentations forthcoming

Finish up implementingI Weights managment toolsI Basic spatial toolsI Easy data capture of a surveyI (Detection/correction of bugs)

Rousseaux E. – The Dataset Project May 24, 2012 24/30

Page 86: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be

IntroductionThe Dataset package

Perspectives

For the Dataset packageReleasing the stDataset packagePresentations forthcoming

Finish up implementingI Weights managment toolsI Basic spatial toolsI Easy data capture of a surveyI (Detection/correction of bugs)

Rousseaux E. – The Dataset Project May 24, 2012 24/30

Page 87: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be

IntroductionThe Dataset package

Perspectives

For the Dataset packageReleasing the stDataset packagePresentations forthcoming

Finish up implementingI Weights managment toolsI Basic spatial toolsI Easy data capture of a surveyI (Detection/correction of bugs)

Rousseaux E. – The Dataset Project May 24, 2012 24/30

Page 88: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be

IntroductionThe Dataset package

Perspectives

For the Dataset packageReleasing the stDataset packagePresentations forthcoming

Finish up implementingI Weights managment toolsI Basic spatial toolsI Easy data capture of a surveyI (Detection/correction of bugs)

Rousseaux E. – The Dataset Project May 24, 2012 24/30

Page 89: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be

IntroductionThe Dataset package

Perspectives

For the Dataset packageReleasing the stDataset packagePresentations forthcoming

Taking spatial data into account

Figure: Poor/Good SRH ratio. PSM 2011, wave 2010 (no weighted)

Rousseaux E. – The Dataset Project May 24, 2012 25/30

Page 90: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be

IntroductionThe Dataset package

Perspectives

For the Dataset packageReleasing the stDataset packagePresentations forthcoming

stDataset: handling longitudinal survey dataI Same design as the Dataset packageI Longitudinal summary (in PDF)I Manipulating trajectories directlyI Construction of an object "life course" ready for analysis

Rousseaux E. – The Dataset Project May 24, 2012 26/30

Page 91: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be

IntroductionThe Dataset package

Perspectives

For the Dataset packageReleasing the stDataset packagePresentations forthcoming

stDataset: handling longitudinal survey dataI Same design as the Dataset packageI Longitudinal summary (in PDF)I Manipulating trajectories directlyI Construction of an object "life course" ready for analysis

Rousseaux E. – The Dataset Project May 24, 2012 26/30

Page 92: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be

IntroductionThe Dataset package

Perspectives

For the Dataset packageReleasing the stDataset packagePresentations forthcoming

stDataset: handling longitudinal survey dataI Same design as the Dataset packageI Longitudinal summary (in PDF)I Manipulating trajectories directlyI Construction of an object "life course" ready for analysis

Rousseaux E. – The Dataset Project May 24, 2012 26/30

Page 93: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be

IntroductionThe Dataset package

Perspectives

For the Dataset packageReleasing the stDataset packagePresentations forthcoming

stDataset: handling longitudinal survey dataI Same design as the Dataset packageI Longitudinal summary (in PDF)I Manipulating trajectories directlyI Construction of an object "life course" ready for analysis

Rousseaux E. – The Dataset Project May 24, 2012 26/30

Page 94: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be

IntroductionThe Dataset package

Perspectives

For the Dataset packageReleasing the stDataset packagePresentations forthcoming

stDataset: handling longitudinal survey dataI Same design as the Dataset packageI Longitudinal summary (in PDF)I Manipulating trajectories directlyI Construction of an object "life course" ready for analysis

Rousseaux E. – The Dataset Project May 24, 2012 26/30

Page 95: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be

IntroductionThe Dataset package

Perspectives

For the Dataset packageReleasing the stDataset packagePresentations forthcoming

Presentations forthcomingI (LACOSA) The ’Dataset’ Project (poster)I (SRSSS) Manipulating panel data with stDatasetI (SRSSS) Handling weights with Dataset and stDataset,

illustration with the PSM

Rousseaux E. – The Dataset Project May 24, 2012 27/30

Page 96: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be

IntroductionThe Dataset package

Perspectives

For the Dataset packageReleasing the stDataset packagePresentations forthcoming

Presentations forthcomingI (LACOSA) The ’Dataset’ Project (poster)I (SRSSS) Manipulating panel data with stDatasetI (SRSSS) Handling weights with Dataset and stDataset,

illustration with the PSM

Rousseaux E. – The Dataset Project May 24, 2012 27/30

Page 97: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be

IntroductionThe Dataset package

Perspectives

For the Dataset packageReleasing the stDataset packagePresentations forthcoming

Presentations forthcomingI (LACOSA) The ’Dataset’ Project (poster)I (SRSSS) Manipulating panel data with stDatasetI (SRSSS) Handling weights with Dataset and stDataset,

illustration with the PSM

Rousseaux E. – The Dataset Project May 24, 2012 27/30

Page 98: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be

IntroductionThe Dataset package

Perspectives

For the Dataset packageReleasing the stDataset packagePresentations forthcoming

Presentations forthcomingI (LACOSA) The ’Dataset’ Project (poster)I (SRSSS) Manipulating panel data with stDatasetI (SRSSS) Handling weights with Dataset and stDataset,

illustration with the PSM

Rousseaux E. – The Dataset Project May 24, 2012 27/30

Page 99: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be

IntroductionThe Dataset package

Perspectives

Elements

of bibliography

Rousseaux E. – The Dataset Project May 24, 2012 28/30

Page 100: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be

IntroductionThe Dataset package

Perspectives

Elements of bibliography I

[Voorpostel et al.] Voorpostel, M., Tillmann, R, Lebert, F., Weaver, B., Kuhn, U., Lipps, O., Ryser,V.-A., Schmid, F., Rothenbühler, M., and Wernli, B. Swiss Household Panel Userguide(1999-2010), Wave 12. Lausanne: FORS.(October 2011).

Rousseaux E. – The Dataset Project May 24, 2012 29/30

Page 101: The Dataset Projectemmanuel.rousseaux.me/Rousseaux2012a.pdf · Introduction The Dataset package Perspectives About the speaker NCCR LIVES Goals of the thesis project What will be

IntroductionThe Dataset package

Perspectives

Thank you for your attention

Any question?

Rousseaux E. – The Dataset Project May 24, 2012 30/30