DDI and the Lifecycle of Longitudinal Surveys

39
DDI and the Lifecycle of Longitudinal Surveys Larry Hoyle, IPSR, Univ. of Kansas Joachim Wackerow, GESIS - Leibniz Institute for the Social Sciences

description

DDI and the Lifecycle of Longitudinal Surveys. Larry Hoyle, IPSR, Univ. of Kansas Joachim Wackerow, GESIS - Leibniz Institute for the Social Sciences. Google Images Search: "Data Lifecycle". DDI View – (Cross-sectional Study). Longitudinal Studies Modify this View. The cycle will repeat - PowerPoint PPT Presentation

Transcript of DDI and the Lifecycle of Longitudinal Surveys

Page 1: DDI and the Lifecycle of Longitudinal Surveys

DDI and the Lifecycle of Longitudinal Surveys

Larry Hoyle, IPSR, Univ. of KansasJoachim Wackerow, GESIS - Leibniz Institute for

the Social Sciences

Page 2: DDI and the Lifecycle of Longitudinal Surveys

ESRA 2011 Hoyle and Wackerow 2

Google Images Search: "Data Lifecycle"

Page 3: DDI and the Lifecycle of Longitudinal Surveys

ESRA 2011 Hoyle and Wackerow 3

DDI View – (Cross-sectional Study)

Page 4: DDI and the Lifecycle of Longitudinal Surveys

ESRA 2011 Hoyle and Wackerow 4

Longitudinal Studies Modify this View

• The cycle will repeat• Any stage may influence another– E.g. analysis may lead to reconceptualization

• Archiving may be done at multiple points– E.g. to preserve the state of data as of publications

Page 5: DDI and the Lifecycle of Longitudinal Surveys

ESRA 2011 Hoyle and Wackerow 5

An Extended Time Frame

• Things change over time– Procedures– Measures– Populations– Staff– External events– Laws

• Documenting these is important for data use, interpretation, and critical for study replication.

Page 6: DDI and the Lifecycle of Longitudinal Surveys

ESRA 2011 Hoyle and Wackerow 6

A study is born

Page 7: DDI and the Lifecycle of Longitudinal Surveys

ESRA 2011 Hoyle and Wackerow 7

Checksum of Study Design Document Could be Archived

Page 8: DDI and the Lifecycle of Longitudinal Surveys

ESRA 2011 Hoyle and Wackerow 8

Multiple Collection Processes Begin

Page 9: DDI and the Lifecycle of Longitudinal Surveys

ESRA 2011 Hoyle and Wackerow 9

Processing – (e.g. Data Cleaning, Restructuring, Recoding)

Page 10: DDI and the Lifecycle of Longitudinal Surveys

ESRA 2011 Hoyle and Wackerow 10

Initial Data are Archived

Page 11: DDI and the Lifecycle of Longitudinal Surveys

ESRA 2011 Hoyle and Wackerow 11

Initial Distribution

Page 12: DDI and the Lifecycle of Longitudinal Surveys

ESRA 2011 Hoyle and Wackerow 12

Initial Distribution – Possibly From Archive

Page 13: DDI and the Lifecycle of Longitudinal Surveys

ESRA 2011 Hoyle and Wackerow 13

Initial Data Discovery

Page 14: DDI and the Lifecycle of Longitudinal Surveys

ESRA 2011 Hoyle and Wackerow 14

Initial Data Analysis

Page 15: DDI and the Lifecycle of Longitudinal Surveys

ESRA 2011 Hoyle and Wackerow 15

Initial Data Analysis and Data Archived

Page 16: DDI and the Lifecycle of Longitudinal Surveys

ESRA 2011 Hoyle and Wackerow 16

Publications – Reference and Referenced by Archive

Page 17: DDI and the Lifecycle of Longitudinal Surveys

ESRA 2011 Hoyle and Wackerow 17

SECOND WAVE – Revised Concept

Page 18: DDI and the Lifecycle of Longitudinal Surveys

ESRA 2011 Hoyle and Wackerow 18

SECOND WAVE – Data Collection

Page 19: DDI and the Lifecycle of Longitudinal Surveys

ESRA 2011 Hoyle and Wackerow 19

SECOND WAVE – Data Processing

Page 20: DDI and the Lifecycle of Longitudinal Surveys

ESRA 2011 Hoyle and Wackerow 20

SECOND WAVE – Processing Uses Feedback from Stage 1

Here something learned in the initial distribution affects future processing. This should be recorded.

Page 21: DDI and the Lifecycle of Longitudinal Surveys

ESRA 2011 Hoyle and Wackerow 21

SECOND WAVE – Processing Uses Feedback from Stage 1

These metadata flows may happen between many stages, e.g. from processing to later collection.

Page 22: DDI and the Lifecycle of Longitudinal Surveys

ESRA 2011 Hoyle and Wackerow 22

SECOND WAVE – Distribution

Page 23: DDI and the Lifecycle of Longitudinal Surveys

ESRA 2011 Hoyle and Wackerow 23

SECOND WAVE – Discovery

Page 24: DDI and the Lifecycle of Longitudinal Surveys

ESRA 2011 Hoyle and Wackerow 24

Final Analysis Archived

Page 25: DDI and the Lifecycle of Longitudinal Surveys

ESRA 2011 Hoyle and Wackerow 25

A Kansan's Cyclone View

Page 26: DDI and the Lifecycle of Longitudinal Surveys

ESRA 2011 Hoyle and Wackerow 26

Gantt View – Initial Design

Page 27: DDI and the Lifecycle of Longitudinal Surveys

ESRA 2011 Hoyle and Wackerow 27

Gantt with Data Flow (Blue)

Much of this movement of data between stages is planned from the beginning of the project

Page 28: DDI and the Lifecycle of Longitudinal Surveys

ESRA 2011 Hoyle and Wackerow 28

Gantt With Planned Data and Metadata Flow

Metadata are generated as data move through the project, as well as before any data are gathered.

Page 29: DDI and the Lifecycle of Longitudinal Surveys

ESRA 2011 Hoyle and Wackerow 29

Gantt – Collection Changes Project Concept

Some metadata are unanticipated. Here something learned during the first collection phase causes a reconceptualization

Page 30: DDI and the Lifecycle of Longitudinal Surveys

ESRA 2011 Hoyle and Wackerow 30

Gantt – Discovery Changes Future Collection

Here something learned during discovery changes future collection

Page 31: DDI and the Lifecycle of Longitudinal Surveys

ESRA 2011 Hoyle and Wackerow 31

DDI Features for the Lifecyclein DDI Lifecycle (DDI 3 Branch)

• Metadata structure– Machine actionable: drive research, automate reuse– Comprehensive description, including external aids to

data collection (e.g. audio, video), sampling• Resource Package– Reusable elements– Documented comparability

• Comparison– Documents similarities and differences

• Lifecycle Events

Page 32: DDI and the Lifecycle of Longitudinal Surveys

ESRA 2011 Hoyle and Wackerow 32

DDI Features for the Lifecycle

• Series– Relationships among waves

• Versioning– Because we know things change

• Access Controls• Control Constructs (Questionnaire Flow)• Summary Statistics• Embedded Data

Page 33: DDI and the Lifecycle of Longitudinal Surveys

ESRA 2011 Hoyle and Wackerow 33

Metadata Structure

• Generate survey instruments, samples – Iverson and Amin Metadata Driven Survey Design

• Administer Surveys – E.g. DDI <-> Blaise, CASES, CSPro

• Tables and graphs• Automated data import

Page 34: DDI and the Lifecycle of Longitudinal Surveys

ESRA 2011 Hoyle and Wackerow 34

Resource Package(i.e. study-independent information)

• Reuse of elements by reference– Harmonization across waves, studies

• Versioning and Comparison elements– Documents changes– Facilitates harmonization efforts

Page 35: DDI and the Lifecycle of Longitudinal Surveys

ESRA 2011 Hoyle and Wackerow 35

Comparisons

• Commonality and difference between– Concepts– Variables– Questions– Categories– Codes– Universes (survey population)

Page 36: DDI and the Lifecycle of Longitudinal Surveys

ESRA 2011 Hoyle and Wackerow 36

Lifecycle Events

• Recording events internal and external to the project aids in interpretability.

• Internal examples– Staff changes– Procedural changes

• External events– Major political or economic events, natural

disasters

Page 37: DDI and the Lifecycle of Longitudinal Surveys

ESRA 2011 Hoyle and Wackerow 37

Other Resources

• Dagstuhl Workshop on "Best Practices for Longitudinal Data", 2010

http://www.ddialliance.org/resources/publications/working/BestPractices/LongitudinalData– Inclusion of non-survey data e.g. biomarkers– Comparison– Presentation

Page 38: DDI and the Lifecycle of Longitudinal Surveys

ESRA 2011 Hoyle and Wackerow 38

Representing Longitudinal Data in DDI(Extract)

Level Dimension Description DDI Tag(s) Project/Study (highest level)

Management Executive control, scientific leadership, funding, etc.

Group Citation Purpose Abstract FundingInformation Archive Module

Organization Individual Role (research, management, funding, etc.) Location Email Telephone

Access How to obtain data and any restrictions on access

Group/Subgroup/StudyUnit Archive Module

AccessConditions AccessPermissions ConfidentialityStatement Restrictions LifecycleInformation

Longitudinal Survey

Sample Design and Procedures

Universe: Population being sampled: Refreshment strategy; Replacement strategy; Potential errors

Group ConceptualComponents

Universe Concept

DataCollection Methodology SamplingProcedure DeviationFromSampleDesign ActionToMinimizeLosses

Page 39: DDI and the Lifecycle of Longitudinal Surveys

ESRA 2011 Hoyle and Wackerow 39

http://www.ddialliance.org/