Software for data management: The contribution of Stata

48
Software for data management: The contribution of Stata Dr Karen Robson, Senior Research Fellow, The Geary Institute, University College Dublin, Ireland

description

Software for data management: The contribution of Stata. Dr Karen Robson, Senior Research Fellow, The Geary Institute, University College Dublin, Ireland. Getting acquainted with Stata. StataCorp develops and distributes Stata, software for statistical analysis. - PowerPoint PPT Presentation

Transcript of Software for data management: The contribution of Stata

Page 1: Software for data management: The contribution of Stata

Software for data management: The contribution of Stata

Dr Karen Robson, Senior Research Fellow, The Geary Institute, University College Dublin, Ireland

Page 2: Software for data management: The contribution of Stata

Getting acquainted with Stata

StataCorp develops and distributes Stata, software for statistical analysis.

Stata is available for Windows, Macintosh, and Unix computers.

Stata is used by medical researchers, biostatisticians, epidemiologists, economists, sociologists, political scientists, geographers, psychologists, social scientists, and other research professionals needing to analyze data. Gaining popularity in the social and medical sciences

Particularly useful for handling large-scale longitudinal data

Page 3: Software for data management: The contribution of Stata

Stata SE (for large data sets)

can analyze datasets with as many as 32,766 variables, and the only limit on observations is the amount of RAM on your computer

can handle string variables with a maximum length of 244 characters

can handle matrices up to 11,000 x 11,000. requires at least 512 megabytes of RAM and

80 megabytes of disk space

Page 4: Software for data management: The contribution of Stata

Stata/Intercooled (the standard one)

can analyze datasets with as many as 2,047 variables, and the only limit on observations is the amount of RAM on your computer

can handle string variables with a maximum length of 244 characters

can handle matrices up to 800 x 800.

Page 5: Software for data management: The contribution of Stata

Small Stata

A smaller, student version of Stata (for educational purchases only)

Page 6: Software for data management: The contribution of Stata

Stata MP

The fastest version of Stata (for dual-core and multicore/multiprocessor computers)

Stata/MP is the fastest and largest version of Stata.

Page 7: Software for data management: The contribution of Stata

Resources

StataCorp website (www.stata.com)

Page 8: Software for data management: The contribution of Stata
Page 9: Software for data management: The contribution of Stata

Resources

StataCorp website (www.stata.com) Timberlake website (www.timberlake.co.uk)

Page 10: Software for data management: The contribution of Stata
Page 11: Software for data management: The contribution of Stata

Resources

StataCorp website (www.stata.com) Timberlake website (www.timberlake.co.uk) UCLA Stata “portal”

(http://www.ats.ucla.edu/stat/)

Page 12: Software for data management: The contribution of Stata
Page 13: Software for data management: The contribution of Stata

Resources

StataCorp website (www.stata.com) Timberlake website (www.timberlake.co.uk) UCLA Stata “portal”

(statcomp.ats.ucla.edu/stata) Statalist (www.hsph.harvard.edu/statalist)

Page 14: Software for data management: The contribution of Stata
Page 15: Software for data management: The contribution of Stata

Resources

StataCorp website (www.stata.com) Timberlake website (www.timberlake.co.uk) UCLA Stata “portal”

(statcomp.ats.ucla.edu/stata) Statalist (www.hsph.harvard.edu/statalist) Stata Journal (www.stata-journal.com)

Page 16: Software for data management: The contribution of Stata
Page 17: Software for data management: The contribution of Stata
Page 18: Software for data management: The contribution of Stata

As well, available Dec 2008

Page 19: Software for data management: The contribution of Stata

Launching Stata

OS contingentDefault window preferencesWindow preferences fully adjustableAuto memory set

Page 20: Software for data management: The contribution of Stata
Page 21: Software for data management: The contribution of Stata
Page 22: Software for data management: The contribution of Stata
Page 23: Software for data management: The contribution of Stata

Comparing with SPSS

Start up differences

Page 24: Software for data management: The contribution of Stata
Page 25: Software for data management: The contribution of Stata
Page 26: Software for data management: The contribution of Stata

Comparing with SPSS

Start up differencesWith data file open

Page 27: Software for data management: The contribution of Stata
Page 28: Software for data management: The contribution of Stata
Page 29: Software for data management: The contribution of Stata

Comparing with SPSS

Start up differencesWith data file openViewing data

data viewer, data editor

Page 30: Software for data management: The contribution of Stata
Page 31: Software for data management: The contribution of Stata

Comparing with SPSS

Start up differencesWith data file openViewing data

data viewer, data editorViewing variables

Page 32: Software for data management: The contribution of Stata
Page 33: Software for data management: The contribution of Stata

Comparing with SPSS

Start up differencesWith data file openViewing data

data viewer, data editorViewing variablesViewing output/commands

output window buffer, log files

Page 34: Software for data management: The contribution of Stata
Page 35: Software for data management: The contribution of Stata
Page 36: Software for data management: The contribution of Stata

Comparing with SPSS

Start up differencesWith data file openViewing data

data viewer, data editorViewing variablesViewing output/commands

output window buffer, log filesSyntax and “do files”

Page 37: Software for data management: The contribution of Stata
Page 38: Software for data management: The contribution of Stata

INPUT

Stata command window

Do file

Pull-down menu

Variable window

Review window

Computation

RESULTS

Output window

Log file

Page 39: Software for data management: The contribution of Stata

Advantages and disadvantages of Stata

User driven Free STBs Dedicated journal Web active Memory

requirements Backward

compatible

Change! SPSS dominance Orientated to writing

syntax/code Pull-down windows

debate! Now in version 8 forward

Page 40: Software for data management: The contribution of Stata

Advantages and disadvantages of Stata

Easier code Easier data handling Clarity of operations/

feedback Results table

function

Before version 8, limited graphics

Now, complex graphics

Variable labelling Editing of output

Page 41: Software for data management: The contribution of Stata

Advantages and disadvantages of Stata

Nested/master do files

Flexible terminology Setting types of

data Interactive help Switch output (log

file) on/off

Copy and paste

Page 42: Software for data management: The contribution of Stata
Page 43: Software for data management: The contribution of Stata

Overview of analytic techniques

Too numerous to mention!Comprehensive manualsA selection:

All types of regressionSurvey packageEpidemiological packageMultilevel modellingTime series functionsCluster analysis

Page 44: Software for data management: The contribution of Stata

Data

Data files .dtaStat/Transfer software

Page 45: Software for data management: The contribution of Stata
Page 46: Software for data management: The contribution of Stata

Stata – using wide and long file formats

Wide file formats (everything you add goes to the right of the existing data)

Long file formats (everything you add goes underneath the existing data)

Page 47: Software for data management: The contribution of Stata

MERGE

Data 1 Data 2

APPEND

Data 2

Data 1

Page 48: Software for data management: The contribution of Stata

Data 1 (indi)

‘master’ Data 2 (indj)

‘using’

_merge values

1

3

2