13.Analysis Demonstration: Descriptive/Comparative Analysis Using Longitudinal Data.

30
13. Analysis Demonstration: Descriptive/Comparative Analysis Using Longitudinal Data

Transcript of 13.Analysis Demonstration: Descriptive/Comparative Analysis Using Longitudinal Data.

13. Analysis Demonstration: Descriptive/Comparative AnalysisUsing Longitudinal Data

2

13. Demonstration: Descriptive/Comparative Analysis Using Longitudinal Data

Prerequisites• Recommended modules to complete before

viewing this module 1. Introduction to the NLTS2 Training Modules 2. NLTS2 Study Overview 3. NLTS2 Study Design and Sampling NLTS2 Data Sources, either

• 4. Parent and Youth Surveys or• 5. School Surveys, Student Assessments, and Transcripts

9. Weighting and Weighted Standard Errors

3

13. Demonstration: Descriptive/Comparative Analysis Using Longitudinal Data

Prerequisites

• Recommended modules to complete before viewing this module (cont’d) NLTS2 Documentation

• 10. Overview• 11. Data Dictionaries• 12. Quick References

• Helpful Implications for analysis

• 6. Data Content• 7. Parent/Youth Survey Data

4

13. Demonstration: Descriptive/Comparative Analysis Using Longitudinal Data

Overview Posing questions to the NLTS2 database

• Types of questions; getting to the answers Comparative variables Analysis demonstration using longitudinal data Analysis plan for this demonstration Results output for this demonstration Interpretation of results for this demonstration Review steps required for this demonstration Closing Important information

5

13. Demonstration: Descriptive/Comparative Analysis Using Longitudinal Data

Posing Questions to the NLTS2database• Types of questions

Descriptive questions—e.g., means, frequency distributions for an identified group

Comparative questions—e.g., differences between subgroups of interest, tests of significance

Longitudinal questions—i.e., relationships between variables measured at different time points

Explanatory questions—i.e., questions regarding relationships between variables, usually requiring controls for covariates

6

13. Demonstration: Descriptive/Comparative Analysis Using Longitudinal Data

Examples of types of questions• Descriptive/comparative

How proficient are high school students with disabilities in reading? How does proficiency differ by disability category? By race/ethnicity?

• Longitudinal/comparative If youth has been employed, how does income change over time for

youth with disabilities as a group? How does it change at the individual level?

How do changes in wages over time differ by disability category?By race/ethnicity?

• Explanatory What factors relate to variations in the employment of youth with

disabilities?

7

13. Demonstration: Descriptive/Comparative Analysis Using Longitudinal Data

Getting to the answer1. Develop an analysis plan.

2. Identify/create variables. Identify data source. Specify variable names. Be aware of variable forms

(e.g., continuous, categorical,) and respondent subset.

Restructure variables if applicable (e.g., regroup into different categories).

Create variables as needed (e.g., scales from multiple items).

3. Select Analysis approach from

options appropriate to the question.

Software procedure(s) to execute approach.

4. Execute plan.

8

13. Demonstration: Descriptive/Comparative Analysis Using Longitudinal Data

Comparative variables• A set of demographic variables are commonly used for

comparative analyses in NLTS2 reports and publications. NLTS2 demographic variables are by wave and included in every

school and parent/youth sourced files.

• The demographic variables are Primary disability category Age category Race/ethnicity Gender Household income category

9

13. Demonstration: Descriptive/Comparative Analysis Using Longitudinal Data

Comparative variables• Grade level and urbanicity are used with school-based

data. Demographic variables are based on location of the

school district where youth attended secondary school.

School related demographic variables are included with school data files but not applicable to parent/youth survey files.

10

13. Demonstration: Descriptive/Comparative Analysis Using Longitudinal Data

Analysis demonstration usinglongitudinal data• Research questions for this demonstration

How do wages change over time for youth with disabilities as a group?

How do they change at the individual level? How do changes in youth wages over time differ by

parent’s household income?

11

13. Demonstration: Descriptive/Comparative Analysis Using Longitudinal Data

Analysis plan for this demonstration

• Select procedures that will describe these changes. In this demonstration, we will use procedures that

provide weighted frequencies, percentages, means, and standard errors.

• Select procedures to make comparisons. In this demonstration, we will produce

crosstabulations and means by a comparison variable and determine whether differences are statistically significant.

12

13. Demonstration: Descriptive/Comparative Analysis Using Longitudinal Data

Analysis plan for this demonstration

• Select variables that measure change. Compare items in Parent/Youth Wave 2 with Parent/Youth Wave 5. Wage youth earned at current or most recent job (Wave 2

np2HourlyWage compared with Wave 5 np5T4h_L4h).

• Select comparison variables. Comparison variables are often demographics such as primary

disability category, gender, and race/ethnicity. Parent/guardian’s household income will be the comparison

variable in this demonstration (W5_IncomeHdr2009).

13

13. Demonstration: Descriptive/Comparative Analysis Using Longitudinal Data

Analysis plan for this demonstration

• Compare time 1 and time 2. One approach to observe a difference between time 1

and time 2 is to compare a time 1 mean with a time 2 mean calculated across the entire sample.

To measure individual difference, an approach is to calculate the difference between that youth’s time 1 measure and his or her time 2 measure.

To calculate the difference between time 1 and time 2, there must be a value for both time 1 and time 2 to be included in this analysis.

14

13. Demonstration: Descriptive/Comparative Analysis Using Longitudinal Data

Analysis plan for this demonstration

• Missing values Expect to have fewer cases to analyze in a

longitudinal analysis than in a separate analysis of either time 1 or time 2.• Youth must have data in both waves to be included.

• Create new variables Difference = time 2 – time1 The order is important; subtracting time 1 from time 2

will result in a positive number if the change has been in a positive direction.

13. Demonstration: Descriptive/Comparative Analysis Using Longitudinal Data

15

Results output for this demonstration

• Means: hourly wages in Wave 2 and Wave 5• Mean: difference in wages between Wave 2 and

Wave 5• Percentage who had a decrease or increase in

wages• Crosstabs by parent/guardian household income

13. Demonstration: Descriptive/Comparative Analysis Using Longitudinal Data

NLTS2 restricted-use data

• NLTS2 data are restricted.• Data used in these presentations are from a

randomly selected subset of the restricted-use NLTS2 data.

• Results in these presentations cannot be replicated with the NLTS2 data licensed by NCES.

16

13. Demonstration: Descriptive/Comparative Analysis Using Longitudinal Data

17

Means of Hourly Wages in Wave 2 and Wave 5

EstimateStandard

ErrorUnweighted

Count

Mean Hourly pay youth earned at job (in or out of sch) in Wave 2 (np2HourlyWage) 7.34 .316 1229

Mean Hourly wage young adult earns at current or most recent job in Wave 5 (np5T4h_L4h) 10.17 .255 2189

Results output for this demonstration

These results cannot be replicated with full dataset; all outputin modules generated with a random subset of the full data.

13. Demonstration: Descriptive/Comparative Analysis Using Longitudinal Data

18

Mean Difference in Wages between Wave 2 and Wave 5

Estimate Standard ErrorUnweighted

Count

Mean Change in wages between Waves 2 and 5 (W25_Wage_chg) 3.40 .404 967

Results output for this demonstration

These results cannot be replicated with full dataset; all outputin modules generated with a random subset of the full data.

13. Demonstration: Descriptive/Comparative Analysis Using Longitudinal Data

19

Percentage Who Had a Decrease or Increase in Wages between Waves 2 and 5

W25_Wage_chgCatEstimate

Standard Error

Unweighted Count

% of Total (1) Decrease in wages(2) No change(3) Increase in wagesTotal

7.7%4.6%

87.7%100.0%

1.6%2.9%3.2%.0%

12530

812967

Results output for this demonstration

These results cannot be replicated with full dataset; all outputin modules generated with a random subset of the full data.

13. Demonstration: Descriptive/Comparative Analysis Using Longitudinal Data

20

Mean Difference in Wages, by Household Income between Wave 2 and Wave 5

Household Income (W5_IncomeHdr2009) EstimateStandard

ErrorUnweighted

Count(1) $25,000 or less Mean Change in wages between

Waves 2 and 5 (W25_Wage_chg)

2.70 .503 194

(1) $25,001 - $50,000 Mean (Change in wages between Waves 2 and 5 W25_Wage_chg)

3.62 .539 224

(1) More than $50,000

Mean (Change in wages between Waves 2 and 5 W25_Wage_chg)

3.63 .762 540

Results output for this demonstration

These results cannot be replicated with full dataset; all outputin modules generated with a random subset of the full data.

13. Demonstration: Descriptive/Comparative Analysis Using Longitudinal Data

21

Percentage Decrease or Increase in Wages by Household Income between Waves 2 and 5

(W25_Wage_chgCat)

(W5_IncomeHdr2009 Income)

(1) $25,000 or less

(2) $25,001–$50,000

(3) More than $50,000

Total

(1) Decrease in wages % within (W5_IncomeHdr2009) Income category for tables

Estimate 4.6% 6.8% 10.5% 7.7%

Standard Error 1.9% 2.0% 3.2% 1.6%

Unweighted Count 27 36 61 124

(2) No change % within (W5_IncomeHdr2009) Income category for tables

Estimate 11.1% 3.4% 1.1% 4.6%

Standard Error 9.3% 1.9% .5% 2.9%

Unweighted Count 8 6 16 30

(3) Increase in wages % within (W5_IncomeHdr2009) Income category for tables

Estimate 84.3% 89.8% 88.4% 87.6%

Standard Error 9.0% 2.9% 3.3% 3.2%

Unweighted Count 159 182 463 804

Total % within (W5_IncomeHdr2009) Income category for tables

Estimate 100.0% 100.0% 100.0% 100.0%

Standard Error .0% .0% .0% .0%

Unweighted Count 194 224 540 958

Results output for this demonstration

These results cannot be replicated with full dataset; all outputin modules generated with a random subset of the full data.

13. Demonstration: Descriptive/Comparative Analysis Using Longitudinal Data

22

Interpretation of results for thisdemonstration• Is it significant?

We use an Excel worksheet. Enter (1) the percentage and standard error for a

category and (2) the percentage and standard error for the comparison category.

The significance is calculated by a formula stored in Excel.

These results cannot be replicated with full dataset; all outputin modules generated with a random subset of the full data.

23

13. Demonstration: Descriptive/Comparative Analysis Using Longitudinal Data

Interpretation of results for this demonstration• Interpreting longitudinal results.

A positive number indicates a positive change between time 1 and time 2 for difference in wage.

A negative number indicates a negative change between time 1 and time 2.

Numbers close to 0 indicate little or no change.• What does it mean?

A positive or negative change may be simple to interpret, but what does little or no change mean?

• What explains the change? For example, postsecondary attendance might mean a

change in employment.

13. Demonstration: Descriptive/Comparative Analysis Using Longitudinal Data

24

Interpretation of results for this demonstration• Was anything significant ?

Not with the randomly selected subset of data. The full data set might have a different result if the standard

errors were smaller. Or perhaps household income was not the best comparison

variable to use.• Things to think about

What other comparison variables might show significant differences between comparison groups?

Are there other difference in employment measures that might be worth exploring?

Were these the best waves of data to compare? Should any types of youth be included or excluded from this

analysis?

13. Demonstration: Descriptive/Comparative Analysis Using Longitudinal Data

25

Interpretation of results for this demonstration• More things to think about

Did we have a good comparison group?• Would it have been better to select Waves 3 and 5 instead of 2

and 5 as more youth are out of secondary school in Wave 3 than in 2?

• Would it have been better to run the analysis on a subgroup of only those who were out of secondary school at each wave?

The answer is…it is important to select the appropriate subsample as well as to select the appropriate variables.

26

13. Demonstration: Descriptive/Comparative Analysis Using Longitudinal Data

Review steps required for thisdemonstration

• Plan for analysis Identify the research question. Find the variables needed. Determine if any variables need to recoded.

• Collapse categories.• Create a longitudinal measure.

Identify the appropriate subsample if applicable.

27

13. Demonstration: Descriptive/Comparative Analysis Using Longitudinal Data

• Plan for analysis (cont’d) Identify the appropriate procedures.

• Descriptive• Crosstabs• Means• Means by comparison variables if calculating

means• Test significance of comparisons

Review steps required for thisdemonstration

13. Demonstration: Descriptive/Comparative Analysis Using Longitudinal Data

28

Closing

• Topics discussed in this module Posing questions to the NLTS2 database Comparative variables Analysis demonstration using longitudinal data Analysis plan for this demonstration Results output for this demonstration Interpretation of results for this demonstration Review steps required for this demonstration

29

13. Demonstration: Descriptive/Comparative Analysis Using Longitudinal Data

Closing

• Next module 14a. Accessing Data Files in SPSS

or

14b. Accessing Data Files in SAS

30

13. Demonstration: Descriptive/Comparative Analysis Using Longitudinal Data

Important information

NLTS2 website contains reports, data tables, and other project-related information http://nlts2.org/

Information about obtaining the NLTS2 database and documentation can be found on the NCES website http://nces.ed.gov/statprog/rudman/

General information about restricted data licenses can be found on the NCES website http://nces.ed.gov/statprog/instruct.asp

E-mail address: [email protected]