Session III: Establishing the base population · 2016-04-19 · Session III: Establishing the base...

31
Session III: Establishing the base population 8 March 2016 Cheryl Sawyer, Lina Bassarsky Population Estimates and Projections Section www.unpopulation.org Regional Workshop on the Production of Population Projections Addis Ababa, 7-11 March 2016

Transcript of Session III: Establishing the base population · 2016-04-19 · Session III: Establishing the base...

Page 1: Session III: Establishing the base population · 2016-04-19 · Session III: Establishing the base population 8 March 2016 Cheryl Sawyer, Lina Bassarsky Population Estimates and Projections

Session III: Establishing the base population

8 March 2016

Cheryl Sawyer, Lina BassarskyPopulation Estimates and Projections Section

www.unpopulation.org

Regional Workshop on the Production of Population ProjectionsAddis Ababa, 7-11 March 2016

Page 2: Session III: Establishing the base population · 2016-04-19 · Session III: Establishing the base population 8 March 2016 Cheryl Sawyer, Lina Bassarsky Population Estimates and Projections

Establishing the base population

o An accurate base population is essential to ensure accuracy of the projection

o Steps to establishing the base population:– Detecting errors in data– Correcting distorted or incomplete data– Moving the population to mid-year

Page 3: Session III: Establishing the base population · 2016-04-19 · Session III: Establishing the base population 8 March 2016 Cheryl Sawyer, Lina Bassarsky Population Estimates and Projections

Evaluation of age and sex distribution data

o Graphical analysis – Population pyramids– Graphical cohort analysis

o Age and sex ratioso Summary indices of error in age-sex data

refresh

Page 4: Session III: Establishing the base population · 2016-04-19 · Session III: Establishing the base population 8 March 2016 Cheryl Sawyer, Lina Bassarsky Population Estimates and Projections

What to look for at the evaluation 

o Possible data errors, ex. – Age misreporting (age heaping and/or age

exaggeration)– Coverage errors – net underenumeration (by age or

sex) o Significant discrepancies in age-sex structure

due to extraordinary events – High migration, war, famine, HIV/AIDS epidemic etc.

refresh

Page 5: Session III: Establishing the base population · 2016-04-19 · Session III: Establishing the base population 8 March 2016 Cheryl Sawyer, Lina Bassarsky Population Estimates and Projections

Population pyramid – Detecting errors

Age misreporting errors (heaping) among adults Under enumeration of young children (< age 2) High fertility level Smaller population in 20-24 age group

>> extraordinary events in 1950-55? Less men relative to women in 20-44 age group

>> labor out-migration?

320,000 220,000 120,000 20,000 80,000 180,000 280,00005

101520253035404550556065707580

85+

Population

Age YEMEN, 1975 CensusMALE FEMALE

Source: United Nations Statistics Division, Demographic Yearbook Statistical Database

1,500,000 750,000 0 750,000 1,500,0000-45-9

10-1415-1920-2425-2930-3435-3940-4445-4950-5455-5960-6465-6970-7475-7980-84

85+

Population

Age YEMEN, 1975 Census

MALE FEMALE

refresh

Page 6: Session III: Establishing the base population · 2016-04-19 · Session III: Establishing the base population 8 March 2016 Cheryl Sawyer, Lina Bassarsky Population Estimates and Projections

Population pyramid – Line instead of bars

Source: United Nations Statistics Division, Demographic Yearbook Statistical Database

Population Pyramid (bar chart)>> Not always easy to determine differences by sex

Use of line chart

3,000,000 1,500,000 0 1,500,000 3,000,00005

101520253035404550556065707580

85+

Population

Age Bangladesh, 2001 CensusMALE FEMALE

0

500,000

1,000,000

1,500,000

2,000,000

2,500,000

3,000,000

0 5 10 15 20 25 30 35 40 45 50 55 60 65 70 75 80+

Popu

latio

n

Age

Bangladesh, 2001 Census

Male

Female

refresh

Page 7: Session III: Establishing the base population · 2016-04-19 · Session III: Establishing the base population 8 March 2016 Cheryl Sawyer, Lina Bassarsky Population Estimates and Projections

Population pyramid – Line instead of bars (2)

Source: United Nations Statistics Division, Demographic Yearbook Statistical Database

0

500,000

1,000,000

1,500,000

2,000,000

2,500,000

3,000,000

0 5 10 15 20 25 30 35 40 45 50 55 60 65 70 75 80+

Popu

latio

n

Age

Bangladesh, 2001 Census

Male

Female

0

1,000,000

2,000,000

3,000,000

4,000,000

5,000,000

6,000,000

7,000,000

8,000,000

9,000,000

Popu

latio

nAge group

Bangladesh, 2001 Census

Male

Female

refresh

Page 8: Session III: Establishing the base population · 2016-04-19 · Session III: Establishing the base population 8 March 2016 Cheryl Sawyer, Lina Bassarsky Population Estimates and Projections

Graphical Cohort Analysis (1)

o Tracking actual cohorts over multiple censuseso The size of each cohort should decline over

each census due to mortality, if no significant international migration

o The age structure (the lines) for censuses should follow the same pattern in the absence of census errors

o An important advantage - possible to evaluate the effects of extraordinary events and other distorting factors by following actual cohorts over time

Page 9: Session III: Establishing the base population · 2016-04-19 · Session III: Establishing the base population 8 March 2016 Cheryl Sawyer, Lina Bassarsky Population Estimates and Projections

1997 Census 2007 Census Birth CohortMale Female Male Female0-4 1,353,206 1,388,350 2002-20075-9 1,112,321 1,113,675 1997-200110-14 947,236 878,429 1,222,668 1,183,939 1992-199715-19 774,327 854,078 925,729 991,323 1987-199220-24 637,113 827,614 774,413 986,526 1982-199725-29 509,109 654,465 707,603 841,416 1977-198230-34 410,148 477,562 583,689 667,865 1972-197735-39 373,813 428,395 481,396 556,191 1967-197240-44 270,046 303,147 366,518 389,087 1962-196745-49 257,070 282,098 321,236 328,660 1957-196250-54 178,902 212,060 231,232 283,288 1952-195755-59 162,122 174,234 194,011 208,657 1947-195260-64 114,335 125,096 140,146 159,557 1942-194765-69 100,425 109,288 113,840 127,794 1937-194270-74 47,407 50,607 72,288 81,329 1932-193775-79 41,529 42,858 55,448 61,012 1927-193280-84 15,305 17,326 22,417 28,278 1922-192785-89 16,576 19,448 1917-192290-94 4,803 5,883 1912-1917

Data is organized by birth cohort Exclude open-ended age

category People who were born in

the same years are compared in the analysis

Source: United Nations Statistics Division, Demographic Yearbook Statistical Database

Mozambique, 1997 and 2007 Censuses

refreshGraphical Cohort Analysis (2)

Page 10: Session III: Establishing the base population · 2016-04-19 · Session III: Establishing the base population 8 March 2016 Cheryl Sawyer, Lina Bassarsky Population Estimates and Projections

Mozambique, 1997 and 2007 Censuses

0

200,000

400,000

600,000

800,000

1,000,000

1,200,000

1,400,000

1,600,000

1992

-199

719

87-1

992

1982

-199

719

77-1

982

1972

-197

719

67-1

972

1962

-196

719

57-1

962

1952

-195

719

47-1

952

1942

-194

719

37-1

942

1932

-193

719

27-1

932

1922

-192

719

17-1

922

1912

-191

719

07-1

912

Popu

latio

n

Birth cohort

Male - 1997 Census

Male - 2007 Census

MALE

0

200,000

400,000

600,000

800,000

1,000,000

1,200,000

1,400,000

1,600,000

1992

-199

719

87-1

992

1982

-199

719

77-1

982

1972

-197

719

67-1

972

1962

-196

719

57-1

962

1952

-195

719

47-1

952

1942

-194

719

37-1

942

1932

-193

719

27-1

932

1922

-192

719

17-1

922

1912

-191

719

07-1

912

Popu

latio

n

Birth cohort

Female - 1997 Census

Female - 2007 Census

FEMALE

Source: United Nations Statistics Division, Demographic Yearbook Statistical Database

refreshGraphical Cohort Analysis (3)

Page 11: Session III: Establishing the base population · 2016-04-19 · Session III: Establishing the base population 8 March 2016 Cheryl Sawyer, Lina Bassarsky Population Estimates and Projections

refresh

o In the absence of sharp changes in fertility or mortality, significant levels of migration or other distorting factors, the enumerated size of a particular cohort should be approximately equal to the average size of the immediately preceding and following cohorts

o The age ratio for a particular cohort to the average of the counts for the adjacent cohorts should be approximately equal to 1 (or 100 if multiplied by a constant of 100)

o Significant departures from this “expected” ratio indicate either the presence of census error in the census enumeration or of other factors

Age ratios (1)

Page 12: Session III: Establishing the base population · 2016-04-19 · Session III: Establishing the base population 8 March 2016 Cheryl Sawyer, Lina Bassarsky Population Estimates and Projections

0

0.2

0.4

0.6

0.8

1

1.2

1.4

5-9

10-1

415

-19

20-2

425

-29

30-3

435

-39

40-4

445

-49

50-5

455

-59

60-6

465

-69

70-7

475

-79

Age r

atio

Age group

Mozambique – 1997 Census

Male

Female0

0.2

0.4

0.6

0.8

1

1.2

1.4

5-9

10-1

415

-19

20-2

425

-29

30-3

435

-39

40-4

445

-49

50-5

455

-59

60-6

465

-69

70-7

475

-79

80-8

485

-89

Age r

atio

Age group

Mozambique – 2007 Census

Male

Female

Source: United Nations Statistics Division, Demographic Yearbook Statistical Database

refreshAge ratios (2) – example

Page 13: Session III: Establishing the base population · 2016-04-19 · Session III: Establishing the base population 8 March 2016 Cheryl Sawyer, Lina Bassarsky Population Estimates and Projections

Sex ratios (1) ‐ calculation

Sex ratio by age group

Sex Ratio = or Sex Ratio =

Where= Male population enumerated in a specific age group= Female population enumerated in the same age group

Value of sex ratio Interpretation1 Same number of men and women in a given age groupAbove 1 More men than women in a given age groupBelow 1 Less men than women in a given age group

Page 14: Session III: Establishing the base population · 2016-04-19 · Session III: Establishing the base population 8 March 2016 Cheryl Sawyer, Lina Bassarsky Population Estimates and Projections

0

20

40

60

80

100

120

140

Sex r

atio

(num

ber o

f men

per

100 w

omen

)

Age group

Mozambique, 1997 and 2007 Censuses

1997 Census

2007 Census

Child sex ratio below 100: > Under-enumeration of boys?> Higher male child mortality?

No decline at adult and older ages:> Higher female mortality?> Higher female out-migration?> Under-enumeration of adult and older women?> …

Low sex ratios in 20s-30s:Where are the men?Or are women’s ages misreported?

Source: United Nations Statistics Division, Demographic Yearbook Statistical Database

refreshSex ratios (2) – Example

Page 15: Session III: Establishing the base population · 2016-04-19 · Session III: Establishing the base population 8 March 2016 Cheryl Sawyer, Lina Bassarsky Population Estimates and Projections

Tools for evaluation of age/sex datao PYRAMIDS.xlsx (UNPD)

– New template by UNPD to produce population pyramids and line graphs by single-year and 5-year age group

o SINGAGE.xls (PASEX)– Graphs data by single year of age– Calculates indicators of age misreporting (Whipple and Myers’

indices)

o AGESEX.xls (PASEX)– Calcuates age ratios and sex ratios for 5-year age groups

o GRPOP-YB.xls (PASEX)– Plots cohorts from 2-3 censuses

Page 16: Session III: Establishing the base population · 2016-04-19 · Session III: Establishing the base population 8 March 2016 Cheryl Sawyer, Lina Bassarsky Population Estimates and Projections

Correcting for age misreporting (smoothing)

o It is important to smooth reported age distributions if they are erratic

o PASEX spreadsheet AGESMTH.xls implements 5 methods, of 3 types

1. Do not modify the total population - accepting population in each 10-year age group, then divide into 5-year age groups– Carrier-Farrag– Karup-King-Newton – Arriaga’s formula (also the first and last group)

Age Population

20‐29 a

30‐39 b

40‐49 cPop (35‐39) = f(a, b, c)

Page 17: Session III: Establishing the base population · 2016-04-19 · Session III: Establishing the base population 8 March 2016 Cheryl Sawyer, Lina Bassarsky Population Estimates and Projections

Correcting for age misreporting (smoothing)

2. Slightly modifying total population - smoothing the 5-year age groups– The United Nations Method

3. Strong smoothing – modifying totals based on consecutive 10-year age groups, then using Arriaga’s for the 5-year population

Page 18: Session III: Establishing the base population · 2016-04-19 · Session III: Establishing the base population 8 March 2016 Cheryl Sawyer, Lina Bassarsky Population Estimates and Projections

Smoothing ExamplePASEX spreadsheet – AGESMTH.xls (using workshop sample data)

Page 19: Session III: Establishing the base population · 2016-04-19 · Session III: Establishing the base population 8 March 2016 Cheryl Sawyer, Lina Bassarsky Population Estimates and Projections

Smoothing ExamplePASEX spreadsheet – AGESMTH.xls (using workshop sample data)

Page 20: Session III: Establishing the base population · 2016-04-19 · Session III: Establishing the base population 8 March 2016 Cheryl Sawyer, Lina Bassarsky Population Estimates and Projections

Smoothing age patterns – Caution needed

No generalized solution for all populations Methods produce similar results Technique used depends on errors in age-sex

distribution Be cautious in using strong smoothing If only part of population distribution problematic, no

need for smoothing on entire age distribution

Page 21: Session III: Establishing the base population · 2016-04-19 · Session III: Establishing the base population 8 March 2016 Cheryl Sawyer, Lina Bassarsky Population Estimates and Projections

Smoothing with correction for underreporting

o Smoothing methods above do not make any adjustment for underreporting

o Need to adjust the census population for projection input

o PASEX spreadsheet BASEPOP.xls– Implements Arriaga smoothing methods for ages 10+– Adjusts age groups 0-4 and 5-9 based on recent

fertility and mortality– Can apply an overall adjustment (based on post-

enumeration survey, etc.)

Page 22: Session III: Establishing the base population · 2016-04-19 · Session III: Establishing the base population 8 March 2016 Cheryl Sawyer, Lina Bassarsky Population Estimates and Projections

Smoothing with correction for underreporting (BASEPOP)‐ example

Dates converter.xlsx

Page 23: Session III: Establishing the base population · 2016-04-19 · Session III: Establishing the base population 8 March 2016 Cheryl Sawyer, Lina Bassarsky Population Estimates and Projections

Smoothing with correction for underreporting (BASEPOP)‐ example

Reported population

If there is a basis to adjust the total population, enter desired total by sex here

Page 24: Session III: Establishing the base population · 2016-04-19 · Session III: Establishing the base population 8 March 2016 Cheryl Sawyer, Lina Bassarsky Population Estimates and Projections

Smoothing with correction for underreporting (BASEPOP)‐ example

B. Life Table Stationary Population (nLx)----------- --- ------------ -------------- ------------ ------------ ------------

Data for interpolation Estimates used ------------------------- ------------------------ ------------

Function Earlier Later Past 2.5 years 7.5 years and age date date year before before----------- --- ------------ -------------- ------------ ------------ ------------Dates 2008.00 2013.00 2012.15 2010.15 2005.15

MALE

1L0 95,484 96,627 96,433 95,975 94,8324L1 371,250 380,289 378,752 375,137 366,0995L5 455,542 469,824 467,396 461,683 447,401

FEMALE

1L0 96,031 97,117 96,933 96,498 95,4124L1 374,362 382,972 381,509 378,064 369,4545L5 460,198 474,036 471,683 466,148 452,310

5L15 448,683 465,294 462,470 455,825 439,2145L20 443,265 460,995 457,981 450,889 433,1595L25 433,937 454,382 450,907 442,729 422,2845L30 417,879 444,143 439,678 429,173 402,9095L35 394,484 429,273 423,359 409,443 374,6545L40 368,375 411,117 403,851 386,754 344,0115L45 345,430 393,572 385,388 366,131 317,9895L50 324,513 376,571 367,721 346,898 294,8405L55 304,609 358,897 349,668 327,953 273,665----------- --- ------------ -------------- ------------ ------------ ------------

User inputs nLxestimates from life tables for two nearby dates

Workbook interpolates these 3 columns

Page 25: Session III: Establishing the base population · 2016-04-19 · Session III: Establishing the base population 8 March 2016 Cheryl Sawyer, Lina Bassarsky Population Estimates and Projections

Smoothing with correction for underreporting (BASEPOP)‐ example

User inputs age‐specific fertility estimates for two nearby dates

Workbook interpolates these 3 columns

Page 26: Session III: Establishing the base population · 2016-04-19 · Session III: Establishing the base population 8 March 2016 Cheryl Sawyer, Lina Bassarsky Population Estimates and Projections

Smoothing with correction for underreporting (BASEPOP)‐ Results

Page 27: Session III: Establishing the base population · 2016-04-19 · Session III: Establishing the base population 8 March 2016 Cheryl Sawyer, Lina Bassarsky Population Estimates and Projections

Smoothing with correction for underreporting (BASEPOP)‐ Results

Page 28: Session III: Establishing the base population · 2016-04-19 · Session III: Establishing the base population 8 March 2016 Cheryl Sawyer, Lina Bassarsky Population Estimates and Projections

Getting to a mid‐year estimate

o Projection software needs a mid-year population as base

o PASEX spreadsheet MOVEPOP.xls shifts population from one date to another

o Inputs:– Date of census– Date desired– Population by age and sex– Mortality pattern (mx)– Fertility pattern (ASFR)– Total net migration

Page 29: Session III: Establishing the base population · 2016-04-19 · Session III: Establishing the base population 8 March 2016 Cheryl Sawyer, Lina Bassarsky Population Estimates and Projections

Getting to a mid‐year estimate (MOVEPOP)‐ example

Adjusted population from BASEPOP

Mx from life table

Age‐specific fertility rates estimate

Page 30: Session III: Establishing the base population · 2016-04-19 · Session III: Establishing the base population 8 March 2016 Cheryl Sawyer, Lina Bassarsky Population Estimates and Projections

Getting to a mid‐year estimate (MOVEPOP)‐ results

Page 31: Session III: Establishing the base population · 2016-04-19 · Session III: Establishing the base population 8 March 2016 Cheryl Sawyer, Lina Bassarsky Population Estimates and Projections

Thank you

Questions?>> until 11 March:

>> After 11 March: [email protected]@un.org

Regional Workshop on the Production of Population ProjectionsAddis Ababa, 7-11 March 2016