GATRA Indonesia... · 2016. 3. 3. · ~ SIDOMlINClI[GATRA ~ r ~ ~!I ,; G :. L :.. -. The...

Paninfsank TC:JlAH\ oRNGI N,~ SIDOMlINClI[ GATRA

~ r ~ ~!I ,; G :. L :.. -.

The Determinants of Birthweight: Addressing Potential

Sample Selection Bias from Babies Who Are Not Weighed at

Birth!

Heni Wahyuni1

Introduction

The empirical literature reviewed with regard to the infant health production function has

focused on the issues relating to endogeneity and sample selection biases, caused by unobserved

health heterogeneity and the pregnancy-resolution decision (Liu 1998; ROllS, Jewell & Brown

2004). The first bias relates to endogeneity of prenatal care, while the second, in existing studies,

arises from a given woman's decision to abort or continue her pregnancy. Specifically,

unobservable factors that may influence a woman's decision to proceed with the pregnancy or

abort are factors that are also likely to influence her use of prenatal care and birth outcomes,

particularly birthweight. Sample selection bias relating to the decision to abort is unlikely to be a

problem in Indonesia, where abortion is socially unacceptable and only conducted for medical

reasons. There is a potential for selection bias, however, due to non-random missing information

on birthweight.

The potential sample selection bias that arises from birthweight being missing for some babies

(those not weighed at birth) is a common issue in developing countries and generally does not

occur in studies of birthweight in developed countries. If the birth weight information in the

sample is not missing at random, however, the analysis of the determinants of birthweight

(without considering unreported birthweight) will be biased (Heckman 1979). This represents a

possible sample selection issue, given that the data on a key variable (birthweight) are available

1 This paper has been presented at the National Seminar of the 60th anniversary of the Faculty of Economics andBusiness UGM Seminar, Balancing Indonesian Economy: Governance and Accountability, Ethics, and Strategytoward Inclusive Growth, 19 September 2015.2 Heni Wahyuni is a lecturer and researcher at Faculty of Economics and Business, Universitas Gadjah Mada. E-mail: [email protected]

only for a subset of the population, who are not weighed at birth. This is often referred to as

incidental truncation (Wooldridge 2002, p. 552).

Relatively few studies, however, have investigated the potential of sample selection bias from

unweighed babies in the relationship between prenatal care and infant health in developing

countries, including Indonesia. Among those few studies, two significant studies have considered

this issue (Habibov & Fan 2011; Mwabu 2009). Habibov and Fan (2011) used 73 percent of all

live births with birthweight in Azerbaijan to analyze the effect of prenatal care on birthweight.

Mwabu (2009) reports that only 17 percent of babies delivered at home and 75 percent of babies

delivered at modem facilities in Kenya have a reported birthweight. Those two studies tested the

potential bias, due to unweighed babies, and found no evidence of selection bias in their data.

One study has examined the impact of the village midwife program on birthweight in Indonesia

(Frankenberg & Thomas 2001), but it does not take into account the selection problem, arising

due to some babies not being weighed at birth.

This study will use the Indonesia Family Life Survey (lFLS) data, which are IFLS3 and IFLS4

data. The focus of the study is to test whether there is a sample selection bias on the determinants

of birthweight. Specifically, birthweight is the outcome of interest, but I will observe this

outcome, conditional upon whether or not the baby has been weighed at birth. The IFLS data for

live-birth babies, born during 2002-08, inclusively, indicates that approximately II percent of

babies were not weighed. It is not appropriate to eliminate unweighed babies from the sample

and only include an analysis of the pregnancy outcome (birthweight) of the subset of mothers

whose babies were weighed, unless the birthweight and whether or not the baby was weighed are

independent. Otherwise, it could lead to biased estimates. Furthermore, in Indonesia, the IFLS

data shows that babies not weighed at birth are more likely to be born at home or in the office of

midwives with a traditional birth assistant, as well as to low-income and less educated mothers.

Previous studies, for example Mwabu (2009), use various instruments such as money prices,

time prices, household assets and income, environmental factors (rainfall), interaction terms

between land and mean long-term rainfall, and between cattle and mean long-term rainfall.

However, these are not available in the IFLS data.

2

A Modeling Framework

Heckman Sample Selection Model

In an attempt to analyze the potential selection problem from unweighed babies, I have used a

Heckman selection model. The model consists of two equations. The selection equation (I)

represents whether the baby was weighed and the outcome equation (2) relates to birthweight,

(1)

_ {1 ifzi > 0Zi - O·f * - 0l Zi -

f xdJ + Eiifzi > 0Yi = l-unobservedifz; = 0 (2)

where z; is a latent variable measuring the propensity of a baby to be weighed at birth; Wi is a

vector of factors known to influence a baby's to be weighed at birth that includes prenatal care

usage, as well as the age of the mother when pregnancy ended, per capita expenditure, years of

schooling, household characteristics (HH index), health condition (general health and 8MI),

whether she smoked before or during pregnancy, and whether she experienced any complication

during the pregnancy, baby-specific characteristics (singleton or multiple birth, gender, order of

pregnancy, year of birth), and the cost of the delivery; Ui contains any unmeasured factors in

equation (I).

We do not actually observe z;. All we observe is a dichotomous variable z., where it is equal to

I if the baby was weighed and zero, otherwise; however, there is only a selected (censored)

sample for the birthweight equation, Yi.

The dependent variable in the selection equation (1) is a dummy variable, indicating whether the

baby was weighed or not and the dependent variable in the outcome equation (2) is the baby's

birthweight (in kilograms). The independent variables in the selection equation (1) are also

included, as independent variables (xa, in the outcome equation (2). In order for the model to be

identified, the selection equation should also have at least one independent variable that is not

3

included in the outcome equation. Otherwise, the model is identified only by functional form,

and the coefficients have no structural interpretation (see identification section).

The disturbance term is assumed to be normal with:

u(-N(O,l)

Er-N(O,l)

When p "* 0, standard OLS techniques applied to the birthweight equation will yield biased

results. If p = 0, then there is no selection problem and the standard OLS model is appropriate. It

is important to test if there is a potential selection problem - if babies that were not weighed at

birth to mothers who have different characteristics from those mothers whose babies were

weighed. The null hypothesis of no selection bias is Ho:P = O. The Heckman selection model is

used to determine whether there is a selection problem or not. The sample is split between babies

that were or were not weighed directly after delivery.

Identification

4

!

1

Identification is very important in the system of equations. As explained previously, most

unweighed babies were delivered at home with midwives or traditional birth attendants. One

reason for mothers to choose to deliver the baby at home is because it may be less costly than

doing so in a modem health facility with professional birth attendants. Any variable, therefore,

that represents the cost of delivery may potentially become an identifier for the selection ~f

whether the baby is delivered at home or at a modem facility. It also can be said that this

identifier is an exclusion restriction in the selection equation, which is whether the baby was

weighed or not. In this analysis, the cost of delivery as an identifier will be applied in the

selection equation and it will be excluded from the birthweight equation, since it is unlikely that

any measure of delivery cost will influence birthweight.

Data

The sample for the empirical analysis is restricted to live births for pregnancies which ended

between the years 2002 and 2008, inclusively, for ever-married women in IFLS4 (2007-08). A

pooled cross-section approach is used, but it utilizes the panel nature of the IFLS data to provide

information on a range of explanatory variables. After excluding observations with missing

responses;' the sample consists of 4,436 live births in which the birthweight is observed (the

baby is weighed). Including live births in which the birthweight is not observed (the baby is not

weighed), the number of observations is 5,023.

Descriptive Statistics

Table I presents the summary statistics for the sample, used in the empirical analysis, separately

tor mothers of babies that were weighed and not weighed. The average number of prenatal care

visits for mothers of babies weighed is 9.19, which is much greater than the 6.13 visits for those

unweighed. Similarly, the percentage of mothers who followed the WHO recommendation for

prenatal care visits is much higher for the weighed babies, approximately 84.49 percent,

compared with only about 54.86 percent for unweighed babies. The household index, as an

indicator of economic background, is higher for mothers of babies that were weighed than babies

that were unweighed. The average number of years of schooling was about 12 years or senior-

high school level for mothers whose babies were weighed, but only about 7 years or graduated

from primary school for those whose babies were not weighed. The lower level of maternal

education relating to unweighed babies may indicate a limited knowledge about health, which

may negatively impact pregnancy outcomes. Moreover, more than 55 percent of mothers whose

babies were weighed lived in urban areas, compared to approximately 17 percent of those with

unweighed babies. The average age at the end of the pregnancy was about the same for both

groups: 27 years of age. Similarly, there was not much difference in the 13M!of mothers and the

general health condition of the mothers. There were very low rates of smoking behavior and

pregnancy complications during pregnancy for both categories of mothers.

3 Responses were coded as missing are defined as the responses with an illogical answer; the surveyor could notmeet the respondents, such as in the case of 8M! (height &weight measurement) or when the respondent refused toanswer.

5

27.7143.17

1.2450.9794.2115.8955.95

892,835.304436

The inclusion of unweighed babies is important because not only are there a significant

percentage (approximately II percent) of babies not weighed in the sample, but the observed

characteristics of mothers of babies not weighed are different from those whose babies were

weighed, thus increasing a potential for bias. More specifically, these mothers are from a lower

socioeconomic background, compared to those whose babies were weighed, in terms of per

capita expenditure, years of education, owning a television, using electricity, having good

drinking water (the household index), and living in rural areas.

Table 1: Statistics Descriptive of Variables Per Category of Birthweight and Definitions

(in percent, unless otherwise indicated)

Variable Birthweight__________________________________________________ ~N~o~t_w~e~ig=h~ed~VVeighed

6.13 9.1954.86 84.493.71 5.457.29 11.99

27.47 27.7121.45 22.5387.56 88.8955.54 64.2957.92 29.13

Total number of prenatal care visit (visits)WHO recommendation"Household indexYears of education (years)Age of mother (years)Body mass index (index)Healthy mother (general health condition)First birthPer capita expenditure below 25th percentile of population levelPer capita expenditure between 25th and 50th percentile of populationlevelPer capita expenditure above 50th percentile of population levelSmoking behaviorMale babySingleton babyHaving pregnancy complicationLiving in urban areaCost of delivery of the baby (rupiahs)Number of observations

Source: IFLS3 (2000) and IFLS4 (2007-08).

24.1917.890.68

52.4792.8414.3117.38

126,338.20587

4 WHO recommends that the minimum number of prenatal care visits during pregnancy be four; with at least onevisit in the first trimester of pregnancy, at least once in the second trimester, and at least twice in the third trimester(World Health Organization 2005).

6

The delivery expense is the variable that differentiates mothers whose babies were and were not

weighed at birth. In general, difference in delivery cost is large between the two groups of

mother: 892,835.30 rupiahs (about A$100) for weighed babies and 126,338.20 rupiahs (A$14)

for unweighed babies. This is a key variable that will be included in the selection equation

(selection for weighed or not weighed), but will not be included in the outcome equation for

birthweight (in kilograms). Thus, it is assumed that the cost of delivery is an instrument in the

selection equation that will not influence birthweight.

Results and Discussion

Estimation results of the Heckman model with the selection problem are provided in Table 2.

The selection equation is the equation that represents whether the baby is weighed or not

weighed immediately following delivery. The estimate for rho indicates a weak correlation

between the selection and the birth outcome (jj = -0.0614). The negative estimate of rho may

appear counterintuitive; however, it is not statistically significant. The associated Wald-test of

independence of equations is not statistically significant (chi2 = 0.63) with p-value = 0.4289.

This suggests that the Heckman model with selection may not be appropriate in this case or, in

other words, there is no sample selection bias problem due to unweighed babies.

Although the Heckman model is not appropriate in this case, it is interesting to observe the

results of the selection equation. The estimated coefficient of money spent on delivery, as a key

variable, is significant in explaining whether the baby was weighed or not. Birthweights also are

more likely to be recorded for mothers who have more years of schooling, are from households

with a higher household socioeconomic index with per capita expenditure between 25th and so"percentiles of the population level. However, as there is no evidence of a selection bias, the

analysis can be continued on a single structural equation for birthweight, using Ordinary Least

Squares (OLS) regression.

7

I

,J

Variable Selection

Table 2: Heckman Selection Model Cor Weighed and Not Weighed Babies

Outcome

Coefficient (SE)Coefficient (SE)Total number of prenatal care visitTotal number of prenatal care visit squared

Household indexYears of education

Age of mother less than 25 yrs (baseline)Age of mother between 25-34 yrs

Age of mother 35 and olderBody mass indexBody mass index squared

Dummy if Body mass index is imputedHealthy mother (general health status)First birthPer capita expenditure below 25th percentile ofpopulation level (baseline)Per capita expenditure between 25th and so"percentile of population levelPer capita expenditure above 50th percentile ofpopulation level

Smoking behaviorMale baby

Singleton babyHaving pregnancy complicationYear of baby's birth

Cost of delivery of the babyConstant

o.oi 17 (0.0059)**

-0.0002 (0.0002)-0.0064 (0.0072)

0.0017 (0.0022)

0.0452 (0.0198)**

0.0832 (0.0304)***

0.0435 (0.0126)***-0.0006 (0.0002)**-0.0810 (0.1691)

-0.0098 (0.0296)

-0.0735 (0.0180)***

-0.0203 (0.0242)

-0.0349 (0.0237)

0.0931 (0.0948)

0.0838 (0.0169)"*

0.2911 (0.0489)"*0.0365 (0.0250)

-0.0158 (0.0051)"*

2.2066 (0.1719)***

0.1290 (0.0135)···-0.0034 (0.0005)··*

0.1848 (0.0229)**·

0.0497 (0.0075)···

0.1166 (0.0683)·

0.1391 (0.0997)

0.0145 (0.0638)0.0003 (0.0013)5.2062 (0.1852)···

0.0573 (0.0923)0.1096 (0.063)·

0.1377 (0.0722)*

0.0977 (0.0781)0.5679 (0.3416)*

0.0003 (0.0544)0.102 (0.1392)

-0.0456 (0.0886)-0.0009 (0.0181)

0.0000015 (0.0000005)"*-2.1115 (0.7648)*"

RhoWald test of independent equation (chi2)=0.63Probability> chi2 = 0.4289

Number of observation 4436 587

-0.0614 (0.0775)

••• Significant at a ) percent level; ** at a 5 percent level; * at a )0 percent level.

Additional (Sensitivity) Analysis

Variables, such as years of education, BMI, general health status, household index, and per

capita expenditure have missing information. Observations with missing information were

excluded from the regression analyses. As a sensitivity analysis, the model was re-estimated,

using imputed values for the missing data. The median value for continuous variables and zero

values for dummy variables were applied to replace the missing values. Dummy variables,

8

In this study, I use the Heckman selection model to test whether there is a potential selection bias

from babies that are not weighed. The results show that I did not find evidence of a selection

problem from some babies not having been weighed in Indonesia. The analyses, therefore, can

be continued on the sub-sample of live-birth babies with reported birthweight.

iildicnting that the data were missing, were included as additional explanatory variables. The

results of these sensitivity analyses were qualitatively the same as those reported. The estimate

lor rho indicates a weak correlation between selection and birth outcome (jj = -0.0718814 ).

The associated Wald-test of independence of equations is not statistically significant (chi ' =

0:.)'-;) with p-value = 0.3297. This indicates that the reported outcome equations are robust to

»ussingness and the evidence of the sample selection problem has not been found.

'Chapter Conclusion

This study has analyzed the potential selection problem that arises when some babies are not

'wcip,hed immediately after delivery in Indonesia. In other countries, there is a sample selection

oi[..; because some pregnancies do not end in live birth due to abortion decisions (pregnancy-

resolution bias), which are not random. In Indonesia, religious and cultural views indicate that

the pregnancy-resolution decision is less of a concern, but there is a similar sample selection

issue for unweighed babies.

One limitation has been noted in the analysis. The cost of delivery here is reported by the

women. It is difficult to use the cost of delivery that is measured at the community level because

the data on the community level is based on the sample of communities from IFLS 1993

(IFLSI). However, I use the data from IFLS 2007-08. There are many missing community data

for new respondents (not panel respondents) in IFLS 2007. Therefore, it is difficult to use any

variables from the community module and so the analysis uses the information of cost of

delivery that is reported by individual mothers,

9

- -

Reference

Frankenberg, E. & Thomas, D. 2001, 'Women's health and pregnancy outcomes: do servicesmake a difference?', Demography, vol. 38, no. 2, pp. 253-65.

Habibov, N.N. & Fan, L. 2011, 'Does prenatal healthcare improve child birthweight outcomes inAzerbaijan? Results of the national demographic and health survey', Economics andHuman Biology, vol. 9, no. 1, pp. 56-65.

Heckman, J.1. 1979, 'Sample selection bias as a specification error', Econometrica, vol. 47, no.1, pp. 153 - 61.

Liu, G.G. 1998, 'Birth outcomes and the effectiveness of prenatal care', Health ServicesResearch, vol. 32, no. 6, pp. 805-23.

Mwabu, G. 2009, 'The production of child in Kenya: a structural model of birth weight', Journalof African Economies, vol. 18, no. 2, pp. 212-60.

Rous, J.1., Jewell, R.T. & Brown, R.W. 2004, 'The effect of prenatal care on birthweight: a full-information maximum likelihood approach', Health Economics, vol. 13, no. 3, pp. 251-64.

Wooldridge, J.M. 2002, Econometric analysis of cross section and panel data, The MIT PressCambridge, Massachusetts, London.

World Health Organization 2005, The World Health Report 2005: Make every mother and childcount, World Health Organization, Geneva.

lO

,~

f,~r-,~,'

~ .-

Faculty of Economics and Business.UNIVERSITAS GADJAH MADA . -. .

This is certify that

Heni Wahyuni, Ph.D.

INNOVAIIINIIRACII NSPIR I

FEBUGM

1955-2015

/ /./,-;;="7._,< "'",/ /.,/ .«.•••

ti!;;"- ./ ~ (/"c....-1>-c..-v ~/' c..---.

I~ PaninBank TEl lJl1 1\ l1N61N'1f SIOOMUNCUI.

GATRA Indonesia... · 2016. 3. 3. · ~ SIDOMlINClI[GATRA ~ r ~ ~!I ,; G :. L :.. -. The...

Documents

Transcript of GATRA Indonesia... · 2016. 3. 3. · ~ SIDOMlINClI[GATRA ~ r ~ ~!I ,; G :. L :.. -. The...