Analysis of Cigarette Smoking Using Fixed Effects Count Data Model Evidence From British Household...

16
Contact Address: COMSATS Institute of Information Technology, Attock 43600, Pakistan. Phone: 0092 57 9316330. Panel Count Data Analysis of Determinants of Cigarette Smoking: Evidence from British Household Panel Survey Abstract Count Datasets contain relatively larger number of realizations of nonnegative integer- valued random variable. The nature necessitates application of regression models based on binomial probability distributions including Poisson, Zero-Inflated Poisson or Zero- Inflated Negative Binomial distributions to infer valid predictions. This paper will use micro-dataset from the British Household Panel Survey (Abbreviated and Well-known as BHPS) to identify factors determining the participation decisions to smoke at the individual level. It is evident from the results that the educated and married person’s participation to smoke is lower than the participation of divorced and separated individuals. Congruent participation to smoke of unemployed or newly employed is higher. The empirical results persuade a more details investigation in a comparable theme to contribute to scarcity of research in health economics to determine causal factors of smoking and formulate more effective health policies. Keywords: Panel Count Data, Binomial Distribution, Poisson distribution, Zero-Inflated Poisson Distribution, Zero-Inflated Negative Binomial Distribution, Smoking

Transcript of Analysis of Cigarette Smoking Using Fixed Effects Count Data Model Evidence From British Household...

Page 1: Analysis of Cigarette Smoking Using Fixed Effects Count Data Model Evidence From British Household Panel Survey

Contact Address: COMSATS Institute of Information Technology, Attock 43600, Pakistan. Phone: 0092 57

9316330.

Panel Count Data Analysis of Determinants of Cigarette Smoking: Evidence from British Household Panel Survey

Abstract

Count Datasets contain relatively larger number of realizations of nonnegative integer-valued random variable. The nature necessitates application of regression models based on binomial probability distributions including Poisson, Zero-Inflated Poisson or Zero-Inflated Negative Binomial distributions to infer valid predictions. This paper will use micro-dataset from the British Household Panel Survey (Abbreviated and Well-known as BHPS) to identify factors determining the participation decisions to smoke at the individual level. It is evident from the results that the educated and married person’s participation to smoke is lower than the participation of divorced and separated individuals. Congruent participation to smoke of unemployed or newly employed is higher. The empirical results persuade a more details investigation in a comparable theme to contribute to scarcity of research in health economics to determine causal factors of smoking and formulate more effective health policies. Keywords: Panel Count Data, Binomial Distribution, Poisson distribution, Zero-Inflated Poisson Distribution, Zero-Inflated Negative Binomial Distribution, Smoking

Page 2: Analysis of Cigarette Smoking Using Fixed Effects Count Data Model Evidence From British Household Panel Survey

1. Introduction Numerous econometric models have been developed to analyse count data to identify factors affecting individual level participation decisions or incidence rates. Containing relatively a larger number of nonnegative integer-valued realizations is an interesting and a distinguished feature of the count data. Such characteristic of the count data necessitates modifying the classical ordinary least square regression models are based on normal distribution. Count data models make an allowance for conditioning the count on its mean to take specific values. Instead of simple linear regression models based on the normal distribution, count data models are based on Poisson, Zero-Inflated Poisson or Zero-Inflated Negative Binomial density functions to capture the conditioning nature of the data. Models based on Poisson and related distributions have been extended to time series, cross sectional or panel data to capture conditions and variations in many social and economic phenomena and its effects on the count dependent variable. Winkelmann (2008) has summarized a few important research areas where the use of count data techniques has been carried out including the analysis of accidents, issues in health economics, dynamics in the labour economics, studies in criminology and nature of tourism. In health economics the demand for health care in terms of determinants of visiting the doctors/clinic is the most commonly investigated field where count data techniques has been applied. Furthermore cigarette smoking has been studied using cross sectional or panel data from countries other than UK. Current paper is aimed to provide empirical evidence on what important labour market and socio-economic determinants of the individual’s smoke decision making. Panel count data techniques have been applied to deduce the results. The data has been derived from British Household Panel Survey. Further details are presented in the section devoted to data description and empirical results. To the best of authors’ knowledge, there is no piece of research employing the panel data from the UK to identify socio-economic determinants of smoking decisions. Rather most of the research is devoted to analyse demand for smoking in context of health consequences or in context of social or public policy perspective. The current research will contribute to the field of research by filling the gap in literature in terms of an application of fixed effects to the panel count data to identify the social and economic determinants of the participation decisions. It is thus complementing the research of Becker and Murphy (1988) analysing the addictive nature of smoking behaviour using panel dynamic data on the prices and consumption of cigarette. Considering the nature of smoking decisions, inclusion of socio-economic factors into the analysis is the special case where empirical work is very rare. Studies analysing individual level data to investigate determinants of the overall smoking decisions to identify different contexts of psychological behaviour, are important to develop anti-smoking policies (Bauer, et al. 2006). An overview of the available literature reveals that regression models based on either Poisson or Negative Binomial distribution will efficiently capture the impact significant socio-economic factors affecting smoking decisions of an individual. Models based on Poisson family of distributions have the problem of inconsistency if applied to over or under dispersed data as the Poisson regression is based on the assumption of

Page 3: Analysis of Cigarette Smoking Using Fixed Effects Count Data Model Evidence From British Household Panel Survey

equidispersion where mean equals variance of the count indicator. In case the assumption of equidispersion is violated, application of models based on the Negative Binomial regression can be implemented to estimate the parameter of the regression models. These models have the advantages over the standard Poisson regressions but are not free from limitations. It is discussed in more details in the section devoted to econometric modelling. 2. Literature review Literature on the use of count data models for estimation of the unknown parameters and inferences thereon has increased in the past decade. Count data models are applied to a range of data types including cross sectional, discrete longitudinal, dynamic panel and bivariate and univariate time series datasets. Molenberghs and Verbeke (2005) present a good overview of the models for discrete longitudinal data in a general context, discussing the application of count data models and present related computational issues to binary or even in incomplete longitudinal data. Xu et al. (2007) adds to literature listing the application of state space models to the longitudinal data with existence of serial correlation. Cameron and Travedi (1998) account the application to the univariate time series data while Denuit et al. (2007) presents a survey on the application of count data models for cross sectional data. Health economics and the related literature have rich examples of models developed for various types of count data models which are listed in the previous paragraphs. Cameron and Travidi (1998) has presented in a way that best describes the nature of count data techniques showing how to use such techniques to data in their well-known book titled “Regression analysis of the count data". Cameron and Travidi (1998) define that a non-negative valued variable might be considered as a count and that economic modelling of such type of data will necessitate estimation of the theoretical models and their parameters to draw inferences. Given the probability distribution for the count data, specification of such models involves nonetheless any other variable and the number of events is assumed to be independently and identically distributed (iid). This paper adds to the literature in health economics and public policy identifying important determinants of cigarette smoking in UK and utilizes panel data techniques to the data acquired from the British Household Survey from 2001 to 2009. The choice of technique is dependent on the nature of the count variable, if it has the characteristics describing the Poisson distribution in terms of equal mean and variance, then panel Poisson distribution could be a better choice. If any disparity exists in the mean and variance of the count variables, evidencing the existence of overdispersion or underdispersion, then negative binomial regression is applied to deduce the results. Further details on the issue are presented in the methodology section of the same paper. Individual smoking decisions are affected by socio-economic conditions which are also the major causes of stress to an individual in their personal and social lives. Castro et al. (1987) summarize the factors into four broad categories as potential, disruptive events, disposition or personality traits and social factors. Social factors lead to stress in the sense that individuals have less access to routine activities into their household and market productivity related activities (Cervantes and Castro, 1985). Disruptive life events include family or household characteristics like conflicts in parents’ marital lives,

Page 4: Analysis of Cigarette Smoking Using Fixed Effects Count Data Model Evidence From British Household Panel Survey

socio-economic status and religion which antecedent might consider before committing smoking decisions as adolescents relate such factors with their efforts in coping and relaxation (Russell, 1979). Furthermore smoking decisions might be a cause of low level of education, lower expectations from future achievements and short term future perspective. It is also expected that individual believing in self-image promoted through smoking and or smoking might also be the cause of some chronic diseases (Brunswick and Messeri, 1984; Horn, 1979 and Weinstein, 1982). Individual’s decisions to smoke is also motivated by one’s behaviour that can be summarized into rebelliousness and socially nonconforming activities with an attitude towards risk-taking attitude like using drugs and other pharmacological agents, also affects one decisions to smoke (Jessor and Jessor, 1977; Williams, 1973 and Huba and Bentler, 1982). Various examples are present in studies in Social Psychology on factors causing stress and depression. Smoking has been studied in a similar context to identify social, economic and health related factors. Health related determinants include some negative effects of smoking like diseases as a result of smoking and its effect on the passive smokers on the assumption that those who have acquired these diseases will be more cautious of the effects and will be less likely to smoke (Office of the U.S. Surgeon General, 2010). The effects and causal nature of health related issue is important from perspective of its direct and indirect economic costs to the smoker. The chances of higher costs to visit the medical practioners and other heads will contribute to the higher per month cost of living (Kopstein, 1984). Further smoking has been investigated as remedial in cases when the individuals are depressed are anxious and results indicates relationship between smoking and subclinical depression and anxiety (Morrell and Cohen, 2006). It has been developed that stress and anxiety is also developed by degraded social and economic conditions which the individuals presumably trying to relax while smoking. Such factors include financial condition, type and level of education, types of jobs, likeliness of the individuals of the neighbourhood (Duncan et al. 1999), family background and peers effect on the development of habitual smoking including fathers’ and also include mothers’ social and economic class, person’s own household behaviour including relationships with the spouse and other family members etc (Powell et al., 2005). These factors are included in our study to identify significance in context to estimate prevalence of smoking. Furthermore the public policies are affecting individuals smoking behaviour or the extent of smoking in terms of number of cigarettes smoked per day. It depends on where he performs his work as if he works at the employers premises, more chances are there that the rate will be low as compared to his work at home, travelling and other locations (Borland et al., 1990; Farkas et al., 1999 and Evan et al., 1998). Research has been conducted to investigate the efficacy of policies implementations checking the suitability of actions like smoking bans in public place with promotions of not to smoke in front of kids due to several social and health effects on them. These factors have been included in the current study and categories for workplace locations and presence of children in the family are utilized to check the smoking patterns in such conditions.

Page 5: Analysis of Cigarette Smoking Using Fixed Effects Count Data Model Evidence From British Household Panel Survey

Although a tendency toward smoking exists as a quick relief from stress by lighting up, there is no empirical evidence that nicotine alleviates stress. In fact, studies have shown the opposite; that smokers have higher stress levels than non-smokers (Warburton, Revell, & Thompson, 1991; Jones & Parrott, 1997; West, 1992). Research shows that adult smokers experience periods of heightened stress between cigarettes, and that smoking only restores their stress levels to normal (Schachter, 1978; Hughes, Higgins, & Hatsukami, 1990; Parrott, Garnham, Wesnes, & Pincock, 1996 and Office of the U.S. Surgeon General, 1988). However, soon after smoking, they will require another cigarette to keep their stress at normal levels because if they don’t they will experience the stress that comes from nicotine withdrawal. Research suggests that smoking does nothing to relieve stress and can only contribute to increase its prevalence (Parrott, A.C.1999). Boucher & Guillen (2009), Cameron and Travidi (1998), Hausman et al. (1984) and Winkelmann (2008) summarises issues in the uses of panel count techniques as methods for panel count data differ from standard count data models in at least one of three ways. First, they address the non-standard form of the covariance matrix of the observations that arises since the assumption of independent observations is most likely invalid. Second, and relatedly, they provide a richer framework for addressing the issue of unobserved heterogeneity than do univariate count data. In particular, dependence between the unobserved heterogeneity and the regressors is no longer excluded. Third, models for panel count data allow the introduction of dynamic elements, such as a lagged dependent variable, into the regression part of the model. Further it is also observed that panel methods also differ from multivariate count data models in that panel methods are somewhat more restrictive in their covariance structure, as they frequently assume that dependence is generated by unobserved heterogeneity that is specific to the individual but constant over time. Secondly, panel data models explicitly consider the possibility that the unobserved individual heterogeneity factor is correlated with one or more explanatory variables. In this situation, conditional models are required. The current paper is restricted to the application of recently developed econometric techniques for panel data assuming that such models are consistent and efficient in estimating the parameters of the models, the only objective of current investigation to shed light on some important socio-economic determinants of smoking decisions. 3. Econometric Strategy As the nature of the data in the area is of special nature so the count data models were modified to take a deeper account of the nature of the data giving birth to the methods of generalized linear models including Poisson and Negative Binomial probability distributions. Poisson distribution is based on the probability of the dependent variable conditioned on some observed and/or unobserved characteristic of the ith observation. It turns naturally into a standard nonlinear regression with the count variable depending on some exogenous regressors in the system which results in a standard Poisson model. Panel data estimation of the count data is well documented by Boucher & Guillen (2009), Cameron and Travidi (1998), Hausman et al. (1984) and Winkelmann (2008). Hilbe (2011) documents the usage and extensions of negative binomial regression. The paper follows Stata commands from Hilbe (2011) for reporting results from the sample.

Page 6: Analysis of Cigarette Smoking Using Fixed Effects Count Data Model Evidence From British Household Panel Survey

3.1. Fixed Effects Poisson Cameron and Travidi (1998) describe the estimation and promotion of fixed effects Poisson regression models for the panel count data. Following standard explanations of fixed effects negative binomial from Allison and Waterman (2002) and Guimaraes (2008), the paper is based on the analysis of the dependent variable in the study

with data on individuals across time has a Poisson

distribution with and depends on the vector of included exogenous variables

Log-linear form of the function is given as: (1)

with is the fixed effect parameter.

The Poisson model can be estimated using conventional approaches using maximum likelihood with including dummies for all the individuals (minus one) to estimate fixed effects model directly. Alternatively conditional maximum likelihood approach can be applied to estimate the model conditioning on the count total for each individual

that yields into a conditional likelihood for the Poisson to be proportional to (2) and is an equivalent of multinomial logit model for grouped data.

(2)

The conditioning eliminates the the parameters of fixed effects from the likelihood

function. Like logistic regression modelling, the inclusion of dummy variables into the fixed effects models results in inconsistent estimates of (Hsiao, 1986) in form of

incidental parameters problem as given by Kalbfleisch and Sprott in 1970. Conditional estimation has been reported as free from such problems (Allison and Waterman, 2002) and the estimates in both the unconditional and conditional models are similar with

associated covariance matrix (Cameron and Travidi, 1998). The fixed effects model controls for unrestricted heterogeneity across the individuals and for the given individual restriction is put on the model to strictly use equality of mean and variance: (3)

with many datasets, additional heterogeneity may not be accounted for by such models. 3.2. Fixed Effects Negative Binomial Hausman, et al. (1984) suggested fixed effects model is not a true fixed effects model and that it deals with the problem of overdispersion on the assumption that the has

the negative binomial distribution (Allison and Waterman, 2002). The model of Hausman, et al. model is based on the density function given as:

(4)

Where represents a gamma function. is assumed to remain constant over time for

each individual while is dependent on the covariates given in the form of:

(5)

The decomposition decision of in form of a function of covariates is surprising, since

it is regarded as a parameter of overdispersion. The density function in (4) becomes Poisson density function as

Mean and variance of is given as:

Page 7: Analysis of Cigarette Smoking Using Fixed Effects Count Data Model Evidence From British Household Panel Survey

(6)

For any individual , is assumed independent over time, implying that has

also a negative binomial distribution with parameters and Condition on the

two parameters of total counts, the likelihood function for a single individual is given as:

(7)

which indicates the elimination of parameters and the likelihood for the complete

sample can be obtained from the multiplication of all individual terms as is in (7). Maximization of the likelihood can then be carried out with respect to parameters

using traditional numerical approaches. The current paper reports results obtained from the use of Stata version 10.1 from the Stata Corp (www.stata.com). The computation of the above technique is not free from pitfalls as it does not really fits the fixed effects methods. Problem lies with the conditioning out of the parameters

from the likelihood does not correspond to different intercepts in the following log linear decomposition of Hausman et al. suggest to write the , where

(5) and (6) implies

The model thus allow for an arbitrary intercept for each individual, where the

approach has problem in that the plays a different role than and changes in

affect mean variance directly and indirectly respectively through changes in mean, unless the is assumed omitted variable effect, then no compelling reason exists in the

variables having a different sort of effect that from

The use of Hausman et al. (1984) has a problem that such regression model can be estimating using both an intercept and time-invariant regressors, which is not permissible with conditional fixed effects models. Alternative parameterization has been suggested in the literature with more generalization of the Poisson model. The probability mass function for such generalizations has been given as:

(8)

with corresponding mean and variance as:

(9)

Here the mean is allowed to vary but the parameter of overdispersion , is assumed

to remain constant for each individual. Modelling dependence on covariates results:

It is the NB2 model of Cameron and Trivedi (1998) to be distinguished from previous NB1 in (6). If the event counts are assumed independent across time for each individual, then the above model is intractable for deriving conditional likelihood, because of

which has not the negative binomial distribution. A complete and sufficient

statistic is lacking in the current specification for the that is a function of the data

alone.

Page 8: Analysis of Cigarette Smoking Using Fixed Effects Count Data Model Evidence From British Household Panel Survey

The paper discusses results obtained from the xtgee and xtnbreg commands of Stata and rest of the analysis is omitted to preserve space. Complete results can be obtained from the authors upon request. More details are presented in the following sections. 4. Data Description The British Household Panel Survey (BHPS) is being carried out by the Longitudinal Studies Centre of the Economic and Social Research Council, UK in collaboration with the Institute for Social and Economic Research (ISER) at the University of Essex. First wave was completed in 1991 and afterwards it has been conducted annually with the latest completed wave 18 in 2009 and published in 2010. The main objective of the survey is to promote understanding of social and economic dynamics at individual and household level in Britain, to identify, model and forecast social and economic changes, to evaluate their causes and consequences in relation to a range of socio-economic indicators. The paper utilizes data on smoking and socio-economic and health related predictors from the British Household Panel Survey for the year 2002-2009 matching with corresponding waves 11 to 18 of the survey. The sample is an unbalanced panel of 6619 individuals observed from 1 to 9 years during the time period, 2002-2009 with total observations equal to 25181. Given the panel nature of the data, it is observed that mean age of sampled individual is just above 39 years when excluding individuals of ages less than 16 years and exceeding 65 year respectively on the occasion that included individual will not be active labour market participants. Individuals are found to smoke about 15 cigarettes per week with variance 8.69 where variance is lower than the average number of smoked cigarette and making the data possibly under dispersed as a presumption. The underdispersion nature of the data will be formally checked using the suggested procedures by Hilbe (2011). The sample include 48% male and remaining 52% are female, as our analysis is based on fixed effects analysis of the data, the inclusion of gender variable is seemingly inappropriate but it will help to compare the sample pattern across the two groups. Beside the above personal level characteristics, the analysis includes work related determinants like income, education, full time employment, management level role and workplace as proxy for checking the effects of public place smoking bans. The analysis includes indicators for social behaviour of the individuals. The indicators include neighbourhood, children at home, marital status and housing tenure. It has been observed that above 90% or more people likes to dwell in the current neighbourhood with around 60% of them have their own houses or mortgaged. 25% of the sample has a child in the household with 12% of the individuals have educational qualification at a degree level. Table (1) summarizes results for the pooled sample and conditional fixed effects negative binomial (NB-2). Complete results are available with the author.

Page 9: Analysis of Cigarette Smoking Using Fixed Effects Count Data Model Evidence From British Household Panel Survey

Table 1: Descriptive Statistics

variable mean Standard Deviation

Overall Between Within

Smoking 15.0195 8.717 8.139 3.825 Age 39.3396 12.97 13.47 1.862 Squared Age/100 17.1580 10.59 10.83 1.58 Financial Status 3 0.1930 0.395 0.323 0.268 Financial Status 4 0.3818 0.486 0.371 0.368 Financial Status 5 0.2973 0.457 0.348 0.349 Financial Status 6 0.1278 0.334 0.277 0.237 Household Size 1 0.1192 0.324 0.287 0.145 Household Size 2 0.8808 0.324 0.287 0.145 male 0.4640 0.499 0.499 0

Likes Neighbourhood 0.9043 0.294 0.248 0.198 Children At Household 0.2546 0.436 0.393 0.198 Degree Level Qualification 0.1290 0.335 0.342 0.061 Anxious 0.1389 0.346 0.294 0.204 Feels Under Strain 4 0.1610 0.367 0.309 0.258 Feels Under Strain 5 0.5216 0.5 0.379 0.385 Feels Under Strain 6 0.2535 0.435 0.331 0.337 Feels Under Strain 7 0.0639 0.245 0.189 0.184 Private Sector Employee 0.4514 0.498 0.45 0.251 Fulltime Employee 0.5324 0.499 0.456 0.239 Work at Employer Premises 0.4802 0.5 0.447 0.264 House Ownership 0.5936 0.491 0.472 0.182

Hospital Visits 2 0.6085 0.488 0.386 0.356 Hospital Visits 3 0.2287 0.42 0.307 0.342 Hospital Visits 4 0.0918 0.289 0.214 0.232 Hospital Visits 5 0.0710 0.257 0.196 0.192 Financial Expectations 3 0.3513 0.477 0.392 0.338 Financial Expectations 4 0.0634 0.244 0.172 0.196 Financial Expectations 5 0.5454 0.498 0.393 0.367 Year 2002 0.1675 0.373 0.339 0.315 Year 2003 0.1409 0.348 0.203 0.321 Year 2004 0.1355 0.342 0.184 0.321 Year 2005 0.1250 0.331 0.159 0.313 Year 2006 0.1177 0.322 0.152 0.305 Year 2007 0.1133 0.317 0.159 0.298

Year 2008 0.1040 0.305 0.165 0.284 Year 2009 0.0961 0.295 0.184 0.268 Note: N is 25181, n is 6619 and T-bar is 3.80435

Page 10: Analysis of Cigarette Smoking Using Fixed Effects Count Data Model Evidence From British Household Panel Survey

5. Econometric Results: The analysis begins with testing the presence of over or underdispersion in the data. Following Hilbe (2011), first unconditional negative binomial regression is estimated to obtain the value of alpha (0.261), which is used to control for the dispersion. The dispersion parameters obtained from the product of (1/df) and Deviance or Pearson coefficient is 1.04 or 0.81, indicating the dispersion has been controlled to an acceptable extent, that around 5% above the dispersion equal to 1 (Hilbe, 2011). It is notable that from the summary statistics it seemed that the data might be underdispered but proper follow up tests indicated that it is actually overdispersed, and the dispersion has been controlled with including the value to of alpha into the GLM model. The results indicate using Negative Binomial is the appropriate technique that can be used to investigate the determinants of smoking. Literature reveals various versions of the negative binomial regression models. Most commonly applied version in panel data literature is the conditional fixed effects negative binomial (NB-2) a common choice of the econometricians and GLM version of negative binomial (NB-1) is the choice of statisticians (Cameron & Travedi, 1986, 1998 and Greene, 2008). Greene (2008) further notes that Cameron and Travedi (1986 & 1998) has no preference for the NB-1 or NB-2 models as these models are non-nested, no parametric test exists to suggest more preferred model from the NB-1 or NB-2. Moreover Greene (2008) suggest using NB-P model which has superiority over the NB-2 model but there is only little effects on the coefficient estimates using NB-P model, hence the choice depends on the computational ease. Furthermore as the current study employs population averaged and conditional fixed effects negative binomial (NB-2) as the appropriate and computationally easy model. Table 2 indicates the effect of age is quadratic on the smoking decisions in terms of incidence rates which changes non-linearly with increasing age. It is observed that the incidence rates is first increasing rapidly and then declines with higher ages which confirms the results that smoking is less prevalent among the elder portions of populations. One aspect of such evidence might be that older people are more mature and they care more about factors that might negatively affect their health. Another significant finding that is evident from the results is that anxiety causes more smoking prevalence in terms of incidence rate. One explanation that is present in case of higher smoking among anxious people that they are more likely feeling depressed and they relax theirselves while smoking which is usually preferred among those who want to relieve depression. To enforce this finding the study incorporates strain as a proxy for the depression parameter of smoking. Strained individuals are found smoking more than those who are less strained. The two indicators indicate that more depressed persons are usually found more smoking as compared to those who are less depressed. Further research will provide evidence what are the social or economic factors causing more depression. The results are confirming the evidence from Ikard, Green and Horn (1969) stating that cigarettes are either pleasurable or relieving. The results further confirm the finding that smoking changes the mode of smokers and calms the smokers by reducing the anger of smokers (Warburton, 1992, p. 57).

Page 11: Analysis of Cigarette Smoking Using Fixed Effects Count Data Model Evidence From British Household Panel Survey

Table 2: Population Averaged and Fixed Effects Negative Binomial Regression Results PAa Poisson FEb Poisson PA Negative

Binomial FE Negative Binomial

Male 1.128*** 1.122*** (8.91) (8.56) Age 1.052*** 1.077* 1.050*** 1.082*** (17.92) (2.56) (17.66) (13.26) Squared Age/100 0.951*** 0.961*** 0.953*** 0.961*** (-14.36) (-10.50) (-13.97) (-9.46) Financial Status 0.992 1.000 0.990 1.001 (-1.34) (-0.09) (-1.77) (0.16) Financial Expectations 0.998 1.001 0.999 1.001 (-0.39) (0.16) (-0.15) (0.11) Living with spouse 0.991 0.991 0.991 0.994 (-0.79) (-0.92) (-0.84) (-0.59) Child Under 12 Years 1.013 1.004 1.013 1.004 (1.18) (0.48) (1.17) (0.36) Degree Level Qualification 0.778*** 0.943 0.777*** 0.963 (-12.05) (-1.89) (-12.09) (-1.07) Likes Neighbourhood 0.969** 0.985 0.970** 0.989 (-3.02) (-1.86) (-2.91) (-1.25) Anxiety 1.042*** 1.022** 1.045*** 1.017* (3.84) (2.83) (4.17) (1.96) Feels Under Strain 1.062*** 1.055*** 1.060*** 1.048*** (4.34) (6.35) (4.27) (4.95) Hospital visits 3 0.984** 0.982*** 0.984** 0.981*** (-2.78) (-3.61) (-2.74) (-3.51) Hospital visits 4 0.960*** 0.952*** 0.962*** 0.950*** (-4.35) (-6.63) (-4.12) (-6.27) Hospital visits 5 0.970** 0.959*** 0.971* 0.958*** (-2.62) (-4.81) (-2.47) (-4.44) Full Time 1.003 1.013 1.004 1.014 (0.33) (1.63) (0.43) (1.64) Work Premises 0.963*** 0.992 0.963*** 0.993 (-4.37) (-1.11) (-4.35) (-0.79) Private Sector 1.016 1.005 1.017 1.007 (1.73) (0.66) (1.90) (0.81) House Ownership 0.931*** 0.992 0.930*** 1.003 (-7.51) (-0.92) (-7.53) (0.30) 2003 0.962 0.956*** (-1.31) (-5.10) Intercept 4.798*** 4.922*** 7.717*** (29.26) (30.52) (12.13) AICc . 98107.131 . 97577.971 SBICd . 98300.376 . 97779.269 Wald Chi2 1065.203 275.120 1088.409 334.978 p-value 0.000 0.000 0.000 0.000 N 25181.000 23200.000 25181.000 23200.000 a. Population Averaged b. Fixed Effects c. Akaike Information Criteria d. Schwarz Bayesian Information Criteria

is aimed only to identify some important socio-economic and health related indicators which affect individual’s decisions to smoke in terms of incidence rates, the study

Page 12: Analysis of Cigarette Smoking Using Fixed Effects Count Data Model Evidence From British Household Panel Survey

Moreover it is clear from the Table (2) that individuals who have more frequent visits to hospitals or health centres are found less smoking and main reason for their less smoking prevalence is that doctors usually prohibits from smoking whenever one has to complete a medical course for treatment. Also the rate is less among those visitors and they are usually more sensitive when they have some medical issue and it is one of the main reason that they smoking less than their usual smoking habits. One last significant finding from the analysis appears to be the effect of ban on smoking in public premises as indicated from the proxy for the determinant. The study includes working in a employers’ premises and there is ruling in the UK that smoking free work environment should be provided. The people who usually work at the employers’ premises are found less frequent smokers as they have to spend more time at work premises. This is usually because of the reason that employers have to maintain smoking free work environment and they do not allow smokers to smoke frequently with breaks from the job. Although the effect of financial variable on smoking is not statistically significant but the results are economically viable, better financial status has a negative effects on individual preferences to smoke and those who have no better financial expectations from the future are found more frequent smoking, possibly due to expected hardships in the future. It would be interesting if research identifies the relationship between future expectations, depression and anxiety and smoking. All this is logically viable and the results indicate importance of economic conditions on the individuals’ preferences to smoke or not to smoke. The incidence rate increases if one has worst expected future and the incidence rate increases if they have better financial status. It is further evident from the results that if a person has better financial condition, it is expected that they would have their own houses and the effect of house ownership is clearly negative on smoking prevalence in terms of less incidence rate of smoking. Conclusion The count nature of the data requires careful application of econometric techniques for the prediction of the effects of included determinants on dependent variables. It is clear from the results that aged persons have less prevalent smoking and smoking prevalence decreases in terms of lower incidence rate ratio when they are less depressed as indicated by the significant effect of anxiety and stain on smoking. Moreover the results indicate that individuals who have frequent visits to hospitals are found less frequently smoking as they usually are sensitive when they have some medical problems and visits hospitals for treatment. The study has implications for

Page 13: Analysis of Cigarette Smoking Using Fixed Effects Count Data Model Evidence From British Household Panel Survey

References: Andrea N. Kopstein, Dorothy P. Rice, Thomas A. Hodgson, Peter Sinsheimer, Warren Browner, 1984, The Economic Costs of the Health Effects of Smoking, The Milbank Quarterly, Vol. 64, No. 4, pp. 489-547 Farkas, A.J., Gilpin, E.A., Distefan, J.M., and Pierce, J.P., 1999, The effects of household and workplace smoking restrictions on quitting behaviours, Tobacco Control Vol. 8:261-265 Baltagi, B. H., and Griffin, J, M., 2001, "The Econometrics of Rational Addiction: The Case of Cigarettes," Journal of Business and Economic Statistics, American Statistical Association, vol. 194, pages 449-54 Bauer, T., Göhlmann, S., and Sinning, M., 2006, "Gender Differences in Smoking Behaviour, "CEPR Discussion Papers 5848, C.E.P.R. Discussion Papers. Becker, G. S., and Murphy, K. M., 1988, A Theory of Rational Addiction..Journal of Political Economy, Vol. 96 4, p 675-700 Borland, R., Chapman, S., Hill, D., Neville, O., 1990, Effects of workplace Smoking Bans on Cigarette Consumption, American Journal of Public Health, 178-180. Boucher, J.P., and Guillen, M., 2009, A survey on models for panel count data with applications to insurance, RACSAM. Rev. R. Acad. Cien. Serie A. Mat. VOL. 103 2, pp. 277–294 Brunswick, Ann F., and Peter Masseri. 1984. "Gender Differences in the Process Leading to Cigarette Smoking." Journal of Psychosocial Oncology 2:49-69. Cameron, A., Trivedi, P., 1986. Econometric models based on count data: comparisons and applications of some estimators and tests. Journal of Applied Econometrics 1, 29–54. Cameron, C., Trivedi, P., 1990. Regression-Based Tests for Overdispersion in the Poisson Regression Model. Journal of Econometrics, 46, pp. 347–364. Cameron, C., Trivedi, P., 1998. Regression Analysis of Count Data. Cambridge University Press, New York. Cameron, C., Trivedi, P., 2005. Microeconometrics: Methods and Applications. Cambridge University Press, Cambridge. Castro, F. G., Maddahian, E., Newcomb, M.D., Bentler, P. M., 1987, A Multivariate Model of the Determinants of Cigarette Smoking Among Adolescents, Journal of Health and Social Behavior, Vol. 28, pp. 273-289 Cervantes, R.C., and Castro, F.G., 1985. "Stress, Coping and Mexican American Mental Health: A Systematic Review." Hispanic Journal of Behavioral Sciences. 7:1-73. Chaloupka, F., 1991, Rational Addictive Behaviour and Cigarette Smoking, Journal of Political Economy, Vol. 99 4, p 722-42 DENUIT, M., MAR´E CHAL, X., PITREBOIS, S. AND WALHIN, J.-F., 2007. Actuarial Modelling of Claim Counts: Risk Classification, Credibility and Bonus-Malus Scales, Wiley, New York. Duarte, R. and Escario, J.J., 2006, "Alcohol abuse and truancy among Spanish adolescents: A count-data approach," Economics of Education Review, Elsevier, vol. 252, pages 179-187

Page 14: Analysis of Cigarette Smoking Using Fixed Effects Count Data Model Evidence From British Household Panel Survey

Duncan, C., Jones, K. and Moon, G., 1999, Smoking and deprivation: are there neighbourhood e?ects?, Social Science and Medicine 48, pp. 497-505 Evans, W., Farrelly, M. and Montgomery, E., 1999. ‘Do Workplace Smoking Bans Reduce Smoking?’, American Economic Review 894: 728-47 Farrell, L., Tim R. L. F., and Harris, M. N., 2003, "A Pack A Day For Twenty Years: Smoking And Cigarette Pack Sizes,” Department of Economics - Working Papers Series 887, The University of Melbourne. Greene, W. H., 1994. "Accounting for Excess Zeros and Sample Selection in Poisson and Negative Binomial Regression Models," Working Papers 94-10, Leonard N. Stern School of Business, Department of Economics, New York University Greene, W., 2008. “Functional forms for the negative binomial model for count data,” Economics Letters Vol. 99, pp. 585–590 Greene,W., 2008. Econometric Analysis, 6th ed. Prentice Hall, Englewood Cliffs. Greene, W., 2007. LIMDEP 9.0 Reference Guide, Econometric Software, Inc. Plainview. Gruber, J., and Zinman, J., 2000, Youth Smoking in the U.S.: Evidence and Implications. National Bureau of Economic Research, Inc, NBER Working Papers: 7780 Hausman, J., Hall, B., Griliches, Z., 1984. Economic models for count data with an application to the patents–RandD relationship. Econometrica 52, 909–938. Hilbe, J., 2011. Negative Binomial Regression, 2nd Edition. Cambridge University Press, Cambridge, UK. Horn, D. 1979. "Psychological Analysis of Establish- ment and Maintenance of the Smoking Habit." Pp. 24-29 in Cigarette Smoking as a Dependence, Process edited by N.A. Krasnegor Research Monograph No. 23. Rockville, MD: National Institute on Drug Abuse. Huba, G.J., and Bentler, P.M., 1982. "A Developmental Theory of Drug Use: Derivation and Assessment of a Causal Modeling Approach." Pp. 147-203 in Life-Span Development and Behavior, Volume 4, edited by P.B. Baltes and O.G. Brim Jr. New York: Academic Press. Hughes, J. R., Higgins, S. T., and Hatsukami, D. K. 1990. Effects of abstinence from tobacco: A critical review. In: L. T. Kozlowski, H. Annis, H. D. Cappell, F. Glaser, M. Goodstadt, Y. Israel, H. Kalant, E. M. Sellers, and J. Vingilis Eds., Research advances in alcohol and drug problems Vol. 10, pp. 317–398. New York: Plenum. Ikard , F. F. , Green , D. E. , and Horn , D. 1969 . A scale to differentiate between types of smoking as related to the management of affect . International Journal of the Addictions , 4 , pp. 649 – 659 . Jessor, R. and Jessor, S.L., 1977. Problem Behavior and Psychosocial Development: A Longitudi- nal Study of Youth. New York: Academic Press Johnson R. A., and Hoffmann, J. P., 2000, Adolescent Cigarette Smoking in U.S. Racial/Ethnic Subgroups: Findings from the National Education Longitudinal Study, Journal of Health and Social Behaviour, Vol. 41, No. 4, pp. 392-407 JONES, M. E. E. and PARROTT, A. C. 1997, STRESS AND AROUSAL CIRCADIAN RHYTHMS IN SMOKERS AND NON-SMOKERS WORKING DAY AND NIGHT SHIFTS. Stress Medicine, 13: 91–97. Kan, K., 2007, Cigarette Smoking and Self-Control, Journal of Health Economics, Vol. 26 1, p 61-81. Lisa M. P., Tauras, J.A., and Ross, H., 2005, The importance of peer effects, cigarette prices and tobacco control policies for youth smoking behavior, Journal of Health Economics 24, pp. 950–968

Page 15: Analysis of Cigarette Smoking Using Fixed Effects Count Data Model Evidence From British Household Panel Survey

MOLENBERGHS, G. AND VERBEKE, G., 2005. Models for Discrete Longitudinal Data, Springer. Morrell, H. E. R., Cohen, L. M., and Dempsey, J. P. 2008. Smoking Prevalence and Awareness among Undergraduate and Health Professional Students. The American Journal on Addictions, 17, 181-186 Mullahy, J., 1997, "Instrumental-Variable Estimation Of Count Data Models: Applications To Models Of Cigarette Smoking Behaviour." The Review of Economics and Statistics, MIT Press, vol. 794, pages 586-593 Office of the U.S. Surgeon General, 1988, The Health Consequences of Smoking: Nicotine Addiction: A Report of the Surgeon General, DEPARTMENT OF HEALTH AND HUMAN SERVICES, Public Health Service, Office of the Surgeon General, Rockville, MD Office of the U.S. Surgeon General, 2010, A Report of the Surgeon General: How Tobacco Smoke Causes Disease: The Biology and Behavioral Basis for Smoking-Attributable Disease,U.S. DEPARTMENT OF HEALTH AND HUMAN SERVICES, Public Health Service, Office of the Surgeon General, Rockville, MD PARROTT, A. C., GARNHAM, N. J., WESNES, K. and PINCOCK, C. 1996, Cigarette Smoking and Abstinence: Comparative Effects Upon Cognitive Task Performance and Mood State over 24 Hours. Human Psychopharmacology: Clinical and Experimental, 11: 391–400. Parrott, Andy C., 1999, Does cigarette smoking cause stress? American Psychologist, Vol 5410, pp. 817-820 Patton GC, Carlin JB, Shao Q, et al. 1997, Adolescent dieting: health weight control or borderline eating disorder? J Child Psychol Psychiatry, Vol. 38, pp. 299-306. Patton, et al. 1998, Depression, Anxiety, and Smoking Initiation: A prospective study over 3 years, American Journal of Public Health, Vol. 88, No. 10, pp. 1518-1522 Russell, M.A.H. 1979. "Tobacco Dependence: Is Nicotine Rewarding or Aversive?" Pp. 100-22 in Cigarette Smoking as a Dependence Process, edited by N.A. Krasnegor. Research Monograph No. 23. Rockville, MD: National Institute on Drug Abuse. Sander, W., 1995, Schooling and Quitting Smoking, Review of Economics and Statistics. Vol. 77 1, p 191-99 Schachter, S., 1978, Pharmacological and physchological determinants of smoking, Annals of Internal Medicine, vol. 88 no. 1, pp. 104-114 Taylor, M.F., Ed. Brice, J., Buck, N., and Prentice-Lane, E., 2010. British Household Panel Survey, User Manual Volume A: Introduction, Technical Report and Appendices. Colchester: University of Essex. Terza, J. V., 1998, "Estimating count data models with endogenous switching: Sample selection and endogenous treatment effects," Journal of Econometrics, Elsevier, vol. 841, pages 129-154 WARBURTON, D. M., REVELL, A. D. and THOMPSON, D. H. 1991, Smokers of the future. British Journal of Addiction, 86: 621–625 Warburton, D.M., 1992, Smoking within reason, Journal of Smoking related disorders, Vol. 3, pp. 55-59 Wasserman, J., Manning, W. G., Newhouse, J. P., and Winkler, J. D., 1991, "The effects of excise taxes and regulations on cigarette smoking," Journal of Health Economics, Elsevier, vol. 101, pages 43-64 Weinstein, Neil D. 1982. "Unrealistic Optimism About Susceptibility to Health Problems." Journal of Behav- ioral Medicine 5:441-60 West, S., 1992, Does Smoke Get in Your Eyes? The journal of American medical Association, Vol 268, No. 8, pp. 1025-1026.

Page 16: Analysis of Cigarette Smoking Using Fixed Effects Count Data Model Evidence From British Household Panel Survey

Williams, A.F., 1973. "Personality and Other Characteristics Associated with Cigarette Smoking Among Young Teenagers". Journal of Health and Social Behavior 14:374-80. Winkelmann, R., 2003. Econometric Analysis of Count Data, 4th ed. Springer Verlag, Heidelberg. Winkelmann, R., Zimmermann, K., 1991. A new approach for modeling economic count data. Economics Letters 37, 139–143. Winkelmann, R., Zimmermann, K., 1995. Recent developments in count data modeling: theory and application. Journal of Economic Surveys 9 (1), 1–36. Winkelmann, R., Zimmermann, K., 1995. Recent developments in count data modeling: theory and application. Journal of Economic Surveys 9 1, 1–36. Wooldridge, Jeffrey, 2002. Econometric Analysis of Cross Section and Panel Data. MIT Press, Cambridge, MA. Wooldridge, Jeffrey, 2002. Econometric Analysis of Cross Section and Panel Data. MIT Press, Cambridge, MA. XU, S., JONES, R. AND GRUNWALD, G., 2007. Analysis of Longitudinal Count Data with Serial Correlation, Biom. J., 49(3), 416–428.