Hybrid discrete choice models: gained insights versus increasing...

Hybrid discrete choice models: gained insights versus

increasing effort

Petr Mariel∗

Department of Applied Economics III (Econometrics and Statistics), University of the Basque Country

Avda. Lehendakari Aguirre, 83

E48015 Bilbao, Spain

E-mail: [email protected]

Tel: +34.94.601.3848

Fax: +34.94.601.3754

Jürgen Meyerhoff a,b

a) Institute for Landscape Architecture and Environmental Planning

Technical University of Berlin

D-10623 Berlin, Germany

E-mail: [email protected]

b) The Kiel Institute for the World Economy, Duesternbrooker Weg 120, 24105 Kiel, Germany

∗ Corresponding author

1

mailto:[email protected]

mailto:[email protected]

Abstract

Hybrid choice models expand the standard models in discrete choice modelling by

incorporating psychological factors as latent variables. They could therefore provide further

insights into choice processes and underlying taste heterogeneity but the costs of estimating

these models often significantly increase. This paper aims at comparing the results from a

hybrid choice model and a classical random parameter logit. Point of departure for this

analysis is whether researchers and practitioners should add hybrid choice models to their

suite of models routinely estimated. Our comparison reveals, in line with the few prior studies,

that hybrid models gain in efficiency by the inclusion of additional information. The use of one

of the two proposed approaches, however, depends on the objective of the analysis. If

disentangling preference heterogeneity is most important, hybrid model seems to be

preferable. If the focus is on predictive power, a standard random parameter logit model

might be the better choice. Finally, we give recommendations for an adequate use of hybrid

choice models based on known principles of elementary scientific inference.

Keywords: discrete choice, hybrid choice model, land use, random parameter logit, marginal

willingness to pay, latent variable

JEL: Q51, C35

2

1. Introduction

Hybrid Choice Models (HCM) have recently become more popular in discrete choice

modelling as they expand standard choice models by incorporating psychological factors that

may affect decision making. Generally, HCMs extend the specification of the traditional

random utility model (RUM) by incorporating additional decision protocols in order to relax

the simplifying assumptions and enrich the underlying behavioural characterizations. These

extensions comprise, among others, flexible disturbances (e.g., factor analytic) to mimic more

complex error structures and to allow for the explicit modelling of latent psychological factors

such as attitudes (Ben-Akiva et al., 2002). This paper aims at contributing to the current

literature by investigating whether HCMs provide further insights that justify the higher costs

of estimation.

The occasion for this question is the experience gained from the development and

estimation of a couple of latent variable models (Bartczak et al., 2015; Hoyos et al., 2015;

Mariel et al., 2015). In all studies we found that the modelling process was very complex and

costly because of the high number of coefficients and the complex likelihood function with

numerous local maxima that make its maximization tricky. On the other hand, the models

resulted in new insights compared to more conventional models like the Random Parameter

Logit (RPL) or the Latent Class Model. Thus, more knowledge about the potential gains of

HCMs seems to be valuable as the majority of choice experiment applications nowadays apply

routinely approaches capable of capturing unobserved taste heterogeneity such as RPL or

latent class models. Given this, the question is whether researchers and practitioners mainly

interested in the outcome of choice experiments should also move ahead and include HCM to

the suite of models routinely estimated as discrete choice models are being increasingly used

in environmental valuation studies (see e.g. Can and Alp, 2012 or Justes et al.,2014).

3

Accordingly, in this paper we focus on a closer comparison of the results from an HCM

and those from an RPL. As case for the comparison we choose to analyse the effect of different

design dimensions on the propensity to select the status quo (SQ) option in discrete choice

tasks. The literature generally provides evidence that people have a tendency to choose the SQ

option disproportionately often and that this behaviour is, at least partially, triggered by the

design characteristics of the choice sets used in the survey (Boxall et al. 2009; Rolfe and

Bennett, 2009; Zhang and Adamowicz, 2011). The data are from a study applying a design-of-

designs approach (Caussade et al., 2005) resulting in 16 different choice designs. Across those

designs the following five design dimensions vary systematically: the number of choice sets,

the number of alternatives, the number of attributes, the number of levels and the range of

attribute levels (see Meyerhoff et al. 2015). Moreover, we use an attitudinal scale developed

for measuring impulsivity. We were motivated to add this scale by the recently increasing

interest in whether personal traits explain (stated) choice behaviour (e.g., Grebitus et al.,

2013).

In the following we compare the HCM and the RPL with respect to the impact of the

design dimensions on the frequency of SQ choices, the distribution of the marginal WTP

estimates, and the extent to which they allow insights into respondents’ decision making. The

paper is organized as follows. Section 2 discusses the definition of the latent variables used in

HCM, Section 3 describes the methodological framework and Section 4 the case study.

Afterwards, Section 5 presents the main results and, finally, Section 6 is devoted to discussions

and conclusions.

2. Latent variables and structural equation models

A latent variable is one of the foundation stones of a structural equation model, but

there is no general definition of a latent variable that includes all its applications. Non-formal

definitions consider latent variables as “hypothetical variables” that cannot be directly

4

measured (MacCallum and Austin, 2000). Among the formal definitions, we find the local

independence definition (Hambleton et al., 1991), the expected value definition (Lord and

Novick 1968, pp. 29-30), the definition of them as nondeterministic functions of observed

variables (Bentler, 1982) or the sample realization definition suggested by Bollen (2002, p.

612), which seems to be simple and flexible: “A latent random (or nonrandom) variable is a

random (or nonrandom) variable for which there is no sample realization for at least some

observations in a given sample”. Structural Choice Models (Rungie, Coote and Louvieree, 2011,

2012) combine structural equation modelling (SEM) with discrete choice models, assuming

that the latent variables have random coefficients with multivariate distributions with

unknown parameters. The model incorporates factors that influence the random coefficients

and can influence each other through links in the structural equations.

We focus in the following on related but different models called HCMs in which the

latent variables represent the characteristics of individuals, typically constructs like attitudes

(Ben-Akiva et al., 2002). These latent variables are treated as endogenous and related to socio-

demographic characteristics in structural equations, but, at the same time, they are

explanatory in measurement equations relating them to observed indicators. This type of

model has been increasingly used in all fields in which discrete choice models are applied.

Nevertheless, criticism of them has increased at the same pace.

The most frequently used latent variable models thus far have been applied and

supported in transportation by, among others, Abou-Zeid et al. (2010), Walker et al. (2010),

Daly et al. (2012), Prato et al. (2012), Glerum et al. (2014), Kamargianni and Polydoropoulou

(2014), Kim et al. (2014) and Paulssen et al. (2014). In a recent paper, also in transportation,

Hess et al. (2013) find better performance of HCMs in terms of efficiency, represented by

lower standard errors, and argue that this approach presents a theoretical advantage in terms

of endogeneity bias and measurement error, but its practical implications seem limited.

5

Chorus and Kroesen (2014) go even further in their criticism. They state that HCMs do not

support the derivation of travel demand policies that aim to change travel behaviour through

changes in a latent variable, because of the non-trivial endogeneity of the latent variable

regarding travel choice and the cross-sectional nature of the latent variable which does not

allow for claims concerning changes in the variable at the individual level. The first argument is

probably highly case specific as the endogeneity of the latent variable can be an empirically

non-relevant issue. The second argument definitely needs future research, as it is not obvious

how strongly the cross-sectional nature of the attitudinal information affects the performance

of the HCMs.

Recently, Dekker et al. (2014) investigated to what extent choices for leisure activities

and related travels are driven by the satisfaction of needs of a particular leisure activity. They

include in their choice model latent variables representing the anticipated level of individual

needs-satisfaction by a particular leisure activity. Using a stated choice-dataset involving

choices between leisure activities, they contrast regret-minimisation based discrete choice

models including and excluding the subjective measurements of need-satisfaction. Their

empirical results show that, not unexpected, a big portion of the unobserved heterogeneity

(around 40%) in the activity specific utility levels can be attributed to anticipated needs

satisfaction.

In environmental valuation, the HCM has been applied, among others, by Hess and

Beharry-Borg (2012), Bartczak et al. (2015), Hoyos et al. (2015), Mariel et al. (2015), and

Lundhede et al. (2015). In general, they all support the finding that HCMs provide greater

insights into attitudes as additional drivers of choices. Both Lundhede et al (2015) and Bartczak

et al (2015) found, for example, a significant influence of age on the latent variable and

subsequently on WTP estimates. In some case also gains in efficiency were achieved.

Nevertheless, Dekker et al. (2013), who additionally asked follow-up questions to record

6

respondents’ response certainty, note, in a rather critical way, that this additional information

does not significantly improve the explanation of the observed choices.

Kløjgaard and Hess (2014), applying the HCM approach in order to investigate data

from a health survey, also express scepticism about latent variable models. They found that

only a small share of the overall heterogeneity was linked to the latent variable. According to

their interpretation, an explanation for the weak link could be the fact that preference

heterogeneity is unrelated to attitudes and perceptions, or, more precisely, that the specific

attitudinal statements measured in the survey are not directly linked to preference

heterogeneity.

Some of the issues related to the use of latent variables in HCMs might be avoided by

learning from the SEM literature. Cliff (1983), for example, gives some warnings and advice to

structural modellers, reminding them of four principles of elementary scientific inference that

are perfectly applicable to discrete choice models with latent variables. The first principle is

that data do not confirm a model; they only fail to refute it. That is, an estimated model cannot

tell us about what is not in it. Generally, it is thus recommended to estimate multiple

specifications and functional forms of a model in order to better understand the underlying

generating process. The second principle is that post hoc does not imply propter hoc; that is, a

significant coefficient in an estimated model does not always mean causality. That principle

can be related to critique by Chorus and Kroesen (2014) regarding the cross-sectional nature of

the latent variable. Due to this characteristic it is not appropriate for analysis of changes in the

variable at the individual level.

The third principle is crucial in HCM as it states that just giving something a name does

not mean that we understand it. This is directly related to the definition of a latent variable,

which usually is defined through associations with a set of indicators. Cliff (1983, p. 121) states:

“... we can only interpret our results very cautiously unless or until we have included enough

7

indicators of a variable in our analysis, and have satisfied not only ourselves but sceptical

colleagues and critics that we have done so”. The meaning of the latent variable will always, to

some extent, be wrong, and our indicators will, to some extent, be unreliable. Moreover, in

HCMs the definition of the latent variables is usually neither based on theoretical foundations

nor proved through empirical work. There are, however, accepted scales to measures, for

example, attitudes with a tested set of questions, like locus of control (Rotter, 1975) or

environmental beliefs (Stern, 2000), which can easily be incorporated in choice models. If the

set of follow-up questions has not been based on theoretical findings, a preliminary

exploratory multivariate analysis should at least be applied to confirm the structure of the

underlying constructs.

The fourth principle is that ex post facto explanations are untrustworthy. If a model

has been adjusted on the basis of its fit or lack of fit to a particular data set, its statistical status

is precarious until it can be tested on a new data set. Regarding that principle, a simple

prediction, such as the one used in this application, can help in model comparison and can

shed light on the real performance of the model and on how close the model is to the true

data-generating process.

3. Model specification

We use two model specifications in this paper to investigate the influence of the

design dimensionality on stated choices. The first is a HCM consisting, apart from

measurement equations for attitudinal indicators, of two types of structural equation, one for

the choice model and one for the latent variable model. The structural equation for the choice

model is based on random utility theory (RUM), which is used to link the deterministic model

with a statistical model of human behaviour. Under this framework, the utility 𝑈𝑖𝑛𝑡 of

alternative 𝑖 for respondent 𝑛 in choice situation 𝑡 (from a total of 𝑇𝑛 choice occasions) is given

by:

8

𝑈𝑖𝑛𝑡 = 𝑉𝑖𝑛𝑡 + 𝜀𝑖𝑛𝑡 , (1)

where 𝑉𝑖𝑛𝑡 in a classical logit model depending on observable explanatory variables, which are

usually attributes (𝑥𝑖𝑛𝑡) and vectors of attribute parameters 𝛽. The term 𝜀𝑖𝑛𝑡 is a random

variable following an extreme value distribution with location parameter 0 and scale

parameter 1. In a HCM, 𝑉𝑖𝑛𝑡 also depends on the latent variable 𝐿𝑉𝑛 and a vector of

parameters 𝛼 usually representing the interaction terms of the latent and explanatory

variables. Now let 𝑗𝑛,𝑡 be the alternative chosen by consumer 𝑛 in choice situation 𝑡, such that

𝑃𝑛,𝑡(𝑗𝑛,𝑡) gives the logit probability of the observed choice for consumer 𝑛 in choice situation

𝑡. The logit probability of consumer 𝑛’s observed sequence of choices is 𝑃𝑛 = ∏ 𝑃𝑛,𝑡(𝑗𝑛,𝑡)𝑇𝑛𝑡=1 .

The second structural equation for the latent variable is given by

𝐿𝑉𝑛 = ℎ(𝑍𝑛, 𝛾) + 𝜔𝑛, (2)

where ℎ(𝑍𝑛, 𝛾) represents the determinist part of 𝐿𝑉𝑛 and the specification is ℎ(·), which is in

our case linear, with 𝑍𝑛 being a vector of the socio-demographic variables of respondent 𝑛,

and 𝛾 being a vector of parameters. Additionally, 𝜔𝑛 is a normally distributed random

disturbance with zero mean and standard deviation 𝜎𝜔. In our case, the latent variable should

represent the level of impulsivity of the respondents.

Measurement equations use the values of the attitudinal indicators as dependent

variables, and explain their values with the help of the latent variables. The ℓ𝑡ℎ indicator (of

the total of 𝐿 indicators) for respondent 𝑛 is therefore defined as:

𝐼ℓ𝑛 = 𝑚(𝐿𝑉𝑛, 𝜁) + 𝑣𝑛, (3)

where the indicator 𝐼ℓ𝑛 is a function of the latent variable 𝐿𝑉𝑛 and a vector of parameters 𝜁.

The specification of 𝑣𝑛 determines the behaviour of the measurement model and depends on

the nature of the indicator. Responses to impulsivity statements in our case study are collected

9

using a Likert type response scale, so that the measurement equations are given by typical

ordinal logit (Mariel et al., 2015) in which, apart from the parameters 𝜁, the corresponding

thresholds 𝜏 need to be estimated.

The model is finally estimated by maximum simulated likelihood. The estimation

involves maximizing the joint likelihood of the observed sequence of choices (𝑃𝑛) and the

observed answers to the attitudinal questions (𝐿𝐼ℓ𝑛). The two components are conditional on

the given realization of the latent variable 𝐿𝑉𝑛. Accordingly, the log-likelihood function of the

model is given by integration over 𝜔𝑛:

𝐿𝐿(𝛽, 𝛾, 𝜁, 𝜏) = ∑ 𝑙𝑛𝑁𝑛=1 ∫ (𝑃𝑛 ∏ 𝐿𝐼ℓ𝑛)𝐿

ℓ=1𝜔 𝑔(𝜔)𝑑𝜔. (4)

Thus, the joint likelihood function (4) depends on the parameters of the utility functions

included in (1), the parameters for the socio-demographic interactions in the latent variable

specification defined in (2), and the parameters for the measurement equations defined in (3).

Daly et al. (2012) describe different identification procedures. We follow the Bolduc

normalization by setting σω equal to 1. All model components are estimated simultaneously

and are contrasted using PythonBiogeme (Bierlaire, 2003, 2008) and Ox (Doornik, 2001).

The benchmark model for the hybrid setting described above is a typical RPL model in

which we assume that 𝛽𝑛 is a vector of the true, but unobserved, taste coefficients for

consumer 𝑛. We assume that 𝛽𝑛 is distributed over consumers with density 𝑔(𝛽,Ω). In this

case, if 𝑃𝑛,𝑡𝑅 (𝑗𝑛,𝑡|𝛽) gives the logit probability of the observed choice for consumer 𝑛 in choice

situation 𝑡, the logit probability of consumer 𝑛’s observed sequence of choices is:

𝑃𝑛𝑅(Ω) = ∫ ∏ 𝑃𝑛𝑅�𝑗𝑛,𝑡|𝛽�𝑇𝑛𝑡=1 𝑔(𝛽|Ω)𝑑𝛽 𝛽 . (5)

The log-likelihood function for the observed choices is then:

𝐿𝐿(Ω) = ∑ 𝑙𝑛𝑁𝑛=1 (𝑃𝑛𝑅(Ω)) . (6)

10

4. Case study

The survey aimed at measuring preferences for land use changes in Germany.1 Thus,

the selected choice attributes comprise share of forest, land consumption, biodiversity

conservation and a price attribute (Table 4.1). All attributes except those concerning

biodiversity conservation were presented in all designs, while the biodiversity attributes were

used to adjust the number of attributes according to the design plan proposed by Hensher

(2004). Following this approach, 16 separate efficient designs were created using C-efficiency

allowing for minimizing the variance of WTP (Scarpa and Rose 2008). The designs were

optimized for a MNL model.

Table 4.1: Attributes used in the Choice Experiment

Attribute Description FOREST Percentage changes in the share of forest (positive and negative) LAND Percentage changes in land conversion for housing development and

traffic (positive and negative) BIO Biodiversity in the whole landscape including all landscape types BIO_AGRAR Agricultural landscape biodiversity BIO_FOREST Forest landscape biodiversity BIO_URBAN Urban area biodiversity BIO_OTHER1 Biodiversity in other landscape types: forests, urban areas,

mountains, water BIO_OTHER2 Biodiversity in other landscape types: urban areas, mountains, water BIO_OTHER3 Biodiversity in other landscape types: mountains, water COST Contribution to a landscape fund in € per year

Table 4.2 provides an overview of the 16 designs and of how the dimensions of the

choice sets vary across designs. All choice tasks included an SQ alternative, i.e., a zero price

option with no environmental changes, plus two or more alternatives depending on the

design-of-designs plan. Choices in the choice experiment regarding landscape changes had to

be made by considering the landscape within a distance of about 15 kilometres from the

respondent’s place of residence. Respondents for the nationwide online survey were recruited

1 See Meyerhoff et al. (2015) for more details of the design of the choice experiment and the survey.

11

from a panel of a survey company. Each respondent was randomly allocated to one of the 16

designs.

Table 4.2: Design overview

Design Sets Alternatives Attributes Levels Range Interviews completed

1 24 4 5 3 Base 67 2 18 4 5 4 +20% 63 3 24 3 6 2 +20% 66 4 12 3 6 4 Base 59 5 6 3 4 3 +20% 82 6 24 3 4 4 -20% 45 7 6 4 7 2 -20% 181 8 12 5 4 4 +20% 65 9 24 5 4 4 Base 65

10 6 5 7 3 +20% 128 11 6 4 6 4 -20% 71 12 12 5 5 2 -20% 68 13 18 4 7 2 Base 83 14 18 3 4 3 -20% 65 15 12 3 5 2 Base 99 16 18 5 6 3 -20% 76

Note: The number of interviews does not include those respondents who always chose the SQ option.

The questionnaire also included scales to capture different attitudes or personality

traits of the respondents. One of these was a scale developed for measuring impulsivity. The

scale is meant to provide a measurement instrument that allows the psychological trait of

impulsivity to be recorded in an economic way, i.e., in a way that consumes only a small

amount of interview time. The scale follows the UPPS (Urgency Premeditation Perseverance

and Sensation Seeking Impulsive Behavior Scale) approach. Kovaleva et al. (2012) point out

that there is still no standard definition of impulsiveness but that it is assumed that the

construct is multidimensional and thus comprises various aspects of impulsive behaviour.

These include, among others, i) the tendency to act without thinking and without sufficient

information for a decision, ii) the tendency to prefer a smaller immediate reward, and iii) the

tendency to choose riskier alternatives or the inability to assess the risks associated with

decisions correctly. Therefore, the UPPS approach comprises the four subscales urgency,

12

intention, endurance and willingness to take risks. Each subscale is addressed using two items.

Table 4.3 reports the wording of the attitudinal statements and the direction of the association

with the latent construct impulsivity. Kovaleva et al. (2012) show that their scale performs well

and allows a reliable and valid measurement of impulsivity.

Table 4.3: Attitudinal questions

impul1 urgency + Sometimes I do things impulsively that I shouldn't do impul2 + I sometimes do things to cheer myself up that I later regret impul3 intention - I usually think carefully before I act impul4 - I usually consider things carefully and logically before I make up

my mind impul5 endurance - I always bring to an end what I have started impul6 - I plan my schedule so that I get everything done on time impul7 willingness to

take risks + I am willing to take risks

impul8 + I am happy to take chances

The scale was added to the survey in order to shed light on the link between

respondents’ psychological traits and their stated choices in the survey. We expect that

respondents who tend to be more impulsive are more likely to choose alternatives with a

positive price, i.e., not the SQ option, and that this intensifies when the choice sets become

more complex with a higher dimensionality. The reason for this is that people who are said to

be more impulsive are, among other things, expected to be more likely to act without

reflecting on the consequences and to be more likely to take risks (Kovaleva et al., 2012). To

some extent, however, the scale, which was provided by a leading social science research

centre in Germany (GESIS - Leibnitz Institute for the Social Sciences), was added in an

experimental manner as we expected it to be a reliable measurement instrument enabling us

to estimate HCMs. The literature applying latent variable models indicates that not using

reliable measurement instruments reduces the possibility of estimating an HCM.

13

5. Results

Table 5.1 describes the variables used in the econometric models, along with their

descriptive statistics. Non-responses to items mean that the useable sample comprises 23,118

responses from 1,661 individuals. Briefly, the mean age is 42.3 years, the share of female

respondents is 53% and the mean disposable income of the respondents’ households is 17,500

Euros. As the survey was conducted as an online survey, we did not expect the sample to be

representative for the population in Germany. Not all people have access to the Internet and,

above all, not all use it regularly. Obvious deviations exist for the variables education and

income. Compared to the German population, the share of respondents with higher education

is too large and thus the disposable incomes are also too high. However, as we did not plan to

aggregate, for example, welfare measures based on the model results, we assume for the

following that the model comparison is not affected by the sample composition.

Table 5.1: Summary statistics

Variable (Attribute) Description Mean Std.Dev. Min Max

AGE Age 42.31 13.53 19 84 MAN Gender: Male 0.47 0.50 0 1 HIGHEDUC Level of education > secondary 0.39 0.49 0 1 INCOME Income 17,500.00 34,959.09 450 100,000 POSITION Position of the choice set 9.18 6.22 1 24 ALTERNATIVES Number of alternatives 3.85 0.81 3 5 ATTRIBUTES Number of attributes 5.32 1.08 4 7 WIDE Wide level range 0.29 0.45 0 1 NARROW Narrow level range 0.33 0.47 0 1 LEVEL3 Three level range 0.28 0.45 0 1 LEVEL4 Four level range 0.33 0.47 0 1

In addition to the socio-economic information, the respondents were asked a series of

attitudinal questions regarding impulsivity, as presented in Table 4.3. Table 5.2 shows the

response distributions on a 5-point Likert scale. For each statement, values closer to five would

14

equate to stronger agreement while values closer to one would equate to stronger

disagreement.

Table 5.2: Responses to the impulsivity attitudinal questions

1 2 3 4 5

impul1 urgency 4% 35 % 24% 32% 5%

impul2 10% 40% 24% 24% 2% impul3 intention 1% 10% 17% 58% 14% impul4 1% 8% 18% 58% 15% impul5 endurance 1% 3% 11% 60% 25% impul6 2% 16% 16% 50% 16% impul7 willingness to take risks 3% 29% 26% 37% 5% impul8 1% 20% 30% 43% 6%

Note: 1 = doesn’t apply at all, 5 = applies completely

As a first step, an exploratory factor analysis was conducted on the responses to the

attitudinal questions. The exploratory factor analysis employed principal axis factor analysis.

According to Table 5.3, it seems reasonable to choose a two-factors solution, as the

percentage of variance explained decreases sharply in the third factor, and the highest factor

loadings appear in the columns for Factors 1 and 2. A HCM with all eight attitudinal questions

and two latent variables would have a very high number of parameters (82), which could lead

to numerical issues in the estimation procedure. As parsimony is also an important issue for

model development, we estimated numerous alternative model specifications and selected a

subset of questions using as criterion the significance of the parameters 𝜁 in the measurement

equations (3). This is the reason why only three attitudinal questions (impul1, impul7 and

impul8) have finally been included in the HCM incorporating therefore only one latent variable

representing the first factor (Table 5.3). This one latent variable solution is also in line with the

definition of our attitudinal questions, as impul1 is related to urgency and impul6 and impul7

to willingness to take risks. Our latent variable represents, therefore, urgency and risk

propensity. What is pursued here is the satisfaction of the third principle introduced by Cliff

(1983) stating that just giving something a name does not mean that we understand it. In our

15

case we chose three indicators of clearly stated theoretical concepts basing our decision on a

factor analysis of our data.

Table 5.3: Exploratory factor analysis

Eigenvalues and percentages

Factor loadings

Factor Eigenvalue Proportion Cumulative

Variable Factor1 Factor2 Factor3 Factor4

Factor1 1.92 0.66 0.66

impul1 0.56 0.00 0.36 0.10

Factor2 1.16 0.40 1.06

impul2 0.46 -0.07 0.43 0.06

Factor3 0.49 0.17 1.23

impul3 -0.61 0.27 0.28 -0.11

Factor4 0.16 0.06 1.28

impul4 -0.60 0.26 0.27 -0.13

Factor5 -0.18 -0.06 1.22

impul5 -0.26 0.45 0.00 0.21

Factor6 -0.19 -0.07 1.15

impul6 -0.33 0.35 -0.04 0.23

Factor7 -0.21 -0.07 1.08

impul7 0.52 0.56 -0.07 -0.11

Factor8 -0.23 -0.08 1.00

impul8 0.47 0.61 -0.11 -0.08

As outlined in Section 3, the specification of a HCM requires the specification of two

types of structural equations, one for the choice model and one for the latent variable model.

Following equation (1), the structural equation for the choice model has a deterministic term

𝑉𝑖𝑛𝑡, defined in our case as:

𝑉𝑖𝑛𝑡 = 𝛽′𝑋𝑖𝑛𝑡 = (𝐴𝑆𝐶𝑖 + 𝛼𝐴𝑆𝐶𝑖𝐿𝑉𝑛) + (𝛽𝐹𝑂𝑅𝐸𝑆𝑇 + 𝛼𝐹𝑂𝑅𝐸𝑆𝑇𝐿𝑉𝑛) 𝐹𝑂𝑅𝐸𝑆𝑇𝑖𝑛𝑡

+(𝛽𝐿𝐴𝑁𝐷 + 𝛼𝐿𝐴𝑁𝐷𝐿𝑉𝑛)𝐿𝐴𝑁𝐷𝑖𝑛𝑡 + 𝛽𝐵𝐼𝑂𝐵𝐼𝑂𝑖𝑛𝑡 − 𝑒𝑥𝑝(𝛽𝐶𝑂𝑆𝑇+𝛼𝐶𝑂𝑆𝑇𝐿𝑉𝑛) 𝐶𝑂𝑆𝑇𝑖𝑛𝑡, (7)

where 𝐹𝑂𝑅𝐸𝑆𝑇, 𝐿𝐴𝑁𝐷,𝐵𝐼𝑂 and 𝐶𝑂𝑆𝑇 are the choice attributes described in Table 4.1 and

𝛼𝐴𝑆𝐶𝑖 = 0 ∀ 𝑖 ≠ 𝑆𝑄. The attribute 𝐵𝐼𝑂 is substituted by the corresponding split attributes in

designs including 𝐵𝐼𝑂_𝐴𝐺𝑅𝐴𝑅,𝐵𝐼𝑂_𝐹𝑂𝑅𝐸𝑆𝑇,𝐵𝐼𝑂_𝑈𝑅𝐵𝐴𝑁,𝐵𝐼𝑂_𝑂𝑇𝐻𝐸𝑅1,𝐵𝐼𝑂_𝑂𝑇𝐻𝐸𝑅2

and 𝐵𝐼𝑂_𝑂𝑇𝐻𝐸𝑅3. In addition, we include alternative specific constants 𝐴𝑆𝐶𝑖 for all but one

of the alternatives. Note that the functional form of equation (7) resembles an RPL with the

key attributes (𝐹𝑂𝑅𝐸𝑆𝑇, 𝐿𝐴𝑁𝐷 and 𝐶𝑂𝑆𝑇) being random to allow for a more straightforward

comparison of the results obtained from the two models. According to (7), and apart from the

key attributes, 𝐴𝑆𝐶𝑆𝑄 is also assumed to be random, which allows a possible SQ effect caused

16

by impulsivity and/or complexity of the design to be analysed. In the RPL the coefficients

𝛽𝐹𝑂𝑅𝐸𝑆𝑇 ,𝛽𝐿𝐴𝑁𝐷 and 𝐴𝑆𝐶𝑆𝑄 are assumed to be normally, and 𝛽𝐶𝑂𝑆𝑇 to be log-normally,

distributed, which is in line with (7). Moreover, we assume that there is a vector of individual

characteristics and complexity variables that affects the mean of these random parameter

distributions. To make the two competing models similar, we include in the vector affecting

the mean of the random parameters the same variables as those included in the determinist

part of the latent variable 𝐿𝑉𝑛 defined in (2).

Figure 5.1: Empirical distributions of random parameters

Note: solid line represents the individual contributions of each random parameter and dashed line a normal density (log-normal

for Cost).

As the selection of the parameters’ distribution is a key issue in the RPL methodology

we applied the empirical approach proposed by Hensher and Greene (2003) to describe

graphically the empirical distributions for the random parameters. Due to this procedure the

same model for different data subsets are estimated. These subsets are created by leaving one

individual out. The differences in the parameter estimates obtained by the use of these

subsets and the parameter estimates of the whole sample provide the contribution

(incremental marginal utility) of a specific individual to the overall sample mean parameter

estimate and they can, therefore, indicate the type of underlying individual preference

heterogeneity. Figure 5.1 shows the shape of these individual contributions for each random

parameter (solid line) together with a normal density (dashed line). The cost coefficient,

however, is plotted with lognormal dashed density. The lognormal distribution (with a sign

Share of forest Land conversion Cost ASCsq

17

change), assumed for the cost parameter, assures finite moments for the WTP distributions

(Daly, Hess, and Train; 2012). Figure 5.1 shows that there are no sizeable deviations of the

individual contributions from the previously assumed density shapes for the random

parameters.

Table 5.4 presents the maximum simulated log-likelihood estimation obtained from

the RPL using 200 Halton draws. The high number of observations and the high number of

different utility function specifications due to the complex design do not allow for using more

Halton draws as this would increase estimation costs drastically. However, both models were

estimated by two different software packages (PythonBiogeme and Ox) and by using various

sets of starting values to prove the stability of presented results. The estimated means and

standard deviations of all random coefficients are presented in the upper part of Table 5.4,

together with estimated coefficients representing the heterogeneity in mean. The lower part

of the same table presents the estimations of the non-random coefficients.

Table 5.4: Random parameter logit estimation

Observations:

23118

Log_L:

-17669.9

Respondents:

1661

AIC:

35455.8

Parameters:

58

BIC:

35922.6

CAIC:

35980.6

Share of forest

Land conversion

Cost

ASC SQ

Value

p-value

Value

p-value

Value

p-value

Value

p-value

Mean 0.0299 *** <0.01

-0.0251

<0.01

-0.442

0.24

1.041

0.21 St. Dev. 0.0316 *** <0.01

0.0203

<0.01

-2.702

<0.01

3.548

<0.01

Mean heterogeneity: Choice task position 0.0002 ** 0.04

-0.0004 *** <0.01

0.0068 0.06

0.0446 *** <0.01

Number of alternatives 0.0002 0.87

0.0004 0.65

-0.1896 *** <0.01

-1.1274 *** <0.01 Number of attributes -0.0007 0.54

0.0024 *** <0.01

-0.4745 *** <0.01

0.0963 0.36

Wide level range -0.0107 *** <0.01

0.0034 ** 0.02

-0.6457 *** <0.01

0.8278 *** <0.01 Narrow level range 0.0079 *** <0.01

-0.0045 *** <0.01

-0.0918 0.27

-0.3945 0.06

Three levels 0.0053 0.06

0.0031 0.06

-2.3069 *** <0.01

0.6809 ** 0.01 Four levels 0.0076 ** 0.02

0.0012 0.54

-1.4857 *** <0.01

0.3379 0.24

Age -0.0001 0.46

0.0000 0.58

0.0091 *** <0.01

0.0168 ** 0.01 Male -0.0013 0.52

0.0009 0.47

-0.8476 *** <0.01

-0.0931 0.60

Higher Education 0.0016 0.47

-0.0053 *** <0.01

-0.0128 0.88

-0.1263 0.54 Other coefficients:

Biodiversity-Whole 0.0105 *** <0.01 Biodiversity-

Agricultural 0.0081 *** <0.01 Biodiversity-Forest 0.0123 *** <0.01 Biodiversity-Urban 0.0095 *** <0.01 Biodiversity-Other1 0.0096 *** <0.01 Biodiversity-Other2 0.0092 *** <0.01

18

Biodiversity-Other3 0.0064 *** <0.01 ASC2 0.1042 *** <0.01 ASC3 -0.1331 *** <0.01 ASC4 -0.3949 *** <0.01

Table 5.5 presents the maximum simulated log-likelihood estimation results of the

HCM obtained using also only 200 Halton draws. The upper part of the table presents the

estimations of the key attributes together with corresponding LV effect coefficient (𝛼). The

coefficients (𝛾) of the structural equation of the LV defined in (2) are on the left hand side of

the table and the coefficients of the measurement equations (𝜁) on the right hand side. These

are presented together with the thresholds estimated using the ordinal logit model (defined as

𝜏ℓ, 𝜏ℓ + 𝛿1ℓ, 𝜏ℓ + 𝛿2ℓ, 𝜏ℓ + 𝛿3ℓ) for the three attitudinal response scales.

Table 5.5: HCM model estimation

Observations:

23118

Log_L:

-26315.4

Respondents:

1661

AIC:

52716.8

Parameters:

43

BIC:

53062.9

CAIC:

53105.9

Share of forest

Land conversion

Cost

ASC SQ

Value p-value

Value

p-value

Value p-value

Value p-value

Coefficient 0.0366 *** <0.01

-0.0228 *** <0.01

-3.0761 *** <0.01

4.1742 *** <0.01

Effect of the LV -0.009 *** <0.01

0.008 *** <0.01

-1.335 *** <0.01

-3.857 *** <0.01 Structural equation

Measurement equation parameters

Choice task position -0.0093 *** <0.01

Thresholds and constants Number of alternatives 0.1441 *** <0.01

𝜏1 -3.686 *** <0.01

Number of attributes 0.0589 ** <0.01

𝛿11 2.571 *** <0.01 Wide level range -0.0101 0.91

𝛿21 1.076 *** <0.01

Narrow level range 0.0608 0.31

𝛿31 2.623 *** <0.01 Three levels 0.2387 *** <0.01

Four levels 0.1141 0.12

𝜏2 -4.005 *** <0.01 Age 0.0007 0.65

𝛿12 2.717 *** <0.01

Male 0.3284 *** <0.01

𝛿22 1.130 *** <0.01 Higher Education 0.1273 0.13

𝛿32 2.686 *** <0.01

Other coefficients: Biodiversity-Whole 0.0139 *** <0.01

𝜏3 -4.682 *** <0.01

Biodiversity-Agricultural 0.0070 *** <0.01

𝛿13 3.115 *** <0.01 Biodiversity-Forest 0.0090 *** <0.01

𝛿23 1.333 *** <0.01

Biodiversity-Urban 0.0077 *** <0.01

𝛿33 2.710 *** <0.01 Biodiversity-Other1 0.0087 *** <0.01

Biodiversity-Other2 0.0048 *** <0.01 Biodiversity-Other3 0.0072 *** <0.01

Coefficients of the LV ASC2 0.0609 *** 0.01

𝜁1 -0.533 ** 0.03

ASC3 -0.0985 *** <0.01

𝜁2 -0.492 ** 0.05 ASC4 -0.3309 *** <0.01

𝜁8 -0.279 ** 0.04

19

A comparison of Tables 5.4 and 5.5 leads to the following conclusions. The estimates of

the non-random coefficients, as well as the coefficient mean values of the Share of forest and

Land conversion coefficients, are very close. The main difference between the models can be

found only in the Cost and 𝐴𝑆𝐶𝑆𝑄 coefficients. Many design dimensions and socio-

demographic variables have significant effects on the mean of the random coefficients in RPL

and on the LV in the HCM model. A direct comparison, however, is not possible because the

RPL is more flexible, in the sense that it allows different impacts of these variables on the

mean of each random coefficient, whereas in the HCM this effect is modelled through a latent

concept of impulsivity and is therefore assumed to be the same for all coefficients.

Given that the 𝜁 coefficients in (3) are negative, high values of the latent variable

correspond to less impulsive, and more risk averse, individuals. Thus, as 𝛼𝐴𝑆𝐶𝑆𝑄 is negative,

more impulsive individuals with high risk propensity are more likely to choose an alternative

different from the SQ option, and that confirms our a priori expectations.

Next, based on the results from Tables 5.4 and 5.5, we simulate the marginal WTP

values and the distribution of the 𝐴𝑆𝐶𝑆𝑄 for the sample population of respondents, using

10,000 draws of the corresponding normal (𝐴𝑆𝐶𝑆𝑄, Share of forest, Land conversion) and log-

normal (Cost) distributions, taking into account the heterogeneity in mean coefficients.

Similarly, the simulated marginal WTP and the distribution of 𝐴𝑆𝐶𝑆𝑄 for the HCM model are

computed by using 10,000 draws for the LV of each respondent and taking into account the

coefficients of the structural equations for the LV. Table 5.6 presents the distribution of the

WTP of the two models obtained for the two attributes Share of forest and Land conversion. As

can easily be seen, the median values are similar, but the distribution of the WTP obtained by

RPL is much wider. This could indicate a better performance of the HCM in terms of less

variation of WTP values (Hess et al., 2013). However, we have to be cautious with the

comparison as the large intervals in the RPL are likely to be, at least partially, driven by the

20

heavy tailed lognormal distribution of the cost coefficient. Nevertheless, the cost coefficient in

the HCM is also modelled in lognormal-like way (−𝑒𝑥𝑝(𝛽𝐶𝑂𝑆𝑇+𝛼𝐶𝑂𝑆𝑇𝐿𝑉𝑛) 𝐶𝑂𝑆𝑇𝑖𝑛𝑡).

Table 5.6: Distribution of marginal WTP obtained by RPL and HCM models

RPL

25th percentile Median 75th percentile Share of forest 0.1 2.0 22.4 Land conversion -10.8 -0.9 0.0

HCM

25th percentile Median 75th percentile

Share of forest 1.3 2.5 4.8 Land conversion -2.0 -1.3 -0.7

As the means of the random coefficients in the RPL model, as well as the latent variables,

depend on various design dimensions and socio-demographic variables, the WTP values can be

simulated for specific subgroups of respondents. The Tables 5.7 and 5.8 demonstrate how the

distributions of WTP change under different scenarios characterized by different values of two

design dimension variables. This allows us to analyse the effect of these variables on the WTP

distributions.

Table 5.7: Effects of position in a series of choice occasions on the marginal WTP distribution

Share of forest - RPL

Land conversion - RPL Position 25th perc. Median 75th perc.

Position 25th perc. Median 75th perc.

Low (<5) 0.0 1.9 21.9

Low (<5) -9.3 -0.7 0.0 High (>13) 0.1 2.2 21.5

High (>13) -12.0 -1.3 -0.1

Share of forest - HCM

Land conversion - HCM Position 25th perc. Median 75th perc.

Position 25th perc. Median 75th perc.

Low (<5) 1.4 2.8 5.1

Low (<5) -2.0 -1.3 -0.7 High (>13) 1.1 2.3 4.4

High (>13) -1.9 -1.2 -0.6

21

Table 5.8: Effects of number of alternatives on the marginal WTP distribution

Share of forest - RPL

Land conversion - RPL

Alternatives 25th perc. Median

75th perc.

Alternatives

25th perc. Median

75th perc.

Low (3) 0.0 1.2 13.2

Low (3) -7.3 -0.7 0.0 High (5) 0.2 4.2 47.4

High (5) -17.9 -1.3 0.0

Share of forest - HCM

Land conversion - HCM

Alternatives 25th perc. Median

75th perc.

Alternatives

25th perc. Median

75th perc.

Low (3) 1.1 2.2 4.1

Low(3) -1.9 -1.1 -0.6 High (5) 1.5 3.0 5.5

High (5) -2.1 -1.4 -0.7

The effects in Tables 5.6 and 5.7 are, as expected, in the direction of the sign of the

corresponding heterogeneity in mean coefficient (RPL) or the structural equation coefficient

(HCM). These effects are not always in the same direction in both approaches as the models

rely on different assumptions. If we focus on shifts in the median values, we conclude that

these are not as large as we would expect in all cases presented above but, nevertheless, they

are too large to be ignored. For example, the mean WTP value for the Share of forest attribute

changes from 1.2 to 4.2 in RPL and from 2.2 to 3.0 in HCM as a consequence of the change in

the number of alternatives from 3 to 5.

Using the same procedure, the distribution of 𝐴𝑆𝐶𝑆𝑄 was simulated in the RPL and

HCM models under different scenarios. Table 5.9 characterizes the changes in those

distributions attributable to design dimension variables. For example, the two approaches

confirm that a choice task appearing later in the sequence of tasks increases the utility of the

SQ alternative, leading to a higher probability of it being chosen. This can be due to the fatigue

effect (e.g., Boxall et al., 2009). On the other hand, as expected, in the two approaches a

higher number of alternatives has an opposite effect – that is, more alternatives leads to a

lower probability for the SQ choice. The same result was obtained in Oehlmann et al. (2014).

22

Table 5.9: Distribution of 𝐴𝑆𝐶𝑆𝑄 under different scenarios

RPL

HCM

25th perc. Median 75th perc.

25th perc. Median 75th perc. Position

Position

Low -4.3 -1.8 0.8

Low -3.3 -0.6 2.1 High -3.3 -0.9 1.6

High -2.5 0.2 2.9

Alternatives

Alternatives Low -2.8 -0.3 2.0

Low -2.2 0.4 3.1

High -4.9 -2.6 -0.2

High -3.6 -1.0 1.7

Attributes

Attributes Low -3.9 -1.3 1.2

Low -2.7 -0.1 2.6

High -3.7 -1.2 1.3

High -3.1 -0.4 2.3

Wide level range

Wide level range No -4.2 -1.7 0.8

No -3.0 -0.3 2.4

Yes -3.0 -0.5 1.9

Yes -2.8 0.0 2.7

Narrow level range

Narrow level range No -4.2 -1.7 0.8

No -2.8 -0.1 2.6

Yes -4.3 -1.8 0.7

Yes -3.2 -0.5 2.2

Next, we compared the performance of the two models by two simple approaches.

First, in a similar way to the simulation of the marginal WTP distributions, we simulated the

probabilities of each alternative based on the sample population of respondents, using 10,000

draws. If we assume that the highest probability coincides with the choice prediction, we get

the classification tables of observed and predicted outcomes presented in Table 5.10. The

results are presented in percentages. As can be observed, there are only minor differences

between the results for both models. Both models, RPL and HCM, predict very similarly, but at

the same time also poorly.

Table 5.10: Classification table of observed and predicted outcomes

RPL

HCM

Predicted

Predicted

1 2 3 4 5

1 2 3 4 5

1 0.084 0.036 0.061 0.030 0.012

0.224

1 0.086 0.030 0.059 0.029 0.020

0.224

2 0.036 0.085 0.066 0.032 0.010

0.229

2 0.032 0.089 0.062 0.030 0.017

0.229

Observed 3 0.066 0.066 0.158 0.026 0.011

0.327

3 0.065 0.068 0.151 0.026 0.017

0.327

4 0.030 0.030 0.022 0.056 0.010

0.147

4 0.030 0.032 0.019 0.054 0.014

0.147

5 0.015 0.014 0.013 0.010 0.021

0.073

5 0.013 0.015 0.013 0.006 0.027

0.073

0.230 0.230 0.320 0.155 0.063

1.000

0.225 0.233 0.304 0.145 0.093

1.000

23

If we transform the information into one indicator, defined as 𝑅𝐶𝑜𝑢𝑛𝑡2 = (1𝑁

)∑ 𝑛𝑗𝑗𝑗 ,

where 𝑛𝑗𝑗 is the number of correct predictions for outcome 𝑗 that are located on the diagonal

cells of the two tables, we get 𝑅𝐶𝑜𝑢𝑛𝑡2 = 0.403 for the RPL model and 𝑅𝐶𝑜𝑢𝑛𝑡2 = 0.406 for the

HCM model. If we make our prediction more realistic and use in each simulation step a draw

from a uniform [0,1] to generate a choice prediction based on the predicted probabilities, then

the values 𝑅𝐶𝑜𝑢𝑛𝑡2 drop slightly to 0.371 and 0.373 in the RPL and HCM models respectively.

Unsurprisingly, the difference is also very small.

If we analyse the contribution of the attributes and attitudinal questions to the

prediction in more detail, we can subtract from the numerator and denominator of 𝑅𝐶𝑜𝑢𝑛𝑡2 the

number of cases in the outcome with the highest frequency (in our case outcome 3), and we

obtain an adjusted 𝑅𝐶𝑜𝑢𝑛𝑡2 which is, in our case, 0.114 for RPL and 0.118 for HCM. Our

knowledge of attributes and attitudinal questions, compared to a prediction based only on the

marginal distributions, reduces the error in prediction by only 11.4% and 11.8% respectively.

There are other simple indicators related to observed and unobserved heterogeneity

that can be used to compare the RPL and HCM. The random coefficients are an appealing part

of the RPL, but we would certainly prefer to interpret a model in which the unobserved

heterogeneity represents only a small part of the random coefficients. The same is true for the

HCMs. Actually, an RPL-like definition of the HCM coefficients (𝛽𝐹𝑂𝑅𝐸𝑆𝑇 + 𝛼𝐹𝑂𝑅𝐸𝑆𝑇𝐿𝑉𝑛) is a

nice way to disentangle the preference heterogeneity through the use of the underlying

construct. To achieve this goal, the coefficients 𝛾 in (2) should be sufficiently big so that

ℎ(𝑍𝑛, 𝛾) represents a high proportion of the total variation of the latent variable. Table 5.11

represents the ratios of the variances of observed and unobserved heterogeneity. For the HCM

model, the table represents the ratio of the variances of ℎ(𝑍𝑛, 𝛾) and 𝜔𝑛 defined in (2) and

computed by the use of the same simulations as those used in the above prediction exercise.

The values in the RPL column have been computed in a similar way.

24

Table 5.11: Observed/unobserved heterogeneity ratios

RPL HCM

Share of forest 0.076 0.067

Land conversion 0.077 0.067

Cost 0.155 0.067 𝐴𝑆𝐶𝑆𝑄 0.097 0.067

As can be observed from Table 5.11, the ratios are low but this finding is not unusual in the

literature (Dekker et al., 2013; Kløjgaard and Hess, 2014).

6. Discussion

The objective of this paper was to investigate whether the insights gained from HCMs,

which have been applied more frequently in the recent literature, justify the additional effort.

We used as a case a data set based on design-of-designs approach allowing for the analysis of

the influence of choice task complexity on model outcomes. Regarding the influence of the

design dimensions we find that both the HCM and the RPL model show that the design

dimensions influence the WTP distribution. The results are obviously not exactly the same for

the two models, as the more flexible RPL specification allows us to see different effects of the

design dimensions on WTP for each attribute. Both approaches, moreover, confirm that all the

design dimensions in the analysis influence the marginal WTP values, and, subsequently some

conclusions can be drawn. Firstly, it is important to choose the design dimensions of choice

sets carefully as they can significantly influence the outcomes. Our results show that the

highest influence corresponds to the number of alternatives and the number of attribute

levels. Secondly, the design dimensions are also related to the frequency of SQ choices.

According to our results, more alternatives for the choice set have a negative impact on the

frequency of SQ choices. This can be explained by the so-called preference matching effect

(Zhang and Adamowicz, 2011), i.e., giving respondents more alternatives on a choice set

increases the probability that they find an alternative that matches their preferences. By

25

contrast, the number of choice tasks faced by a respondent positively affects the frequency of

SQ choices, i.e., the later in the sequence of choice sets, the higher the propensity to choose

the SQ alternative. This might be caused by respondent fatigue at the end of the sequence of

choice sets. To what extent learning and fatigue take place while responding to a discrete

choice experiment is, however, still under investigation (see for a recent study Campbell et al.,

2015). In this study we have only focused on the design dimensions and have not incorporated

other aspects of complexity such as the total number of level changes or the similarity of

alternatives measured, for example, through entropy (e.g., Zhang and Adamowicz, 2011).

Therefore, we might not have captured all those aspects of complexity that influence the

propensity to choose the SQ alternative. The reason for this is that we wanted to focus here on

the comparison of the models. Readers interested in the relationship between the other

aspects complexity and SQ choices are thus referred to Oehlmann et al. (2014). Finally,

regarding the effect of impulsivity on the propensity to choose the SQ option, we conclude

that more impulsive and risk-seeking people are more likely to choose a non-SQ alternative.

The findings add to an increasing evidence about the relationship between personality traits

and choices (Farizo et al., 2016).

The main objective of this paper, as stated in the introduction, was to compare, more

closely than is usually done, an HCM with the more commonly used RPL model. The

comparison includes performance, the insights gained through the estimation and the

subsequent post-estimation analysis. We therefore believe that our results add new insights to

the ongoing debate regarding the performance and additional value of HCMs (Chorus and

Kroesen, 2014; Dekker et al., 2014, Kløjgaard and Hess, 2014; Vij and Walker, 2015). The two

competing models in our case study were specified in a similar way so that their comparison

would be relatively easy. The two models allow for preference heterogeneity of three key

attributes. One part of this heterogeneity is linked to the dimensionality of the choice tasks

and to socio-demographic variables. The other part remains random. The main difference

26

between the two approaches is that the taste heterogeneity in the RPL model is not linked to

any underlying latent attitudes. Thus, a comparison in terms of model fit is not

straightforward. Some authors compute the LogL-value of competing models corresponding

only to the choice part of the model. However, this procedure is debateable as the log-

likelihood function is maximized taking into account all the parameters of the model. This is

why in the literature the debate about the suitability of the HCM usually remains in the

discussion of the actual differences in the implied sensitivities of alternative model

specifications.

The work of Glerum et al. (2014) is an exception, presenting an interesting validation

of their model in relation to the fourth principle of the SEM literature that ex post facto

explanations are untrustworthy. They estimate the HCM on 80% of the data and compute the

choice probabilities for the remaining 20% of the data. Assuming that the highest predicted

probability corresponds to the chosen alternative, the authors compare this to the actual

choice. They also use the �̅�2 as an additional indicator of the validity of the HCM in comparison

to a plain MNL model, concluding, unsurprisingly, that the HCM performs better. A different

approach was presented by Kløjgaard and Hess (2014), who try to disentangle the influence of

the latent variable, but their conclusion is not very optimistic. Only a small share of the overall

heterogeneity is linked to the latent variable that explains only slightly more than 6% of the

total variance. The validation of the HCM should therefore be an important part of any

empirical application based on HCM methodology, as the criticism of this approach basing on

empirical evidence (Kløjgaard and Hess, 2014) and theoretical foundations (Chorus and

Kroesen, 2014, Dekker et al., 2013) has increased considerably.

If we focus on our comparison of the performance of the two models, the first

conclusion, based on the prediction exercise (Table 5.10), is that the two models perform very

similarly and that no great differences can be found in their prediction outcomes. The second

27

conclusion is that the two models perform very poorly in forecasting, as they are only able to

predict around 40% of the actual outcomes. This low percentage is not a big issue in our case,

as the main goal of the discrete choice models in the environmental field is usually not

prediction but policy making based on the WTP values obtained. Nevertheless, it shows that

the methodologically complex models like those presented in this application, which are

widely used and not only in environmental studies, can show complex forms of inter-variable

relations and how the variables relate to preference heterogeneity, but fail to give a good

representation of the underlying data-generating process.

Regarding the observed/unobserved heterogeneity ratios of the two models (Table

5.11), the low ratio for the HCM is perfectly in line with Kløjgaard and Hess (2014), as in their

case study only 6% of the overall heterogeneity is linked to the latent variable. This may

require a revision of the main idea behind the relationship between preference heterogeneity

and attitudes, i.e., that attitudes contribute substantially to explaining taste heterogeneity. As

can be seen in Table 5.10, the RPL model does not amend this shortcoming. The standard

deviations of the random parameters are too high, leaving little space for the design

dimensions and socio-demographic variables to represent the relevant portion of the overall

heterogeneity. It is obviously questionable whether the assumed distributions are the

appropriate ones in this case, but the same pattern of a low ratio is observed for all

coefficients. And this point is related to the first principle of elementary scientific inference:

that data do not confirm a model, they can only fail to refuse it. Low ratios of observed and

unobserved heterogeneity in many HCM applications lead us to revise our a priori assumption

that the unobserved constructs representing our latent attitudes are related to socio-

demographic characteristics usually included in a choice experiment, not only in environmental

valuation. The modelling approach should be also tested in detail. The significant reduction of

the unobserved heterogeneity obtained by Dekker et al. (2014) by the use of a mixture of the

RPL and HCM approach seems to be promising line for a future research.

28

As already stated in the literature, HCMs gain in efficiency by the inclusion of

additional information (attitudinal questions) in the choice model. This is why, in our case, the

WTP distributions derived from the HCM present lower variations. Our case is rather atypical,

because the plain RPL model has more coefficients than the HCM. The typical situation is the

opposite one, i.e., the number of parameters in a HCM rises rapidly with each attitudinal

debriefing question. Thus, our case is an untypical example where the HCM simplifies a

complex RPL model devoted to the capture of complex observed heterogeneity.

Our recommendation based on the present evidence is that people conducting choice

experiments can consider adding the HCM to their suite of models even if the costs are high

and these models are not yet available in standard econometric packages. Nevertheless, there

are many case specific external issues needed to be taken into account to be able to reduce

unobserved heterogeneity. This is in line with Vij and Walker (2016) who evaluate

systematically the benefits of the HCM framework in comparison with a more traditional

choice model without latent variables using a set of criteria based on statistical considerations

and relevance to practice and policy. The study finds the statistical benefits of the HCM to be

smaller than previously believed. According to their conclusions, HCM can improve predictions

and to reduce the variance of the parameter estimates only in some specific cases. However,

in terms of relevance for practice and policy, they recommend the use of HCM as it allows to

measure, test and quantify the impact of latent constructs on observable behaviour through,

for example, willingness to pay estimates or elasticity of demand.

The overall conclusion of our model comparison leads us to highlight the importance

of the final use of an estimated model. If we are interested in learning something new from

the model and disentangle the preference heterogeneity further than a standard RPL allows, a

HCM is a promising option. This view is supported by Vij and Walker (2016, pp.212): “Unlike

simpler choice models, ICLV models provide a mathematical framework for testing and

29

applying complex theories of behaviour, and lend structure and meaning to underlying sources

of heterogeneity.” If model fit and predictive power are the goals, more simple models can be

a more adequate choice. However, so far most studies in environmental valuation investigate

taste heterogeneity and its drivers.

Finally, we would like to highlight the importance of the SEM principles in the use of

HCMs summarized in Section 2. Specifically, the third and fourth principles of elementary

scientific inference are not given sufficient attention by practitioners. The third is related to

the theoretical foundation of the latent variable and the necessity to apply exploratory

multivariate analysis to the attitudinal responses before including them in the model. Further,

the last principle, related to model validation, is almost completely missing in the recent choice

modelling literature. The present application uses classical approaches to gain deeper insight

into the performance of the applied complex models. In addition to the methodological issues

there is, finally, a very important issue with respect to the estimation procedure as we found

that it is very important to use various starting values as the HCMs can quickly end up in local

maxima.

All in all, both the RPL and the HCM allow for an understanding of some of the complex

impacts of the complexity variables on the structural coefficients, but the two models confirm

that a large portion of the unobserved heterogeneity remains unexplained. Future research

should find the missing variables or alternative model specifications that could reduce this

unobserved heterogeneity.

30

Acknowledgments

The authors acknowledge financial support from the Department of the Environment of the

Basque Government and from the Department of Education of the Basque Government

through grant IT-642-13 (UPV/EHU Econometrics Research Group), Spanish Ministry of

Economy and Competitiveness through grant ECO2014-52587-R as well as from the German

Federal Ministry of Education and Research Funding (Fkz. 033L029G; Fkz 01LL0909A). This

paper was partially written when Petr Mariel was visiting the Durham University Business

School. He gratefully acknowledges the support provided by the Basque Government

(Ikermugikortasuna 2016) for this stay.

31

References

Abou-Zeid, M., Ben-Akiva, M., Bierlaire, M., Choudhury, C., Hess, S., 2010. Attitudes and value

of time heterogeneity, in: Van de Voorde, E., Vanelslander, T. (Eds.), Applied Transport

Economics – A Management and Policy Perspective. De Boeck Publishing, Antwerp, pp. 523-

545.

Bartczak, A., Mariel, P., Chilton, S., Meyerhoff, J., 2015. The impact of latent risk preferences

on valuing preservation of threatened lynx populations in Poland, forthcoming in Australian

Journal of Agricultural and Resource Economics, doi: 10.1111/1467-8489.12123.

Ben-Akiva, M., McFadden, D., Train, K., Walker, J., Bhat, C., Bierlaire, M., Bolduc, D., Boersch-

Supan, A., Brownstone, D., Bunch, D.S., Daly, A., De Palma, A., Gopinath, D., Karlstrom, A.,

Munizaga, M.A., 2002. Hybrid choice models: Progress and challenges. Marketing Letters

13(3), 163-175.

Bentler, P.M., 1982. Linear systems with multiple levels and types of latent variables, in:

Jöreskog, K.G. and Wold, H. (Eds.), Systems Under Indirect Observation, North-Holland,

Amsterdam, pp 101-130.

Bierlaire, M., 2003. BIOGEME: A free package for the estimation of discrete choice models, in:

Chevroulet, T., Sevestre, A. (Eds.), Proc. 3rd Swiss Transportation Research Conf., 19-21 March

2003, Monte-Verita, Ascona, Switzerland.

Bierlaire, M., 2008. An Introduction to BIOGEME Version 1.7. Available at: biogeme.epfl.ch.

Bollen, K., 2002. Latent variables in psychology and the social sciences. Annual Review of

Psychology 53, 605-634.

32

Boxall, P., Moon, A., Adamowicz, W.L., 2009. Complexity in decision making: The effect of task

structure and context on participant response. Australian Journal of Agricultural and Resource

Economics 53(4), 503-519.

Campbell, D., Boeri, M., Doherty, E., Hutchinson, W. G., 2015. Learning, fatigue and preference

formation in discrete choice experiments. Journal of Economic Behaviour & Organization 119,

345-363.

Can, O., Alp E., 2012. Valuing of environmental improvements in a specially protected marine

area: a choice experiment approach in Göcek Bay, Turkey. Science of Total Environment 8,

439-291.

Chorus, C., Kroesen, M., 2014. On the (im-)possibility of deriving transport policy implications

from hybrid choice models. Transport Policy 36, 217-222.

Cliff, N., 1983. Some cautions concerning the application of causal modeling methods.

Multivariate Behavioral Research 18, 115-126.

Daly, A., Hess, S., Patruni, B., Potoglou, D., Rohr, C., 2012. Using ordered attitudinal indicators

in a latent variable choice model: A study of the impact of security on rail travel behaviour.

Transportation 39, 267-297. DOI 10.1007/s11116-011-9351-z.

Daly, A., Hess, S., & Train, K. (2012). Assuring finite moments for willingness to pay in random

coefficient models. Transportation, 39(1), 19–31. http://doi.org/10.1007/s11116-011-9331-3

Dekker, T., Hess, S., Hofkes, M., Brouwer, R., 2013. Hybrid choice models for decision

uncertainty: Implicitly or explicitly uncertain?. The Sebel Pier One Sydney. 3 - 5 July 2013. The

3rd International Choice Modelling Conference.

Dekker, T., Hess, S., Arentze, T. and Chorus, C., 2014. Incorporating needs-satisfaction in a

discrete choice model of leisure activities. Journal of Transport Geography, 38. 66 - 74

33

Doornik, J., 2001. Ox: An Object-Oriented Matrix Language. London. Timberlake Consultants

Press.

Farizo, B. A., Oglethorpe, D., Soliño, M., 2016. Personality traits and environmental choices: On

the search for understanding. Science of the Total Environment 566-567, 157-167.

Glerum, A., Atasoy, B., Bierlaire, M., 2014. Using semi-open questions to integrate perceptions

in choice models. The Journal of Choice Modelling 10, 11-33.

Grebitus, C., Lusk, J.L., Nayga Jr, R.M., 2013. Explaining differences in real and hypothetical

experimental auctions and choice experiments with personality. Journal of Economics

Psychology 26, 11-16.

Hambleton, R.K., Swaminathan, H., Rogers, H.J., 1991. Fundamentals of Item Response Theory.

Sage, Newbury Park, CA.

Hensher, D. A., & Greene, W. H. (2003). The mixed logit model: The state of practice.

Transportation, 30(2), 133–176. http://doi.org/10.1023/A:10225587153502

Hess, S., Beharry-Borg, N., 2012. Accounting for latent attitudes in willingness-to-pay studies:

The case of coastal water quality improvements in Tobago. Environmental and Resource

Economics 52(1), 109-131. DOI 10.1007/s10640-011-9522-6.

Hess, S., Shires, J., Jopson, A., 2013. Accommodating underlying pro-environmental attitudes in

a rail travel context: Application of a latent variable latent class specification, Transportation

Research Part D: Transport and Environment, 25 (December 2013), 42-48

Hoyos, D., Mariel, P., Hess, S., 2015. Incorporating environmental attitudes in discrete choice

models: An exploration of the utility of the awareness of consequences scale. Science of the

Total Environment 505, 1100-1111.

34

Justes, A., Barberán R., Farizo B., 2014. Economic valuation of domestic water uses. Science of

Total Environment 8, 472-712.

Kamargianni, M., Polydoropoulou, A., 2014. Development of a hybrid choice model to

investigate the effects of teenagers’ attitudes towards walking and cycling on mode choice

behavior. Transportation Research Record 2382, 151-161.

Kim, J., Rasouli, S., Timmermans, H., 2014. Expanding scope of hybrid choice models allowing

for mixture of social influences and latent attitudes: Application to intended purchase of

electric cars. Transportation Research Part A 69, 71-85. DOI 10.1016/j.tra.2014.08.016.

Kløjgaard, M.E., Hess, S., 2014. Understanding the formation and influence of attitudes in

patients’ treatment choices for lower back pain: Testing the benefits of a hybrid choice model

approach. Social Science & Medicine, DOI 10.1016/j.socscimed.2014.05.058.

Kovaleva, A., Beierlein, C., Kemper, C.J., Rammstedt, B., 2012. Eine Kurzskala zur Messung von

Impulsivität nach dem UPPS-Ansatz: Die Skala Impulsives-Verhalten-8 (I-8) (GESIS Working

Papers 2012|20). Köln: GESIS.

Lord, F.M., Novick, M.R., 1968. Statistical theories of mental test scores. Addison-Wesley,

Reading, MA.

Lundhede, T.H., Jacobsen, J.B., Hanley, N., Strange, N., Thorsen, B.J., 2015. Incorporating

outcome uncertainty and prior outcome beliefs in stated preferences. Land Economics 91, 296-

316.

MacCallum, R.C., Austin, J.T., 2000. Applications of structural equation modeling in

psychological research. Annual Review of Psychology 51, 201-226.

35

Mariel, P., Meyerhoff, J., Hess, S., 2015. Heterogeneous preferences toward landscape

externalities of wind turbines – combining choices and attitudes in a hybrid model. Renewable

and Sustainable Energy Reviews 41, 647-657.

Meyerhoff, J., Oehlmann, M., Weller, P., 2015. The influence of design dimensions on stated

choices in an environmental context. Environmental and Resource Economics 61(3), 385-407

Oehlmann, M., Weller, P, Meyerhoff, J. 2014. Complexity-induced Status Quo Effects in

Discrete Choice Experiments for Environmental Valutation. Paper presented at Annual

Conference 2014 (Hamburg): Evidence-based Economic Policy from German Economic

Association. https://www.econstor.eu/bitstream/10419/100616/1/VfS_2014_pid_615.pdf

Paulssen, M., Temme, D., Vij, A., Walker, J., 2014. Values, attitudes and travel behavior: A

hierarchical latent variable mixed logit model of travel mode choice. Transportation 41, 873-

888.

Prato, G.G., Bekhor, S., Pronello, C., 2012. Latent variables and route choice behavior.

Transportation 39(2), 299-319.

Rolfe, J., Bennett, J., 2009. The impact of offering two versus three alternatives in choice

modelling experiments. Ecological Economics 68, 1140-1148

Rotter, J.B., 1975. Some problems and misconceptions related to the construct of internal

versus external control of reinforcement. Journal of Consulting and Clinical Psychology 43, 56-

67. DOI 10.1037/h0076301.

Rungie, C., Coote, L.V., Louvieree, J.J., 2011. Structural choice modelling: Theory and

applications to combining choice experiments. Journal of Choice Modelling 4(3), 1-29.

Rungie, C., Coote, L.V., Louvieree, J.J., 2012. Latent variables in discrete choice experiments.

Journal of Choice Modelling 5(3), 145-156.

36

Scarpa, R., Rose, J.M., 2008. Design efficiency for non-market valuation with choice modelling:

How to measure it, what to report and why. The Australian Journal of Agricultural and

Resource Economics 52, 253-282.

Stern, P.C., 2000. Towards a coherent theory of environmentally significant behavior. Journal

of Social Issues 56, 407-424.

Vij, A., and Walker, J. L. (2016). How, when and why integrated choice and latent variable

models are latently useful. Transportation Research Part B: Methodological, 90, 192–217.

http://doi.org/10.1016/j.trb.2016.04.021

Walker, J., Li, J., Srinivasan, S., Bolduc, D., 2010. Travel demand models in the developing

world: Correcting for measurement errors. Transportation Letters 2, 231-243.

Zhang, J. and Adamowicz, W.L., 2011. Unraveling the choice format effect: A context-

dependent random utility model. Land Economics 87(4), 730-743.

37

Hybrid discrete choice models: gained insights versus increasing...

Documents

Transcript of Hybrid discrete choice models: gained insights versus increasing...