Modeling Consumers’ Choice of Multiple Items Simultaneously: A Methodological Approach with an Application to Vehicle Holdings and Use : Modeling Consumers’ Choice of Multiple Items Simultaneously: A Methodological Approach with an Application to Vehicle Holdings and Use
Chandra Bhat
The University of Texas at Austin
Northwestern University, January 30, 2007
Overview : Overview Introduction
Functional form of utility function
Stochastic form of utility function
Specific model structures
Empirical illustration
Conclusions
Slide3 : Introduction Several consumer demand choices are characterized by multiple discreteness Vehicle type holdings and usage
Activity type choice and duration of participation
Airline fleet mix and usage
Carrier choice and transaction level
Brand choice and purchase quantity
Stock choice and investment amount
Slide4 : Multiple discreteness
Choice of multiple alternatives simultaneously
Modeling methodologies of multiple discrete situations
Traditional random utility-based (RUM) single discrete choice models
Number of composite alternatives explodes with the number of elemental alternatives
Multivariate probit (logit) methods
Not based on a rigorous underlying utility-maximizing framework of multiple discreteness
Other issues with these methods
Cannot accommodate the diminishing marginal returns (i.e., satiation) in the consumption of an alternative
Cumbersome to include a continuous dimension of choice
Slide5 : Modeling methodologies of multiple discrete situations
Two alternative methods proposed by Wales and Woodland (1983)
Amemiya-Tobin approach
Kuhn-Tucker approach
Both approaches assume a direct utility function U(x) that is assumed to be quasi-concave, increasing, and continuously differentiable with respect to the consumption quantity vector x
Approaches differ in how stochasticity, non-negativity of consumption, and corner solutions (i.e., zero consumption of some goods) are accommodated
Slide6 : Methods proposed by Wales and Woodland
Amemiya-Tobin approach
Extension of the classic microeconomic approach of adding normally distributed stochastic terms to the budget-constrained utility-maximizing share equations
Direct utility function U(x) assumed to be deterministic by the analyst, and stochasticity is introduced post-utility maximization
Kuhn-Tucker (KT) approach
Based on the Kuhn Tucker or KT (1951) first-order conditions for constrained random utility maximization
Employs a direct stochastic specification by assuming the utility function U(x) to be random (from the analyst’s perspective) over the population
Derives the consumption vector for the random utility specification subject to the linear budget constraint by using the KT conditions for constrained optimization
Stochastic nature of the consumption vector in the KT approach is based fundamentally on the stochastic nature of the utility function
Slide7 : Advantages of KT approach
Constitutes a more theoretically unified and consistent framework for dealing with multiple discreteness consumption patterns
Satisfies all the restrictions of utility theory
Stochastic KT first-order conditions provide the basis for deriving the probabilities for each possible combination of corner solutions (zero consumption) for some goods and interior solutions (strictly positive consumption) for other goods
Accommodates for the singularity imposed by the “adding-up” constraint
Problems with KT approach used by Wade and Woodland
Random utility distribution assumptions lead to a complicated likelihood function that entails multi-dimensional integration
Slide8 : Studies that used the KT approach for multiple discreteness
Kim et al. (2002)
Used the GHK simulator to evaluate the multivariate normal integral appearing in the likelihood function in the KT approach
Used a generalized variant of the well-known translated constant elasticity of substitution (CES) direct utility function
Not realistic for practical applications and is unnecessarily complicated
Bhat (2005)
Introduced a simple and parsimonious econometric approach to handle multiple discreteness
Based on the generalized variant of the translated CES utility function but with a multiplicative log-extreme value error term
Labeled as the multiple discrete-continuous extreme value (MDCEV) model
MDCEV model represents the multinomial logit (MNL) form-equivalent for multiple discrete-continuous choice analysis and collapses exactly to the MNL in the case that each (and every) decision-maker chooses only one alternative
Several studies in the environmental economics field
Phaneuf et al., 2000; von Haefen et al., 2004; von Haefen, 2003a; von Haefen, 2004; von Haefen and Phaneuf, 2005; Phaneuf and Smith, 2005
Used variants of the linear expenditure system (LES) and the translated CES for the utility functions, and used multiplicative log-extreme value errors
Functional form of utility function : Functional form of utility function
is a quasi-concave, increasing, and continuously differentiable function with respect to the consumption quantity vector x
, and are parameters associated with good k
Assumptions : Assumptions Additive separability
All the goods are strictly Hicksian substitutes
Marginal utility with respect to any good is independent of the level of consumption of other goods
Weak complementarity
Slide11 : Role of
: baseline (at zero consumption) marginal utility
: marginal rate of substitution at zero consumption
Higher baseline implies less likelihood of a corner solution for good k
Slide12 : Role of
Slide13 : Indifference Curves
Indifference Curves Corresponding to Different Values of
Slide14 : Role of
Effect of Value on Good k’s Subutility Function Profile
Slide15 : Role of
Effect of Value on Good k’s Subutility Function Profile
Slide16 : Empirical identification issues associated with utility form
Alternative Profiles for Moderate Satiation Effects with Low Value and High Value
Slide17 : Empirical identification issues associated with utility form Alternative Profiles for Moderate Satiation Effects with High Value and Low Value
Slide18 : Empirical identification issues associated with utility form Alternative Profiles for Low Satiation Effects with High Value and High Value
Slide19 : Empirical identification issues associated with utility form Alternative Profiles for High Satiation Effects with Low Value and Low Value
Stochastic form of utility function : Stochastic form of utility function Overall random utility function Random utility function for optimal expenditure allocations
Lagrangian and KT Conditions : Lagrangian and KT Conditions
Slide22 : KT conditions if (k = 2, 3,…, K) if (k = 2, 3,…, K), where (k = 1, 2, 3,…, K)
Slide23 : General econometric model structure and identification where J is the Jacobian whose elements are given by: ; i, h = 1, 2, …, M – 1
Specific Model Structures : Specific Model Structures The MDCEV model structure Can be derived from the differenced form also
Slide25 : The MDCEV model structure
Probability of the consumption pattern of the goods (rather than the expenditure pattern) is where
Slide26 : The MDCGEV model
Generalized Extreme Value error structure
A nested logit example with four alternatives
Start with the following expression for density function A B 1 2 3 4
Slide27 : The MDCGEV model (nested logit example)
Slide28 : The Mixed MDCEV model
Slide29 : The Mixed MDCEV model - Heteroscedastic structure
The heteroscedastic structure may be specified in the form of the following covariance matrix for :
Two ways to proceed with a normalization
Normalize and estimate the heteroscedastic covariance matrix of
Normalize one of the terms instead of the term
Slide30 : The Mixed MDCEV model - General error covariance structure
Allows correlation in unobserved factors influencing the baseline utility of alternatives
Requires appropriate identification normalizations to be placed on and the covariance matrix of
One way to achieve identification in the most general error covariance structure, and when there is price variation
Normalize the scale parameter to be a small value such that the variance of the minimum variance alternative exceeds
Normalize for the minimum variance alternative k to zero
Normalize all correlations of this minimum variance alternative with other alternatives to zero
These normalizations leave only parameters to be estimated, and are adequate for identification
Empirical Analysis : Empirical Analysis Increasing dependence on automobiles
Wide-ranging impacts of automobile dependency
Household level
Community level
Regional level
Slide32 : A widely used indicator of automobile dependency is vehicle holdings and use
92% of US households owned at least one motorized vehicle in 2003 (compared to 80% in the early 1970s)
Household VMT has increased 300% between 1997-2001 relative to a population increase of 30% during same period
Important to examine vehicle holdings and usage
Travel demand forecasting
Transportation policy analysis
Slide33 : Examine several dimensions of household vehicle holdings and usage decisions
Number of vehicles owned
Vehicle body type
Vehicle age (i.e., vintage)
Vehicle make and model
Vehicle usage
Slide34 : Incorporate a comprehensive set of determinants of vehicle holdings and usage decisions
Household demographics
Individual characteristics
Vehicle characteristics
Built environment characteristics
Develop a comprehensive econometric model to analyze the many dimensions of vehicle holdings and use that accommodates for
Multiple discreteness
Satiation effects
Data : Data 2000 San Francisco Bay Area travel survey (BATS)
Designed and administered by MORPACE International Inc.
2-day survey of 15000 households
Information on vehicle fleet mix of households, individual and household socio-demographics, individual characteristics and activity episodes
Data on vehicle make/model attributes from secondary data sources
Consumer Guides
EPA Fuel Economy Guide
Land use/Demographic coverage data from MTC of San Francisco Bay area
GIS layer of bicycle facilities from MTC of San Francisco Bay area
Census 2000 Tiger files
Sample Characteristics : Sample Characteristics Final sample: 8107 households
10 motorized vehicle types
Coupe
Mini/Subcompact Sedan
Compact Sedan
Mid-size Sedan
Large Sedan
Hatchback/Station Wagon
Sports Utility Vehicle (SUV)
Pickup Truck
Minivan
Van
2 vintages considered for each motorized vehicle type
New vehicles (age of the vehicle less than or equal than 5 years)
Old Vehicles (age of the vehicle is more than 5 years )
Twenty-one vehicle types/vintages studied including
20 motorized vehicle type/vintages
Non-motorized form of transportation
Slide37 : Classification of Vehicle type/vintage
Slide38 : Distribution of Vehicles
Slide39 : Descriptive Statistics of Vehicle Type/Vintage Holdings
Empirical Results : Empirical Results Variables considered
Household socio-demographics
Household income, presence of children in the household, presence of a senior adult in the household, household size and number of employed people in the household
Household location attributes
Area type variables (central business district, urban zone, suburban zone and rural zone), residential density and employment density variables
Built environment characteristics of the residential neighborhood
Percentages and absolute values of acreage in residential, commercial/industrial, and other land-use categories; fractions and number of single family and multi-family dwelling units, and fractions and number of households living in single family and multi-family dwelling units, bikeway density, street block density, highway density
Characteristics of the household head
Age (classified into less than 30 years of age, 31 to 45 years of age and greater than 45 years of age), gender and ethnicity (primarily, Caucasian, African-American, Hispanic, Asian and Other)
Vehicle Characteristics
Purchase price, fuel cost, seating capacity, luggage volume, engine size, number of cylinders, front headroom space, front legroom space, rear headroom space, rear legroom space, standard payload capacity (for pickup trucks only), wheelbase, length, height, width, horse power, vehicle weight, type of fuel used, amount of greenhouse gas emissions (tons/year), types of drive wheels, type of vehicle make
Slide41 : MDCEV model – Effects of Household Demographics
Medium income (35-90K) and high income (>90K) households have a high baseline preference for new SUVs as compared to low-income households and a low preference for old vans
High income households have a lower baseline preference for old vehicles compared to low/middle income households
High income households less likely to undertake activities using non-motorized forms of transportation
Households with very small children (less than 4 years of age) are more likely to use compact sedans, mid-size sedans, and SUVs than other households
Households with kids between 5 and 15 years of age have a high baseline preference for minivans than other households
Households with senior adults (greater than 65 years) are more likely to use compact, mid-size, and large sedans relative to coupes and subcompact sedans
As the size of the household increases, the household is more likely to use mid-size sedans, large sedans, station wagons, SUVs, pickup trucks, minivans and vans
Household with more number of employed members have a high baseline preference for new vehicle types such as subcompact sedans and compact sedans while a low baseline preference for large sedans and minivans
Slide42 : MDCEV model – Effects of Household Location Characteristics
Households residing in the suburban zones are less likely to own and use old vehicles relative to households in urban zones
Households residing in the suburban and rural zones are more likely to own and use pickup trucks relative to urban households
MDCEV model – Effects of Built Environment Characteristics of the Residential Neighborhood
Households located in highly residential/commercial areas are less likely to prefer large vehicle types such as pickup trucks and vans, irrespective of the age of the vehicle
Households located in a neighborhood with high bike lane density have a high baseline preference for non-motorized modes of transportation
Households located in a neighborhood with high street block density are more likely to prefer smaller vehicle types (such as subcompact and compact sedans), and older vehicles, relative to new vehicles
Slide43 : MDCEV model – Characteristics of the Household Head
Older households (i.e., households whose heads are greater than 30 years) are generally more likely to own vehicles of an older vintage compared to younger households (i.e., households whose heads are less than or equal to 30 years of age)
Older households are more likely to own minivans and old vans, and travel by non-motorized forms of transportation
Households have higher baseline preference for older and larger vehicles if the male is the oldest member (or only adult) in the household relative to households with the female being the oldest member (or only adult)
Asians more likely to own sedans and new minivans, and less likely to own pickup trucks, than other races.
MDCEV model – Random Error Components/Coefficients
Households preferring old coupes due to unobserved factors also prefer new coupes
Intangible unobserved factors that affect utilities of all old vehicles
Slide44 : MNL model for Vehicle Make/Model Choice
Slide45 : Satiation Effects
All the satiation parameters are very significantly different from 1
Middle and High income households are more likely to get satiated with the increasing use of any vehicle type/vintage compared to low income households
Low income households are least likely to get satiated with the increasing use of old subcompact sedans, new and old compact sedans, and old midsize sedans
Satiation effect is highest for non-motorized mode of transportation compared to all vehicle type/vintage categories
Logsum Parameters
Indicate the presence of common unobserved attributes that affect the utilities of all makes/models corresponding to old SUV, old minivan, new minivan, old van, and new van vehicle type/vintage categories
Application of the Model : Application of the Model
Slide47 : Directions for further research
Accommodating more than one constraint in the utility maximization problem (for example, recognizing both time and money constraints in activity type choice and duration models)
Incorporating latent consideration sets in a theoretically appropriate way within the MDCEV structure
Using more flexible utility structures that can handle both complementarity as well as substitution among goods, and that do not impose the constraints of additive separability
Model with an Outside Good : Model with an Outside Good
Comparison With Earlier Multiple Discrete-Continuous Models : Comparison With Earlier Multiple Discrete-Continuous Models Kim et al.’s model
Slide50 : Models in environmental economics
Three Good Case: Probability of Choice of Only Outside Good : Three Good Case: Probability of Choice of Only Outside Good
Slide54 : Utility of outside good is assumed to be deterministic (i.e., )
A three-good example
Empirical Illustrations : Empirical Illustrations Specifications for the “No Outside Good” case with no price variables
Slide56 : Specifications for the “No Outside Good” case with price variables
Slide57 : Specifications for case with outside good and with price variables
Slide58 : Specifications for case with outside good and with price variables (continued)
Conclusions : Conclusions Several issues associated with extant KT multiple discrete-continuous models were examined
A new utility function form was proposed that enables clarity in the role of each parameter in the utility specification
Identification considerations associated with the utility specification was presented and the MDCEV model was extended to the case of price variation across goods and to general error covariance structures
The relationship between earlier KT-based multiple discrete-continuous models was discussed
Several technical nuances and identification considerations of the multiple discrete-continuous model structure was illustrated through empirical examples
Slide60 : ?
Slide61 : General econometric model structure and identification KT conditions for optimal expenditure for this modified utility function
can be shown to be: if (k = 2, 3,…, K) if (k = 2, 3,…, K), where
Slide62 : Research objectives
Reformulate the utility specification used in earlier studies
Present identification considerations related to both the functional form as well as the stochastic nature of the utility specification
Derive the MDCEV model expression for the case when there is price variation across goods and extend the MDCEV model to accommodate generalized extreme value (GEV)-based and other correlation structures
Discuss the relationship between the models of Kim et al. (2002), the KT formulations used in Environmental Economics, and the MDCEV formulation
Illustrate the technical issues related to the properties and identification of the MDCEV model through empirical illustrations