Confounding between taste heterogeneity and error structure in discrete choice models.

Abstract

The past decade has seen substantial advances in the specification flexibility and estimation technology associated with discrete choice. In particular, it is now possible to accommodate both flexible patterns of inter-alternative correlation and random taste heterogeneity amongst agents. Although methods now exist to accommodate these two phenomena simultaneously (e.g. GEV mixture structures), it is still commonplace for these two phenomena to be considered separately. At the conceptual level, such a separationis entirely reasonable, however, in practice the distinction is by no means so clear-cut, and there is a significant risk of confounding. As an example, the random variations in the sensitivity to public transport fares will lead to correlation between the utility functions of various public transport alternatives, such that, in a misspecified model, this taste heterogeneity may be misrepresented as simple inter-alternative correlation such as that caused by the presence of shared unobserved attributes. The theoretical part of the paper demonstrates how confounding between inter-alternative correlation and random inter-agent taste variations can arise, showing how allowing for only one of the two phenomena can produce results that are biased by the presence of the unmodelled effect. Although, in some cases, it is possible for the wrongly specified model to attain similar model fit, the risk of misinterpretation of findings persists. This relates to the implied cross-elasticities as well as to implications in terms of behaviour in the tails of the population. It can be seen that two possible scenarios arise in which the issue of confounding can play a major role. The first case is one where, in the true model, only one of the two phenomena plays a role, but where the estimated model allows only for the other phenomenon to have an effect. Here, the effects of the unexplained phenomenon can lead to erroneous results showing an effect of the other phenomenon.In the second scenario, both phenomena play a role, but the model employed in estimation allows only for the presence of either of the two phenomena. Here, the presence of the second, unexplained phenomenon, can lead to biased estimates in relation to the other phenomenon. The applied part of the paper presents six separate case studies using simulated data, representing the various possible situations in which confounding can play a role.The first two case studies illustrate how the presence of unexplained inter-alternative correlation can lead to erroneous results with regards to the prevalence of random taste heterogeneity. The following three case studies illustrate the converse, showing how the presence of unexplained random taste heterogeneity can lead to erroneous results with regards to the presence of inter-alternative correlation. The final case study shows that, in the case where both phenomena play a role, not accounting for the effect of one of the two phenomena can lead to biased results in relation to the second phenomenon. Each of the six case studies also shows how, by usingmodels allowing jointly for the effects of the two phenomena, the risk ofconfounding is much reduced, although, in some cases, minor issues with confounding can still exist even with the use of such models. The next partof the paper illustrates the impact of the biased results in terms of model forecasts. As such, the use of models affected by confounding can lead to biased forecasts of market shares, which can in turn lead to misguided policy decisions. While the issue with misleading forecasts arises especially in the case of incorrect results in relation to inter-alternative correlation, problems are also caused in the case of incorrect results in terms of random taste heterogeneity, for example by giving an inadequate account of variations in willingness-to-pay measures across individuals, which can lead to major problems in cost-benefit analyses. The results discussedin this paper offer strong evidence that modellers should acknowledge thepotential risk of confounding, especially given the lack of a priori knowledge as to the true nature of the error structure. While testing separately for the two phenomena can alert the modeller to the relative performance of the two approaches, it does not remove the risk of biased findings. As such, the findings from this paper suggest that modellers should always allow for the effects of both phenomena in a joint fashion, either in a GEV mixture structure, or with the help of a combined ECL-RCL formulation. For the covering abstract see ITRD E135582.

Confounding between taste heterogeneity and error structure in discrete choice models.

Request publication

Publication

Our collection