Clustering and profiling traffic roads by means of accident data.

Author(s)
Brijs, T. Guerts, C. Vanhoof, K. & Wets, G.
Year
Abstract

The identification of geographical locations with high accident risk by means of clustering techniques and profiling them in terms of accident related data and location characteristics by means of data mining techniques must therefore provide valuable input for government actions towards traffic safety. In the first part of this research, an innovative method based on latent class clustering (also called model-based clustering or finite mixture modelling) is used to cluster traffic roads into distinct groups based on their similar accident frequencies. The data that will be used are obtained from the Belgian "Analysis Form for Traffic Accidents" that should be filled out by a police officer for each traffic accident that occurs with killed or seriously injured casualties on a public road in Belgium. More specifically, this analysis will focus on 19 central roads of the city of Hasselt for 3 consecutive time periods of 1992-4, 1995-7 and 1998-2000. The observed accident frequencies are assumed to originate from a mixture of density distributions for which the parameters of the distribution, the size and the number of segments are unknown. It is the objective of latent class clustering to 'unmix' the distributions and to find the optimal parameters of the distributions and the number and size of the segments, given the underlying data. The development and use of the model is described. In the second part of this study, the data mining technique of association rules is used to profile each cluster of traffic roads in terms of the available traffic accident data. The strength of this approach lies within the identification of relevant variables that make a strong contribution towards a better understanding of the accident circumstances for each group of traffic roads. Since the clusters show different results for the overall accident 'risk' on the roads, one could expect that not every accident variable will be of equal importance when describing the different groups of traffic roads. Therefore, a comparative analysis between the accident characteristics of the different clusters is conducted, which provides new insights into the complexity and causes of road accidents. For the covering abstract see ITRD E126595.

Request publication

12 + 7 =
Solve this simple math problem and enter the result. E.g. for 1+3, enter 4.

Publication

Library number
C 34656 (In: C 33295 CD-ROM) /80 /82 / ITRD E127550
Source

In: Proceedings of the European Transport Conference ETC, Strasbourg, France, 8-10 October 2003, 16 p.

Our collection

This publication is one of our other publications, and part of our extensive collection of road safety literature, that also includes the SWOV publications.