Open this publication in new window or tab >>
2017 (English) In: Technical Gazette, ISSN 1330-3651, E-ISSN 1848-6339, Vol. 24, no 5, p. 1597-1607Article in journal (Refereed) Published
Abstract [en] A huge amount of traffic data is archived which can be used in data mining especially supervised learning. However, it is not being fully used due to lack of accurate accident information (labels). In this study, we improve a Mahalanobis distance based algorithm to be able to handle differential data to estimate flow fluctuations and detect accidents and use it to support correcting and complementing accident information. The outlier detection algorithm provides accurate suggestions for accident occurring time, duration and direction. We also develop a system with interactive user interface to realize this procedure. There are three contributions for data handling. Firstly, we propose to use multi-metric traffic data instead of single metric for traffic outlier detection. Secondly, we present a practical method to organise traffic data and to evaluate the organisation for Mahalanobis distance. Thirdly, we describe a general method to modify Mahalanobis distance algorithms to be updatable. © 2017, Strojarski Facultet. All rights reserved.
Place, publisher, year, edition, pages
Strojarski Facultet, 2017
Keywords Accident data, Data labelling, Differential distance, Mahalanobis distance, Outlier detection, Traffic data, Updatable algorithm, Accidents, Data mining, Statistics, User interfaces, Mahalanobis distances, Data handling
National Category
Communication Systems Computer and Information Sciences
Identifiers urn:nbn:se:bth-15472 (URN) 10.17559/TV-20150616163905 (DOI) 000417100300037 () 2-s2.0-85032512786 (Scopus ID)
Note Funded by National Natural Science Foundation of China
Funding nr. 61364019
2017-11-102017-11-102023-12-28 Bibliographically approved