Open this publication in new window or tab >>Show others...
2018 (English)In: 2018 17TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA) / [ed] Wani M.A.,Sayed-Mouchaweh M.,Lughofer E.,Gama J.,Kantardzic M., IEEE, 2018, p. 1123-1130, article id 8614207Conference paper, Published paper (Refereed)
Abstract [en]
Outlier detection has been studied in many domains. Outliers arise due to different reasons such as mechanical issues, fraudulent behavior, and human error. In this paper, we propose an unsupervised approach for outlier detection in a sequence dataset. The proposed approach combines sequential pattern mining, cluster analysis, and a minimum spanning tree algorithm in order to identify clusters of outliers. Initially, the sequential pattern mining is used to extract frequent sequential patterns. Next, the extracted patterns are clustered into groups of similar patterns. Finally, the minimum spanning tree algorithm is used to find groups of outliers. The proposed approach has been evaluated on two different real datasets, i.e., smart meter data and video session data. The obtained results have shown that our approach can be applied to narrow down the space of events to a set of potential outliers and facilitate domain experts in further analysis and identification of system level issues.
Place, publisher, year, edition, pages
IEEE, 2018
Keywords
Clustering, Minimum spanning tree, Outlier detection, Sequential pattern mining
National Category
Computer Sciences
Identifiers
urn:nbn:se:bth-17100 (URN)10.1109/ICMLA.2018.00182 (DOI)000463034400174 ()9781538668047 (ISBN)
Conference
17th IEEE International Conference on Machine Learning and Applications, ICMLA 2018; Orlando; United States; 17 December 2018 through 20 December
Funder
Knowledge Foundation, 20140032
2018-10-092018-10-092021-07-26Bibliographically approved