Ändra sökning
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf
Modeling Evolving User Behavior via Sequential Clustering
Blekinge Tekniska Högskola, Fakulteten för datavetenskaper, Institutionen för datavetenskap.
Blekinge Tekniska Högskola, Fakulteten för datavetenskaper, Institutionen för datavetenskap.ORCID-id: 0000-0001-7199-8080
2019 (Engelska)Konferensbidrag, Publicerat paper (Refereegranskat)
Abstract [en]

In this paper we address the problem of modeling the evolution of clusters over time by applying sequential clustering. We propose a sequential partitioning algorithm that can be applied for grouping distinct snapshots of streaming data so that a clustering model is built on each data snapshot. The algorithm is initialized by a clustering solution built on available historical data. Then a new clustering solution is generated on each data snapshot by applying a partitioning algorithm seeded with the centroids of the clustering model obtained at the previous time interval. At each step the algorithm also conducts model adapting operations in order to reflect the evolution in the clustering structure. In that way, it enables to deal with both incremental and dynamic aspects of modeling evolving behavior problems. In addition, the proposed approach is able to trace back evolution through the detection of clusters' transitions, such as splits and merges. We have illustrated and initially evaluated our ideas on household electricity consumption data. The results have shown that the proposed sequential clustering algorithm is robust to modeling evolving behavior by being enable to mine changes and update the model, respectively.

Ort, förlag, år, upplaga, sidor
Springer, 2019.
Nationell ämneskategori
Datavetenskap (datalogi)
Identifikatorer
URN: urn:nbn:se:bth-18666OAI: oai:DiVA.org:bth-18666DiVA, id: diva2:1352327
Konferens
European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, ECMLPKDD, Würzburg,16th to the 20th of September
Anmärkning

In progress of being published in Springer Lecture Notes.

Tillgänglig från: 2019-09-18 Skapad: 2019-09-18 Senast uppdaterad: 2019-11-18Bibliografiskt granskad
Ingår i avhandling
1. Data-Driven Techniques for Modeling and Analysis of User Behavior
Öppna denna publikation i ny flik eller fönster >>Data-Driven Techniques for Modeling and Analysis of User Behavior
2019 (Engelska)Licentiatavhandling, sammanläggning (Övrigt vetenskapligt)
Abstract [en]

Our society is becoming more digitalized for each day. Now, we are able to gather data from individual users with higher resolution than ever. With the increased amount of data on an individual user level, we can analyze their behavior. This is of interest in many different domains, for example service providers wanting to improve their service for their customers. If they know how their service is used, they have more insight in how they can improve. But, it also imposes additional difficulties. When we reach the individual user, the irregularities in the regular behavior makes it harder to model the normal behavior.

In this thesis, we explore data-driven techniques to model and analyze user behaviors. We aim to evaluate existing as well as develop novel technologies to identify approaches that are suitable for use on an individual user level. We use both supervised and unsupervised learning methods to model the user behavior and evaluate the approaches on real world electricity consumption data.

Firstly, we analyze household electricity consumption data and investigate the use of regression to model the household's behavior. We identify consumption trends, how data granularity affects modeling, and we show that regression is a viable approach to model user behavior. Secondly, we use clustering analysis to profile individual households in terms of their electricity consumption. We compare two dissimilarity measures, how they affect the clustering analysis, and we investigate how the produced clustering solutions differ. Thirdly, we propose a sequential clustering algorithm to model evolving user behavior. We evaluate the proposed algorithm on electricity consumption data and show how the produced model can be used to identify and trace changes in the user's behavior. The algorithm is robust to evolving behaviors and handles both dynamic and incremental aspects of streaming data.

Ort, förlag, år, upplaga, sidor
Karlskrona: Blekinge Tekniska Högskola, 2019
Serie
Blekinge Institute of Technology Licentiate Dissertation Series, ISSN 1650-2140 ; 15
Nationell ämneskategori
Datavetenskap (datalogi)
Identifikatorer
urn:nbn:se:bth-18667 (URN)978-91-7295-391-8 (ISBN)
Handledare
Tillgänglig från: 2019-11-15 Skapad: 2019-09-18 Senast uppdaterad: 2019-12-18Bibliografiskt granskad

Open Access i DiVA

fulltext(745 kB)12 nedladdningar
Filinformation
Filnamn FULLTEXT01.pdfFilstorlek 745 kBChecksumma SHA-512
b438a7714c2ddd3a57b0cbcaf1c3886e9e830d5d82bf8fd1f7fd144b3ed10dd401e4a60912f6a376c2da7e1014d3fa8b9cee331a832e3696f1bdba401f517900
Typ fulltextMimetyp application/pdf

Sök vidare i DiVA

Av författaren/redaktören
Boeva, VeselkaNordahl, Christian
Av organisationen
Institutionen för datavetenskap
Datavetenskap (datalogi)

Sök vidare utanför DiVA

GoogleGoogle Scholar
Totalt: 12 nedladdningar
Antalet nedladdningar är summan av nedladdningar för alla fulltexter. Det kan inkludera t.ex tidigare versioner som nu inte längre är tillgängliga.

urn-nbn

Altmetricpoäng

urn-nbn
Totalt: 230 träffar
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf