Endre søk
RefereraExporteraLink to record
Permanent link

Direct link
Referera
Referensformat
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annet format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annet språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf
Identification of Energy Hotspots: A Case Study of the Very Fast Decision Tree
Blekinge Tekniska Högskola, Fakulteten för datavetenskaper, Institutionen för datalogi och datorsystemteknik.ORCID-id: 0000-0003-4973-9255
Blekinge Tekniska Högskola, Fakulteten för datavetenskaper, Institutionen för datalogi och datorsystemteknik.
Blekinge Tekniska Högskola, Fakulteten för datavetenskaper, Institutionen för datalogi och datorsystemteknik.ORCID-id: 0000-0001-9947-1088
2017 (engelsk)Inngår i: GPC 2017: Green, Pervasive, and Cloud Computing / [ed] Au M., Castiglione A., Choo KK., Palmieri F., Li KC., Cham, Switzerland: Springer, 2017, Vol. 10232, s. 267-281Konferansepaper, Publicerat paper (Fagfellevurdert)
Abstract [en]

Large-scale data centers account for a significant share of the energy consumption in many countries. Machine learning technology requires intensive workloads and thus drives requirements for lots of power and cooling capacity in data centers. It is time to explore green machine learning. The aim of this paper is to profile a machine learning algorithm with respect to its energy consumption and to determine the causes behind this consumption. The first scalable machine learning algorithm able to handle large volumes of streaming data is the Very Fast Decision Tree (VFDT), which outputs competitive results in comparison to algorithms that analyze data from static datasets. Our objectives are to: (i) establish a methodology that profiles the energy consumption of decision trees at the function level, (ii) apply this methodology in an experiment to obtain the energy consumption of the VFDT, (iii) conduct a fine-grained analysis of the functions that consume most of the energy, providing an understanding of that consumption, (iv) analyze how different parameter settings can significantly reduce the energy consumption. The results show that by addressing the most energy intensive part of the VFDT, the energy consumption can be reduced up to a 74.3%.

sted, utgiver, år, opplag, sider
Cham, Switzerland: Springer, 2017. Vol. 10232, s. 267-281
Serie
Lecture Notes in Computer Science, ISSN 0302-9743
Emneord [en]
Machine learning, Big data, Very Fast Decision Tree, Green machine learning, Data mining, Data stream mining
HSV kategori
Identifikatorer
URN: urn:nbn:se:bth-15490DOI: 10.1007/978-3-319-57186-7_21ISI: 000426275000021ISBN: 978-3-319-57185-0 (tryckt)ISBN: 978-3-319-57186-7 (digital)OAI: oai:DiVA.org:bth-15490DiVA, id: diva2:1156958
Konferanse
International Conference on Green, Pervasive and Cloud Computing (GPC), Cetara, Amalfi Coast, Italy
Forskningsfinansiär
Knowledge Foundation, 20140032Tilgjengelig fra: 2017-11-14 Laget: 2017-11-14 Sist oppdatert: 2018-03-23bibliografisk kontrollert
Inngår i avhandling
1. Extraction and Energy Efficient Processing of Streaming Data
Åpne denne publikasjonen i ny fane eller vindu >>Extraction and Energy Efficient Processing of Streaming Data
2017 (engelsk)Licentiatavhandling, med artikler (Annet vitenskapelig)
Abstract [en]

The interest in machine learning algorithms is increasing, in parallel with the advancements in hardware and software required to mine large-scale datasets. Machine learning algorithms account for a significant amount of energy consumed in data centers, which impacts the global energy consumption. However, machine learning algorithms are optimized towards predictive performance and scalability. Algorithms with low energy consumption are necessary for embedded systems and other resource constrained devices; and desirable for platforms that require many computations, such as data centers. Data stream mining investigates how to process potentially infinite streams of data without the need to store all the data. This ability is particularly useful for companies that are generating data at a high rate, such as social networks.

This thesis investigates algorithms in the data stream mining domain from an energy efficiency perspective. The thesis comprises of two parts. The first part explores how to extract and analyze data from Twitter, with a pilot study that investigates a correlation between hashtags and followers. The second and main part investigates how energy is consumed and optimized in an online learning algorithm, suitable for data stream mining tasks.

The second part of the thesis focuses on analyzing, understanding, and reformulating the Very Fast Decision Tree (VFDT) algorithm, the original Hoeffding tree algorithm, into an energy efficient version. It presents three key contributions. First, it shows how energy varies in the VFDT from a high-level view by tuning different parameters. Second, it presents a methodology to identify energy bottlenecks in machine learning algorithms, by portraying the functions of the VFDT that consume the largest amount of energy. Third, it introduces dynamic parameter adaptation for Hoeffding trees, a method to dynamically adapt the parameters of Hoeffding trees to reduce their energy consumption. The results show an average energy reduction of 23% on the VFDT algorithm.

sted, utgiver, år, opplag, sider
Karlskrona: Blekinge Tekniska Högskola, 2017
Serie
Blekinge Institute of Technology Licentiate Dissertation Series, ISSN 1650-2140 ; 3
Emneord
machine learning, green computing, data mining, data stream mining, green machine learning
HSV kategori
Identifikatorer
urn:nbn:se:bth-15532 (URN)
Presentation
2017-12-18, J1640, Blekinge Tekniska Högskola, 371 79, Karlskrona, 13:00 (engelsk)
Opponent
Veileder
Prosjekter
Scalable resource-efficient systems for big data analytics
Forskningsfinansiär
Knowledge Foundation, 20140032
Tilgjengelig fra: 2017-11-22 Laget: 2017-11-22 Sist oppdatert: 2018-01-13bibliografisk kontrollert

Open Access i DiVA

fulltext(445 kB)275 nedlastinger
Filinformasjon
Fil FULLTEXT01.pdfFilstørrelse 445 kBChecksum SHA-512
3d6671840bda99d65280208d42e076a69c743dc40f8c0547beec9f8299d87bbad0cce811e9dfca814077c4d59b45df1a264e6ec95484b5017d96927fd18102c9
Type fulltextMimetype application/pdf

Andre lenker

Forlagets fulltekst

Personposter BETA

García Martín, EvaLavesson, NiklasGrahn, Håkan

Søk i DiVA

Av forfatter/redaktør
García Martín, EvaLavesson, NiklasGrahn, Håkan
Av organisasjonen

Søk utenfor DiVA

GoogleGoogle Scholar
Totalt: 275 nedlastinger
Antall nedlastinger er summen av alle nedlastinger av alle fulltekster. Det kan for eksempel være tidligere versjoner som er ikke lenger tilgjengelige

doi
isbn
urn-nbn

Altmetric

doi
isbn
urn-nbn
Totalt: 278 treff
RefereraExporteraLink to record
Permanent link

Direct link
Referera
Referensformat
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annet format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annet språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf