Planned maintenance
A system upgrade is planned for 10/12-2024, at 12:00-13:00. During this time DiVA will be unavailable.
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
FedCO: Communication-Efficient Federated Learning via Clustering Optimization †
Blekinge Institute of Technology, Faculty of Computing, Department of Computer Science.ORCID iD: 0000-0001-6061-0861
Blekinge Institute of Technology, Faculty of Computing, Department of Computer Science.ORCID iD: 0000-0003-3128-191x
Blekinge Institute of Technology, Faculty of Computing, Department of Computer Science.ORCID iD: 0000-0002-3118-5058
2022 (English)In: Future Internet, E-ISSN 1999-5903, Vol. 14, no 12, article id 377Article in journal (Refereed) Published
Abstract [en]

Federated Learning (FL) provides a promising solution for preserving privacy in learning shared models on distributed devices without sharing local data on a central server. However, most existing work shows that FL incurs high communication costs. To address this challenge, we propose a clustering-based federated solution, entitled Federated Learning via Clustering Optimization (FedCO), which optimizes model aggregation and reduces communication costs. In order to reduce the communication costs, we first divide the participating workers into groups based on the similarity of their model parameters and then select only one representative, the best performing worker, from each group to communicate with the central server. Then, in each successive round, we apply the Silhouette validation technique to check whether each representative is still made tight with its current cluster. If not, the representative is either moved into a more appropriate cluster or forms a cluster singleton. Finally, we use split optimization to update and improve the whole clustering solution. The updated clustering is used to select new cluster representatives. In that way, the proposed FedCO approach updates clusters by repeatedly evaluating and splitting clusters if doing so is necessary to improve the workers’ partitioning. The potential of the proposed method is demonstrated on publicly available datasets and LEAF datasets under the IID and Non-IID data distribution settings. The experimental results indicate that our proposed FedCO approach is superior to the state-of-the-art FL approaches, i.e., FedAvg, FedProx, and CMFL, in reducing communication costs and achieving a better accuracy in both the IID and Non-IID cases. © 2022 by the authors.

Place, publisher, year, edition, pages
MDPI, 2022. Vol. 14, no 12, article id 377
Keywords [en]
clustering, communication efficiency, convolutional neural network, federated learning, Internet of Things, Convolutional neural networks, Cost reduction, Learning systems, Privacy-preserving techniques, Central servers, Clustering optimizations, Clusterings, Communication cost, Optimization approach, Shared model, Workers'
National Category
Computer Sciences
Identifiers
URN: urn:nbn:se:bth-24176DOI: 10.3390/fi14120377ISI: 000901037100001Scopus ID: 2-s2.0-85144590253OAI: oai:DiVA.org:bth-24176DiVA, id: diva2:1725283
Note

open access

Available from: 2023-01-10 Created: 2023-01-10 Last updated: 2024-04-05Bibliographically approved
In thesis
1. Resource-Aware and Personalized Federated Learning via Clustering Analysis
Open this publication in new window or tab >>Resource-Aware and Personalized Federated Learning via Clustering Analysis
2024 (English)Doctoral thesis, comprehensive summary (Other academic)
Abstract [en]

Today’s advancement in Artificial Intelligence (AI) enables training Machine Learning (ML) models on the daily-produced data by connected edge devices. To make the most of the data stored on the device, conventional ML approaches require gathering all individual data sets and transferring them to a central location to train a common model. However, centralizing data incurs significant costs related to communication, network resource utilization, high volume of traffic, and privacy issues. To address the aforementioned challenges, Federated Learning (FL) is employed as a novel approach to train a shared model on decentralized edge devices while preserving privacy. Despite the significant potential of FL, it still requires considerable resources such as time, computational power, energy, and bandwidth availability. More importantly, the computational capabilities of the training devices may vary over time. Furthermore, the devices involved in the training process of FL may have distinct training datasets that differ in terms of their size and distribution. As a result of this, the convergence of the FL models may become unstable and slow. These differences can influence the FL process and ultimately lead to suboptimal model performance within a heterogeneous federated network.

In this thesis, we have tackled several of the aforementioned challenges. Initially, a FL algorithm is proposed that utilizes cluster analysis to address the problem of communication overhead. This issue poses a major bottleneck in FL, particularly for complex models, large-scale applications, and frequent updates. The next research conducted in this thesis involved extending the previous study to include wireless networks (WNs). In WSNs, achieving energy-efficient transmission is a significant challenge due to their limited resources. This has motivated us to continue with a comprehensive overview and classification of the latest advancements in context-aware edge-based AI models, with a specific emphasis on sensor networks. The review has also investigated the associated challenges and motivations for adopting AI techniques, along with an evaluation of current areas of research that need further investigation. To optimize the aggregation of the FL model and alleviate communication expenses, the initial study addressing communication overhead is extended to include a FL-based cluster optimization approach. Furthermore, to reduce the detrimental effect caused by data heterogeneity among edge devices on FL, a new study of group-personalized FL models has been conducted. Finally, taking inspiration from the previously mentioned FL models, techniques for assessing clients' contribution by monitoring and evaluating their behavior during training are proposed. In comparison with the most existing contribution evaluation solutions, the proposed techniques do not require significant computational resources.

The FL algorithms presented in this thesis are assessed on a range of real-world datasets. The extensive experiments demonstrated that the proposed FL techniques are effective and robust. These techniques improve communication efficiency, resource utilization, model convergence speed, and aggregation efficiency, and also reduce data heterogeneity when compared to other state-of-the-art methods.

Place, publisher, year, edition, pages
Karlskrona: Blekinge Tekniska Högskola, 2024. p. 260
Series
Blekinge Institute of Technology Doctoral Dissertation Series, ISSN 1653-2090 ; 2024:04
Keywords
Federated Learning, Clustering Analysis, Eccentricity Analysis, Non- IID Data, Model Personalization
National Category
Computer Sciences
Research subject
Computer Science
Identifiers
urn:nbn:se:bth-26081 (URN)978-91-7295-478-6 (ISBN)
Public defence
2024-05-17, C413A, Karlskrona, 10:00 (English)
Opponent
Supervisors
Available from: 2024-04-05 Created: 2024-04-05 Last updated: 2024-04-22Bibliographically approved

Open Access in DiVA

fulltext(2836 kB)484 downloads
File information
File name FULLTEXT01.pdfFile size 2836 kBChecksum SHA-512
d7b4c2ded5a960f3a477aac61aa621889e1af5086fbf6f4af70bb0949069d7a41ed3a63e6b57d5c61f3e1323d3471e7c817245c2124bfe6ae38377dd2fe4156a
Type fulltextMimetype application/pdf

Other links

Publisher's full textScopus

Authority records

Al-Saedi, Ahmed Abbas MohsinBoeva, VeselkaCasalicchio, Emiliano

Search in DiVA

By author/editor
Al-Saedi, Ahmed Abbas MohsinBoeva, VeselkaCasalicchio, Emiliano
By organisation
Department of Computer Science
In the same journal
Future Internet
Computer Sciences

Search outside of DiVA

GoogleGoogle Scholar
Total: 484 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 645 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf