Handling Data Heterogeneity in Federated Learning via Knowledge Fusion

Sep 27, 2023·
Xu Zhou
,
Xinyu Lei
,
Cong Yang
,
Yichun Shi
,
Xiao Zhang
Jingwen Shi
Jingwen Shi
· 0 min read
Image credit:
Abstract
Federated learning (FL) supports distributed training of a global machine learning model across multiple Internet of Things (IoT) devices with the help of a central server. However, data heterogeneity across different IoT devices leads to the client model drift issue and results in model performance degradation and poor model fairness. To address the issue, we design federated learning with global–local knowledge fusion (FedKF) scheme in this article. The key idea in FedKF is to let the server return the global knowledge to be fused with the local knowledge in each training round so that the local model can be regularized toward the global optima. Therefore, the client model drift issue can be mitigated. In FedKF, we first propose the active–inactive model aggregation technique that supports a precise global knowledge representation. Then, we propose a data-free knowledge distillation (KD) approach to enable each client model to learn the global knowledge (embedded in the global model) while each client model can still learn the local knowledge (embedded in the local data set) simultaneously, thereby realizing the global–local knowledge fusion process. The theoretical analysis and intensive experiments demonstrate the superiority of FedKF over previous solutions.
Type
Publication
*IEEE Internet of Things Journal