Abstract
Federated Learning (FL) allows healthcare organizations to train models using diverse datasets while maintaining patient confidentiality collaboratively. While promising, FL faces challenges in optimizing model accuracy and communication efficiency. To address these, we propose an algorithm that combines feature selection with synthetic data generation, specifically targeting medical datasets. Our method eliminates irrelevant local features, identifies globally relevant ones, and uses synthetic data to initialize model parameters, improving convergence. It also employs a zero-trust model, ensuring that data remain on local devices and only learned weights are shared with the central server, enhancing security. The algorithm improves accuracy and computational efficiency, achieving communication efficiency gains of 4 to 14 through backward elimination and threshold variation techniques. Tested on a federated diabetic dataset, the approach demonstrates significant improvements in the performance and trustworthiness of FL systems for medical applications.
| Original language | English |
|---|---|
| Pages (from-to) | 2126-2140 |
| Number of pages | 15 |
| Journal | IEEE Journal on Selected Areas in Communications |
| Volume | 43 |
| Issue number | 6 |
| DOIs | |
| Publication status | Published - 2025 |
Keywords
- Federated learning
- embedded
- feature selection
- filter
- medical data
- privacy
- synthetic data
- wrapper
- zero-trust
ASJC Scopus subject areas
- Computer Networks and Communications
- Electrical and Electronic Engineering