[1]ZHAO Zehua,LIANG Meiyu,XUE Zhe,et al.Client dynamic sampling optimization of efficient reinforcement federated learning based on data quality assessment[J].CAAI Transactions on Intelligent Systems,2024,19(6):1552-1561.[doi:10.11992/tis.202305054]
Copy
CAAI Transactions on Intelligent Systems[ISSN 1673-4785/CN 23-1538/TP] Volume:
19
Number of periods:
2024 6
Page number:
1552-1561
Column:
学术论文—人工智能基础
Public date:
2024-12-05
- Title:
-
Client dynamic sampling optimization of efficient reinforcement federated learning based on data quality assessment
- Author(s):
-
ZHAO Zehua; LIANG Meiyu; XUE Zhe; LI Ang; ZHANG Min
-
Beijing Key Laboratory of Intelligent Communication Software and Multimedia, Beijing University of Posts and Telecommunications, Beijing 100876, China
-
- Keywords:
-
federated learning; deep reinforcement learning; client dynamic sampling; contribution index; data quality; communication efficiency; Shapley value; model accuracy
- CLC:
-
TP295
- DOI:
-
10.11992/tis.202305054
- Abstract:
-
Communication cost and efficiency are the key bottlenecks of federated learning due to the existence of system and statistical heterogeneities. Selecting only a subset of clients to perform model updates and aggregation can effectively reduce communication costs among numerous participants. However, biased selection and uneven distribution of data quality across clients pose additional challenges to client sampling methods. Therefore, this paper proposes a method for client dynamic sampling optimization in efficient reinforcement federated learning based on data quality assessment (RQCS) to address the aforementioned issues. This method evaluates data quality on clients using a contribution index based on the Shapley value and intelligently selects clients with high data quality for each round of federated learning. By leveraging reinforcement learning, the method aims to offset the bias introduced by uneven data quality distribution, accelerate model convergence, and improve model accuracy. Experiments on the MNIST and CIFAR-10 datasets show that the proposed algorithm not only reduces communication costs but also further accelerates convergence speed and achieves better performance in model accuracy compared to other algorithms.