FedA 4: Federated Learning with Anti-Bias Aggregation and TrAjectory-Based Adaptation
Guanyi Zhao , Juntao Hu , Zhengjie Yang , Dapeng Oliver Wu
Transactions on Artificial Intelligence ›› 2026, Vol. 2 ›› Issue (1) : 26 -38.
Non-Independent and Identically Distributed (Non-IID) data pose a fundamental challenge in Federated Learning (FL). It usually causes a severe client drift issue (various client model update directions) and thus, degrades the global model performance. Existing methods typically address this by assigning appropriate weights to client models or optimizing model update directions. However, these methods overlook client model update trends. They focus solely on the final client models to be aggregated at the server at each communication round, ignoring model optimization trajectories, which may contain richer information to aid model convergence. To address this issue, we propose FedA 4, a novel FL framework with Anti-bias Aggregation and trAjectory-based Adaptation, which leverages clients’ optimization trajectories, rather than only their final model snapshots. For anti-bias aggregation, by observing a phenomenon termed model collapse, where biased clients tend to predict any input data as the dominant classes in their own datasets, we quantify the class dominance and analyze the level of client drift for each client. We evaluate a prediction entropy, namely concentration, so as to assign an optimal weight to each client at each training round. To further mitigate the negative effect of clients with high levels of client drift (biased clients), we then develop a gradient adaptation mechanism termed trajectory-based adaptation, which analyzes clients’ trajectories to correct each client’s contribution to the aggregated global model. Extensive experiments on CIFAR-10, CIFAR-100, STL-10, and Fashion-MNIST demonstrate that FedA 4 significantly outperforms state-of-the-art baselines, particularly in scenarios with extreme data heterogeneity (high level of Non-IID).
federated Learning / non-IID Data / client drift / anti-bias aggregation / optimization trajectory
| [1] |
McMahan, B.; Moore, E.; Ramage, D.; |
| [2] |
Konečný, J.; McMahan, H.B.; Ramage, D.; |
| [3] |
Li, T.; Sahu, A.K.; Talwalkar, A.; |
| [4] |
Zhao, Y.; Li, M.; Lai, L.; |
| [5] |
Li, T.; Sanjabi, M.; Beirami, A.; |
| [6] |
Ji, S.; Pan, S.; Long, G.; |
| [7] |
T Dinh, C.; Tran, N.; Nguyen, J. Personalized federated learning with moreau envelopes. In Proceedings of the 34th International Conference on Neural Information Processing Systems, Vancouver, BC, Canada, 6—12 December 2020. |
| [8] |
Fu, S.; Yang, Z.; Hu, C.; |
| [9] |
Blanchard, P.; El Mhamdi, E.M.; Guerraoui, R.; |
| [10] |
Wang, H.; Yurochkin, M.; Sun, Y.; |
| [11] |
Yurochkin, M.; Agarwal, M.; Ghosh, S.; |
| [12] |
Lin, T.; Kong, L.; Stich, S.U.; |
| [13] |
Li, T.; Sahu, A.K.; Zaheer, M.; |
| [14] |
Karimireddy, S.P.; Kale, S.; Mohri, M.; |
| [15] |
Acar, D.A.E.; Zhao, Y.; Navarro, R.M.; |
| [16] |
Gao, L.; Fu, H.; Li, L.; |
| [17] |
Zhu, Z.; Hong, J.; Zhou, J. Data—free knowledge distillation for heterogeneous federated learning. In Proceedings of the 38th International Conference on Machine Learning, Virtual, 18—24 July 2021; pp. 12878-12889. |
| [18] |
Yang, M.; Su, S.; Li, B.; |
| [19] |
Goodfellow, I.J.; Pouget—Abadie, J.; Mirza, M.; |
| [20] |
Zhu, L.; Liu, Z.; Han, S. Deep leakage from gradients. In Proceedings of the NeurIPS 2019, Vancouver, BC, Canada, 8—14 December 2019. |
| [21] |
Zhao, B.; Mopuri, K.R.; Bilen, H. Dataset condensation with gradient matching. arXiv 2020, arXiv:2006.05929. |
| [22] |
Krizhevsky, A. Learning Multiple Layers of Features from Tiny Images; University of Toronto: Toronto, ON, Canada, 2009. |
| [23] |
Coates, A.; Ng, A.; Lee, H. An Analysis of Single—Layer Networks in Unsupervised Feature Learning. In Proceedings of the Fourteenth International Conference on Artificial Intelligence and Statistics, Fort Lauderdale, FL, USA, 11—13 April 2011; Volume 15, pp. 215-223. |
| [24] |
Xiao, H.; Rasul, K.; Vollgraf, R. Fashion—mnist: A novel image dataset for benchmarking machine learning algorithms. arXiv 2017, arXiv:1708.07747. |
| [25] |
Reddi, S.; Charles, Z.; Zaheer, M.; |
| [26] |
Wang, R.; Chen, Y. Adaptive Model Aggregation in Federated Learning Based on Model Accuracy. IEEE Wireless Commun. 2024, 31, 200-206. |
| [27] |
Pillutla, K.; Kakade, S.M.; Harchaoui, Z. Robust aggregation for federated learning. IEEE Trans. Signal Process. 2022, 70, 1142-1154. |
| [28] |
Li, Q.; He, B.; Song, D. Model—contrastive federated learning. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Nashville, TN, USA, 20—25 June 2021; pp. 10713-10722. |
| [29] |
Rasouli, M.; Sun, T.; Rajagopal, R. Fedgan: Federated generative adversarial networks for distributed data. arXiv 2020, arXiv:2006.07228. |
/
| 〈 |
|
〉 |