PDF
Abstract
The detection of rare diseases utilizing advanced artificial intelligence (AI) techniques has garnered considerable attention in recent years. Numerous approaches have been proposed to detect diverse rare diseases by leveraging a range of medical data, including medical images, electronic health records, and sensory data. In order to safeguard the privacy of health data, considerable investigation has been undertaken on a novel learning paradigm known as federated learning, which has been applied to the domain of rare disease detection. Nonetheless, this nascent research direction remains in its infancy, necessitating greater scrutiny and attention. Within this survey, our primary focus lies in providing fresh perspectives, deliberating the challenges, and enumerating potential research directions concerning the application of federated learning techniques in rare disease detection. Furthermore, we provide a succinct summary of existing advancements using AI techniques for rare disease detection, as well as the utilization of federated learning within healthcare informatics. Moreover, we furnish a compilation of publicly available datasets that can be employed to validate novel federated learning algorithms for the purpose of detecting rare diseases.
Keywords
Rare disease detection
/
federated learning
/
AI for healthcare
Cite this article
Download citation ▾
Jiaqi Wang, Fenglong Ma.
Federated learning for rare disease detection: a survey.
Rare Disease and Orphan Drugs Journal, 2023, 2(4): 22 DOI:10.20517/rdodj.2023.16
| [1] |
Rare diseases at FDA. Available from: https://www.fda.gov/patients/rare-diseases-fda [Last accessed on 9 Oct 2023].
|
| [2] |
EU research on rare diseases. Available from: https://research-and-innovation.ec.europa.eu/research-area/health/rare-diseases_en [Last accessed on 9 Oct 2023].
|
| [3] |
What is a rare disease? Available from: https://www.eurordis.org/information-support/what-is-a-rare-disease/ [Last accessed on 9 Oct 2023].
|
| [4] |
Public health challenges of rare diseases. Available from: https://rarediseases.info.nih.gov/ [Last accessed on 9 Oct 2023]
|
| [5] |
Siegel RL,Wagle NS.Cancer statistics, 2023.CA Cancer J Clin2023;73:17-48
|
| [6] |
Walter FM,Mendonça SC.Symptoms and patient factors associated with diagnostic intervals for pancreatic cancer (SYMPTOM pancreatic study): a prospective cohort study.Lancet Gastroenterol Hepatol2016;1:298-306. PMCID:PMC6358142
|
| [7] |
Xing H,Wang Y.Diagnostic value of CA 19-9 and carcinoembryonic antigen for pancreatic cancer: a meta-analysis.Gastroenterol Res Pract2018;2018:8704751 PMCID:PMC6280291
|
| [8] |
Luo G,Deng S.Roles of CA19-9 in pancreatic cancer: biomarker, predictor and promoter.Biochim Biophys Acta Rev Cancer2021;1875:188409.
|
| [9] |
Liu Y,An S,Palmer R.3D face reconstruction with mobile phone cameras for rare disease diagnosis. In: Aziz H, Corrêa D, French T, editors. AI 2022: advances in artificial intelligence. Cham: Springer International Publishing; 2022. pp. 544-56.
|
| [10] |
Lam C,Huang L.Retinal lesion detection with deep learning using image patches.Invest Ophthalmol Vis Sci2018;59:590-6 PMCID:PMC5788045
|
| [11] |
Sharma S.A systematic study of intelligent face scanning in rare disease detection. In. Proceedings of 2nd International Conference on Artificial Intelligence: Advances and Applications: ICAIAA 2021: Springer; 2022, p. 451-62.
|
| [12] |
Soni H,Singh U.Identify rare disease patients from electronic health records through machine learning approach. In. 2018 International Conference on Inventive Research in Computing Applications (ICIRCA): IEEE; 2018, p. 1390-5.
|
| [13] |
Garcelon N,Salomon R.Electronic health records for the diagnosis of rare diseases.Kidney Int2020;97:676-86
|
| [14] |
Dong H,Zhang H.Ontology-driven and weakly supervised rare disease identification from clinical notes.BMC Med Inform Decis Mak2023;23:86. PMCID:PMC10162001
|
| [15] |
Cheng Y,Chen T.Federated learning for privacy-preserving AI.Commun ACM2020;63:33-6
|
| [16] |
Li T,Talwalkar A.Federated learning: challenges, methods, and future directions.IEEE Signal Proc Mag2020;37:50-60.
|
| [17] |
Kariampuzha WZ,Qu S.Precision information extraction for rare disease epidemiology at scale.J Transl Med2023;21:157 PMCID:PMC9972634
|
| [18] |
Regulation (EC) No 141/2000 of the European Parliament and of the Council of 16 December 1999 on orphan medicinal products. Official Journal L 2000;18:1-5. Available from: https://eur-lex.europa.eu/legal-content/EN/TXT/?uri=celex%3A32000R0141 [Last accessed on 9 Oct 2023]
|
| [19] |
Derayeh S,Rabiei R,Moghaddasi H.National information system for rare diseases with an approach to data architecture: a systematic review.Intractable Rare Dis Res2018;7:156-63 PMCID:PMC6119672
|
| [20] |
Stoller JK.The challenge of rare diseases.Chest2018;153:1309-14.
|
| [21] |
Cui L,Glass LM,Sun J.CONAN: complementary pattern augmentation for rare disease detection.AAAI2020;34:614-21.
|
| [22] |
Li W,Cai Y,Zhao E.Semi-supervised rare disease detection using generative adversarial network.arXiv2018;181200547.
|
| [23] |
Yu K,Cai Y.Rare disease detection by sequence modeling with generative adversarial networks.arXiv2019;190701022
|
| [24] |
Yu K,Cai Y.Modelling patient sequences for rare disease detection with semi-supervised generative adversarial nets. In: Lemaire V, Malinowski S, Bagnall A, Bondu A, Guyet T, Tavenard R, editors. Advanced analytics and learning on temporal data. Cham: Springer International Publishing; 2020. pp. 141-50.
|
| [25] |
Arvaniti E.Sensitive detection of rare disease-associated cell subsets via representation learning.Nat Commun2017;8:14825 PMCID:PMC5384229
|
| [26] |
Jia X,Zhang J,Zhu Y.Few-shot radiology report generation for rare diseases. In. 2020 IEEE International Conference on Bioinformatics and Biomedicine (BIBM): IEEE; 2020, p. 601-8.
|
| [27] |
Besson R,Allassonniere S.A model-based reinforcement learning approach for a rare disease diagnostic task.arXiv2018;181110112
|
| [28] |
Segura-Bedmar I,Guerrero-Aspizua S.Exploring deep learning methods for recognizing rare diseases and their clinical manifestations from texts.BMC Bioinformatics2022;23:263 PMCID:PMC9258216
|
| [29] |
Prakash P,Ranade N.RareBERT: transformer architecture for rare disease patient identification using administrative claims.AAAI2021;35:453-60
|
| [30] |
Li X,Jin Y,Xing L.Difficulty-aware meta-learning for rare disease diagnosis. In: Martel AL, Abolmaesumi P, Stoyanov D, Mateus D, Zuluaga MA, Zhou SK, Racoceanu D, Joskowicz L, editors. Medical image computing and computer assisted intervention-MICCAI 2020. Cham: Springer International Publishing; 2020. pp. 357-66.
|
| [31] |
Lopes RR,Ramos LA.Improving electrocardiogram-based detection of rare genetic heart disease using transfer learning: an application to phospholamban p.Arg14del mutation carriers.Comput Biol Med2021;131:104262.
|
| [32] |
Taroni JN,Hu Q.MultiPLIER: a transfer learning framework for transcriptomics reveals systemic features of rare disease.Cell Syst2019;8:380-94.e4. PMCID:PMC6538307
|
| [33] |
Li J,Zhang M.Estimating rare disease incidences with large-scale internet search data: development and evaluation of a two-step machine learning method.JMIR Infodemiology2023;3:e42721 PMCID:PMC10182453
|
| [34] |
Spiga O,Fiorini C.Machine learning application for development of a data-driven predictive model able to investigate quality of life scores in a rare disease.Orphanet J Rare Dis2020;15:46 PMCID:PMC7017449
|
| [35] |
Lee J,Kim J.Deep learning for rare disease: a scoping review.J Biomed Inform2022;135:104227
|
| [36] |
Schaefer J,Schepers J,Thun S.The use of machine learning in rare diseases: a scoping review.Orphanet J Rare Dis2020;15:145 PMCID:PMC7285453
|
| [37] |
Banerjee J,Allaway RJ,Guinney J.Machine learning in rare disease.Nat Methods2023;20:803-14
|
| [38] |
Decherchi S,Mordenti M,Sangiorgi L.Opportunities and challenges for machine learning in rare diseases.Front Med2021;8:747612. PMCID:PMC8523988
|
| [39] |
Brasil S,Francisco R,Videira PA.Artificial intelligence (AI) in rare diseases: is the future brighter?.Genes2019;10:978. PMCID:PMC6947640
|
| [40] |
Groft SC,Taruscio D.Progress, challenges and global approaches to rare diseases.Acta Paediatr2021;110:2711-6
|
| [41] |
Brasil S,Rijoff T.Artificial intelligence in epigenetic studies: shedding light on rare diseases.Front Mol Biosci2021;8:648012 PMCID:PMC8131862
|
| [42] |
Hurvitz N,Kesler A.Establishing a second-generation artificial intelligence-based system for improving diagnosis, treatment, and monitoring of patients with rare diseases.Eur J Hum Genet2021;29:1485-90 PMCID:PMC8484657
|
| [43] |
Skweres-Kuchta M,Szaruga E.Literature review on health emigration in rare diseases-a machine learning perspective.Int J Environ Res Public Health2023;20:2483 PMCID:PMC9915846
|
| [44] |
Visibelli A,Spiga O.The impact of artificial intelligence in the odyssey of rare diseases.Biomedicines2023;11:887 PMCID:PMC10045927
|
| [45] |
Roman-Naranjo P,Lopez-Escamez JA.A systematic review on machine learning approaches in the diagnosis of rare genetic diseases.medRxiv2023;23285203
|
| [46] |
Hasani N,Morris MA.Artificial intelligence in medical imaging and its impact on the rare disease community: threats, challenges and opportunities.PET Clin2022;17:13-29 PMCID:PMC8764708
|
| [47] |
McMahan B,Ramage D,y Arcas BA.Communication-efficient learning of deep networks from decentralized data.PMLR2017;54:1273-82.
|
| [48] |
Wang J,Long Z,Xiao H.Knowledge-enhanced semi-supervised federated learning for aggregating heterogeneous lightweight clients in IoT. In: Shekhar S, Zhou Z, Chiang Y, Stiglic G, editors. Proceedings of the 2023 SIAM International Conference on Data Mining (SDM). Philadelphia: Society for Industrial and Applied Mathematics; 2023. pp. 496-504.
|
| [49] |
Tan Y,Liu L.FedProto: federated prototype learning across heterogeneous clients.AAAI2022;36:8432-40
|
| [50] |
Jere MS,Koushanfar F.A taxonomy of attacks on federated learning.IEEE Secur Privacy2021;19:20-8
|
| [51] |
Solanki S,Damle S.Differentially private federated combinatorial bandits with constraints. In: Amini M, Canu S, Fischer A, Guns T, Kralj Novak P, Tsoumakas G, editors. Machine learning and knowledge discovery in databases. Cham: Springer Nature Switzerland; 2023. pp. 620-37.
|
| [52] |
Lycklama H,Viand A,Hithnawi A.RoFL: robustness of secure federated learning. In. 2023 IEEE Symposium on Security and Privacy (SP): IEEE Computer Society; 2023, p. 453-76.
|
| [53] |
Lyu L,Wang Q.Collaborative Fairness in federated learning. In: Yang Q, Fan L, Yu H, editors. Federated learning. Cham: Springer International Publishing; 2020. pp. 189-204.
|
| [54] |
Wang J,Cui S,Ma F.Towards federated COVID-19 vaccine side effect prediction. In: Amini M, Canu S, Fischer A, Guns T, Kralj Novak P, Tsoumakas G, editors. Machine learning and knowledge discovery in databases. Cham: Springer Nature Switzerland; 2023. pp. 437-52.
|
| [55] |
Liu Y,Luo Y.FedVision: an online visual object detection platform powered by federated learning.AAAI2020;34:13172-9
|
| [56] |
Huang A,Chen T.StarFL: hybrid federated learning architecture for smart urban computing.ACM Trans Intell Syst Technol2021;12:1-23.
|
| [57] |
Long G,Jiang J.Federated learning for open banking. In: Yang Q, Fan L, Yu H, editors. Federated learning. Cham: Springer International Publishing; 2020. pp. 240-54.
|
| [58] |
IBM federated learning. Available from: https://www.ibm.com/docs/en/cloud-paks/cp-data/4.7.x?topic=models-federated-learning [Last accessed on 9 Oct 2023].
|
| [59] |
Federated learning on Google Cloud. Available from: https://cloud.google.com/architecture/federated-learning-google-cloud [Last accessed on 9 Oct 2023].
|
| [60] |
TensorFlow federated: machine learning on decentralized data. Available from: https://www.tensorflow.org/federated [Last accessed on 9 Oct 2023].
|
| [61] |
Dang TK,Weng J.Federated learning for electronic health records.ACM Trans Intell Syst Technol2022;13:1-17
|
| [62] |
Xu J,Su C,Bian J.Federated learning for healthcare informatics.J Healthc Inform Res2021;5:1-19 PMCID:PMC7659898
|
| [63] |
Antunes RS,Küderle A,Eskofier B.Federated learning for healthcare: systematic review and architecture proposal.ACM Trans Intell Syst Technol2022;13:1-23
|
| [64] |
Oh W.Federated learning in health care using structured medical data.Adv Kidney Dis Health2023;30:4-16 PMCID:PMC10208416
|
| [65] |
Pfitzner B,Arnrich B.Federated learning in a medical context: a systematic literature review.ACM Trans Internet Technol2021;21:1-31.
|
| [66] |
Rieke N,Li W.The future of digital health with federated learning.NPJ Digit Med2020;3:119 PMCID:PMC7490367
|
| [67] |
Reddy K, Gadekallu TR. A comprehensive survey on federated learning techniques for healthcare informatics.Comput Intell Neurosci2023;2023:8393990 PMCID:PMC9995203
|
| [68] |
Sheller MJ,Edwards B,Bakas S.Multi-institutional deep learning modeling without sharing patient data: a feasibility study on brain tumor segmentation. In: Crimi A, Bakas S, Kuijf H, Keyvan F, Reyes M, van Walsum T, editors. Brainlesion: glioma, multiple sclerosis, stroke and traumatic brain injuries. Cham: Springer International Publishing; 2019. pp. 92-104. PMCID:PMC6589345
|
| [69] |
Feki I,Kessentini Y.Federated learning for COVID-19 screening from Chest X-ray images.Appl Soft Comput2021;106:107330. PMCID:PMC7979273
|
| [70] |
Linardos A,Walsh S,Lekadir K.Federated learning for multi-center imaging diagnostics: a simulation study in cardiovascular disease.Sci Rep2022;12:3551 PMCID:PMC8894335
|
| [71] |
Vaid A,Xu J.Federated learning of electronic health records to improve mortality prediction in hospitalized patients with COVID-19: machine learning approach.JMIR Med Inform2021;9:e24207 PMCID:PMC7842859
|
| [72] |
Salim MM.Federated learning-based secure electronic health record sharing scheme in medical informatics.IEEE J Biomed Health Inform2023;27:617-24.
|
| [73] |
Islam SM, Kwak D, Kabir H, Hossain M, Kwak KS. The internet of things for health care: a comprehensive survey.IEEE Access2015;3:678-708
|
| [74] |
Elayan H,Guizani M.Sustainability of healthcare data analysis IoT-based systems using deep federated learning.IEEE Internet Things J2022;9:7338-46.
|
| [75] |
Wu Q,Zhou Z.FedHome: cloud-edge based personalized federated learning for in-home health monitoring.IEEE Trans on Mobile Comput2022;21:2818-32
|
| [76] |
Arikumar KS,Alazab M.FL-PMI: federated learning-based person movement identification through wearable devices in smart healthcare systems.Sensors2022;22:1377 PMCID:PMC8962969
|
| [77] |
Li J,Ma L.A federated learning based privacy-preserving smart healthcare system.IEEE T Ind Inform2021;18:2021-31
|
| [78] |
Kurniawan H.Homomorphic encryption-based federated privacy preservation for deep active learning.Entropy2022;24:1545 PMCID:PMC9689508
|
| [79] |
Alam T.Federated learning and its role in the privacy preservation of iot devices.Future Internet2022;14:246
|
| [80] |
Singh S,Alfarraj O,Yoon B.A framework for privacy-preservation of IoT healthcare data using federated learning and blockchain technology.Future Gener Comp Sy2022;129:380-8
|
| [81] |
Wu N,Yang X,Yan Z.FedIIC: towards robust federated learning for class-imbalanced medical image classification.arXiv2022;220613803
|
| [82] |
Pati S,Edwards B.Federated learning enables big data for rare cancer boundary detection.Nat Commun2022;13:7346
|
| [83] |
Chen B,Zeng X.DFML: dynamic federated meta-learning for rare disease prediction.IEEE/ACM Trans Comput Biol Bioinform2023;1-11.
|
| [84] |
Goto S,John JE.Multinational federated learning approach to train ecg and echocardiogram models for hypertrophic cardiomyopathy detection.Circulation2022;146:755-69 PMCID:PMC9439630
|
| [85] |
Lu MY,Kong D.Federated learning for computational pathology on gigapixel whole slide images.Med Image Anal2022;76:102298 PMCID:PMC9340569
|
| [86] |
Wang J. Rare disease detection with federated learning. Available from: https://github.com/JackqqWang/rare_disease_detection_fl/blob/main/main.md [Last accessed on 10 Oct 2023].
|
| [87] |
Thrun S.Lifelong learning algorithms. In: Thrun S, Pratt L, editors. Learning to learn. Boston: Springer US; 1998. pp. 181-209.
|
| [88] |
Fontenla-Romero Ó,Martinez-Rego D,Peteiro-Barral D.Online machine learning. In: editor^editors, editor. Efficiency and scalability methods for computational intellect:IGI global;2013.p.27-54.
|
| [89] |
Chen C,Ma F,Sun J.Unite: uncertainty-based health risk prediction leveraging multi-sourced data. In. Proceedings of the Web Conference 2021; 2021, p. 217-26.
|
| [90] |
The portal for rare diseases and orphan drugs. Available from: https://www.orpha.net/ [Last accessed on 9 Oct 2023].
|
| [91] |
An online catalog of human genes and genetic disorders. Available from: https://www.omim.org/ [Last accessed on 9 Oct 2023]
|
| [92] |
Ehrhart F,Kutmon M,Curfs LMG.A resource to explore the discovery of rare diseases and their causative genes.Sci Data2021;8:124 PMCID:PMC8096966
|