
A clinical trial termination prediction model based on denoising autoencoder and deep survival regression
Huamei Qi, Wenhui Yang, Wenqin Zou, Yuxuan Hu
Quant. Biol. ›› 2024, Vol. 12 ›› Issue (2) : 205-214.
A clinical trial termination prediction model based on denoising autoencoder and deep survival regression
Effective clinical trials are necessary for understanding medical advances but early termination of trials can result in unnecessary waste of resources. Survival models can be used to predict survival probabilities in such trials. However, survival data from clinical trials are sparse, and DeepSurv cannot accurately capture their effective features, making the models weak in generalization and decreasing their prediction accuracy. In this paper, we propose a survival prediction model for clinical trial completion based on the combination of denoising autoencoder (DAE) and DeepSurv models. The DAE is used to obtain a robust representation of features by breaking the loop of raw features after autoencoder training, and then the robust features are provided to DeepSurv as input for training. The clinical trial dataset for training the model was obtained from the ClinicalTrials.gov dataset. A study of clinical trial completion in pregnant women was conducted in response to the fact that many current clinical trials exclude pregnant women. The experimental results showed that the denoising autoencoder and deep survival regression (DAE-DSR) model was able to extract meaningful and robust features for survival analysis; the C-index of the training and test datasets were 0.74 and 0.75 respectively. Compared with the Cox proportional hazards model and DeepSurv model, the survival analysis curves obtained by using DAE-DSR model had more prominent features, and the model was more robust and performed better in actual prediction.
clinical trials / denoising autoencoder / DeepSurv / experimental termination / survival analysis
[1] |
Friedman LM , Furberg CD , DeMets DL , Reboussin DM , Granger CB . Fundamentals of clinical trialss. 5th ed Berlin: Springer; 2015.
CrossRef
Google scholar
|
[2] |
Chen TT . Predicting analysis times in randomized clinical trials with cancer immunotherapy. BMC Med Res Methodol. 2016; 16 (1): 1- 10.
CrossRef
Google scholar
|
[3] |
Geletta S , Follett L , Laugerman M . Latent Dirichlet Allocation in predicting clinical trial terminations. BMC Med Inf Decis Making. 2019; 19 (1): 1- 12.
CrossRef
Google scholar
|
[4] |
Follett L , Geletta S , Laugerman M . Quantifying risk associated with clinical trial termination: a text mining approach. Inf Process Manag. 2019; 56 (3): 516- 25.
CrossRef
Google scholar
|
[5] |
Lupattelli A , Spigset O , Twigg MJ , Zagorodnikova K , Mårdby AC , Moretti ME , et al. Medication use in pregnancy: a cross-sectional, multinational web-based study. BMJ Open. 2014; 4 (2): e004365.
CrossRef
Google scholar
|
[6] |
Shields KE , Lyerly AD . Exclusion of pregnant women from industry-sponsored clinical trials. Obstet Gynecol. 2013; 122 (5): 1077- 81.
CrossRef
Google scholar
|
[7] |
Cox DR . Partial likelihood. Biometrika. 1975; 62 (2): 269- 762.
CrossRef
Google scholar
|
[8] |
Grønnesby JK , Borgan Ø . A method for checking regression models in survival analysis based on the risk score. Lifetime Data Anal. 1996; 2 (4): 315- 28.
CrossRef
Google scholar
|
[9] |
Kim B , Jang YJ , Cho HR , Kim SY , Jeong JE , Shim MK , et al. Predicting completion of clinical trials in pregnant women: Cox proportional hazard and neural network models. Clinical and Translational Science. 2022; 15 (3): 691- 9.
CrossRef
Google scholar
|
[10] |
Elkin ME , Zhu X . Understanding and predicting COVID-19 clinical trial completion vs. cessation. PLoS One. 2021; 16 (7): e0253789.
CrossRef
Google scholar
|
[11] |
Elkin ME , Zhu X . Predictive modeling of clinical trial terminations using feature engineering and embedding learning. Sci Rep. 2021; 11 (1): 3446.
CrossRef
Google scholar
|
[12] |
Wang J , Xie X , Shi J , He W , Chen Q , Chen L , et al. Denoising autoencoder, a deep learning algorithm, aids the identification of a novel molecular signature of lung adenocarcinoma. Dev Reprod Biol. 2020; 18 (4): 468- 80.
CrossRef
Google scholar
|
[13] |
Vincent P , Larochelle H , Lajoie I , Bengio Y , Manzagol PA , Bottou L . Stacked denoising autoencoders: learning useful representations in a deep network with a local denoising criterion. Journal of machine learning research. 2010; 11 (12).
|
[14] |
Xiao N , Qiang Y , Bilal Zia M , Wang S , Lian J . Ensemble classification for predicting the malignancy level of pulmonary nodules on chest computed tomography images. Oncol Lett. 2020; 20 (1): 401- 8.
CrossRef
Google scholar
|
[15] |
Atlam M , Torkey H , El-Fishawy N , Salem H . Coronavirus disease 2019 (COVID-19): survival analysis using deep learning and Cox regression model. Pattern Anal Appl. 2021; 24 (3): 993- 1005.
CrossRef
Google scholar
|
[16] |
Guo LY , Wu AH , Wang YX , Zhang LP , Chai H , Liang XF . Deep learning-based ovarian cancer subtypes identification using multi-omics data. BioData Min. 2020; 13 (1): 1- 12.
CrossRef
Google scholar
|
[17] |
Liu Q , Hu P . Association analysis of deep genomic features extracted by denoising autoencoders in breast cancer. Cancers. 2019; 11 (4): 494.
CrossRef
Google scholar
|
[18] |
Katzman JL , Shaham U , Cloninger A , Bates J , Jiang T , Kluger Y . DeepSurv: personalized treatment recommender system using a Cox proportional hazards deep neural network. BMC Med Res Methodol. 2018; 18 (1): 1- 12.
CrossRef
Google scholar
|
[19] |
Vincent P , Larochelle H , Bengio Y , Manzagol PA . Extracting and composing robust features with denoising autoencoders. In: Proceedings of the 25th international conference on Machine learning; 2008. p. 1096- 103.
CrossRef
Google scholar
|
[20] |
Zhou G , Sohn K , Lee H . Online incremental feature learning with denoising autoencoders. In: Artificial intelligence and statistics. PMLR; 2012. p. 1453- 61.
|
[21] |
Chai H , Zhou X , Zhang Z , Rao J , Yang Y . Integrating multi-omics data through deep learning for accurate cancer prognosis prediction. Comput Biol Med. 2021; 134: 104481.
CrossRef
Google scholar
|
[22] |
Zhong Y , Jia S , Hu Y . Denoising auto-encoder network combined classfication module for brain tumors detection. In: 2022 3rd IWECAI. IEEE; 2022. p. 540- 3.
CrossRef
Google scholar
|
[23] |
International ethical guidelines for biomedical research involving human subjects . Council for international Organizations of medical Sciences. Bull Med Ethics. 2002; 182: 17- 23.
|
[24] |
US Food and Drug Administration . Pregnant women: scientific and ethical considerations for inclusion in clinical trials guidance for industry; 2021.
|
[25] |
Antolini L , Boracchi P , Biganzoli E . A time-dependent discrimination index for survival data. Stat Med. 2005; 24 (24): 3927- 44.
CrossRef
Google scholar
|
[26] |
The pandas development team . pandas-dev/pandas: Pandas; 2020.
|
[27] |
Lemaître G , Nogueira F , Aridas CK . Imbalanced-learn: a python toolbox to tackle the curse of imbalanced datasets in machine learning. J Mach Learn Res. 2017; 18 (1): 559- 63.
|
[28] |
Bello GA , Dawes TJ , Duan J , Biffi C , de Marvao A , Howard LSGE , et al. Deep-learning cardiac motion analysis for human survival prediction. Nat Mach Intell. 2019; 1 (2): 95- 104.
CrossRef
Google scholar
|
[29] |
Harrell FE , Califf RM , Pryor DB , Lee KL , Rosati RA . Evaluating the yield of medical tests. JAMA. 1982; 247 (18): 2543- 6.
CrossRef
Google scholar
|
[30] |
Graf E , Schmoor C , Sauerbrei W , Schumacher M . Assessment and comparison of prognostic classification schemes for survival data. Stat Med. 1999; 18 (17-18): 2529- 45.
CrossRef
Google scholar
|
/
〈 |
|
〉 |