SKALE: An Interpretable Multiscale Machine Learning Model for Decoding Phase-Specific Protein Aggregation in Neurodegenerative Proteinopathies
Wei Xuan Wilson Loo , Jia Shen Sio , Keyin Yap , Yan Shan Loo , Hui Xuan Lim , Shuangyue Zhang , Huitao Liu , Chen Seng Ng
Aggregate ›› 2026, Vol. 7 ›› Issue (2) : e70280
Protein aggregation drives proteinopathies ranging from ALS to systemic amyloidosis, yet the multiscale determinants bridging sequence, structure, and kinetics remain elusive. We present SKALE, an interpretable machine learning framework that integrates sequence motifs, AlphaFold-derived structural descriptors, and experimental kinetics to decode aggregation mechanisms. SKALE identifies latent hotspots that evade conventional tools and matches high-performing neural baselines while preserving computational efficiency. In ALS-linked SOD1 G86R, the model isolates a risk region at residues 72–91 where preserved β-sheet geometry coincides with weakened hydrogen bonding to drive nucleation. Similarly, analysis of TDP-43 S332N reveals that a locally unwound helix increases surface exposure, a prediction validated by showing that targeted deletion of model-identified regions significantly reduces cellular aggregation. The framework generalizes to Tau P301L and PRNP variants where it uncovers distal aggregation-prone regions to discriminate pathogenic drivers from neutral mutations. Interpretability analysis further disentangles global from mutation-local mechanisms to reveal that β-sheet propensity acts as a shared determinant while hydrogen bond dynamics define specific routes to nucleation. These findings establish SKALE as a scalable, disease-agnostic engine that combines high-fidelity prediction with biophysical resolution to decode the molecular logic of misfolding and guide therapeutic design.
amyotrophic lateral sclerosis / machine learning / protein aggregation / superoxide dismutase 1 / TAR DNA-binding protein 43
| [1] |
|
| [2] |
|
| [3] |
|
| [4] |
|
| [5] |
|
| [6] |
|
| [7] |
|
| [8] |
|
| [9] |
|
| [10] |
|
| [11] |
|
| [12] |
|
| [13] |
|
| [14] |
|
| [15] |
|
| [16] |
|
| [17] |
|
| [18] |
|
| [19] |
|
| [20] |
|
| [21] |
|
| [22] |
|
| [23] |
|
| [24] |
|
| [25] |
|
| [26] |
|
| [27] |
|
| [28] |
|
| [29] |
|
| [30] |
|
| [31] |
|
| [32] |
|
| [33] |
|
| [34] |
|
| [35] |
|
| [36] |
|
| [37] |
|
| [38] |
|
| [39] |
|
| [40] |
|
| [41] |
|
| [42] |
|
| [43] |
|
| [44] |
|
| [45] |
|
| [46] |
|
| [47] |
|
| [48] |
|
| [49] |
|
| [50] |
|
| [51] |
|
| [52] |
|
| [53] |
|
| [54] |
|
| [55] |
|
| [56] |
|
| [57] |
|
| [58] |
|
| [59] |
|
| [60] |
|
| [61] |
|
| [62] |
|
| [63] |
|
| [64] |
|
| [65] |
|
| [66] |
|
| [67] |
|
| [68] |
|
| [69] |
|
| [70] |
|
| [71] |
|
| [72] |
|
| [73] |
|
| [74] |
|
| [75] |
|
| [76] |
|
| [77] |
|
| [78] |
|
| [79] |
|
| [80] |
|
| [81] |
|
| [82] |
|
| [83] |
|
| [84] |
|
| [85] |
|
| [86] |
|
| [87] |
|
| [88] |
|
| [89] |
|
| [90] |
|
| [91] |
|
| [92] |
|
| [93] |
|
| [94] |
|
| [95] |
|
| [96] |
|
| [97] |
|
| [98] |
|
| [99] |
|
| [100] |
|
| [101] |
|
| [102] |
|
| [103] |
|
| [104] |
|
2026 The Author(s). Aggregate published by SCUT, AIEI, and John Wiley & Sons Australia, Ltd.
/
| 〈 |
|
〉 |