Structured Learning in Biological Domain
Canh Hao Nguyen
Journal of Systems Science and Systems Engineering ›› 2020, Vol. 29 ›› Issue (4) : 440 -453.
Biological domain has been blessed with more and more data from biotechnologies as well as data integration tools. In the renaissance of machine learning and artificial intelligence, there is so much promise of data-driven biological knowledge discovery. However, it is not straight forward due to the complexity of the domain knowledge hidden in the data. At any level, be it atoms, molecules, cells or organisms, there are rich interdependencies among biological components. Machine learning approaches in this domain usually involves analyzing interdependency structures encoded in graphs and related formalisms. In this report, we review our work in developing new Machine Learning methods for these applications with improved performances in comparison with state-of-the-art methods. We show how the networks among biological components can be used to predict properties.
Structured learning / sparse modeling / systems biology / deep learning
| [1] |
|
| [2] |
|
| [3] |
|
| [4] |
|
| [5] |
|
| [6] |
|
| [7] |
|
| [8] |
Gilmer J et al. (2017). Neural message passing for quantum chemistry. Proceedings of the 34th International Conference on Machine Learning(PMLR): 1263–1272. Sydney, Australia. |
| [9] |
|
| [10] |
|
| [11] |
|
| [12] |
|
| [13] |
|
| [14] |
|
| [15] |
|
| [16] |
|
| [17] |
|
| [18] |
|
| [19] |
|
| [20] |
Nguyen DH, Nguyen CH, Mamitsuka H (2018). SIMPLE: Sparse interaction model over peaks of MoLEcules for fast, interpretable metabolite identification from tandem mass spectra. Bioinformatics: Proceedings of the 26th International Conference on Intelligent Systems for Molecular Biology (ISMB 2018): i323-i332. |
| [21] |
Nguyen DH, Nguyen CH, Mamitsuka, H (2019). ADAPTIVE: learning Data-dependent, concise molecular VEctors for fast, accurate metabolite identification from tandem mass spectra. Bioinformatics 35: Proceedings of the 26th International Conference on Intelligent Systems for Molecular Biology (ISMB 2019): i164-i172. |
| [22] |
|
| [23] |
|
| [24] |
Smola AJ, Kondor RI (2003). Kernels and regularization on graphs. In Proceedings of Conference on Learning Theory: 144–158. |
| [25] |
|
| [26] |
|
| [27] |
|
| [28] |
|
| [29] |
Zhu X, Ghahramani Z, Lafferty J (2003). Semi-supervised learning using Gaussian fields and harmonic functions. The 20th International Conference on Machine Learning (ICML): 912–919. |
/
| 〈 |
|
〉 |