DTIBFAI: drug-target interaction prediction based on BERT and feature augment of Informer
Naichao WANG , Yihe DIWU , Mingchen FENG , Yuchen ZHANG , Xiujuan LEI
Front. Comput. Sci. ›› 2026, Vol. 20 ›› Issue (7) : 2007904
DTIBFAI: drug-target interaction prediction based on BERT and feature augment of Informer
Drug-target interaction (DTI) is critical for drug discovery, providing insights into novel therapies. The development of language models has provided strong support for DTI prediction. However, Transformer-based self-attention mechanisms often fail to capture fine-grained drug-target interactions, and single-mode feature representations limit the ability to fully characterize DTI. To address these issues, this study proposes a novel prediction method DTIBFAI based on BERT and Informer model. DTIBFAI preprocesses drug and protein sequences using ChemBERTa and BioBERT, while incorporating molecular fingerprints and dipeptide composition features to augment the richness and representativeness of the features. Additionally, this study integrates a modified Informer for the DTI prediction problem. The modified Informer augments feature embeddings and effectively captures the complex interaction patterns within sequence data. The performance of the DTIBFAI model is evaluated by comparing it with several state-of-the-art methods for drug-target interaction prediction. Experimental results demonstrate that DTIBFAI significantly outperforms these methods on the evaluated datasets, achieving AUROC and AUPRC scores of 0.9661 and 0.9673, respectively. Case studies reveal the model’s ability to identify novel DTI, including previously unrecorded interactions, validated for their biological plausibility. These results demonstrate the potential of DTIBFAI in advancing DTI prediction and its application in drug discovery. The code and data of DTIBFAI are available at the website of github.com/KNDF001/DTIBFAI-Drug-target-Interaction-Prediction-Based-on-BERT-and-Feature-Augment-of-Informer.
DTI prediction / BioBERT / ChemBERTa / multi-feature fusion / Informer
| [1] |
|
| [2] |
|
| [3] |
|
| [4] |
|
| [5] |
|
| [6] |
|
| [7] |
|
| [8] |
|
| [9] |
|
| [10] |
|
| [11] |
|
| [12] |
|
| [13] |
|
| [14] |
|
| [15] |
|
| [16] |
|
| [17] |
|
| [18] |
|
| [19] |
|
| [20] |
|
| [21] |
|
| [22] |
|
| [23] |
|
| [24] |
|
| [25] |
Leskovec J, Sosič R. SNAP: A General Purpose Network Analysis and Graph Mining Library. ACM Transactions on Intelligent Systems and Technology, 2016, 8(1): 1 |
| [26] |
|
| [27] |
|
| [28] |
|
| [29] |
|
| [30] |
|
| [31] |
|
| [32] |
|
| [33] |
Sennrich R, Haddow B, Birch A. Neural machine translation of rare words with subword units. In: Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics. 2016, 1715–1725 |
| [34] |
|
| [35] |
|
| [36] |
|
| [37] |
|
| [38] |
|
| [39] |
|
| [40] |
|
| [41] |
|
| [42] |
|
| [43] |
|
| [44] |
|
| [45] |
|
| [46] |
Markus A Lill, Matthew L Danielson. Computer-aided drug design platform using PyMOL. Journal of Computer-Aided Molecular Design, 2011, 25(1): 13−19 |
| [47] |
|
| [48] |
|
| [49] |
|
| [50] |
|
| [51] |
|
| [52] |
|
Higher Education Press
/
| 〈 |
|
〉 |