Incorporating contextual evidence to improve implicit discourse relation recognition in Chinese
Sheng XU , Peifeng LI , Qiaoming ZHU
Front. Comput. Sci. ›› 2024, Vol. 18 ›› Issue (3) : 183312
The discourse analysis task, which focuses on understanding the semantics of long text spans, has received increasing attention in recent years. As a critical component of discourse analysis, discourse relation recognition aims to identify the rhetorical relations between adjacent discourse units (e.g., clauses, sentences, and sentence groups), called arguments, in a document. Previous works focused on capturing the semantic interactions between arguments to recognize their discourse relations, ignoring important textual information in the surrounding contexts. However, in many cases, more than capturing semantic interactions from the texts of the two arguments are needed to identify their rhetorical relations, requiring mining more contextual clues. In this paper, we propose a method to convert the RST-style discourse trees in the training set into dependency-based trees and train a contextual evidence selector on these transformed structures. In this way, the selector can learn the ability to automatically pick critical textual information from the context (i.e., as evidence) for arguments to assist in discriminating their relations. Then we encode the arguments concatenated with corresponding evidence to obtain the enhanced argument representations. Finally, we combine original and enhanced argument representations to recognize their relations. In addition, we introduce auxiliary tasks to guide the training of the evidence selector to strengthen its selection ability. The experimental results on the Chinese CDTB dataset show that our method outperforms several state-of-the-art baselines in both micro and macro F1 scores.
discourse parsing / discourse relation recognition / contextual evidence selection
| [1] |
|
| [2] |
Lin Z, Kan M Y, Ng H T. Recognizing implicit discourse relations in the Penn discourse treebank. In: Proceedings of 2009 Conference on Empirical Methods in Natural Language Processing. 2009, 343−351 |
| [3] |
|
| [4] |
Webber B, Popescu-Belis A, Tiedemann J. Proceedings of the third workshop on discourse in machine translation. In: Proceedings of the 3rd Workshop on Discourse in Machine Translation. 2017 |
| [5] |
|
| [6] |
Liu Y, Li S. Recognizing implicit discourse relations via repeated reading: Neural networks with multi-level attention. In: Proceedings of 2016 Conference on Empirical Methods in Natural Language Processing. 2016, 1224−1233 |
| [7] |
|
| [8] |
Liu X, Ou J, Song Y, Jiang X. On the importance of word and sentence representation learning in implicit discourse relation classification. In: Proceedings of the 29th International Joint Conference on Artificial Intelligence. 2021, 530 |
| [9] |
|
| [10] |
|
| [11] |
|
| [12] |
Liu Y, Li S, Zhang X, Sui Z. Implicit discourse relation classification via multi-task neural networks. In: Proceedings of the 30th AAAI Conference on Artificial Intelligence. 2016, 2750−2756 |
| [13] |
Lan M, Wang J, Wu Y, Niu Z Y, Wang H. Multi-task attention-based neural networks for implicit discourse relationship representation and identification. In: Proceedings of 2017 Conference on Empirical Methods in Natural Language Processing. 2017, 1299−1308 |
| [14] |
|
| [15] |
|
| [16] |
Jiang F, Fan Y, Chu X, Li P, Zhu Q. Not just classification: Recognizing implicit discourse relation on joint modeling of classification and generation. In: Proceedings of 2021 Conference on Empirical Methods in Natural Language Processing. 2021, 2418−2431 |
| [17] |
|
| [18] |
Xu Y, Hong Y, Ruan H, Yao J, Zhang M, Zhou G. Using active learning to expand training data for implicit discourse relation recognition. In: Proceedings of 2018 Conference on Empirical Methods in Natural Language Processing. 2018, 725−731 |
| [19] |
|
| [20] |
Dai Z, Huang R. A regularization approach for incorporating event knowledge and coreference relations into neural discourse parsing. In: Proceedings of 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing. 2019, 2976−2987 |
| [21] |
|
| [22] |
Zhang Y, Meng F, Li P, Jian P, Zhou J. Context tracking network: Graph-based context modeling for implicit discourse relation recognition. In: Proceedings of 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. 2021, 1592−1599 |
| [23] |
|
| [24] |
|
| [25] |
Hirao T, Yoshida Y, Nishino M, Yasuda N, Nagata M. Single-document summarization as a tree knapsack problem. In: Proceedings of 2013 Conference on Empirical Methods in Natural Language Processing. 2013, 1515−1520 |
| [26] |
|
| [27] |
|
| [28] |
|
| [29] |
|
| [30] |
|
| [31] |
|
| [32] |
|
| [33] |
|
| [34] |
|
| [35] |
|
| [36] |
|
| [37] |
|
| [38] |
|
| [39] |
Zhang B, Su J, Xiong D, Lu Y, Duan H, Yao J. Shallow convolutional neural network for implicit discourse relation recognition. In: Proceedings of 2015 Conference on Empirical Methods in Natural Language Processing. 2015, 2230−2235 |
| [40] |
|
| [41] |
Dai Z, Huang R. Improving implicit discourse relation classification by modeling inter-dependencies of discourse units in a paragraph. In: Proceedings of 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. 2018, 141−151 |
| [42] |
|
| [43] |
|
| [44] |
|
| [45] |
Li Y, Feng W, Sun J, Kong F, Zhou G. Building Chinese discourse corpus with connective-driven dependency tree structure. In: Proceedings of 2014 Conference on Empirical Methods in Natural Language Processing. 2014, 2105−2114 |
| [46] |
|
| [47] |
|
| [48] |
|
| [49] |
|
| [50] |
|
| [51] |
Bhatia P, Ji Y, Eisenstein J. Better document-level sentiment analysis from RST discourse parsing. In: Proceedings of 2015 Conference on Empirical Methods in Natural Language Processing. 2015, 2212−2218 |
| [52] |
Yang Z, Yang D, Dyer C, He X, Smola A, Hovy E. Hierarchical attention networks for document classification. In: Proceedings of 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. 2016, 1480−1489 |
| [53] |
|
| [54] |
|
| [55] |
|
| [56] |
Yoshida Y, Suzuki J, Hirao T, Nagata M. Dependency-based discourse parser for single-document summarization. In: Proceedings of 2014 Conference on Empirical Methods in Natural Language Processing. 2014, 1834−1839 |
| [57] |
|
| [58] |
|
| [59] |
Devlin J, Chang M W, Lee K, Toutanova K. BERT: Pre-training of deep bidirectional transformers for language understanding. In: Proceedings of 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. 2019, 4171−4186 |
| [60] |
|
| [61] |
|
| [62] |
|
| [63] |
|
| [64] |
|
| [65] |
|
| [66] |
|
| [67] |
|
| [68] |
|
| [69] |
|
| [70] |
|
| [71] |
|
Higher Education Press
Supplementary files
/
| 〈 |
|
〉 |