Achieving >97% on GSM8K: deeply understanding the problems makes LLMs better solvers for math word problems

Qihuang ZHONG , Kang WANG , Ziyang XU , Liang DING , Juhua LIU , Bo DU

Front. Comput. Sci. ›› 2026, Vol. 20 ›› Issue (1) : 2001310

PDF (765KB)
Front. Comput. Sci. ›› 2026, Vol. 20 ›› Issue (1) : 2001310 DOI: 10.1007/s11704-025-41102-z
Artificial Intelligence
LETTER

Achieving >97% on GSM8K: deeply understanding the problems makes LLMs better solvers for math word problems

Author information +
History +
PDF (765KB)

Graphical abstract

Cite this article

Download citation ▾
Qihuang ZHONG, Kang WANG, Ziyang XU, Liang DING, Juhua LIU, Bo DU. Achieving >97% on GSM8K: deeply understanding the problems makes LLMs better solvers for math word problems. Front. Comput. Sci., 2026, 20(1): 2001310 DOI:10.1007/s11704-025-41102-z

登录浏览全文

4963

注册一个新账户 忘记密码

References

[1]

Wei J, Wang X, Schuurmans D, Bosma M, Ichter B, Xia F, Chi E H, Le Q V, Zhou D. Chain-of-thought prompting elicits reasoning in large language models. In: Proceedings of the 36th International Conference on Neural Information Processing Systems. 2022, 1800

[2]

Wang L, Xu W, Lan Y, Hu Z, Lan Y, Lee R K W, Lim E P. Plan-and-Solve prompting: improving zero-shot chain-of-thought reasoning by large language models. In: Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics. 2023, 2609−2634

[3]

Kojima T, Gu S S, Reid M, Matsuo Y, Iwasawa Y. Large language models are zero-shot reasoners. In: Proceedings of the 36th International Conference on Neural Information Processing Systems. 2022, 1613

[4]

Zhou D, Schärli N, Hou L, Wei J, Scales N, Wang X, Schuurmans D, Cui C, Bousquet O, Le Q V, Chi E H. Least-to-most prompting enables complex reasoning in large language models. In: Proceedings of the 11th International Conference on Learning Representations. 2023

[5]

Tian Q, Zhu H, Wang L, Li Y, Lan Y. R3 prompting: review, rephrase and resolve for chain-of-thought reasoning in large language models under noisy context. In: Proceedings of the Association for Computational Linguistics: EMNLP 2023. 2023, 1670−1685

[6]

Zhang Z, Zhang A, Li M, Smola A. Automatic chain of thought prompting in large language models. In: Proceedings of the 11th International Conference on Learning Representations. 2023

RIGHTS & PERMISSIONS

Higher Education Press

AI Summary AI Mindmap
PDF (765KB)

Supplementary files

Highlights

Supplementary materials

495

Accesses

0

Citation

Detail

Sections
Recommended

AI思维导图

/