A Reinforcement Learning-Based Decision-Making Framework for Complex Industry Process

Yufei Zhang; Enjie Ma; Jie Hua; Zhongyuan Wang

doi:10.1007/s11704-026-51930-2

Front. Comput. Sci. ›› DOI: 10.1007/s11704-026-51930-2

RESEARCH ARTICLE

A Reinforcement Learning-Based Decision-Making Framework for Complex Industry Process

Author information +

History +

PDF (3931KB)

Abstract

In complex process industries, raw material prices and compositions can fluctuate beyond historical ranges, challenging traditional, expert-driven decision-making that relies heavily on past experience. This paper presents a novel reinforcement learning (RL) framework to address this issue and achieve dynamic, plant-wide economic optimization. We first develop a hybrid model that serves as a realistic simulation environment, effectively overcoming the limitations of sparse or out-of-distribution historical data and enabling safe policy exploration. We then formulate the plant’s operational optimization as a high-dimensional, continuous-action decision problem. To solve this, we propose the Asynchronous Three-Delay Deep Deterministic (A3D3) policy gradient algorithm, which offers greater adaptability to industrial settings. A3D3 improves training stability through asynchronous delay updates and enhances the robustness of industrial optimization processes by incorporating noise learning and expert knowledge guidance. The proposed method was validated in an industrial alumina refinery. The operational strategy derived by A3D3 significantly outperformed the plan devised by scheduling experts, successfully achieving increased production (3,167-ton production boost), cost reduction (2%), and enhanced economic benefits (7.6% improvement in profitability). Comparative experiments further demonstrated that A3D3 converges faster and delivers higher economic benefits than classical reinforcement learning algorithms, while ablation studies validated the unique contributions of its core components.

Keywords

Complex process industry / Production operation strategy / Hybrid model / Maximizing profit / Reinforcement learning / Alumina industry

Cite this article

Download citation ▾

Yufei Zhang, Enjie Ma, Jie Hua, Zhongyuan Wang. A Reinforcement Learning-Based Decision-Making Framework for Complex Industry Process. Front. Comput. Sci. DOI:10.1007/s11704-026-51930-2

登录浏览全文

4963

注册一个新账户忘记密码

References

Publishing order | Descend order by publishing year | Descend order by cited within

RIGHTS & PERMISSIONS

Higher Education Press 2026

PDF (3931KB)

Accesses

Citation

Detail

Sections

Recommended

About the journal

Aims & scope

Description

Editorial board

Abstracting / indexing

Contact us

Browse

Just accepted

All volumes and issues

Collections

Featured articles

Most accessed

Most cited

Collections

Multimedia collections

Authors & reviewers

Online submission

Call for papers

Guidelines for authors

Download templates

Guidelines for reviewers

Abstract

Keywords

Cite this article

References

RIGHTS & PERMISSIONS

Just Accepted