Safe integration of Large Language Models into industrial process control: a multi-agent architecture with P&ID-grounded validation
Daniel Schall
Autonomous Intelligent Systems ›› 2026, Vol. 6 ›› Issue (1) : 14
Large Language Models (LLMs) offer powerful reasoning capabilities for industrial process control, yet their non-deterministic nature, susceptibility to hallucination, and lack of intrinsic physical understanding make direct deployment in safety-critical environments unacceptable. This paper addresses five research questions on safely integrating LLM-based reasoning into industrial process automation through the Autonomous Action Execution (AAE) framework. For safe architectural integration (RQ1), we present a four-layer multi-agent architecture that confines LLM inference to an observation-only Monitor layer while safety-critical decisions are made by deterministic Verification and Execution agents. For structuring heterogeneous plant data (RQ2), we introduce a text-level aggregation framework with pluggable analyzers that transforms SCADA states, time-series measurements, Piping and Instrumentation Diagrams (P&IDs), and Standard Operating Procedures (SOPs) into contextually rich documents for LLM consumption. For automated validation (RQ3), a P&ID-grounded method uses graph traversal over the P&ID topology to verify physical consistency of LLM-generated proposals, checking tag existence, actuatability, fail-state consistency, and downstream impact. For quantifiable context enrichment (RQ4), a graduated baseline comparison (B0–B3) demonstrates the incremental value of each pipeline component. For cross-domain generalisability (RQ5), evaluation across five industrial scenarios—three derived from the Tennessee Eastman Process (TEP) benchmark (Downs & Vogel, 1993) providing community-standard validation, plus two retained scenarios (PolyReactor, Dryer) that establish performance boundaries from best-case (zero hallucination) to worst-case (70% safety violations)—demonstrates portability of the framework across continuous and batch processes of varying P&ID complexity within the evaluated synthetic scenarios. An error injection study across 43 crafted proposals demonstrates that the validation layer achieves 100% recall (zero false negatives) on the covered failure modes (P&ID-grounded structural checks together with the scenario-specific forbidden-action rubric). A statistical robustness study (N = 50 LLM runs) shows that even when LLMs propose unsafe actions in 10%–70% of runs, the deterministic validation layer catches every invalid proposal in the covered categories (structurally invalid or forbidden-action) under the evaluated scenarios. The core finding is that architecturally constrained advisory integration of LLMs in process industries is fundamentally an architectural challenge, and that established systems engineering principles of separation of concerns, independent protection layers, and deterministic safety logic provide deterministic checks against the covered validation-layer failure modes (structural and rubric-defined) when deploying LLM-based advisory systems in process industries.
Multi-agent systems / Large Language Models / Industrial process control / Functional safety / P&ID validation / SCADA / Tennessee Eastman Process
| [1] |
|
| [2] |
|
| [3] |
|
| [4] |
|
| [5] |
International Electrotechnical Commission. IEC 61511 Functional Safety — Safety Instrumented Systems for the Process Industry Sector, 2016, 2, Geneva, IEC |
| [6] |
|
| [7] |
OASIS, MQTT Version 5.0. OASIS Standard (2019). https://docs.oasis-open.org/mqtt/mqtt/v5.0/mqtt-v5.0.html |
| [8] |
|
| [9] |
|
| [10] |
|
| [11] |
International Electrotechnical Commission. IEC 62264: Enterprise-Control System Integration, 2013. Geneva, IEC |
| [12] |
|
| [13] |
|
| [14] |
|
| [15] |
International Society of Automation. ISA-5.1: Instrumentation Symbols and Identification, 2009ISA, Research Triangle Park |
| [16] |
International Organization for Standardization. ISO 15926: Industrial Automation Systems and Integration — Integration of Life-Cycle Data for Process Plants, 2003. Geneva, ISO |
| [17] |
|
| [18] |
|
| [19] |
|
| [20] |
|
| [21] |
|
| [22] |
|
| [23] |
|
| [24] |
|
| [25] |
|
| [26] |
Qwen Team, Qwen2.5-VL: a frontier multimodal model for understanding and interacting with the world (2024). arXiv preprint. arXiv:2502.13923 |
| [27] |
|
| [28] |
International Electrotechnical Commission. IEC 61508 Functional Safety of Electrical/Electronic/Programmable Electronic Safety-Related Systems, 2010, 2, Geneva, IEC |
The Author(s)
/
| 〈 |
|
〉 |