1. College of Management and Economics, Tianjin University, Tianjin 300072, China
2. School of Management, Tianjin University of Technology, Tianjin 300384, China
3. School of Economics and Management, Inner Mongolia University of Technology, Hohhot 010051, China
shuguanghe@tju.edu.cn
Show less
History+
Received
Accepted
Published Online
2025-07-09
2025-10-11
2026-03-26
PDF
(6401KB)
Abstract
Serial multistage manufacturing systems (SMMS), comprising multiple consecutive stages, are widely adopted in modern industry. At each stage, quality-related components (QRCs) refer to machine parts that directly impact product quality, while key product characteristics (KPCs) reflect both product performance and customer requirements. The quality of KPCs serves as an indicator of both product quality and machine condition, whereas the condition of QRCs reflects component health and provides early warnings of potential quality issues. Simultaneously monitoring both KPCs and QRCs across all stages is vital for ensuring system reliability and maintaining consistent product quality. To best of our knowledge, in SMMS, existing studies have primarily focused on the economic design of condition-based maintenance (CBM) strategies for either monitoring QRCs or KPCs individually, while their joint monitoring has received limited attention. To address this gap, this study proposes a cost-effective joint CBM strategy for SMMS over a finite horizon. A multivariate generalized likelihood ratio (MGLR) chart is employed to monitor the variations in KPCs, and the hazard rate of QRCs is evaluated using a proportional hazards model (PHM). Alarm scenarios and cost formulations are developed over a finite horizon, and a simulation framework is established to calculate the expected total cost (ETC). Subsequently, a simulation-based genetic algorithm is employed to minimize the ETC. Finally, the proposed strategy is validated on the four-stage scroll machining process in a compressor manufacturing system, demonstrating its effectiveness.
Jin LI, Qi LI, Wei ZHANG, Zhen HE, Shuguang HE.
Condition-based maintenance for serial multistage manufacturing system considering reliability and quality over a finite horizon.
Eng. Manag, 2026, 13(1): 17-41 DOI:10.1007/s42524-026-5184-5
Serial multistage manufacturing system (SMMS) is a common type of discrete manufacturing system and plays a critical role in modern industry. The SMMS consists of sequentially connected stages that facilitate the flow of materials and information. The collective performance of these stages ultimately determines the quality of final product (Bera and Mukherjee, 2016; Feng et al., 2025). In SMMS, the condition of machine at each stage has a direct impact on product quality, while non-conforming items generated at one stage may adversely affect the operational condition of downstream machine. Moreover, the widespread application of sensors in manufacturing processes has greatly enhanced data acquisition, thereby providing robust support for the joint monitoring of machine condition and product quality. With the integration of machine condition data and product quality data, early indicators of quality deterioration or machine failure can be effectively identified. This enables manufacturers to implement both feedforward and feedback control strategies for timely and targeted maintenance actions (Shi, 2023). This joint monitoring approach reduces defects, rework, and scrap, leading to significant cost savings and higher economic efficiency, while improving machine stability and utilization to enhance overall production performance and system reliability (Boumallessa et al., 2023; Wang et al., 2025).
In each stage of SMMS, the quality-related components (QRCs) refer to components of machine that impact product quality, while the key product characteristics (KPCs) represent the critical indicators reflecting product performance and customer requirements (Lu and Zhou, 2017). In real production, relying solely on monitoring the condition of QRCs may overlook real-time variations in product quality, potentially resulting in delayed detection of non-conforming items. This weakens both the responsiveness and effectiveness of quality control. Conversely, monitoring only KPCs may fail to detect machine abnormalities in a timely manner, thereby increasing the risk of sudden failures and production interruptions. For example, in the scroll machining process of compressors, the quality control of scroll primarily relies on product sampling, while maintenance decisions are typically based on either sampled results or cumulative production counts of scroll. However, this approach overlooks the real-time condition of QRCs (such as cutting tools and fixtures). As a result, these components may deteriorate to a high-risk state even when sampled products still meet specifications, potentially leading to unexpected machine failures and unplanned downtime. The joint monitoring scheme can effectively overcome these limitations by integrating QRCs and KPCs across all stages. Real-time monitoring of KPCs ensures process stability and reduces the occurrence of non-conforming products, whereas real-time monitoring of QRCs enables the detection of degradation trends and mitigates risks in SMMS. The integration of these monitoring insights supports more precise process control and continuous optimization, ultimately enhancing system reliability and operational efficiency.
When the SMMS stops due to machine condition or product quality issues, effective maintenance of the machines is essential to restore their reliability and ensure consistent product quality. Since machine condition directly affects product quality at each stage, and the production of non-conforming items can further impair the operational condition of downstream machines, a scientific maintenance strategy is essential to prevent quality degradation and avoid cascading failures across stages. Condition-based maintenance (CBM), as a vital and widely implemented maintenance strategy, plays a crucial role in enhancing system reliability, reducing maintenance costs, and promoting intelligent manufacturing. CBM leverages real-time condition data such as vibration, acoustic emission and oil analysis to continuously assess machine health, facilitating more accurate maintenance decisions and improving cost efficiency (Alaswad and Xiang, 2017; Zheng and Zhou, 2021). Within a joint monitoring framework, simultaneous acquisition and analysis of machine condition data and product quality data allow for a more comprehensive understanding of deterioration process and quality variation across stages. Based on this framework, a joint CBM strategy is formulated to integrate machine health with product quality. By leveraging real-time information on machine condition and product quality, this approach enables proactive and targeted maintenance, effectively mitigating unexpected failures. Consequently, it reduces operational costs, enhances machine reliability and utilization, ensures consistent product quality, and significantly improves the overall economic performance of intelligent manufacturing systems.
1.2 Literature review
In recent years, numerous studies have investigated CBM strategies that incorporate product quality considerations. These studies are generally categorized into two main approaches: (1) Equipment condition assessment through product quality monitoring. This approach leverages real-time quality monitoring to infer machine condition, allowing maintenance decisions to be triggered by deterioration in product quality. (2) Quality inference from machine condition monitoring. This method establishes a quantitative relationship between machine condition and product quality. By directly monitoring the machine condition, the corresponding product quality level is inferred, and maintenance actions are subsequently determined.
In practical manufacturing settings, product quality is often prioritized as it directly influences customer satisfaction and compliance with industry standards (Liu et al., 2023). As a result, quality data, such as dimensional measurements and non-conforming rates, are routinely collected using cost-effective tools and are often required by regulations. Thus, many studies have developed CBM strategies based on quality monitoring. Statistical process control (SPC) is the most commonly used method in the literature for real-time monitoring of product quality (Montgomery, 2019; Qiu, 2013). For example, in the early years, Cassady et al. (2000) combining proposed an integrated control chart and maintenance strategy by combining an X-bar chart with an age-replacement policy, and demonstrated through simulation-based optimization that this joint approach achieved greater cost reduction than employing either method independently. Yeung et al. (2007) simultaneous proposed integrating preventive maintenance (PM) with SPC by formulating it as a partially observable Markov decision process to derive near-optimal joint policies that minimize maintenance, sampling, and quality-related costs. Further studies on early efforts to integrate CBM and SPC can be found in these works (Mehrafrooz and Noorossana, 2011; Panagiotidou and Tagaras, 2010; Xiang, 2013). In the past decade, Naderkhani and Makis (2016) proposed an economic design framework for multivariate Bayesian control charts using a semi-Markov decision process. This method employs a dual-sampling interval strategy and jointly optimizes sample size, sampling intervals, control limits, and maintenance decisions to minimize the long-run expected cost per unit time. Bahria et al. (2019) utilized X-bar control charts to monitor the production process and developed an integrated strategy model for coordinating production-inventory management, maintenance scheduling, and process quality control in a single-unit manufacturing system. Kampitsis and Panagiotidou (2022) proposed a new double-sampling Bayesian control chart with state-dependent variable inspection frequency, which considered a single-unit process prone to operational deterioration and catastrophic failures. Tasias (2022) introduced a Bayesian model that could jointly optimize processes such as production, maintenance and quality, which consider the process has multiple correlated quality characteristics. Wan et al. (2023) proposed an integrated approach that simultaneously addresses economic production quantity, maintenance policy and quality control for an imperfect continuous flow process. Shojaee et al. (2024) proposed an integrated model combining Hotelling control chart for linear profiles, production cost evaluation, and maintenance policies to minimize total cost. For more information on assessing the machine condition through product quality monitoring, refer to the following works (Cao et al., 2025; He et al., 2019; Lv et al., 2024; Rasay et al., 2022; Shi et al., 2024; Zhang et al., 2024).
Over the past two decades, extensive research has been devoted to CBM of degrading systems through real-time machine condition monitoring (Alaswad and Xiang, 2017; Li et al., 2020; Mohamed-Larbi and Daoud, 2024; Olde Keizer et al., 2017; Zhao et al., 2025). Among these efforts, significant work has also been devoted to the joint evaluation of machine condition and product quality based on machine condition monitoring. Such evaluation typically relies on the prior establishment of a quantitative relationship between machine condition and product quality. For example, Chen and Jin (2005) proposed a general linear model to construct the effect of QRCs degradation on KPCs and the KPCs is described as a covariate to assess the failure rates of QRCs. Jin et al. (2019) proposed a dynamic quality model that characterizes the varying influence of machine degradation on product quality, with process effects represented as piecewise linear functions. Boumallessa et al. (2023) proposed a coupling function links the product quality indicator and machine degradation level for joint control of production, quality, and maintenance. Based on the general linear model proposed in Chen and Jin (2005), aiming to simultaneously improve process reliability and final product quality, Lu and Zhou (2017) introduced a novel opportunistic maintenance scheduling method for SMMS. Ye et al. (2019) considered the effect of imperfect inspection process and a new reliability model is developed based on the interaction between machine performance and product quality. Shi et al. (2023) considered a maintenance strategy that simultaneously considers machine availability, reliability and product quality. More recently, Lu and Luo (2024) modeled the influence of tool wear on product quality deterioration and proposed a dynamic CBM policy for heterogeneous-wearing tools. Zhang et al. (2025) joint proposed three joint optimization models integrating PM and product quality improvement for deteriorating manufacturing systems with quality–reliability dependency. For more on assessing machine condition and product quality by monitoring machine degradation, one can refer to these works (Cheng and Li, 2020; Han et al., 2021; Khatab et al., 2019; Li et al., 2023; Lu and Zhou, 2019; Majdouline et al., 2022; Wang et al., 2020; Wang et al., 2024).
Moreover, when it comes to the monitoring of multistage manufacturing systems, Zou and Tsung (2008) integrated the multivariate change point detection scheme with specific directional information based on a multistage state-space model. Shang et al. (2013) proposed a multivariate generalized linear mixed-effects model to characterize multistage processes with categorical data, effectively integrating physical laws and engineering knowledge. Yeganeh et al. (2024) proposed a multistage control chart tailored for healthcare data, combining statistical control charts with machine learning techniques to enhance early detection, confirmation, and patient safety. For more on multistage monitoring, refer to the these works (Ebadi and Ahmadi-Javid, 2020; Liu et al., 2025; Odom et al., 2018; Park and Lee, 2022). However, to best of our knowledge, no existing studies have simultaneously integrated product quality and machine condition monitoring within the framework of CBM for SMMS. This study aims to bridge this research gap by developing an integrated monitoring and early warning framework over a finite horizon, with the goal of enhancing machine reliability and ensuring consistent product quality in SMMS.
Table 1 presents a detailed comparison between the proposed strategy and representative studies, highlighting a more refined classification of the key differences. Evidently, CBM strategies jointly incorporating machine condition and product quality data in SMMS remain unexplored in existing literature.
1.3 Overview
This study investigates the monitoring of product quality variation and the assessment of machine hazard rates in SMMS over a finite horizon. Based on the monitoring results, it identifies the stage requiring maintenance and accordingly optimizes the maintenance strategy over a finite horizon. The main contributions of this study are summarized as follows: (1) Existing models often oversimplify the interdependence between QRCs degradation and KPCs quality in SMMS (Chen and Jin, 2005; Lu and Luo, 2024; Shi et al., 2023). To address this limitation, we develop a coupled degradation-quality framework that explicitly incorporates the effect of non-conforming items on QRCs degradation and links the impact of QRCs degradation on KPCs quality through a generalized linear model. (2) Conventional monitoring approaches in SMMS typically only consider KPCs quality and QRCs degradation separately, which limits their ability to accurately capture QRCs hazard rates and KPCs quality variations simultaneously (Khatab et al., 2019; Liu et al., 2023). To overcome this shortcoming, we propose an integrated joint monitoring and diagnosis scheme that enables simultaneous monitoring and stage diagnosis, thereby improving responsiveness to abnormal changes in both QRCs hazard rates and KPCs quality variations. (3) Existing CBM strategies often overlook the joint feedback effects of maintenance actions on both KPCs quality and QRCs condition (Bahria et al., 2019; Zhang et al., 2025). In contrast, we formulate a joint CBM strategy in which maintenance actions are triggered by joint monitoring results, and their effects on QRCs hazard rates and KPCs quality variations are quantitatively evaluated. (4) Previous studies have rarely addressed the optimization of joint CBM strategies for SMMS over a finite horizon (Lu and Zhou, 2019; Lu and Luo, 2024; Shi et al., 2023). This study fills that gap by simulating the total cost over a finite horizon and optimizing the joint CBM strategy, thereby ensuring cost-effective decision-making and demonstrating the practical feasibility of the proposed framework for SMMS. This research offers a new approach to quality control and machine management in manufacturing. It enables intelligent monitoring, accurate diagnosis, and optimized decision-making, thereby enhancing system reliability and cost efficiency.
The remainder of this paper is organized as follows. Section 2 presents the problem statement, including the presentation of SMMS and the assumptions used in this study. Section 3 models the degradation of QRCs and KPCs quality interactions of SMMS. Section 4 models the proposed joint CBM scheme over a finite horizon, including the modeling of system hazard rate, MGLR chart and maintenance effect. In Section 5, we give the different scenarios of alarm results and derive the expression of expected total cost (ETC). The simulation-based optimisation of ETC over a finite horizon is introduced. In Section 6, the applicability of proposed strategy is validated by a four-stage scroll machining process. Finally, some concluding remarks are given in Section 7.
2 Problem statement
We consider an SMMS operating over a finite time horizon , consisting of consecutive stages, where each stage is equipped with a single machine. In each stage, the machine comprises QRCs, while the product produced exhibits KPCs. The degradation of QRCs directly impacts the corresponding KPCs at the same stage, and this relationship is referred to as the degradation-quality (D-Q) effect. Additionally, non-conforming items from upstream stages can adversely affect the condition of downstream QRCs, and accelerating the degradation of QRCs. This relationship is referred to as the quality-degradation (Q-D) effect. Figure 1 schematically illustrates the interactions between degradation of QRCs and KPCs across the stages of SMMS. To monitor both QRCs condition and KPCs quality, a periodic inspection scheme is employed with a fixed interval . At each inspection epoch , the degradation of QRCs and the stability of KPCs at each stage are systematically assessed. A PM action is initiated whenever the QRCs at any stage enter an undesired condition or the stability of KPCs exceeds the defined tolerance levels.
In SMMS, checking and repairing machines at each stage when an alarm occurs is time-consuming and costly, and it often leads to unnecessary resource consumption. To address this issue, this study proposes a diagnostic strategy based on alarm signals. By leveraging these signals, the strategy identifies the machine that has the greatest impact on system stability and reliability, enabling targeted maintenance action. This diagnostic strategy facilitates the timely restoration of the health of QRCs and mitigates further quality deterioration, thereby preserving the performance of the SMMS. Moreover, some assumptions are presented to facilitate further elaborations:
Assumption 1: It is assumed that the number of non-conforming items propagated to the downstream stage follows a Poisson process. The effects of the dimensional deviations of these non-conforming items on QRCs degradation are characterized by mutually independent truncated Normal distributions. This assumption is consistent with that adopted in (Ye et al., 2019).
Assumption 2: The inherent degradation process driven by their operational conditions is assumed to be independent of the degradation process caused by the influence of non-conforming items, and all the degradation level is initialized to zero at time . This assumption is consistent with the literature (Cao et al., 2020; Kong et al., 2017; Ye et al., 2019).
Assumption 3: It is assumed that the hazards rate of QRCs at each stage is mutually independent, and that the effect of QRCs degradation caused by non-conforming items on the hazards rate of downstream QRCs is negligible. This assumption of hazards rate independence in SMMS is widely adopted in the literature (Jafari et al., 2018; Lu and Zhou, 2019; Shi et al., 2023).
Assumption 4: It is assumed that the state of KPCs at each stage can be categorized into two distinct conditions: in control (IC) and out of control (OC). This classification aligns with the piecewise characterization of process quality states proposed by (Jin et al., 2019), which effectively captures the dynamic shifts in quality state during manufacturing.
Assumption 5: When the control chart signals an alarm while the process is actually IC, the alarm is considered a false alarm, which incurs false cost but does not cause downtime. After the process state is verified, production continues without interruption. This assumption reflects practical operational procedures aimed at minimizing unnecessary stoppages and is widely adopted in the SPC (Bahria et al., 2019; He et al., 2019; Kampitsis and Panagiotidou, 2022; Shojaee et al., 2024).
The notations involved in this study are summarized in Table 2.
3 Modeling degradation-quality interactions
3.1 Influence of non-conforming items on QRCs degradation
In this section, the QRCs degradation process is decomposed into two contributing factors: the inherent degradation driven by their operational conditions and the degradation caused by the influence of non-conforming items propagated from upstream stages. At stage , the inherent degradation of QRCs driven by their operational conditions, are represented by . The inherent degradation of QRCs is modeled by independent Gamma processes. The Gamma process is appropriate for degradations such as tool wear, bearing fatigue, and thermal aging, which are typically irreversible and cumulative, as its strictly increasing paths can effectively capture such deterioration (Lu and Zhou, 2017; Wang et al., 2024; Ye et al., 2019; Zhang et al., 2025). Specifically, the degradation of th QRC is described as follows
1) ,
2) For , are independent,
3) For , ,
where and are shape and scale parameters for the th QRC of stage , respectively.
Let denote the dimensional deviation of the th non-conforming item from the specification limit, which is propagated to stage . The is given by
where and are the upper and lower specification limits, respectively, and is the th observation of stage within the inspection interval . In SMMS, the upstream dimensional deviations of non-conforming items will accelerate the degradation of downstream machine. Let represent the degradation increment at stage caused by the th non-conforming item from stage . A truncated Normal distribution is employed to model the influence of the dimensional deviation of the th non-conforming item on the degradation of QRCs, i.e., , where and are the mean and variance parameters, respectively.
Remark: In Eq. (1), the definition of assumes that the deviation is zero when the observed value falls within the specification limits. Once the observation exceeds these limits, the deviation is measured by the difference between the observation and the specification limit. However, in a more general context, can be defined as the deviation of the observation from the target value, which is consistent with the formulation of the Taguchi quality loss function (Taguchi, 1986). Considering a target value , the Taguchi loss function can be expressed as
where is the loss coefficient. The definition of based on the Taguchi quality loss function is more general, as it implies that any deviation of KPCs from the target value contributes to QRCs degradation. Figure 2 illustrates the different definitions of the dimensional deviation . In this study, we adopt the definition in Eq. (1), which is reasonable in certain machining scenarios. For example, in turning operations, a workpiece is considered acceptable as long as its diameter remains within the specified tolerance range, even if it deviates from the nominal target value, since such deviations have a negligible impact on downstream assembly (Ye et al., 2019).
Let denote the number of non-conforming items propagated to stage during the time interval , and represent the accumulated discrete degradation at stage caused by these non-conforming items. In this study, the counting process is modeled as a Poisson process with a rate parameter , i.e., . For stage , the degradation of QRCs caused by the influence of non-conforming items propagated from upstream stages, denoted as . The th QRC degradation which account for the influence of non-conforming items is defined as
where the coefficient represents the weight capturing the contribution of upstream non-conforming items to the degradation of the th QRC, with .
Considering both the inherent degradation and the degradation induced by the influence of non-conforming items, and under the Assumption 2, the total degradation of QRCs is given by
where and .
3.2 Influence of QRCs degradation on downstream KPCs
In modern manufacturing systems, the condition of machines has become an increasingly critical factor in determining product quality. As machine operate over time, its QRCs gradually degrade, which often affects the corresponding KPCs quality. Let denote the KPCs value vector in stage at time , with its observation abbreviated as . The observation of is abbreviated as . The relationship between and can be modeled by
where is the vector of random errors for stage and . The function characterizes the relationship between and , and for notational simplicity, is abbreviated as . Specifically, represents the conditional expectation of given , where each element is defined as .
Moreover, with the development of intelligent manufacturing technology, the role of these non-machine factors (e.g., man, material, method, measurement and environment) on the quality is gradually weakened, and they are referred to as the noise factors . Considering the interaction between the QRCs degradation and the noise factors (Chen and Jin, 2005), the general linear model is assumed as
where , , and . and represent the effect coefficient of and , respectively; represents the effect coefficient of interactions between and .
Proposition 1 When , we have the following results:
(1) For ith KPC of stage , the conditional expectation .
(2) For stage , the random errors follow the multivariate normal distribution, i.e., , where and .
Proof: For (1), according to Eq. (6), we get . Due to , we get . This proves (1). According to (1), Eqs. (5) and (6), we have . Thus, . Since the are independent of each other, we get . This proves (2).
Remark: In SMMS, the quality of KPCs at each stage is continuously influenced by the degradation of QRCs. First, according to (1) in Proposition 1, as the degradation of QRCs accumulates across stages, the conditional expectation tends to increase. This may lead to greater quality uncertainty and a higher likelihood of exceeding tolerance limits, thereby increasing the number of non-conforming items. Second, according to (2) in Proposition 1, the progressive degradation of QRCs also leads to increasing variation of KPCs. As a result, the KPCs gradually deviate from their nominal targets, ultimately undermining the stability and consistency of the final product.
As established in (2), the term within the quadratic form is a key role in increasing the variance . Suppose that is the acceptable variance threshold of . When , the shift magnitude of variance is small and does not differ much each. When , the shift magnitude of variance gradually becomes larger. When the variance of all the random errors in each stage does not exceed the corresponding acceptable threshold, i.e., , the process is considered to be IC and the IC covariance matrix of the random errors is denoted by . Conversely, if there exists at least one random error term whose variance exceeds the acceptable threshold in its corresponding stage, i.e., , the process is considered as the OC state and the OC covariance matrix of the random errors is denoted by .
4 Modeling of the joint CBM strategy
In this section, a novel quality and reliability oriented joint CBM strategy is proposed to facilitate the PM decision-making. The PHM is developed to monitor the hazard rate of QRCs, and the MGLR control chart is employed to capture the variation of KPCs in the SMMS. Based on the monitoring results, a diagnostic strategy is developed to identify the stages requiring PM over a finite horizon, and the effect of PM actions on QRCs is quantitatively assessed.
4.1 Modeling of system hazard rate
The PHM is widely adopted in reliability engineering for its capability to capture the effects of time-varying and condition-dependent factors on machine reliability (Zheng and Zhou, 2021). This flexibility makes the PHM particularly suitable for analyzing how different degradation indicators contribute to the increasing hazard rate over time. As a result, it allows for more accurate failure predictions and the development of effective maintenance strategies (Hu and Chen, 2020; Pedersen et al., 2023; Zheng et al., 2023). In each stage of the SMMS, the degradation states are incorporated as covariates to enable a quantitative evaluation of their impact on failure risk. Accordingly, the hazard rate function of stage k is given by
where . The represents the Weibull baselinehazard function, where and are the shape and scaleparameters for stage , respectively. The represents the expected impact of QRCs degradation on hazard, where is regression coefficient vector. The parameters of hazard function in Eq. (7) can be estimated using the maximum likelihood estimation method with lifetime histories (Vlok et al., 2002). From Eq. (7), it can be seen that is the function of , and its derivation is given in Appendix A.
According to Assumption 3, each stage operates independently, meaning the hazard rate of one machine does not influence another. This independence simplifies the modeling process by enabling the hazard rates of individual machines to be evaluated separately (Lu and Zhou, 2019). Consequently, the system hazard rate function of the SMMS is given by
The system PHM signals an alarm when , where is the threshold for system hazard rate.
4.2 MGLR chart for monitoring the covariance
In SMMS of intelligent manufacturing, the KPCs data at each stage can be efficiently collected through advanced sensing and data acquisition technologies. The random errors of stage is computed according to Proposition 1, i.e., . We regard as the -dimensional multivariate normal vectors and . If a shift of stage occurs at time , then the covariance matrix will change from to . The hypothesis test of stage is designed by
Let a sample of size be collected at time . The corresponding log-likelihood function of , based on the sample , is given by (Jing et al., 2022)
where is the determinant of matrix , represents the trace of , and . If , all the samples up to time follow the IC distribution and the log-likelihood is
If , then the log-likelihood is
The shift time can be estimated by maximizing the log-likelihood and is given by
Considering the log-likelihood ratio for testing whether there has been a change in the process prior to the current time point, the MGLR test statistic at time is
where can be computed under the hypothesis . The proof of Eq. (14) is given by Appendix B. Furthermore, the monitoring statistic for the covariance in SMMS at time is given by
The MGLR chart signals an alarm if , where is a pre-specified control limit.
4.3 The joint CBM strategy for finite horizon
Over the finite horizon , an alarm may be triggered by the MGLR chart, by the system PHM, or by both simultaneously. If the first alarm is triggered by the MGLR chart, it reflects a significant variation in the KPCs of the SMMS. If the first alarm is triggered by the system PHM, it indicates that the hazard rate of the QRCs in the SMMS has reached an unacceptable level. If both alarms are triggered simultaneously, it implies that both a significant variation and an unacceptable hazard rate have occurred. In all cases, according to Eq. (7) and Proposition 1, these alarms indicate that the QRCs have degraded to an undesired level, thereby requiring maintenance action.
This study considers two types of maintenance action: minimal repair and PM. When machine fails, the minimal repair action is performed to restore it to a normal operational condition. The minimal repair action neither changes the condition of the QRCs nor affects their deterioration states, and its duration is assumed to be negligible. When an alarm is triggered, the PM action is conducted proactively to mitigate QRCs deterioration. The PM action restores the QRCs to an as-good-as-new condition in terms of degradation levels, which are reset to zero, while leaving the age of machine unchanged. It is assumed that the degradation levels of QRCs are zero at the beginning of their service life.
Let denote the index of th PM action over a finite horizon, and let represent the time at which this PM action is performed. Accordingly, the degradation of the QRCs after the th PM action is given by
According to Proposition 1, after maintenance, both the conditional expectation and the variance of random errors at stage are updated according to Eq. (16). Let denote as the virtual age after the th PM action, which is given by
where is the age reduction coefficient and . and indicate that the PM action restores the QRCs condition to as good as new and does not change the QRCs condition, respectively. The evolution of hazard rate at stage under PM action can be given by
where .
However, in practical production processes, simultaneously inspecting and repairing all stages of the machine is both time-consuming and costly. To address this issue, we propose a diagnostic strategy that identifies the specific stage requiring maintenance, thereby ensuring the overall reliability and stability of the SMMS while reducing unnecessary maintenance efforts.
Proposition 2 When the joint monitoring alarms at time in SMMS, a diagnostic strategy to determine which stage needs PM action is given by
where and . is the hazard rate when the PM action is taken at stage and is the stage set of system hazard restoring to the desired value when PM action is taken at stage .
In Proposition 2, the diagnostic scheme of the control chart identifies the stage with the highest statistic , which reflects the most pronounced KPCs variation and thus requires PM action. By contrast, the diagnostic scheme of the system PHM aims to keep the system hazard rate within acceptable thresholds, leading to the maintenance of the stage that yields the largest hazard rate improvement, i.e., . When alarms are simultaneously raised by both the MGLR chart and the system PHM at time , PM action is concurrently taken on the stage with the maximum and on the stage providing the greatest reduction in hazard rate.
Over the finite horizon, three possible scenarios of joint monitoring alarm results may occur and are illustrated in Fig. 3(a). Specifically:
● Scenario corresponds to the MGLR chart alarms and arises under the condition . When no shift occurs over a finite horizon, i.e., the process remains IC, no action is taken and production continues as usual. Conversely, if a shift occurs prior to an alarm from the MGLR chart, a PM action is carried out.
● Scenario corresponds to the system PHM alarms and arises under the condition . PM action is performed regardless of whether a shift occurs.
● Scenario corresponds to the case where no alarm is triggered before the horizon ends and arises under the condition , and a final replacement action is taken.
Within the finite horizon, scenarios and may occur multiple times, whereas scenario occurs at most once, typically at the end of finite horizon. Let , , denote the index of the th PM action at stage associated with scenario , where is the total number of such actions at stage over a finite horizon. Let , , denote the index of the th PM action at stage associated with scenario , where is the total number of such actions at stage over a finite horizon. The total number of PM actions for and across all stages are , . The total number of PM actions over a finite horizon is .
Figure 3(b) illustrates an example of a two-stage SMMS () with a total of four PM actions () executed over a finite horizon. Among them, two PM actions are triggered by alarms from the MGLR chart (), and the other two by the system PHM (). In Stage 1, two PM actions are performed: one () occurs at the first PM action (), and one () occurs at the fourth PM action (). In Stage 2, two PM actions are also executed, including one () at the second PM action () and one () at the third PM action ().
5 Simulation-based optimization for finite horizon
In this section, we systematically characterize the cost associated with the joint monitoring strategy over a finite horizon. Specifically, the total cost comprises four major components: inspection cost, quality-related cost, maintenance cost, and false alarm cost. We provide detailed mathematical formulations for each of these cost elements incurred over a finite horizon.
5.1 Inspection cost
When employing the proposed monitoring scheme, each inspection incurs a cost associated with acquiring the degradation data of QRCs and KPCs data. Let denote the cost per inspection. In scenario , for the th PM interval of stage , , the number of inspection within the th PM interval of stage is . Hence, the total inspection cost over a finite horizon of is .
Similarly, the total inspection cost over a finite horizon of scenario is . Scenario represents the period from last maintenance time until the end of the finite horizon , with the number of inspection for is . Since occurs only once over a finite horizon, the total inspection cost of is .
Thus, the total inspection cost over a finite horizon is given by
where and denote the occurrence probabilities of scenarios and , respectively.
5.2 Maintenance cost
As the maintenance strategy proposed in Section 4.3, between two consecutive PM actions, two types of maintenance activities may be performed: the first is minimal repair, which is initiated upon failure and incurs a minimal repair cost ; the second is PM, which is triggered by alarms from either the MGLR chart or the system PHM, with an associated PM cost . At the end of the finite horizon, we will replace all the QRCs at each stage, with a replacement cost .
For scenario , the expected number of failures during the interval is , where and denote the inspection time points corresponding to the th and th PM actions, respectively. Therefore, the total maintenance cost over a finite horizon of is given by
Similarly, the total maintenance cost over a finite horizon of scenario is given by
where and denote the inspection time points corresponding to the th and th PM actions, respectively.
For scenario , the maintenance cost consists of the minimal repair cost , as well as the replacement cost . Therefore, the total maintenance cost over a finite horizon of is given by
where denotes the inspection time points corresponding to the th PM action.
Thus, the total maintenance cost over a finite horizon is given by
where and denote the occurrence probabilities of scenarios and , respectively.
5.3 Quality cost
In the production process, the system typically transitions from IC to OC state, with the associated costs for each state being distinctly different. We assume the unit time cost of IC state is , while the unit time cost of OC state is .
For scenario , the shift time of stage corresponding to its th PM action is denoted by . The total quality cost over a finite horizon of is given by
and according to Eq. (13), the shift time is estimated as
Similarly, for scenarios and , the corresponding quality costs over a finite horizon, denoted by and , are given by
and the shift time and are estimated as
Thus, the total quality cost over a finite horizon is given by
where and denote the occurrence probabilities of scenarios and , respectively.
5.4 False alarm cost
When the control chart signals an alarm while the process is IC state, such an event is identified as a type I error, characterized by the false alarm rate (Montgomery, 2019; Qiu, 2013). Although production can continue after confirming that the process is indeed IC, the associated inspections may still incur costs. Let denote the cost incurred by each false alarm.
For scenario , the number of inspections conducted while the process is IC state during the interval is given by . Accordingly, the number of false alarms during this interval is . Thus, the total false alarm cost over a finite horizon is given by
Similarly, for scenarios and , the false alarm costs and over a finite horizon are given by
Thus, the total false alarm cost over a finite horizon is given by
where and denote the occurrence probabilities of scenarios and , respectively.
5.5 Optimization of ETC over a finite horizon
According to the cost formulations presented in Sections 5.1–5.4, the total cost over a finite horizon is expressed by the following equation
where ,,, and denote the inspection cost, quality cost, maintenance cost, and false alarm cost, respectively. Since , , , and are stochastic variables over a finite horizon, the total cost is also stochastic. Due to the randomness and complexity inherent in these variables, it is difficult to compute the total cost analytically. Therefore, a simulation-based optimization approach is employed to approximate the total cost and determine the optimal decision (Cheng et al., 2018).
For each candidate set of decision variables , multiple simulation runs are conducted to estimate the ETC. Regarding the simulation method for calculating the ETC, we give it in Algorithm 1. In Algorithm 1, the is the total cost for th simulation. The optimal thresholds are obtained by minimizing the ETC, denoted as , as follows
Here, and denote the lower and upper bounds of the continuous threshold , and is selected from a discrete control limit set . The set is related to the type I error , and this relationship can be established through Monte Carlo simulation. This approach is widely used in control chart research to assess the statistical performance of monitoring schemes, especially in balancing detection sensitivity against the false alarm frequency (Qiu, 2013).
To enhance the efficiency of identifying the optimal policy, a simulation-based optimization approach using a genetic algorithm is employed. Genetic algorithms are heuristic methods designed to explore large solution spaces efficiently. The algorithm begins by generating an initial population of candidate solutions, followed by iterative evaluation of their fitness to guide the construction of subsequent generations. Genetic operations, such as crossover and mutation, are employed to evolve the population toward better solutions. The specific implementation and modifications of the genetic algorithm in this study are detailed as follows:
(1) Coding: Solutions are represented using real-number encoding.
(2) Generation of initial population: The initial population, consisting of candidate solutions, is generated randomly.
(3) Fitness evaluation: The fitness of a solution is computed as , where is obtained by Algorithm 1. To improve computational efficiency, once is calculated for a solution, its value is stored and directly retrieved in later generations if the same solution appears.
(4) Crossover: Parent solutions are selected using the roulette wheel selection method. Arithmetical crossover is applied to generate new candidate solutions as follows:
where and are two parent solutions, is a randomly chosen coefficient, and .
(5) Mutation: To preserve valuable genetic information while maintaining diversity, a non-uniform mutation method is adopted. The mutation is applied to a randomly selected variable of a solution , generating a new solution . The mutated value is determined as follows
where and are the upper and lower bounds of , respectively. The is the non-uniform perturbation function.
6 Numerical analysis
To validate the applicability of the proposed strategy, this paper conducts a numerical analysis based on the scroll machining process in compressor manufacturing. The analysis evaluates the proposed strategy’s impact on both manufacturing efficiency and product quality, thereby demonstrating its effectiveness in optimizing overall production performance. The scroll machining process is a four-stage serial operation conducted over a finite time horizon of hours. It involves several critical procedures, including milling, turning, drilling, and slotting, all of which are essential to ensuring the accuracy and performance of scroll components. At each stage, the QRCs refer to key machine components that influence product quality, such as cutting tools, fixtures, the spindle, and guideways. Meanwhile, the KPCs focus on the dimensional accuracy of critical features on the workpiece, which directly determine the functionality and overall quality of the scroll components. The schematic of compressor and scroll, along with the scroll machining process, are illustrated in Fig. 4. The relationship between QRCs degradation and the corresponding KPCs at each stage is described by the following expression, where the coefficients are obtained from field production records and refined through designed experiments (Taguchi, 1986; Wu and Hamada, 2011).
where and
In SMMS, the occurrence rate of non-conforming items and their effects on QRCs often vary across stages. This study considers three trends of the occurrence rate with stage progression: increasing, constant, and decreasing. The corresponding results for different levels of the QRCs impact parameter are illustrated in Fig. 5. As shown in Fig. 5(a), the first stage is not considered, as it is assumed that the initial incoming product quality is stable, with zero occurrence rate of non-conforming items. When the occurrence rate of non-conforming items increases across the subsequent stages, QRCs degradation also rises, and higher impact levels of non-conforming items further amplify this effect. In Fig. 5(b), under a constant occurrence rate, QRCs degradation remains relatively stable across stages, although higher impact levels still lead to a gradual increase. In Fig. 5(c), when the occurrence rate decreases across stages, QRCs degradation correspondingly declines; however, higher impact levels continue to intensify the degradation.
In this study, we assume the non-conforming rate at each stage is . The influence of each non-conforming item on the downstream QRCs follows a truncated Normal distribution with and . The weighting coefficient representing the effect of non-conforming items on the degradation of QRCs is . Additionally, assuming that the IC covariance matrix of stage is and the OC covariance matrix of stage is , where represents the shift magnitude. Let
and the above parameter values are provided in Table 3.
6.1 Simulation-based optimization results
Next, we simulate the degradation process of QRCs and the variation of KPCs over a finite horizon based on the parameter settings of Table 3. Figure 6 compares the total degradation process of QRCs with inherent degradation process. It can be observed that both the inherent degradation and the total degradation increase continuously. Except for the first stage, the degradation in the other three stages is affected by upstream non-conforming items, which further contributes to the growth of total degradation. This indicates that neglecting the cumulative effect of upstream non-conforming items on QRCs degradation would lead to an underestimation of the actual degradation level of QRCs. In Fig. 7, the variance of random errors of each stage are further analyzed. The variance of KPCs across stages exhibits a decreasing-then-increasing trend. This suggests that in the early stages, KPCs variations are relatively small and product quality remains stable, whereas in the later stages, the variations become more pronounced and quality deteriorates. Therefore, accurately identifying the transition point of this variance is critical for maintaining the stability of KPCs. It can be observed that as QRCs degrade, when the variance of random errors satisfy , the variation of KPCs initially decreases and then increases, with relatively small overall variation. However, when , the variation of KPCs continuously increases. In this study, we assume the covariance matrix of random errors as and when and , respectively.
According to Algorithm 1, the ETC over a finite horizon is estimated through simulation with 5000 replications. Let ,, and the control limit set {20, 20.6, 21.2, 21.8, 22.6, 23.5, 24.5, 25.6, 29.5}, the control limit set corresponds to the type I error set {0.01, 0.009, 0.008, 0.007, 0.006, 0.005, 0.004, 0.003, 0.002}. The genetic algorithm with population size is employed to optimize the ETC. As illustrated in Fig. 8, after 50 generations of iteration, the ETC tends to converge,with its variation amplitude remaining below 400. The variation in convergence results can be attributed to the number of simulation replications and remains within the acceptable range in practice. Specifically, at the 50th generation, converges to corresponding to the optimal decision variables and .
To verify the cost-saving effectiveness of the proposed strategy, we compare it with tradition single monitoring strategies. Strategy I: The proposed joint monitoring strategy; Strategy II: The strategy which monitors only the QRCs condition; Strategy III: The strategy which monitors only KPCs. Table 4 presents the optimization results of three strategies and denotes null. It is evident from Table 4 that Strategy I achieves a lower compared to Strategies II and III. Specifically, Strategy I incurs higher maintenance costs due to more frequent maintenances; however, timely maintenance stabilizes product quality, thereby reducing quality-related costs. In contrast, Strategies II and III make maintenance decisions based solely on either the condition of QRCs or the quality of KPCs, overlooking certain critical maintenance opportunities. This oversight prolongs the OC time of the process, resulting in increased quality-related costs.
Subsequently, the real-time monitoring of SMMS is conducted based on the optimal decision variables and , the monitoring results are presented in Table 5 and Fig. 9. In Table 5, denotes null. As shown in Table 5 and Fig. 9, over a finite horizon, four alarms () occur. The MGLR chart triggers two alarms (); The system PHM triggers two alarms (). Specifically, the MGLR chart triggers alarms at the th and the th hours at stages and , respectively. The system PHM triggers alarms at the th and the th hours at stages and , respectively.
6.2 Sensitivity analysis
To further support decision-making, managers may wish to examine the influence of parameter settings on both the optimal decision and the . Therefore, sensitivity analyses are performed by adjusting each parameter individually within a range of to , while keeping all other parameters constant. The results are shown in the Figs. 10 and 11.
Figure 10 depicts the impact of cost parameters on . The results show that increases with rising values of all cost parameters. In particular, exhibits high sensitivity to and , whereas its sensitivity to , , and is comparatively moderate. Figure 11(a) shows the effect of cost parameters on the optimal control limit . The results reveal that decreases with increasing and , but increases with increasing , , and . Among these parameters, the variation of with respect to and is the most pronounced, whereas its response to , , and remains relatively moderate. Figure 11(b) depicts the effect of cost parameters on the hazard threshold . The analysis reveals that decreases as increases, while it increases with increasing ,, , and . Notably, exhibits high sensitivity to ,, , and , whereas its sensitivity to is relatively moderate. Based on the above analysis, the cost parameters and are the most influential factors affecting both and . Their joint impact highlights the necessity of considering these two parameters as the primary drivers in decision-making. Accordingly, managers are advised to dynamically adjust the control limit and the hazard threshold in response to variations in and . Such adaptive adjustment not only mitigates quality-related losses stemming from process instability but also minimizes maintenance costs associated with preventive interventions, thereby supporting more effective optimal decision-making.
In Fig. 12, we analyze how the sample size , shift magnitude , age reduction coefficient , and non-conforming rate affect the . The Fig. 12 considers three levels for each parameter: , and . (1) As the shift magnitude becomes larger, also declines. This is because the monitoring system is more responsive to larger shifts, which facilitates quicker identification of instability. Therefore, in systems prone to large shifts, joint monitoring can be particularly cost-effective. (2) An increase in the age reduction coefficient leads to a modest decline in . Although a higher reflects faster deterioration after maintenance, it also increases the frequency of hazard warnings. This prompts more frequent PM actions, which shortens periods of instability and reduces the overall cost. Hence, managers should account for the machine recovery rate when designing maintenance strategies. (3) As the sample size increases, decreases steadily. A larger sample size improves the sensitivity of the monitoring scheme, allowing earlier detection of process instability and reducing quality-related and maintenance costs. This suggests that managers should consider increasing the sample size when cost and operational constraints allow. (4) Conversely, an increase in the non-conforming rate leads to a rise in . A higher elevates the probability of producing non-conforming items within each inspection interval, thereby accelerating the degradation of downstream components. To mitigate the expected total cost, it is imperative for managers to strengthen upstream quality control measures aimed at reducing the non-conforming rate. Overall, is jointly affected by ,,, and . Among these, and have the strongest impact, while and play moderate roles. The joint effects of sample size and shift magnitude on indicate that a larger improves monitoring sensitivity, while a larger enables faster detection of process instability. Considering both together, the optimal strategy is to choose a moderate-to-large and set the control limit according to the expected . In addition, by accounting for the non-conforming rate and selecting an appropriate age reduction coefficient , managers can determine the optimal system hazard rate threshold , achieving and timely interventions.
7 Conclusions
This paper considers a CBM strategy that simultaneously monitors two data sources in the SMMS. First, the inherent degradation processes of QRCs are modeled using the Gamma process. To capture the influence of KPCs on QRCs degradation, the impact of upstream non-conforming items is incorporated. This degradation is modeled by a truncated normal distribution, from which the cumulative effect of all non-conforming items is derived. By integrating this with the inherent degradation, the total degradation of QRCs is obtained. The degradation of QRCs is further incorporated as a covariate into the constructed PHM. Meanwhile, the MGLR statistic is employed to monitor variations in the KPCs. By simultaneously considering the PHM and the MGLR control chart, a joint monitoring scheme is proposed. Based on the monitoring results, a diagnostic strategy is designed to identify the stages requiring maintenance, while taking into account the effects of different PM actions on the degradation of the QRCs. Subsequently, the possible alarm scenarios over a finite horizon are enumerated, and the corresponding cost formulations are derived. Finally, simulations are conducted to replicate the alarm scenarios and estimate the ETC. To minimize the ETC, a simulation-based genetic algorithm is employed to optimize the control limits and hazard rate thresholds.
In the numerical analysis, an example involving a four-stage serial scroll machining process is implemented, and we simulate the degradation process of QRCs and the variation of KPCs. Compared to existing approaches, the proposed joint CBM strategy achieves a lower total cost. Furthermore, we conducted a comprehensive sensitivity analysis to examine the influence of cost parameters on , and . The results indicate that and exert the most significant influence on these metrics. Based on the sensitivity analysis, managers should prioritize reducing the OC cost and the PM cost by negotiating more favorable service agreements and optimizing maintenance schedules. Additionally, we analyzed the effects of various process parameters on . The results reveal that a larger sample size , greater shift magnitude , higher age reduction coefficient generally lead to a lower , and a larger non-conforming rate leads to a larger .
Future work can move in two directions. First, this study only considers changes in variance of random errors, while mean shifts may also occur and jointly affect product quality; extending the framework to monitor both mean and variance of random errors would allow earlier and more accurate detection of quality problems. Second, the current approach reflects a reactive intervention, whereas predictive maintenance offers a proactive way to reduce downtime and quality risks; combining data on QRC degradation and product quality for joint predictive maintenance in SMMS would be a promising direction to improve reliability and quality.
8 Appendixes
8.1 Appendix A Proof of Eq. (7)
According to Eq. (4), we have
Considering the independence of and , we can get
Next, we prove the and respectively. Since and according to the moment generating function of the Gamma distribution, we can get
where is the probability density function of . Since is a compound Poisson process, we have
According to Assumption 1, the sequence is mutually independent, we have . It is worth mentioning that this study assumes that follows the truncated normal distribution, and the following derivations for other distributions are similar. Let , according to the moment generating function of the truncated normal distribution on , we have
where is the probability density function of . Since is a Poisson process, we have and
According to the Taylor series expansion of the exponential function, we have . Consequently, . Combing the Eqs. (A.3) and (A.6), the Eq. (7) is given by
where . This proves Eq. (7).
8.2 Appendix B The proof of Eq. (14)
At time , a series of observations has been collected, where denotes the vector of random errors observed at the th sampling of stage . Considering a test statistic of stage at time , which is given by
where is the pdf of and is the change point from to . Let
Based on Eq. (10), and , where . Taking and into Eq. (A.8), the is given by
Alaswad S,Xiang Y, (2017). A review on condition-based maintenance optimization models for stochastically deteriorating system. Reliability Engineering & System Safety, 157: 54–63
[2]
AndersonT W (1958). An Introduction to Multivariate Statistical Analysis (Vol. 2). Wiley New York
[3]
Bahrami H,Niaki S T A,Khedmati M, (2021). Monitoring multivariate profiles in multistage processes. Communications in Statistics. Simulation and Computation, 50( 11): 3436–3464
[4]
Bahria N,Chelbi A,Bouchriha H,Dridi I H, (2019). Integrated production, statistical process control, and maintenance policy for unreliable manufacturing systems. International Journal of Production Research, 57( 8): 2548–2570
[5]
Bera S,Mukherjee I, (2016). A multistage and multiple response optimization approach for serial manufacturing system. European Journal of Operational Research, 248( 2): 444–452
[6]
Boumallessa Z,Chouikhi H,Elleuch M,Bentaher H, (2023). Modeling and optimizing the maintenance schedule using dynamic quality and machine condition monitors in an unreliable single production system. Reliability Engineering & System Safety, 235: 109216
[7]
Cao Y,Liu S,Fang Z,Dong W, (2020). Modeling ageing effects in the context of continuous degradation and random shock. Computers & Industrial Engineering, 145: 106539
[8]
CaoYWangPXvWDongW (2025). Joint optimization of quality-based multi-level maintenance and buffer stock within multi-specification and small-batch production. Frontiers of Engineering Management, 1–20
[9]
Cassady C R,Bowden R O,Liew L,Pohl E A, (2000). Combining preventive maintenance and statistical process control: A preliminary investigation. IIE Transactions, 32( 6): 471–478
[10]
Chen Y,Jin J, (2005). Quality-reliability chain modeling for system-reliability analysis of complex manufacturing processes. IEEE Transactions on Reliability, 54( 3): 475–488
[11]
Cheng G,Li L, (2020). Joint optimization of production, quality control and maintenance for serial-parallel multistage production systems. Reliability Engineering & System Safety, 204: 107146
[12]
Cheng G Q,Zhou B H,Li L, (2018). Integrated production, quality control and condition-based maintenance for imperfect production systems. Reliability Engineering & System Safety, 175: 251–264
[13]
Ebadi M,Ahmadi-Javid A, (2020). Control charts for monitoring multi-stage service processes with optimal queue performance. Communications in Statistics. Simulation and Computation, 49( 9): 2472–2484
[14]
Feng T,He Y,Shi R,Li J,Dai W,Yu S, (2025). Functional risk modeling and selective maintenance optimization approach for multi-stage manufacturing system considering operational robustness. Reliability Engineering & System Safety, 256: 110775
[15]
Hadian S M,Farughi H,Rasay H, (2021). Joint planning of maintenance, buffer stock and quality control for unreliable, imperfect manufacturing systems. Computers & Industrial Engineering, 157: 107304
[16]
Han X,Wang Z,Xie M,He Y,Li Y,Wang W, (2021). Remaining useful life prediction and predictive maintenance strategies for multi-state manufacturing systems considering functional dependence. Reliability Engineering & System Safety, 210: 107560
[17]
He Y,Liu F,Cui J,Han X,Zhao Y,Chen Z,Zhou D,Zhang A, (2019). Reliability-oriented design of integrated model of preventive maintenance and quality control policy with time-between-events control chart. Computers & Industrial Engineering, 129: 228–238
[18]
Hu J,Chen P, (2020). Predictive maintenance of systems subject to hard failure based on proportional hazards model. Reliability Engineering & System Safety, 196: 106707
[19]
Huang S,Yang J,Xie M, (2020). A double-sampling SPM scheme for simultaneously monitoring of location and scale shifts and its joint design with maintenance strategies. Journal of Manufacturing Systems, 54: 94–102
[20]
Jafari L,Naderkhani F,Makis V, (2018). Joint optimization of maintenance policy and inspection interval for a multi-unit series system using proportional hazards model. Journal of the Operational Research Society, 69( 1): 36–48
[21]
Jin R,Deng X,Chen X,Zhu L,Zhang J, (2019). Dynamic quality-process model in consideration of equipment degradation. Journal of Quality Technology, 51( 3): 217–229
[22]
Jing H,Li J,Bai K, (2022). Directional monitoring and diagnosis for covariance matrices. Journal of Applied Statistics, 49( 6): 1449–1464
[23]
Kampitsis D,Panagiotidou S, (2022). A Bayesian condition-based maintenance and monitoring policy with variable sampling intervals. Reliability Engineering & System Safety, 218: 108159
[24]
Khatab A,Diallo C,Aghezzaf E H,Venkatadri U, (2019). Integrated production quality and condition-based maintenance optimisation for a stochastically deteriorating manufacturing system. International Journal of Production Research, 57( 8): 2480–2497
[25]
Kong D,Balakrishnan N,Cui L, (2017). Two-phase degradation process model with abrupt jump at change point governed by Wiener process. IEEE Transactions on Reliability, 66( 4): 1345–1360
[26]
Li S,Yang Z,He J,Li G,Yang H,Liu T,Li J, (2023). A novel maintenance strategy for manufacturing system considering working schedule and imperfect maintenance. Computers & Industrial Engineering, 185: 109656
[27]
Li Y,Peng S,Li Y,Jiang W, (2020). A review of condition-based maintenance: Its prognostic and operational aspects. Frontiers of Engineering Management, 7( 3): 323–334
[28]
Liu H C,Liu R,Gu X,Yang M, (2023). From total quality management to Quality 4.0: A systematic literature review and future research agenda. Frontiers of Engineering Management, 10( 2): 191–205
[29]
Liu P,Ma Y,Zhang C, (2025). Stream of variation modeling and monitoring for heterogeneous profiles in multi-stage manufacturing processes. IISE Transactions, 57( 11): 1263–1276
[30]
Lu B,Luo Y, (2024). A dynamic condition-based maintenance policy for heterogeneous-wearing tools with considering product quality deterioration. International Journal of Production Research, 62( 19): 7096–7113
[31]
Lu B,Zhou X, (2017). Opportunistic preventive maintenance scheduling for serial-parallel multistage manufacturing systems with multiple streams of deterioration. Reliability Engineering & System Safety, 168: 116–127
[32]
Lu B,Zhou X, (2019). Quality and reliability oriented maintenance for multistage manufacturing systems subject to condition monitoring. Journal of Manufacturing Systems, 52: 76–85
[33]
Lv X,Shi L,He Y,He Z,Lin D K, (2024). Joint optimization of production, maintenance, and quality control considering the product quality variance of a degraded system. Frontiers of Engineering Management, 11( 3): 413–429
[34]
Majdouline I,Dellagi S,Mifdal L,Kibbou E,Moufki A, (2022). Integrated production-maintenance strategy considering quality constraints in dry machining. International Journal of Production Research, 60( 9): 2850–2864
[35]
Mehrafrooz Z,Noorossana R, (2011). An integrated model based on statistical process control and maintenance. Computers & Industrial Engineering, 61( 4): 1245–1255
[36]
Mohamed-Larbi R,Daoud A K, (2024). Condition-based maintenance optimisation for multi-component systems using mean residual life. International Journal of Production Research, 62( 13): 4831–4855
[37]
MontgomeryD C (2019). Introduction to Statistical Quality Control. John wiley & sons. ISBN: 9781119646453
[38]
Naderkhani F,Makis V, (2016). Economic design of multivariate Bayesian control chart with two sampling intervals. International Journal of Production Economics, 174: 29–42
[39]
Odom G J,Newhart K B,Cath T Y,Hering A S, (2018). Multistate multivariate statistical process control. Applied Stochastic Models in Business and Industry, 34( 6): 880–892
[40]
Olde Keizer M C A,Flapper S D P,Teunter R H, (2017). Condition-based maintenance policies for systems with multiple dependent components: A review. European Journal of Operational Research, 261( 2): 405–420
[41]
Panagiotidou S,Tagaras G, (2010). Statistical process control and condition-based maintenance: a meaningful relationship through data sharing. Production and Operations Management, 19( 2): 156–171
[42]
Park C,Lee J, (2022). Monitoring profiles in multistage processes using the multivariate multiple regression model. Quality and Reliability Engineering International, 38( 7): 3437–3450
[43]
Pedersen T I,Liu X,Vatn J, (2023). Maintenance optimization of a system subject to two-stage degradation, hard failure, and imperfect repair. Reliability Engineering & System Safety, 237: 109313
[44]
QiuP (2013). Introduction to statistical process control. Boca Raton, FL: Chapman Hall/CRC
[45]
Rasay H,Taghipour S,Sharifi M, (2022). An integrated maintenance and statistical process control model for a deteriorating production process. Reliability Engineering & System Safety, 228: 108774
[46]
Sabri-Laghaie K,Fathi M,Zio E,Mazhar M, (2022). A novel reliability monitoring scheme based on the monitoring of manufacturing quality error rates. Reliability Engineering & System Safety, 217: 108065
[47]
Salmasnia A,Abdzadeh B,Namdar M, (2017). A joint design of production run length, maintenance policy and control chart with multiple assignable causes. Journal of Manufacturing Systems, 42: 44–56
[48]
Shang Y,Tsung F,Zou C, (2013). Statistical process control for multistage processes with binary outputs. IIE Transactions, 45( 9): 1008–1023
[49]
Shi H,Zhang J,Zio E,Zhao X, (2023). Opportunistic maintenance policies for multi-machine production systems with quality and availability improvement. Reliability Engineering & System Safety, 234: 109183
[50]
Shi J, (2023). In-process quality improvement: Concepts, methodologies, and applications. IISE Transactions, 55( 1): 2–21
[51]
Shi L,Lv X,He Y,He Z, (2024). Optimising production, maintenance, and quality control for imperfect manufacturing systems considering timely replenishment. International Journal of Production Research, 62( 10): 3504–3525
[52]
Shi L,Lv X,Wang Y,He Y,He Z, (2025). Joint optimization strategy based on a three-state manufacturing system considering dynamic buffers. International Journal of Production Economics, 283: 109587
[53]
Shojaee M,Noori S,Jafarian-Namin S,Johannssen A, (2024). Integration of production–maintenance planning and monitoring simple linear profiles via Hotelling’s T2. Computers & Industrial Engineering, 188: 109864
[54]
TaguchiG (1986). Introduction to quality engineering: Designing quality into products and processes
[55]
Tasias K A, (2022). Integrated quality, maintenance and production model for multivariate processes: A Bayesian approach. Journal of Manufacturing Systems, 63: 35–51
[56]
Tasias K A, (2024). Simultaneous optimization of inventory, maintenance, and quality for production systems subject to multiple mean and variance shifts. Communications in Statistics-Theory and Methods, 53( 9): 3078–3101
[57]
Vlok P,Coetzee J,Banjevic D,Jardine A K,Makis V, (2002). Optimal component replacement decisions using vibration monitoring and the proportional-hazards model. Journal of the Operational Research Society, 53( 2): 193–202
[58]
Wan Q,Chen L,Zhu M, (2023). A reliability-oriented integration model of production control, adaptive quality control policy and maintenance planning for continuous flow processes. Computers & Industrial Engineering, 176: 108985
[59]
Wang J,Luo L,Mu G,Ma Y,Ni C, (2025). Joint optimization of quality control and maintenance policy for a production system with quality-dependent failures. Expert Systems with Applications, 272: 126800
[60]
Wang L,Lu Z,Ren Y, (2020). Joint production control and maintenance policy for a serial system with quality deterioration and stochastic demand. Reliability Engineering & System Safety, 199: 106918
[61]
Wang Y,Xia T,Xu Y,Si G,Wang D,Pan E,Xi L, (2024). Quality-centered production and maintenance scheduling for multi-machine manufacturing systems under variable operating conditions. Reliability Engineering & System Safety, 250: 110264
[62]
WuC JHamadaM S (2011). Experiments: Planning, Analysis, and Optimization. John Wiley & Sons. ISBN: 9781119786891
[63]
Xiang Y, (2013). Joint optimization of X control chart and preventive maintenance policies: A discrete-time Markov chain approach. European Journal of Operational Research, 229( 2): 382–390
[64]
Ye Z,Cai Z,Zhou F,Zhao J,Zhang P, (2019). Reliability analysis for series manufacturing system with imperfect inspection considering the interaction between quality and degradation. Reliability Engineering & System Safety, 189: 345–356
[65]
Yeganeh A,Johannssen A,Chukhrova N,Rasouli M, (2024). Monitoring multistage healthcare processes using state space models and a machine learning based framework. Artificial Intelligence in Medicine, 151: 102826
[66]
Yeung T G,Cassady C R,Schneider K, (2007). Simultaneous optimization of [Xbar] control chart and age-based preventive maintenance policies under an economic objective. IIE Transactions, 40( 2): 147–159
[67]
Zhang J,Chen X,An Y,Zhang L,Shi H,Xu W, (2025). Joint optimization of preventive maintenance and product quality improvement policies for deteriorating manufacturing systems with quality-reliability dependency. Reliability Engineering & System Safety, 255: 110661
[68]
Zhang Y,Yan M,Ke C, (2024). Joint design of VSI Tr control chart and equipment maintenance in high quality process. Journal of Process Control, 136: 103177
[69]
Zhao X,Chen P,Tang L C, (2025). Condition-based maintenance via Markov decision processes: A review. Frontiers of Engineering Management, 12( 2): 330–342
[70]
Zheng M,Lin J,Xia T,Liu Y,Pan E, (2023). Joint condition-based maintenance and spare provisioning policy for a K-out-of-N system with failures during inspection intervals. European Journal of Operational Research, 308( 3): 1220–1232
[71]
Zheng R,Zhou Y, (2021). Comparison of three preventive maintenance warranty policies for products deteriorating with age and a time-varying covariate. Reliability Engineering & System Safety, 213: 107676
[72]
Zou C,Tsung F, (2008). Directional MEWMA schemes for multistage process monitoring and diagnosis. Journal of Quality Technology, 40( 4): 407–427
RIGHTS & PERMISSIONS
The Author(s). This article is published with open access at link.springer.com and journal.hep.com.cn