Machine learning for membrane bioreactor research: principles, methods, applications, and a tutorial

Yizhe Lai; Kang Xiao; Yifan He; Xian Liu; Jihua Tan; Wenchao Xue; Aiqian Zhang; Xia Huang

doi:10.1007/s11783-025-1954-2

PDF(5722 KB)

Front. Environ. Sci. Eng. ›› 2025, Vol. 19 ›› Issue (3) : 34. DOI: 10.1007/s11783-025-1954-2

REVIEW ARTICLE

Machine learning for membrane bioreactor research: principles, methods, applications, and a tutorial

Yizhe Lai¹ ,
Kang Xiao¹^,²^,³ ,
Yifan He⁵ ,
Xian Liu³^,⁴ ,
Jihua Tan¹^,³ ,
Wenchao Xue⁵ ,
Aiqian Zhang³^,⁴ ,
Xia Huang⁶

Author information +

History +

Highlights

	● Principles and methods of machine learning for MBR application are summarized.
	● Available models for MBR pollutant removal and membrane fouling are reviewed.
	● A tutorial is given to illustrate machine learning models for fouling prediction.
	● Limitations and future improvements for MBR intelligent operation are discussed.

Abstract

Membrane fouling poses a significant challenge to the sustainable development of membrane bioreactor (MBR) technologies for wastewater treatment. The accurate prediction of the membrane filtration process is of great importance for identifying and controlling fouling. Machine learning methods address the limitations of traditional statistical approaches, such as low accuracy, poor generalization ability, and slow convergence, particularly in predicting complex filtration and fouling processes within the realm of big data. This article provides an in-depth exposition of machine learning theory. The study then reviews advances in MBRs that utilize machine learning methods, including artificial neural networks (ANN), support vector machines (SVM), decision trees, and ensemble learning. Based on current literature, this study summarizes and compares the model input and output characteristics (including foulant characteristics, solution environments, filtration conditions, operating conditions, and time factors), as well as the selection of models and optimization algorithms. The modeling procedures of SVM, random forest (RF), back propagation neural network (BPNN), long short-term memory (LSTM), and genetic algorithm-back propagation (GA-BP) methods are elucidated through a tutorial example. The simulation results demonstrated that all five methods yielded accurate predictions with R² > 0.8. Finally, the existing challenges in the implementation of machine learning models in MBRs were analyzed. It is notable that integration of deep learning, automated machine learning (AutoML) and explainable artificial intelligence (XAI) may facilitate the deployment of models in practical engineering applications. The insights presented here are expected to facilitate the establishment of an intelligent control framework for MBR processes in future endeavors.

Graphical abstract

Keywords

Membrane bioreactor / Machine learning / Pollutant removal / Membrane fouling / Model prediction

Cite this article

EndNote

Ris (Procite)

Bibtex

Download citation ▾

Yizhe Lai, Kang Xiao, Yifan He, Xian Liu, Jihua Tan, Wenchao Xue, Aiqian Zhang, Xia Huang. Machine learning for membrane bioreactor research: principles, methods, applications, and a tutorial. Front. Environ. Sci. Eng., 2025, 19(3): 34 https://doi.org/10.1007/s11783-025-1954-2

1 Introduction

Wastewater treatment and reclamation are essential strategies for alleviating water crisis caused by water scarcity and pollution. The membrane bioreactor (MBR) technology, a combination of membrane separation and biological treatment (Yamamoto et al., 1989), has been widely used in recent years owing to its advantages of excellent and stable effluent quality, small footprint, and low residual sludge production (Xiao et al., 2014; Krzeminski et al., 2017; Xiao et al., 2019; Qu et al., 2022). However, membrane fouling during MBR operation leads to decreased flux, deteriorated separation efficiency, increased energy consumption, and shortened membrane lifespan, limiting the techno-economic sustainability (Xiao et al., 2019; Qu et al., 2022).

Membrane fouling is closely related to membrane properties, mixed liquor properties, and operating conditions. The fouling behavior of membrane materials, such as polyvinylidene fluoride, polyethersulfone, polyethylene, and polyacrylonitrile, varies depending on their hydrophilicity/hydrophobicity, pore structure, and surface roughness, all of which influence the membrane-foulant interaction (Yamato et al., 2006; Zhang et al., 2008). Generally, a hydrophobic membrane is more prone to fouling than a hydrophilic one (Choi et al., 2002). Membrane pore and foulant size play interactive roles in fouling. Foulants of comparable size to pores can tightly clog pores owing to size exclusion (Meireles et al., 1991), whereas small-size foulants can cause adsorptive fouling within the pores (Kawakatsu et al., 1993). A narrower pore size distribution is favorable for maintaining a stable flux (Shimizu et al., 1990; Meireles et al., 1991). The membrane surface roughness influences the fluid dynamics for particle deposition at the micrometer scale and the intermolecular contact for foulant adsorption at the nanometer scale (Xu et al., 2020). Empirically, a smoother membrane surface tends to impede the progress of cake layer formation (Vatanpour et al., 2011; Sadeghi et al., 2013; Panda et al., 2015). Mixed liquor suspended solids (MLSS), extracellular polymeric substances (EPS), and soluble microbial products (SMP) all contribute to membrane fouling in MBRs. MLSS is responsible for overall fouling, especially at concentrations above 10 mg/L. EPS and SMP, on the other hand, are critical contributors to physically irreversible fouling. MLSS, EPS, and SMP are closely related to the operating conditions of MBR such as hydraulic retention time (HRT), sludge retention time (SRT), and food-to-microorganism rate (Huang et al., 2011). For instance, an excessively short HRT or long SRT can lead to high concentrations of MLSS and SMP, which can exacerbate fouling. Nonetheless, unusually low MLSS concentrations can also exacerbate fouling, probably due to the excessive release of EPS (Yoon, 2015).

Fouling control strategies can be mainly categorized into three groups: mixed liquor conditioning, filtration conditions adjustment, and physical/chemical cleaning of the fouled membranes (Meng et al., 2017). Mixed liquor conditioning involves the addition of material such as suspended carriers, particles, coagulants, ozone, and other chemicals (Wu and Huang, 2008; Kurita et al., 2014,2015; Juntawang et al., 2017; Zhang et al., 2017; Zhang et al., 2022). These additives can tailor the properties of SMP and EPS and affect the floc structure (Wu et al., 2006; Wu and Huang, 2008; Juntawang et al., 2017; Zhang et al., 2017). The overall fouling rate and cake/gel layer reversibility are influenced by aeration intensity and filtration/relaxation intervals (Liu et al., 2020b). Aeration or air scouring removes foulants from the membrane surface by increasing the cross-flow shear. Physical cleaning (e.g. tap water rinsing, membrane effluent rinsing, staggered flow rinsing, backwash, and ultrasonic cleaning) can mitigate reversible fouling, whereas chemical cleaning (using acid, alkali, oxidant, chelating agent, and surfactants) can further alleviate irreversible fouling. However, these fouling control measures are mainly taken a posteriori based on observation of fouling phenomena (such as the increase in transmembrane pressure, TMP) that have already occurred, resulting in a certain lag between the fouling and fouling control. Incorrect control measures can reduce effectiveness, waste energy and chemicals, and lead to membrane damage. Therefore, it is essential to apply suitable control measures with accurate timing and dosage. To achieve early warning and timely control of membrane fouling, it is crucial to develop a model to predict the fouling tendency in its infancy.

Data-driven prediction and regulation models provide a new approach for precise control of membrane fouling. Researchers have progressed in fouling prediction using conventional statistical models. For instance, Zhang et al. (2012) established a partial least squares model (R² = 0.84) to predict membrane flux from mixed liquor characteristics, including MLSS/EPS/SMP concentrations, relative hydrophobicity, average particle size, and osmotic pressure. Chen et al. (2022) developed a time series model taking temperature as a covariate and online cleaning events as switching variables to predict the trend of TMP in an anaerobic MBR (AnMBR) across seasons (R² = 0.91). Conventional statistical models can well reflect the relationships between input and output variables. However, these models rely on a priori knowledge of these relationships, which inevitably have the following shortcomings: (a) insufficient fitting accuracy, weak generalization ability, and poor adaptability to fresh samples due to underestimation of the complexity of the real relationships and (b) low computational efficiency for big data analysis and slow convergence when dealing with multiple variables with complex interactions because of the limitations of the model structure. The diversity and complexity of multiple interactive variables are inherent in actual membrane fouling system, making the use of conventional statistical models challenging.

Machine learning is a recent statistical development that complements conventional statistical models. Machine learning has gained increasing attention in environmental engineering as a general artificial intelligence technology (Zhong et al., 2021). Born out of but different from statistical models, machine learning focuses on accurately estimating complex functions using a computer, rather than providing statistical confidence intervals for these functions. Machine learning algorithms automatically “learn” from experience (data) to improve the system performance (Bishop, 2006). As a black-box model, it has a strong fitting ability, good adaptability, and prediction accuracy in dealing with problems with unknown response functions, complex variable relationships, and large amount of data. These advantages provide good prospects for fouling prediction in MBR systems. In recent years, machine learning has been gradually applied to predicting membrane filtration performances, such as flux, resistance, and permeability, and lent new quantitative support to the analysis of fouling mechanisms (Niu et al., 2022).

This study first introduces four common machine learning models, then reviews the application of machine learning models in predicting pollutant removal and fouling performance, and finally analyzes the limitations of existing models to help future researchers develop new machine learning models for improved fouling prediction in MBRs.

2 Principles and methods of machine learning

Machine learning constructs a model using “training data” to make predictions or decisions without explicit a priori assumptions or programming. Fig.1(a) and 1(b) depict the general and detailed steps of machine learning, respectively. Popular machine learning models include supervised learning, unsupervised learning, and reinforcement learning (RL), as illustrated in Fig.2. To select an appropriate model, it is necessary to consider several factors, including model principles, problem type (classification, regression, time series, etc.), data volume, feature dimensions, and interpretation requirements. When there are a variety of models to choose, the optimal model can be selected by comparing the model performance, computational costs, and interpretability. Stemming from artificial intelligence, machine learning has found extensive application in the field of environmental science and engineering, enabling the construction of predictive models using diverse data sets, the assessment of feature importance via model interpretation, anomaly detection through historical data comparison, and the advancement of new materials (Zhong et al., 2022).

Fig.1 Machine learning process: (a) general procedure; (b) detailed steps.

Full size|PPT slide

Fig.2 Classification of machine learning methods.

Full size|PPT slide

2.1 Machine learning methods

This section mainly introduces three common machine learning methods: support vector machine (SVM), artificial neural network (ANN), and decision tree. Additionally, a concise overview of the k-nearest neighbor (KNN) algorithm is provided. These four methods have demonstrated their efficacy in addressing classification problems with discrete variables and regression problems with continuous variables. Tab.1 summarizes the four methods and their respective scopes of application.

Tab.1 Summary and comparison of machine learning methods

Method	Advantage	Disadvantage	Scope of environmental application	Potential application scenarios and examples in MBRs
SVM	• Supervised learning. • Clear mathematical basis. • Robust performance with limited sample size. • Rapid prediction capabilities. • Strong interpretability.	• Susceptibility to missing data. • High computational complexity. • Inappropriateness for large data set.	Classification or regression problems where the sample size is not particularly large.	Prediction of TMP and resistance (Liu et al., 2020a).
ANN	• Supervised learning • Diversified variants and innovations. • Wide range of applicable sample sizes. .• Appropriateness for a variety of input data forms. • Robust nonlinear mapping capability.	• Complexity parameter setting .• Challenge of determining the optimal network structure. • Gradient vanishing and exploding issues.	Classification, regression, time series, and feature self-extraction for various sample sizes.	Prediction of effluent quality and biogas yield (Li et al., 2022); prediction of flux and flux recovery rate (Zhao et al., 2020); identification of fouling type (Shi et al., 2022).
Tree-based model	• Supervised learning .• If-then logical basis. • Strong interpretability.	• Susceptibility to noise or missing data. • Challenge of dealing with exclusive OR logic. • Proneness to overfitting.	Interpretable classification or regression problems where the sample size is not particularly large.	Prediction of effluent quality (Zhuang et al., 2021); prediction of flux (Li et al., 2020).
KNN	• Unsupervised learning. • Insensitivity to outliers. • Appropriateness for multimodal and multi-label classification problems.	• High computational complexity. • Susceptibility to sample imbalance. • Low interpretability.	Primarily applicable to handle numerical and nominal data.	Outlier identification and screening.

2.1.1 Support vector machine

SVM is a binary classification method rooted in statistical learning theory, implementing a structural risk minimization strategy. It accomplishes data transformation from a low-dimensional space to a high-dimensional space by constructing a kernel function. SVM can be utilized for solving both classification (support vector classification, SVC) and regression (support vector regression, SVR) problems by finding the optimal dividing hyperplane that classifies the sample data into different categories with the widest “margin” at the border. The sample points closest to the border determine the position of the hyperplane and are thus called support vectors.

SVM is a suitable method for solving nonlinear mapping problems in high-dimensional space with limited samples, and has certain generalization ability. The input to SVM is typically comprised of a series of data points containing multiple features. The number of features is typically ≥ 2, depending on the modeling problem. When dealing with a small sample size (e.g., a training sample size less than 125 (Qian et al., 2015)), the SVM model generally exhibits superior generalization performance and prediction accuracy. This advantage arises from its relatively low complexity and solid statistical theoretical foundation. However, the computational efficiency decreases when the sample size is large, which may require longer time for the calculation. SVM is susceptible to the presence of missing data. Therefore, it is imperative to screen the data set for outliers and to address the issue of missing values prior to SVM modeling. Furthermore, it is important to consider the interpretability of high-dimensional mapping.

SVM has a variety of applications in water engineering and water environments, including leakage detection in water supply system (McMillan et al., 2024), structural analysis or characterization of pollutants (Zhong and Guan, 2023), spectral analysis of water quality (Mallet et al., 2022), monitoring of reactor operation (Vasilaki et al., 2020), ecological monitoring and control in watersheds (Kim et al., 2021a), and the early warning of water quality pollution event (Oliker and Ostfeld, 2014). SVM has also been used to monitor and assess of air quality (Li et al., 2017) and rapidly estimate soil organic carbon (Li et al., 2015).

2.1.2 Artificial neural network

An ANN comprises of a multitude of fundamental neurons that mimic the learning process of the human brain for solving diverse practical issues. As a representative example of connectionist learning, the ANN is one of the most frequently used models in machine learning. An ANN typically contains input, hidden, and output layers. A layer can have a few to millions of neurons, and the number of neurons determines the complexity of the hidden relationships among the data to be learned. The input layer receives training data, which is nonlinearly varied by one or more hidden layers, and the output layer outputs the nonlinearly transformed valuable data, thereby realizing input-to-output mapping. ANN’s ability to deal with complex nonlinear problems can be increased by increasing the “depth” (by adding more layers), increasing the “width” (by increasing the number of neurons in a single layer), and optimizing the activation function. The ANN can be employed to address classification, regression, and time-series problems. It is suitable for the resolving problems with varying sample sizes through the multiple variants. The input data are multivariate and can be continuous, discrete, or matrix/image data.

Multilayer perceptron (MLP, also known as back-propagation neural network (BPNN)) (Fig.3(a)) and radial basis function neural network (RBFNN) (Fig.3(b)) are the simplest neural networks, typically employed to address regression problems. They can also be employed to address classification problems and time series puzzles. Compared to MLP and RBFNN, deep neural network (DNN) (e.g., convolutional neural network (CNN) (Fig.3(c)), recurrent neural network (RNN) (Fig.3(d)), and graph neural network (GNN)) possess more intricate network architectures and are capable of autonomously extracting features, which reduces the necessity for human intervention and enhances the quality of feature extraction (Zhang et al., 2018). CNNs frequently utilize spatial or image data as inputs to achieve image recognition or spatial feature extraction through convolutional computation (Zeiler and Fergus, 2014; Gao et al., 2019; Kiranyaz et al., 2021). There have been many innovations in CNN, such as GoogLeNet (Szegedy et al., 2015), depth-based CNNs (Szegedy et al., 2016), DenseNet (Huang et al., 2017), wide residual networks (Zagoruyko and Komodakis, 2016), and dual-channel CNN (Ma et al., 2023). RNNs, in particular long short-term memory (LSTM) (Greff et al., 2017), demonstrate certain advantages when input data exhibit temporal characteristics. It is important to note that DNNs present a challenge in terms of increased model complexity and reduced interpretability.

Fig.3 Schematic diagrams of artificial neural networks: (a) MLP, (b) RBFNN, (c) CNN, (d) RNN.

Full size|PPT slide

It should be noted that different neural networks may apply different hidden layer activation functions. The Table S1 in Appendix A provides a summary of the hidden layer activation functions that may be employed. In contrast to the conventional S-type activation function, wavelet neural network (WNN) employs a wavelet function as the hidden layer activation function (Alexandridis and Zapranis, 2013). Compared to the traditional MLP, WNN has several benefits such as faster network convergence, prevention of local optimization, and the ability to conduct local time-frequency analysis.

The application of ANN is pervasive in the environmental field, encompassing a multitude of domains. These include the operation or optimization of membrane process systems (Wang et al., 2024b), wastewater treatment (Al-Ghazawi and Alawneh, 2021), prediction of novel pollutants such as disinfection by-products (Kulkarni and Chellam, 2010), prediction of surface or groundwater quality, biogas production (Liu et al., 2021), and environmental adsorption.

2.1.3 Decision tree and ensemble learning

The decision tree is a machine learning method characterized by a tree-like structure. Three common decision tree algorithms (Table S2 in Appendix A), namely ID3, C4.5 (Quinlan, 1993), and categorical regression tree (CART) (Breiman et al., 1984), use different metrics to divide samples. The single decision tree algorithm is typically applied to solve classification problems, whereas the CART method is suited for solving regression problems. The decision tree algorithm can be considered as a set of if-then rules or a conditional probability distribution defined in both the feature and class spaces. This results in a low model complexity and facilitates good interpretability. However, overfitting may occur, potentially leading to weak generalization. To mitigate this, techniques such as pruning, cross-validation (CV), and ensemble learning can be employed to prevent overfitting.

Ensemble learning is a machine learning algorithm that combines a group of base learners (e.g., decision trees, BPNN, etc.) according to a specific strategy to improve the generalization performance of these base learners. Bagging and boosting are two typical ensemble learning implementation algorithms. Random forest (RF) (Breiman, 2001) is an extension of the bagging method. Adaptive boosting, gradient boosting decision tree (GBDT), and eXtreme gradient boosting (Chen and Guestrin, 2016) are extensions of the boosting method. Ensemble learning offers several advantages over base learner approaches, including high accuracy, good generalization performance, fast training speed, good robustness, minimal feature engineering, and a wide range of application scenarios.

Similar to ANN and SVM, decision trees and their integrated learning methods are widely used. Such as, water quality monitoring or classification (Xu et al., 2021), nano-plastics identification (Xie et al., 2023), pollutant degradation (Zhang et al., 2023), air quality prediction (Chen et al., 2020), groundwater contamination prediction (Bindal and Singh, 2019), source of soil contamination (Zhou and Li, 2024), and ecological and environmental evaluation (Espel et al., 2020). With better interpretability, tree models are employed not only for environmental prediction but also for environmental management and decision-making (Jiang et al., 2021).

2.1.4 k-nearest neighbors

KNN is an unsupervised machine learning model that measures the distance between different feature values as a basis for classification or regression. KNN model is employed to address the classification problem. It involves finding the k-nearest neighbors of the sample to be predicted for regression problem. KNN has obvious advantages, such as high precision, insensitivity to outliers, and no input assumption for sample labels. However, it is important to note that KNN has certain drawbacks, including high computational and space complexity. Hence, KNN is primarily applicable for handling numerical and nominal data.

KNN has a more circumscribed range of applications in the environmental domain than SVM, ANN, and tree models. Nevertheless, it can still be employed to solve a wide range of complex environment problems, including water quality monitoring (Uddin et al., 2023), wastewater treatment control (Xu et al., 2022), adsorption evaluation (Nguyen et al., 2022), air quality prediction (Tella et al., 2021), etc.

2.1.5 Other methods

In addition to supervised and unsupervised learning, RL is another class of machine learning that relies on the optimal policy for mapping states to behaviors based on the interaction between intelligence and environment, with the objective of maximizing the cumulative rewards (Byeon, 2023). The Markov decision process is a fundamental RL framework. RL is a commonly employed methodology for addressing decision-making and control issues. Several studies have employed RL to optimize wastewater treatment control, including the removal of phosphorus (Mohammadi et al., 2024) and the reduction of energy consumption.

The development of large models (also known as foundation models) represents a significant advancement in the field of artificial intelligence. These models are commonly employed in the domains of natural language processing, computer vision, and multimodal problems. Large models are distinguished by large parameter scale, complex computational structure, multitask learning, and emergence capability. ChatGPT is currently one of the hottest models (Ahmed et al., 2024). At present, large models are employed primarily for addressing large-scale or long-term environment problems, like weather forecasting (Bi et al., 2023) and global methane emissions (Rouet-Leduc and Hulbert, 2024), given the necessity for a substantial number of samples to serve as a foundation for training or learning.

Another rapidly evolving field is automated machine learning (AutoML), which aims to automate the process of building machine learning. A series of automated procedures, including data processing, feature engineering, model/algorithm selection, hypermeter optimization, and model evaluation, have been established to minimize the need for intervention by model developers and enhancing the quality of models (Salehin et al., 2024). AutoML is a valuable tool in computer vision and natural language processing and has been employed in the environmental sector to predict the potential energy surfaces (Abbott et al., 2019) and water quality (Senthil Kumar et al., 2024).

2.1.6 Related algorithms

Fuzzy logic and Monte Carlo methods are frequently utilized algorithms in machine learning. Fuzzy logic represents a multi-valued logic approach to address such uncertainty and imprecise information through simulation of human thinking (Zadeh, 2023). Fuzzy logic finds extensive application in control systems (Castillo et al., 2008). The combination of fuzzy logic and neural networks gives rise to a fuzzy neural network, which leverages pre-existing data to generate an expert knowledge base and predict outcomes through fuzzy logic inference (de Campos Souza, 2020). The Monte Carlo method is a numerical calculation method grounded in probability statistics theory (Raychaudhuri, 2008). The method is based on the “large number theorem” to achieve random approximation by repeatedly sampling a data set and performing randomized tests. Monte Carlo methods have been employed in machine learning, particularly in reinforcement learning. For example, the famous AlphaGo applied Monte Carlo method in reinforcement learning to improve the decision-making ability of neural networks (Silver et al., 2016). The integration of these two algorithms with machine learning enables the resolution of complex and uncertain problems in the context of big data.

2.2 Model optimization

Machine learning involves the optimization of multiple parameters during the learning process. A straightforward method for model optimization is CV (including k-fold CV, where the check of k has no normal rule), where the training data are first split into k subsets, and then the subsets are assigned to two parts (Browne, 2000): the training part (k-1 subsets) to train the model, and the validation part (the remaining subset outside the training part) to check the error. These two parts of data are used to obtain the model with the least generalization error. CV requires a substantial amount of time throughout repeated re-training and re-verification.

Heuristic intelligent optimization algorithms in the learning process can significantly improve the model performance and convergence rate during the learning process. Intelligent optimization algorithms usually draw inspiration from natural, biological, or physical phenomena. Common intelligent optimization algorithms include genetic algorithms (GA) (Fig.4(a)) (Katoch et al., 2021), particle swarm optimization (PSO) (Fig.4(b)), simulated annealing (SA) (Fig.4(c)) (Suman and Kumar, 2006), artificial bee colony (Karaboga and Basturk, 2007), ant colony optimization (Dorigo et al., 2006), firefly algorithm (FFA) (Yang, 2009), bat algorithm (BA) (Yang, 2010), and gray wolf optimizer (GWO) (Mirjalili et al., 2014). These algorithms have been widely used in various fields to solve optimization problems with high efficiency and accuracy. It is important to note that each algorithm has its own strengths and weaknesses.

Fig.4 Intelligent optimization algorithms: (a) genetic algorithms (GA), (b) particle swarm optimization (PSO), (c) simulated annealing (SA).

Full size|PPT slide

2.3 Assessment of model performance

A statistical model’s performance can usually be evaluated through indicators, such as the coefficient of determination (R²), mean square error (MSE), root mean square error (RMSE), and mean absolute percentage error (MAPE) (Niu et al., 2022). The receiver operating characteristic (ROC), area under curve (AUC), accuracy, precision, and recall are frequently employed to assess the efficacy of a binary classifier. R², MSE, RMSE, MAPE, ROC, AUC, accuracy, precision, and recall can evaluate the performance of a model, but they are not designed to consider the model complexity (related to the number of unknown parameters).

Information criterion, such as Akaike information criterion (AIC) (Akaike, 1974), Bayesian information criterion (BIC) (Gideon, 1978), and Hannan-Quinn criterion (HQC) (Hannan and Quinn, 1979), can be employed to achieve optimal model selection by balancing the model complexity and model performance. For a large sample size, the model parameter penalty exerted on the three criteria exhibits a gradient from weak to strong as follows: AIC < HQC < BIC (Tu and Xu, 2012). The stronger the penalty, the more inclined it is toward favoring a low-dimensional model. In general, smaller AIC, BIC, or HQC values indicate better model fitting and greater accuracy.

2.4 Assessment of model interpretation

Variable importance metrics aid researchers in better understanding the data generation process by evaluating the relative importance of independent variables with respect to a dependent variable (Kruskal, 1987).

Common tree model interpretation methods include the Shapley value (Samek, 2020) and TreeExplainer method (Lundberg et al., 2020). However, the interpretability of the model decreases when the decision-making involves multiple trees. Therefore, advanced tree models fall into the category of black boxes (Samek, 2020). The variable importance measure is based on changes in out-of-bag prediction accuracy (such as MSE, accuracy) or Gini index when using that variable (Grömping, 2015).

In models like ANN and SVM, the relative importance of input features can be assessed using the simple R² (Hosseinzadeh et al., 2020). Variable importance can be evaluated through sensitivity analysis (Cortez and Embrechts, 2013). For a neural network, it is possible to calculate the variable importance of input variables to the output using methods based on the connection weights of neurons. Several methods, such as those of Garson (1991), Goh (1995), Gedeon (1997) and Olden (Olden et al., 2004), allow for this assessment. Notably, Gedeon’s and Olden’s methods use the weights of all connections, with Gedeon's method being tailored specially for deep learning.

Recently, explainable artificial intelligence (XAI) has seen application in various scientific fields, including water quality prediction (Madni et al., 2023) and understanding MBR process (Chang et al., 2022). Partial dependence plot, individual conditional expectation, local interpretable model-agnostic explanations, and Shapley additive explanations (SHAP) are suitable for model interpretation (Molnar, 2019). In particular, SHAP has the desirable properties of local accuracy, missingness, and consistency, thus can be used to interpret the model from global, local, and feature interaction perspectives (Aldrees et al., 2024b). These methods can be used to interpret or explain all kinds of machine learning models, including deep learning models.

2.5 Model selection guide

The selection of an appropriate modeling method is of paramount importance. Unsupervised learning methods are employed when the data set lacks output labels, whereas supervised learning methods are used when the data set contains output labels. The first step is to determine the type of problem. For problems with numerical output labels (i.e., regression or time-series problems), models such as SVR, RF, and ANN can be selected, whereas for problems with discrete or nominal output data (i.e., classification problems), SVC, RF, and ANN are suitable. In cases involving multiple decision-making processes, reinforcement learning may prove an advantageous solution. Subsequently, the magnitude of the data set must be taken into consideration. When the sample size is limited, it has been demonstrated that SVM, RF, and MLP models with relatively simple structures can achieve adequately robust results. The performance of DNN can be optimized when the sample size is large. The characteristics of the input data are also of great importance. LSTM is appropriate for time-series data, CNN is suitable for matrix data with image characteristics, and GNN is suitable for directly processing graph-structured data.

3 Application of machine learning in MBRs

3.1 Overview of application

In recent years, machine learning, an artificial intelligence technology, has gained widespread usage for predicting MBR performances. Since 2015, there has been a noticeable surge in the number of researchers publishing papers on machine learning model in MBR, with a sharp rise after 2018 (Fig. S1 in Appendix A). The statistical analysis of models, features, and optimization provides a more comprehensive reflection of the research progress in MBR performance prediction. Among the reviewed papers, 34.3% focused on predicting pollutant removal and 67.1% were dedicated to predicting membrane fouling.

The characteristics/parameters of the membrane process can be categorized into seven main groups: time parameter (t), conventional concentration indices (CCI) (including chemical oxygen demand (COD), total nitrogen (TN), and total phosphorus (TP) in the influent and effluent, total suspended solids (TSS) in the influent, MLSS in the mixed liquor, and other parameters), membrane filtration indices (MFI) (including TMP, membrane flux, filtration resistance, ΔTMP/Δt, membrane permeability, and other parameters), environment indices (EI) (including temperature (T), dissolved oxygen (DO), pH, oxidation reduction potential (ORP), organic loading rate (OLR), and other parameters), operation indices (OI) (including SRT, HRT, filtration-relaxation ratio, aeration intensity, backwash strength/time, and other parameters), and characteristic foulant indices (CFI) (including concentrations of SMP, loosely-bound EPS, tightly-bound EPS, among others). These characteristics can be used as model inputs, as well as other parameters such as sludge particle size, viscosity, zeta potential, blocking coefficient, membrane pore size, etc. Spectroscopic measurement results, such as spectral data and spectral grayscale maps, may also be used as model inputs to facilitate automatic spectral features extraction. With the aforementioned model inputs, pollutant removal and membrane fouling performances are typically received as the model outputs.

Fig.5(a) shows the models for MBR fouling prediction. For this purpose, CCI (60.4%) and MFI (68.8%) were used as the majority of input characteristics, and membrane flux (52.7%) was selected as the primary indicator of membrane fouling performance. Of the established models, ANN models accounted for 72.9%, with a focus on relatively simple model structures, including MLP at 39.6% and RBFNN at 18.8%. Additionally, DNN and SVM models were utilized, accounting for 8.3% and 18.8%, respectively. For model tuning, over half (56.3%) of the models did not employ any optimization algorithms, 33.3% chose intelligent optimization algorithms, and 8.3% used simple CV to optimize the parameters. Among the intelligent optimization algorithms, GA was predominant at 56.5%, followed by PSO at 18.9%.

Fig.5 Distribution of different machine learning models used for MBR research: (a) membrane fouling prediction models, (b) pollutant removal prediction models.

Full size|PPT slide

The prediction models for MBR pollutant removal are illustrated in Fig.5(b). None of the model inputs employed CFI. Instead, the majority of the models utilized CCI (75.2%), OI (62.5%), and EI (54.2%) as input characteristics. Only 20.8% of the models incorporated MFI into the input characteristics. Regarding the model outputs, 79.2% of the models aimed to predict the crucial carbon-related indices (such as effluent COD and COD removal rate). The nitrogen-related indices (such as the removal rates and effluent concentrations of TN, ammonium nitrogen (NH₃-N), and nitrate nitrogen (NO₃⁻-N)) followed at 58.3%, while a lower percentage of models (16.7%) were targeted at the prediction of trace organic pollutants. In terms of model structure, a significant proportion of the models employed ANN models, with MLP models being the most favored option at 66.7%. Compared with membrane fouling prediction models, pollutant removal prediction models are usually simpler, with fewer models using optimization algorithms.

Figure S2 in Appendix A shows the relationship between R² and the number of parameters of different ANN models, with model’s capacity denoted by the number of parameters. As illustrated in Fig. S2, the stability of the MLP model is inferior, which is potentially influenced by the data set or model parameter settings. The introduction of optimization algorithms, such as GA, leads to overall improvement in model’s performance, potentially resulting from the algorithm optimization or the refinement of the model parameter training process. Conversely, the WNN model shows commendable results with a modest parameter count, which is likely attributed to the construction of a more complex activation function.

In general, machine learning is a prevalent tool in the MBRs with satisfactory performance. It is important to recognize the limitations of an incomplete indicator system, lack of full-scale engineering validation, difficulty of real-time prediction, and lack of process understanding contribution.

3.2 Machine learning models to predict pollutant removal performances

The application of machine learning models to predict MBRs’ pollutant removal performance is summarized in Table S3 in Appendix A. These applications are mainly based on ANNs. MLP model serves as a foundation for statistical analysis of various models, with a focus on optimizing the hierarchical structure and activation functions. Based on MLP, Kim et al. (2021b) used near-infrared spectroscopy to predict the concentrations of effluent pollutant (COD, TN, NH₃-N, nitrite nitrogen, NO₃⁻-N, and phosphate) as well as SMP and EPS in the mixed liquor with R² > 0.97. The MLP topology for predicting these three types of pollutants/foulants were 5-11-6, 5-9-1, and 5-9-2, respectively.

Some MLP models were specifically focused on predicting the removal of trace organic pollutants, in addition to the detection of routine water quality (Wolf et al., 2001; Wolf et al., 2003). Researchers have attempted to optimize the activation function of the hidden layer using radial basis function (RBF) and wavelet functions. Mirbagheri et al. (2015b) established an RBFNN model with a topology of 5-5-1 to evaluate the performance of a submerged MBR in the treatment of a combined urban and industrial wastewater, using the influent concentrations (biochemical oxygen demand (BOD), COD, NH₃-N, TP), influent total dissolved solids, HRT, volatile MLSS (MLVSS), and mixed liquor pH as input characteristics to predict the effluent concentrations (BOD, COD, NH₃-N, and TP) with R² > 0.98. Cai et al. (2019b) established a WNN model with a topology of 3-2-1 to predict the effluent quality (COD: R² = 0.999; NH₃-N: R² = 0.997) with the influent COD, NH₃-N, and salinity as input characteristics. They also found that the WNN model had better performance than the MLP model in predicting effluent COD and TN (Cai et al., 2019a).

Researchers have also optimized the model hierarchy with structures that are more complex than MLPs, such as CNN, DenseNet, and LSTM. Li et al. (2022) established three DNN models, including fully connected network (FCN), CNN, and DenseNet, to predict the effluent pH, effluent COD, COD removal rate, biogas (CH₄, N₂, and CO₂) yield, and redox potential of an AnMBR by considering the ambient temperature, influent water temperature, influent pH, influent COD, mixed liquor temperature, and membrane flux as input characteristics. The prediction accuracy of the DenseNet model reached 97.4%, whereas those of FCN and CNN were 92.6% and 91.8%, respectively. Yaqub et al. (2020) established an LSTM to predict the removal of TN, TP, and NH₃-N by an anaerobic/anoxic/aerobic-MBR process using the influent water quality (total organic carbon (TOC), TN, TP, COD, NH₃-N, and suspended solids) and operating parameters (DO, ORP, and MLSS) as input characteristics, and the results showed that the model performed the best in predicting NH₃-N removal rate with MSE = 0.0047.

Based on the above analysis, previous studies have predominantly used MLP models to predict the pollutant removal performances. More complex ANN models, such as WNN, CNN, and LSTM are occasionally utilized. In addition to the enhancement of the model structure, some researchers have optimized machine learning models by applying intelligent optimization algorithms (e.g., FFA, PSO, and GWO) to improve the model accuracy and achieve more effective solutions to complex problems (Aldrees et al., 2024a). However, some models did not include MFI as input variable and thus may have underestimated the role of membranes in pollutant removal. Furthermore, with regard to output variables, previous research primarily concentrated on removing C, N, and P, with insufficient attention paid to trace pollutants. ANNs seemed to be less effective in predicting trace organic pollutants due to the diversified behavior of trace pollutants during their degradation process. Optimizing the activation function can improve the performance of ANNs, and tree models may have better performance than ANNs. It is worth noting that complicating the model structure may not straightforwardly improve the performance of ANNs, which may be due to an insufficient amount of data, errors in the measured data, or complex relationship between variables.

3.3 Machine learning models to predict membrane fouling

Membrane fouling often involves organic, inorganic, biological, and composite fouling. Organic compounds, such as polysaccharides, proteins, and humic substances, contribute to various stages of membrane fouling resulting in reversible or irreversible fouling (Lin et al., 2014; Xu et al., 2020). Pollutant removal in MBRs involves a combination of biological processes and membrane retention, implying that the formation of membrane fouling is inherently linked to pollutant removal. Compared to considering solely pollutant removal, membrane fouling is a complex process with higher nonlinearity between parameters. Such complexity offers enormous potential for the implementation of machine learning models.

3.3.1 ANN for membrane fouling prediction

ANN is a favorable machine learning model with strong nonlinear fitting capabilities. It demonstrated good performance in predicting membrane fouling. The prediction of membrane fouling can be classified into two categories: filtration state prediction (such as flux, TMP, and permeability) and membrane fouling analysis (such as fouling type, flux recovery rate, and membrane interfacial energy). Tab.2 summarizes the ANN models used to predict MBR filtration state. To improve the prediction, some researchers have modified the MLP model by optimizing/training algorithms, changing the hidden layer activation function, and adjusting the model hierarchy. Sensitivity factor analysis was used to interpret the model or identify significant parameters.

Tab.2 Examples of ANN models to predict membrane filtration state in MBRs

Model	Optimization	Hidden layer activation function	Structural features	Input parameter	Output parameter	Training algorithm	Fitting performance	Ref.
ENN			9-55-1	T, SRT, TSS, ODR, TMP, dTMP/dt, Filtration and backwash time	Flux		AD = 2.7%	Geissler et al., 2005
MLP			3-5-1	Backwash time, Operation time, Flux	Flux	LM	R² = 0.99	Aidan et al., 2008
MLP	GA	log-sigmoid		MLSS, TMP, Resistance	Flux	LM	MAPE = 0.0331	Li et al., 2014
MLP	GA	tan-sigmoid	5-10-1	Time, MLSS, COD, SRT, TSS	TMP, Permeability	LM	R² = 0.98R² = 0.98	Mirbagheri et al., 2015b
RBFNN	GA	RBF	5-5-1	Time, MLSS, COD, SRT, TSS	TMP, Permeability	LM	R² = 0.98R² = 0.99	Mirbagheri et al., 2015b
MLP	GA	tan-sigmoid	6-8-1	Flux, Aeration ratio, Concentration of SMP and EPS, initial TMP, Running time	TMP	Bayesian rule	Relative MSE = 0.024	Wang and Wu, 2015
RBFNN	CV	RBF	2-2-1	Aeration volume, TMP	Flux		R² = 0.80	2017
MLP		log-sigmoid	6-5-1	Influent (TN, NO₃⁻-N, TP), Effluent (TN, NO₃⁻-N, TP)	TMP	LM	R² = 0.85	Schmitt et al., 2018
Fuzzy-RBFNN	PSO	log-sigmoid	2-14-49-1	Flux, Membrane flux variation	Flux		MAPE = 0.0287	Tao and Li, 2018
MLP	PSO			Temperature, Flux, TMP, MLSS	Resistance	LM	R² = 0.97	Hamedi et al., 2019
MLP		tan-sigmoid	4-8-1	MLSS, EC, DO, Time	Flux	LM	R² = 0.98	Hosseinzadeh et al., 2020
RBFNN		RBF	1-3-1	Permeate pump pressure	Flux, TMP	LM	R² > 0.90	Abdul Wahab et al.,
MLP		tan-sigmoid	1-5(7)-1	Permeate pump pressure	Flux, TMP	LM	R² > 0.88	Abdul Wahab et al.,
RNN				EC, Flux	EC, Flux		RMSE = 18 mS/cmRMSE = 1.1 LMH	Viet et al., 2021
MLP		tan-sigmoid	4-30-30-14-5-5-5-5-5-5-1	pH, EC, influent TN and NH₃-N	FluxResistance	LM	Flux: R² = 0.88Resistance: R² = 0.86	Viet and Jang, 2021
WNN	BA	Bandelet function	5-12-2	MLSS, Sludge particle size, EPS, SMP, Sludge viscosity, RH, Zeta potential	Flux, Membrane flux recovery rate	Gradient descent method	MAPE = 0.032	Zhao et al., 2020
MLP			3-17-2	MLSS, HRT, Time	Flux, COD removal rate	LM	R² = 0.9996	Hazrati et al., 2017
ANFIS				OLR, Effluent pH, MLSS, MLVSS	TMP	LM	R² = 0.98	Taheri et al., 2021
MLP		log-sigmoid	6-9-1	Time, Flux, influent COD, pH, MLSS, TMP rate of change	Permeability		R² = 0.9985	Yao et al., 2022
MLP		tan-sigmoid	3-9-1	Disc rotational speed, Membrane to disc gap, OLR	Permeability	LM	R² = 0.999	Irfan et al., 2022
MLP	CV	tan-sigmoid	6-6-1	Sludge filterability, MLVSS, pH, influent COD, T, Cleaning cycle	Permeability	BFGS	R² = 0.93	Alkmim et al., 2020

Note: ENN = Elman neural network, ODR = oxygen decay rate; EC = Electrical conductivity, ANFIS = adaptive network-based fuzzy inference system, BFGS = Broyden-Fletcher-Goldfarb-Shanno, RH = relative hydrophobicity.

GA and fuzzy logic have been used in the modified MLP models. Levenberg-Marquardt (LM) is a widely used training algorithm, some researchers have also used the Bayesian rule, gradient descent algorithm, or Broyden-Fletcher-Goldfarb-Shanno (BFGS) algorithm for model training. Wang and Wu (2015) predicted TMP by inputting flow rate, aeration ratio, initial TMP, running time, and the concentration of characteristic foulants (SMP and EPS) to obtain the jump point of TMP with a relative MSE of 0.024. They established the MLP model with a topology of 6-8-1, optimized the weight and bias of the model by GA algorithm, and trained the model by the Bayesian rule. Additionally, they showed that the performance of MLP with small sample sizes was less stable than traditional mathematical models. Alkmim et al. (2020) established an MLP model with a topology of 6-6-1, trained with the BFGS algorithm, and optimized the parameters by CV. They considered sludge filterability, MLVSS, pH, influent COD, temperature, and cleaning cycle as input characteristics to predict membrane permeability with R² = 0.93. The optimizing/training algorithms seem useful to improve the model. In the meantime, the impact of sample size on machine learning performance in addressing membrane fouling remains a concern.

RBF and wavelet function were used to optimize the hidden layer activation function in the fouling prediction models. Mirbagheri et al. (2015a) used an RBFNN model optimized by GA algorithm to predict TMP and membrane permeability from five input indicators, including operating time, TSS, COD, SRT, and MLSS, with R² > 0.98. Sensitive analysis highlighted the importance of operating time and mixed liquid MLSS as influencing factors. Zhao et al. (2020) used the Bandelet function (approximate to the wavelet function) as the hidden layer activation function to establish the Bandelle neural network, trained the network using the gradient descent method, and introduced the BA to optimize the parameters. They realized the prediction of membrane flux and membrane flux recovery rate using mixed liquid properties such as MLSS, sludge particle size, EPS, SMP, sludge viscosity, relative hydrophobicity, and zeta potential with a relative error of 3.2% on the entire data set.

RNNs are suitable for predicting time-varying membrane fouling because of the advantage in processing sequential data. Elman neural network (ENN) is a rudimentary form of RNN with local memory cells and local feedback connections, and has been used for membrane fouling prediction in early years. Geissler et al. (2005) established an ENN model with a network topology of 9-55-1 to predict membrane flux with an average deviation of 2.7%. Through sensitivity analysis, they revealed that the best membrane backwash condition was high-pressure backwash with short intervals. More recently, Viet et al. (2021) established an RNN model to predict the mixed liquor conductivity and membrane flux of the osmosis membrane bioreactor for 40 d, showing an acceptable RMSE of 18 mS/cm and 1.1 LMH, respectively. The above studies have shown that consideration of time-series development factors can help to better predict membrane fouling, while model interpretation can help to design more optimal operating conditions or process flow. These findings may contribute to refined operation of MBR.

Many researchers using ANN, such as MLP, RBFNN, RNN, and CNN, for membrane fouling analysis. Chen et al. (2012) developed an MLP model that used the bioprocess aeration capacity, membrane aeration capacity, mixed liquor recirculation flowrate, and membrane flux as input variables to predict the energy consumption per unit water production in a full-scale MBR with the R² exceeding 0.55. Zhao et al. (2019) established an RBFNN model to quantify the MBR membrane interfacial energy by considering the contact angles of water/glycerol/diiodomethane on sludge/membrane surface, the zeta potentials of sludge/membrane surface, and the distance between particulate sludge and membrane surface as input variables. The computation time required by RBFNN was only approximately 1/50 that of the analytically extended Derjaguin-Laudau-Verwey-Overbeek method. Shi et al. (2022) established a CNN model based on the attention mechanism. The model utilized a processed grayscale image set as input to classify membrane fouling with a diagnostic accuracy of 98%. Beside supervised learning methods, unsupervised learning methods were also used to analyze membrane fouling for operational process optimization (Woo et al., 2022). Although online monitoring is not available for most model features in the above applications, these models still possess strong analytical capabilities. These applications offer new insights for researchers to analyze mechanisms of membrane fouling and provide a modeling basis for targeted control of membrane fouling.

3.3.2 Other model applications

Other models, such as SVM (including SVR), least squares SVM (LSSVM), RF, limit GBDT, and GBDT have also been applied for membrane fouling prediction in MBRs.

Table S4 in Appendix A presents examples of using SVM or tree-based models to predict membrane fouling. Although the application of these models is not as widespread as ANN, they still demonstrate good predictive abilities in various scenarios. Hamedi et al. (2019) established an LSSVM to simplify the optimization of a linear equation system. MLSS, TMP, flux, and temperature were selected as input parameters to predict the filtration resistance. LSSVM model outperformed both the PSO-MLP model (R² = 0.96) and the gene expression programming model (R² = 0.98), achieving an R² of 0.99. Li et al. (2020) developed an RF model with 300 trees and 2 node variables. MLSS, TMP, and membrane resistance were selected as the main input features, as pre-evaluated via principal component analysis (PCA), to predict the membrane flux. The results showed that the RF model (R² = 0.95) outperformed the SVM model (R² = 0.92) and the MLP model (R² = 0.89). As seen from this example, the RF models may be more suitable than SVM and MLP models for predicting membrane fouling. The use of PCA for feature selection in these applications indicates the possibility of multicollinearity between factors that affect membrane fouling. Although the factors contributing to membrane fouling are complex, a few representative indicators can be selected for online monitoring and prediction.

4 Tutorial example

4.1 Method

According to the above literature review, BPNN, SVM, and RF have emerged as prevalent machine learning models, among which the BP and SVM models have been most widely used in MBRs. The GA is the predominant algorithm used for model optimization. In addition, introducing of long short-term memory mechanism in LSTM (as a variant of RNN) has demonstrated significant promise in addressing time issues. Given the complex nature of fouling formation compared to pollutant removal, the real-time prediction of membrane fouling proved is crucial, particularly considering the dynamic and time-sensitive interactions between foulants and membranes.

This tutorial aims to apply five typical machine learning methods (SVM, RF, BPNN, LSTM, and GA-BP) to predict membrane fouling in MBR systems. A virtual data set (see S Appendix B for the datasheet) was created for the machine learning practice of this example which is conducted on the MATLAB platform.

4.1.1 Data preprocessing

One should first specify the independent variables (as input characteristics) and the dependent variables (as output characteristics) from the raw data set. In this example, TMP was selected as the target output to characterize the fouling state. The possibly important influencing factors for fouling were selected as input characteristics, according to the common understanding and preliminary investigations. In this example, variables such as temperature, MLSS, pH, DO in aerobic zone, influent water quality (COD, TN, and TP), and membrane flux were selected as input characteristics. To eliminate the adverse effects caused by singular data and expedite convergence, “mapminmax” function in MATLAB was applied to normalize the input and output data within the range of [0,1]. Subsequently, 70% of the sample data was randomly selected as the training set, and the remaining 30% for the test set.

4.1.2 Selection of model and policy

The MATLAB software was used for the modeling and programming (see Appendix C for the exemplary codes). For the SVM modeling, the LibSVM toolkit was employed for regression prediction, with the RBF kernel chosen as the kernel function. The parameters C (penalty coefficient, representing the tolerance for error) and G (a parameter for the RBF kernel function, calculated as G = 1/(2σ_RBF²), which implicitly determines the distribution of the data mapped to the new feature space) are optimized through CV. To construct BPNN, LSTM, and RF models, the neural network toolbox and random forest toolbox were selected from the MATLAB packages. The backpropagation training algorithm was employed for neural network model development. For GA-BP, a hybrid model of GA and BPNN algorithm was established, which involves a three-step process of selection, crossover, and mutation to globally optimize the weight and bias of the neural network.

4.1.3 Model evaluation

The reliability and accuracy of each model were evaluated by three statistical parameters: R², RMSE, and MAPE. In general, the MAPE and RMSE values being closer to 0 (or R² closer to 1) suggest a more accurate prediction and better model performance.

4.2 Results and discussion

4.2.1 Presentation of raw data

Table S5 displays the mean, standard deviation, and maximum and minimum values of the input and output data. The input variables consisted of T, MLSS, pH, aerobic-zone DO, influent COD, influent TN, influent TP, and flux. The output variable was TMP. Machine learning models establish a mapping relationship between input and output variables to predict fouling. The raw data set comprises 2000 samples, with 70% randomly selected for training and the remaining 30% for testing.

4.2.2 Model performance

The set of model parameters is presented in Section 1 of Appendix A. Tab.3 compares the prediction results of the SVM, RF, BPNN, LSTM, and GA-BP methods. All five models demonstrated good fitting ability for the full data set. The SVM, BPNN, LSTM, and GA-BP performed similarly on the training and testing sets, whereas RF showing good fitting ability but weaker generalized predictability. Although RFs do not have the problem of overfitting (Peter et al., 1998), their lack of generalization ability may originate from the correlation and redundancy among the independent variables caused by randomly drawn features (Wu et al., 2012). The generalization ability of the model can be improved by increasing the number of trees, performing feature selection, and optimizing the pruning algorithm (Yang et al., 2012). The LSTM has a slightly better predictive ability than BPNN. The GA-BP improved the prediction compared with the BPNN.

Tab.3 Performance of different machine learning models for the tutorial example

		SVM	RF	BPNN	LSTM	GA-BP
R²	Training	0.8208	0.9017	0.8199	0.8206	0.8201
	Testing	0.8124	0.7344	0.8096	0.8175	0.8128
	All Data	0.8184	0.8537	0.8170	0.8197	0.8180
RMSE	Training	1.4075	1.0424	1.4107	1.4080	1.4100
	Testing	1.3955	1.6605	1.4058	1.3765	1.3940
	All Data	1.4039	1.2601	1.4092	1.3987	1.4052
MAPE	Training	0.0616	0.0463	0.0629	0.0630	0.0628
	Testing	0.0624	0.0737	0.0636	0.0619	0.0625
	All Data	0.0618	0.0545	0.0631	0.0626	0.0627

5 Summary and prospect

Machine learning models, including ANNs, SVMs, and decision trees, have been used to predict membrane filtration performances in MBRs. Several models have been reported to exhibit fairly good fitting performance and generalization ability. The model parameters can be optimized using intelligent optimization algorithms such as GA and PSO to enhance predictability. The application of SHAP to the interpretation of MBR predication models has developed rapidly (Aldrees et al., 2024a; Niu et al., 2024). However, the present models are facing challenges in terms of: (a) inadequate input features and monitoring indices, (b) limited interpretability and generalizability of model prediction, and (c) lacked utilization in automated process control. This calls for further advancements in the modeling techniques and deeper integration of modeling into the physical world (e.g., monitoring and control systems). Therefore, we recommend exploring the constraints and enhancements of machine learning in MBR engineering validation applications at four levels: constitution of model features, online monitoring of model features, application toward process control, and data sharing and generalization.

(1) The model input parameters can be extended to achieve a more accurate and comprehensive description of key features. Compared to the conventional rough indices to describe overall membrane fouling by current models, specific indicators can be refined to more accurately describe fouling behavior (e.g., fouling potential for specific fouling stages) and foulant properties (e.g., polysaccharide/protein/humus concentrations and molecular characteristics). Another requirement is to extend the current index system toward a more complete coverage of possible factors. Previous models for pollutant removal efficiency have paid less attention to the effects of membrane operating state (TMP, flux, permeability, MFI, etc.) and the resultant pollutant interception by the membrane. In fouling prediction models, MFI and CCI have been frequently utilized as input parameters, whereas OI, which is related to membrane backwash and scouring, has not been adequately included. Traditional concentration parameters can be integrated with membrane status and operating conditions in the ensuing models. The inclusion of these more accurate and complete monitoring indices as input features would thus facilitate the machine learning for MBR processes.

(2) Online monitoring of the key parameters is crucial in developing “smart” models that are timely responsive to real-time operating status of MBRs. The conventional online monitoring items include temperature, pH, DO, turbidity, and COD, etc., which are inadequate to provide accurate details of pollutants/foulants for the modeling. For instance, COD can only reflect the overall concentration of organic matter, without revealing details of chemical composition and molecular structure. Elaborate measurements of these properties are often laborious, time-consuming, and unsuitable for online monitoring. Alternatively, spectroscopic methods (such as ultraviolet, visible, and fluorescence spectroscopy) offer new possibilities for real-time reflection of molecular details. These techniques are fast, sensitive, and informative for exploration of molecular fingerprints, and are promising supplements to the online monitoring system. Combining online spectral indicators with membrane operating status and control conditions is beneficial for developing models that are really implementable for process control.

(3) Feedforward models are required to support the early warning and proactive control of MBR processes. For optimized operation of MBRs in terms of pollutant removal and fouling mitigation, preventive actions should be taken in advance rather than reacting to adverse events. However, current MBRs lack feedforward control. Although most models have demonstrated fairly good performance in simulating existing facts, their capability to predict future tendencies remains insufficient. Owing to incomplete input features and lack of prediction models, intelligent feedforward fouling control has not yet been realized. In the ensuing modeling, attention should be paid to the tendency indices (e.g., fouling potential) as the model output, or incorporate time series concepts to forecast future performances from current operation status. Moreover, the model results should be coupled with an automatic control system to make them practical.

(4) A widely shared database can be constructed to enhance the interpretability and generalizability of the models. The interpretability of “black box” models remains a longstanding issue, and the complexity of physical/chemical/biological processes in MBR systems poses new requirements and obstacles for the interpretability of the predictive models. Currently, the application of model interpretation and deep learning models in MBR prediction remains limited. Various methods, such as decision rule analysis, correlation coefficient comparison, sensitivity factor analysis, and SHAP, can help explain the model, but have not been sufficiently used. The incompleteness of input features makes the interpretation of the model toward its real physical meaning even harder. To support a robust deep learning model with sufficient input features and widely generalizable, a large amount of representative data are critical, which requires a significant workload for data collection and preprocess. To address this challenge, an open database similar to ImageNet would make great sense, where different researchers can share data and develop predictive models with better generalization and broader application by using large and diverse experimental or engineering data.

In summary, the future trajectory of machine learning in MBRs encompasses a multitude of dimensions, including monitoring multidimensional features with online potential, extracting spectral-driven molecular details, integrating intelligent control system based on real-time warning models, and establishing open and shared data systems. Additionally, model innovation is also noteworthy. Cutting-edge AI methods, such as explainable reinforcement learning (Yu et al., 2023) and large models based on the Transformer framework or multimodality (Vasu et al., 2023), have demonstrated considerable potential in certain fields, including remote sensing (Sun et al., 2023), spatio-temporal early prediction (Wei et al., 2024), and 3D object detection (Li et al., 2024b). It is also important for researchers to consider simple cutting-edge models such as KAN (Kolmogorov-Arnold Network with learnable activation functions on weights rather than employing fixed activation functions on neurons) (Liu et al., 2024), xLSTM (extended LSTM with exponential gating and enhanced memory structures) (Beck et al., 2024). The incorporation of physics-informed, physics-aware, or data-knowledge driven methodologies can also improve the model performance and/or interpretability (Nguyen et al., 2023; Li et al., 2024a; Wang et al., 2024a). However, these new models or methods have not been significantly underutilized in the field of water management and wastewater engineering. It is recommended that MBR researchers direct their attention to these cutting-edge directions. Machine learning (including deep learning) has several applications in materials, biology, medicine, and remote sensing. The field of water treatment may benefit from the interdisciplinary or cross-disciplinary inspiration of machine learning techniques, which have the potential to enhance the existing approaches to water treatment. The profiling of data types, solution goals, and application requirements facilitates the selection and optimization of models. Further comprehensive exploration of the prospective applications of intelligent optimization algorithms (also known as metaheuristic algorithms), AutoML, and XAI in MBR procedures may facilitate the advancement of effective models for process prediction and explanation.

6 Conclusions

This paper summarized recent advances in machine learning for predicting the performance of MBR pollutants removal and membrane fouling. Based on the literature review, a range of machine learning methods, including ANN, SVM, and decision trees, have been utilized in this scope. Ordinary ANNs have dominated the prediction models, and there are limitations in the study of deep learning models. This paper not only reviewed the basic principles related to machine learning but also presented a tutorial example for readers to practice five models for the prediction of TMP. Alongside the great potential of machine learning in MBR research, further development and application of the models in full-scale engineering are challenged by inadequacies in input features and monitoring metrics, insufficient model interpretability and generalizability, and lack of practicability in automated process control. A more complete input index system reinforced by online monitoring would be crucial for developing real-time responsive or feedforward models that bridge the gap between model prediction and practical control. An open and shared MBR operation database would largely benefit advancements in model generalizability and interpretability. The application of deep learning and intelligent optimization algorithms may enhance the model performance. In addition, the integration of AutoML and XAI may facilitate the deployment of models in practical engineering applications. The information presented in this paper is expected to provide implications for future researches toward machine learning-based intelligent operation and maintenance of MBR processes.

7 Abbreviations

Abbreviation	Description
AIC	Akaike information criterion
ANFIS	Adaptive network-based fuzzy inference system
AnMBR	Anaerobic membrane bioreactor
ANN	Artificial neural networks
AUC	Area under curve
AutoML	Automated machine learning
BA	Bat algorithm
BFGS	Broyden-Fletcher-Goldfarb-Shanno
BIC	Bayesian information criterion
BOD	Biochemical oxygen demand
BPNN	Back propagation neural network
CART	Classification and regression tree
CCI	Conventional concentration indices
CFI	Characteristic foulant indices
CNN	Convolutional neural network
COD	Chemical oxygen demand
CV	Cross-validation
DNN	Deep neural network
DO	Dissolved oxygen
EC	Electrical conductivity
EI	Environment indices
ENN	Elman neural network
EPS	Extracellular polymeric substances
FFA	Firefly algorithm
FCN	Fully connected network
GA	Genetic algorithms
GA-BP	Genetic algorithm-back propagation
GBDT	Gradient boosting decision tree
GNN	Graph neural network
GWO	Grey wolf optimizer
HQC	Hannan-Quinn criterion
HRT	Hydraulic retention time
KAN	Kolmogorov-Arnold network
KNN	K-nearest neighbors
LM	Levenberg-Marquardt
LSSVM	Least-squares support vector machine
LSTM	Long short-term memory
MAPE	Mean absolute percentage error
MBR	Membrane bioreactor
MFI	Membrane filtration indices
MLP	Multilayer perceptron
MLSS	Mixed liquor suspended solids
MLVSS	Volatile MLSS
MSE	Mean square error
NH₃-N	Ammonium nitrogen
NO₃⁻-N	Nitrate nitrogen
ODR	Oxygen decay rate
OI	Operation indices
OLR	Organic loading rate
ORP	Oxidation reduction potential
PCA	Principal component analysis
PSO	Particle swarm optimization
RBF	Radial basis function
RBFNN	Radial basis function neural network
RF	Random forest
RH	Relative hydrophobicity
RL	Reinforcement learning
RMSE	Root mean square error
RNN	Recurrent neural network
ROC	Receiver operating characteristic
SA	Simulated annealing
SHAP	Shapley additive explanations
SMP	Soluble microbial products
SRT	Sludge retention time
SVC	Support vector classification
SVM	Support vector machines
SVR	Support vector regression
t	Time parameter
T	Temperature
TMP	Transmembrane pressure
TN	Total nitrogen
TOC	Total organic carbon
TP	Total phosphorus
TSS	Total suspended solids
WNN	Wavelet neural network
XAI	Explainable artificial intelligence

References

Publishing order | Descend order by publishing year | Descend order by cited within

[1]

Abbott A S, Turney J M, Zhang B, Smith D G A, Altarawy D, Schaefer H F III. (2019). PES-learn: an open-source software package for the automated generation of machine learning models of molecular potential energy surfaces. Journal of Chemical Theory and Computation, 15(8): 4386–4398

CrossRef Google scholar

[2]	Abdul Wahab N, Mahmod N, Vilanova R. (2020). Permeate flux control in SMBR system by using neural network internal model control. Processes, 8(12): 1672 CrossRef Google scholar

[3]	Ahmed I, Kajol M, Hasan U, Datta P P, Roy A, Reza M R. (2024). ChatGPT versus Bard: a comparative study. Engineering Reports, 6(11): 12890 CrossRef Google scholar

[4]	Aidan A, Abdel-Jabbar N, Ibrahim T H, Nenov V, Mjalli F. (2008). Neural network modeling and optimization of scheduling backwash for membrane bioreactor. Clean Technologies and Environmental Policy, 10(4): 389–395 CrossRef Google scholar

[5]	Akaike H. (1974). New look at statistical-model identification. IEEE Transactions on Automatic Control, 19(6): 716–723 CrossRef Google scholar

[6]	Al-Ghazawi Z, Alawneh R. (2021). Use of artificial neural network for predicting effluent quality parameters and enabling wastewater reuse for climate change resilience: a case from Jordan. Journal of Water Process Engineering, 44: 102423 CrossRef Google scholar

[7]	Aldrees A, Javed M F, Khan M, Siddiq B. (2024a). Optimized prediction modeling of micropollutant removal efficiency in forward osmosis membrane systems using explainable machine learning algorithms. Journal of Water Process Engineering, 66: 105937 CrossRef Google scholar

[8]	Aldrees A, Khan M, Taha A T B, Ali M. (2024b). Evaluation of water quality indexes with novel machine learning and SHapley Additive ExPlanation (SHAP) approaches. Journal of Water Process Engineering, 58: 104789 CrossRef Google scholar

[9]	Alexandridis A K, Zapranis A D. (2013). Wavelet neural networks: a practical guide. Neural Networks, 42: 1–27 CrossRef Google scholar

[10]	Alkmim A R, De Almeida G M, De Carvalho D M, Amaral M C S, Oliveira S. (2020). Improving knowledge about permeability in membrane bioreactors through sensitivity analysis using artificial neural networks. Environmental Technology, 41(19): 2424–2438 CrossRef Google scholar

[11]	BeckM, Poppel K, SpanringM, AuerA, Prudnikova O, KoppM K, KlambauerG, Brandstetter J, HochreiterS J A (2024). xLSTM: Extended long short-term memory. ArVix, abs/2405.04517

[12]	Bi K, Xie L, Zhang H, Chen X, Gu X, Tian Q. (2023). Accurate medium-range global weather forecasting with 3D neural networks. Nature, 619(7970): 533–538 CrossRef Google scholar

[13]	Bindal S, Singh C K. (2019). Predicting groundwater arsenic contamination: regions at risk in highest populated state of India. Water Research, 159: 65–76 CrossRef Google scholar

[14]	BishopC M (2006). Pattern recognition and machine learning. Springer New York, NY

[15]	Breiman L. (2001). Random forests. Machine Learning, 45(1): 5–32 CrossRef Google scholar

[16]	BreimanL, Friedman J, OlshenR A, StoneC J (1984). Classification and Regression Trees. New York: Chapman and Hall/CRC

[17]	Browne M W. (2000). Cross-validation methods. Journal of Mathematical Psychology, 44(1): 108–132 CrossRef Google scholar

[18]	Byeon H. (2023). Advances in value-based, policy-based, and deep learning-based reinforcement learning. International Journal of Advanced Computer Science and Applications, 14(8): 348–354 CrossRef Google scholar

[19]	Cai Y H, Ben T, Zaidi A A, Shi Y, Zhang K, Lin A Q, Liu C. (2019a). Effect of pH on pollutants removal of ship sewage treatment in an innovative aerobic-anaerobic micro-sludge MBR system. Water, Air, and Soil Pollution, 230(7): 163 CrossRef Google scholar

[20]

Cai Y H, Zaidi A A, Shi Y, Zhang K, Li X, Xiao S H, Lin A Q. (2019b). Influence of salinity on the biological treatment of domestic ship sewage using an air-lift multilevel circulation membrane reactor. Environmental Science and Pollution Research International, 26(36): 37026–37036

CrossRef Google scholar

[21]	Castillo O, Aguilar L, Cazarez N, Cardenas S. (2008). Systematic design of a stable type-2 fuzzy logic controller. Applied Soft Computing, 8(3): 1274–1279 CrossRef Google scholar

[22]	Chang H M, Xu Y, Chen S S, He Z. (2022). Enhanced understanding of osmotic membrane bioreactors through machine learning modeling of water flux and salinity. Science of the Total Environment, 838: 156009 CrossRef Google scholar

[23]	Chen C, Sun M Z, Chang J, Liu Z W, Zhu X Z, Xiao K, Song G Q, Wang H, Liu G L, Huang X. (2022). Unravelling temperature-dependent fouling mechanism in a pilot-scale anaerobic membrane bioreactor via statistical modelling. Journal of Membrane Science, 644: 120145 CrossRef Google scholar

[24]	Chen J C, Ng W J, Luo R, Mu S, Zhang Z, Andersen M, Jorgensen P E. (2012). Membrane bioreactor process modeling and optimization: ULU Pandan water reclamation plant. Journal of Environmental Engineering, 138(12): 1218–1226 CrossRef Google scholar

[25]	ChenT, Guestrin C (2016). XGBoost: A Scalable Tree Boosting System. San Francisco: Association for Computing Machinery, 785–794

[26]	Chen W, Ran H, Cao X, Wang J, Teng D, Chen J, Zheng X. (2020). Estimating PM_2.5 with high-resolution 1-km AOD data and an improved machine learning model over Shenzhen, China. Science of the Total Environment, 746: 141093 CrossRef Google scholar

[27]	Choi J G, Bae T H, Kim J H, Tak T M, Randall A A. (2002). The behavior of membrane fouling initiation on the crossflow membrane bioreactor system. Journal of Membrane Science, 203(1−2): 103–113 CrossRef Google scholar

[28]	Cortez P, Embrechts M J. (2013). Using sensitivity analysis and visualization techniques to open black box data mining models. Information Sciences, 225: 1–17 CrossRef Google scholar

[29]	de Campos Souza P V. (2020). Fuzzy neural networks and neuro-fuzzy networks: a review the main techniques and applications used in the literature. Applied Soft Computing, 92: 106275 CrossRef Google scholar

[30]	Dorigo M, Birattari M, Stützle T. (2006). Ant colony optimization. IEEE Computational Intelligence Magazine, 1(4): 28–39 CrossRef Google scholar

[31]	Espel D, Courty S, Auda Y, Sheeren D, Elger A. (2020). Submerged macrophyte assessment in rivers: an automatic mapping method using Pleiades imagery. Water Research, 186: 116353 CrossRef Google scholar

[32]	Gao Q, Li Z, Pan J. (2019). A convolutional neural network for airport security inspection of dangerous goods. IOP Conference Series. Earth and Environmental Science, December 27–29, 2019, Guangzhou , China, 252(4): 042042 CrossRef Google scholar

[33]	Garson G D. (1991). Interpreting neural-network connection weights. AI Expert, 6(4): 46–51

[34]	Gedeon T D. (1997). Data mining of inputs: analysing magnitude and functional measures. International Journal of Neural Systems, 8(2): 209–218 CrossRef Google scholar

[35]	Geissler S, Wintgens T, Melin T, Vossenkaul K, Kullmann C. (2005). Modelling approaches for filtration processes with novel submerged capillary modules in membrane bioreactors for wastewater treatment. Desalination, 178(1−3): 125–134 CrossRef Google scholar

[36]	Gideon S. (1978). Estimating the dimension of a model. Annals of Statistics, 6(2): 461–464

[37]	Goh A T C. (1995). Backpropagation neural networks for modeling complex-systems. Artificial Intelligence in Engineering, 9(3): 143–151 CrossRef Google scholar

[38]	Greff K, Srivastava R K, Koutník J, Steunebrink B R, Schmidhuber J. (2017). LSTM: a search space odyssey. IEEE Transactions on Neural Networks and Learning Systems, 28(10): 2222–2232 CrossRef Google scholar

[39]	Grömping U. (2015). Variable importance in regression models. Wiley Interdisciplinary Reviews: Computational Statistics, 7(2): 137–152 CrossRef Google scholar

[40]	Hamedi H, Ehteshami M, Mirbagheri S A, Zendehboudi S. (2019). New deterministic tools to systematically investigate fouling occurrence in membrane bioreactors. Chemical Engineering Research & Design, 144: 334–353 CrossRef Google scholar

[41]	Hannan E J, Quinn B G. (1979). The determination of the order of an autoregression. Journal of the Royal Statistical Society. Series B, Statistical Methodology, 41(2): 190–195 CrossRef Google scholar

[42]	Hazrati H, Moghaddam A H, Rostamizadeh M. (2017). The influence of hydraulic retention time on cake layer specifications in the membrane bioreactor: experimental and artificial neural network modeling. Journal of Environmental Chemical Engineering, 5(3): 3005–3013 CrossRef Google scholar

[43]	Hosseinzadeh A, Zhou J L, Altaee A, Baziar M, Li X. (2020). Modeling water flux in osmotic membrane bioreactor by adaptive network-based fuzzy inference system and artificial neural network. Bioresource Technology, 310: 123391 CrossRef Google scholar

[44]	HuangG, Liu Z, MaatenL V D, WeinbergerK Q (2017). Densely connected convolutional networks. Honolulu, HI, USA: IEEE, 2261–2269

[45]	Huang Z, Ong S L, Ng H Y. (2011). Submerged anaerobic membrane bioreactor for low-strength wastewater treatment: effect of HRT and SRT on treatment performance and membrane fouling. Water Research, 45(2): 705–713 CrossRef Google scholar

[46]

Irfan M, Waqas S, Arshad U, Khan J A, Legutko S, Kruszelnicka I, Ginter-Kramarczyk D, Rahman S, Skrzypczak A. (2022). Response surface methodology and artificial neural network modelling of membrane rotating biological contactors for wastewater treatment. Materials, 15(5): 1932

CrossRef Google scholar

[47]	Jiang M, He Y, Song C, Pan Y, Qiu T, Tian S. (2021). Disaggregating climatic and anthropogenic influences on vegetation changes in Beijing-Tianjin-Hebei region of China. Science of the Total Environment, 786: 147574 CrossRef Google scholar

[48]	Juntawang C, Rongsayamanont C, Khan E. (2017). Entrapped cells-based-anaerobic membrane bioreactor treating domestic wastewater: performances, fouling, and bacterial community structure. Chemosphere, 187: 147–155 CrossRef Google scholar

[49]	Karaboga D, Basturk B. (2007). A powerful and efficient algorithm for numerical function optimization: artificial bee colony (ABC) algorithm. Journal of Global Optimization, 39(3): 459–471 CrossRef Google scholar

[50]	Katoch S, Chauhan S S, Kumar V. (2021). A review on genetic algorithm: past, present, and future. Multimedia Tools and Applications, 80(5): 8091–8126 CrossRef Google scholar

[51]	Kawakatsu T, Nakao S, Kimura S. (1993). Effects of size and compressibility of suspended particles and surface pore-size of membrane on flux in cross-flow filtration. Journal of Membrane Science, 81(1−2): 173–190 CrossRef Google scholar

[52]	Kim J H, Shin J K, Lee H, Lee D H, Kang J H, Cho K H, Lee Y G, Chon K, Baek S S, Park Y. (2021a). Improving the performance of machine learning models for early warning of harmful algal blooms using an adaptive synthetic sampling method. Water Research, 207: 117821 CrossRef Google scholar

[53]	Kim S Y, Ćurko J, Gajdoš Kljusurić J, Matošić M, Crnek V, López-Vázquez C M, Garcia H A, Brdjanović D, Valinger D. (2021b). Use of near-infrared spectroscopy on predicting wastewater constituents to facilitate the operation of a membrane bioreactor. Chemosphere, 272: 129899 CrossRef Google scholar

[54]	Kiranyaz S, Avci O, Abdeljaber O, Ince T, Gabbouj M, Inman D J. (2021). 1D convolutional neural networks and applications: a survey. Mechanical Systems and Signal Processing, 151: 107398 CrossRef Google scholar

[55]	Kruskal W. (1987). Relative importance by averaging over orderings. American Statistician, 41(1): 6–10 CrossRef Google scholar

[56]	Krzeminski P, Leverette L, Malamis S, Katsou E. (2017). Membrane bioreactors: a review on recent developments in energy reduction, fouling control, novel configurations, LCA and market prospects. Journal of Membrane Science, 527: 207–227 CrossRef Google scholar

[57]	Kulkarni P, Chellam S. (2010). Disinfection by-product formation following chlorination of drinking water: artificial neural network models and changes in speciation with treatment. Science of the Total Environment, 408(19): 4202–4210 CrossRef Google scholar

[58]	Kurita T, Kimura K, Watanabe Y. (2014). The influence of granular materials on the operation and membrane fouling characteristics of submerged MBRs. Journal of Membrane Science, 469: 292–299 CrossRef Google scholar

[59]	Kurita T, Kimura K, Watanabe Y. (2015). Energy saving in the operation of submerged MBRs by the insertion of baffles and the introduction of granular materials. Separation and Purification Technology, 141: 207–213 CrossRef Google scholar

[60]	Li C Q, Yang Z X, Yan H Y, Wang T. (2014). The application and research of the GA-BP neural network algorithm in the MBR membrane fouling. Abstract and Applied Analysis, 2014(1): 673156 CrossRef Google scholar

[61]

Li G Y, Ji J Y, Ni J L, Wang S R, Guo Y T, Hu Y S, Liu S W, Huang S F, Li Y Y. (2022). Application of deep learning for predicting the treatment performance of real municipal wastewater based on one-year operation of two anaerobic membrane bioreactors. Science of the Total Environment, 813: 151920

CrossRef Google scholar

[62]	Li H M, Wang J H, Wang Q G, Tian C H, Qian X, Leng X Z. (2017). Magnetic properties as a proxy for predicting fine-particle-bound heavy metals in a support vector machine approach. Environmental Science & Technology, 51(12): 6927–6935 CrossRef Google scholar

[63]	Li S, Shi Z, Chen S C, Ji W J, Zhou L Q, Yu W, Webster R. (2015). In situ measurements of organic carbon in soil profiles using vis-NIR spectroscopy on the Qinghai-Tibet Plateau. Environmental Science & Technology, 49(8): 4980–4987 CrossRef Google scholar

[64]	Li W W, Li C Q, Wang T. (2020). Application of machine learning algorithms in MBR simulation under big data platform. Water Practice & Technology, 15(4): 1238–1247 CrossRef Google scholar

[65]	Li X A, Wu J, Tai X, Xu J, Wang Y G. (2024a). Solving a class of multi-scale elliptic PDEs by Fourier-based mixed physics informed neural networks. Journal of Computational Physics, 508: 113012 CrossRef Google scholar

[66]	Li Y, Fan L, Liu Y, Huang Z, Chen Y, Wang N, Zhang Z. (2024b). Fully sparse fusion for 3D object detection. IEEE Transactions on Pattern Analysis and Machine Intelligence, 46(11): 7217–7231 CrossRef Google scholar

[67]

Lin H J, Zhang M J, Wang F Y, Meng F G, Liao B Q, Hong H C, Chen J R, Gao W J. (2014). A critical review of extracellular polymeric substances (EPSs) in membrane bioreactors: characteristics, roles in membrane fouling and control strategies. Journal of Membrane Science, 460: 110–125

CrossRef Google scholar

[68]	Liu C Q, Xiao J W, Li H Y, Chen Q, Sun D Z, Cheng X, Li P S, Dang Y, Smith J A, Holmes D E. (2021). High efficiency in-situ biogas upgrading in a bioelectrochemical system with low energy input. Water Research, 197: 117055 CrossRef Google scholar

[69]	Liu J, Kang X, Luan X, Gao L, Tian H, Liu X. (2020a). Performance and membrane fouling behaviors analysis with SVR-LibSVM model in a submerged anaerobic membrane bioreactor treating low-strength domestic sewage. Environmental Technology & Innovation, 19: 100844 CrossRef Google scholar

[70]	LiuZ, WangY, VaidyaS, Ruehle F, HalversonJ, SoljacicM, HouT Y, TegmarkM J A (2024). KAN: Kolmogorov-Arnold Networks. ArXiv,

[71]	Liu Z W, Yu J L, Xiao K, Chen C, Ma H, Liang P, Zhang X Y, Huang X. (2020b). Quantitative relationships for the impact of gas sparging conditions on membrane fouling in anaerobic membrane bioreactor. Journal of Cleaner Production, 276: 123139 CrossRef Google scholar

[72]	Lundberg S M, Erion G, Chen H, Degrave A, Prutkin J M, Nair B, Katz R, Himmelfarb J, Bansal N, Lee S I. (2020). From local explanations to global understanding with explainable AI for trees. Nature Machine Intelligence, 2(1): 56–67 CrossRef Google scholar

[73]	Ma K, Tang C, Zhang W, Cui B, Ji K, Chen Z, Abraham A. (2023). DC-CNN: Dual-channel Convolutional Neural Networks with attention-pooling for fake news detection. Applied Intelligence, 53(7): 8354–8369 CrossRef Google scholar

[74]	MadniH A, Umer M, IshaqA, AbuzinadahN, Saidani O, AlsubaiS, HamdiM, AshrafI (2023). Water-quality prediction based on H2O autoML and explainable AI techniques, Water, 15(3): 475

[75]	Mallet A, Charnier C, Latrille E, Bendoula R, Roger J M, Steyer J P. (2022). Fast and robust NIRS-based characterization of raw organic waste: using non-linear methods to handle water effects. Water Research, 227: 119308 CrossRef Google scholar

[76]	McMillan L, Fayaz J, Varga L. (2024). Domain-informed variational neural networks and support vector machines based leakage detection framework to augment self-healing in water distribution networks. Water Research, 249: 120983 CrossRef Google scholar

[77]	Meireles M, Aimar P, Sanchez V. (1991). Effects of protein fouling on the apparent pore-size distribution of sieving membranes. Journal of Membrane Science, 56(1): 13–28 CrossRef Google scholar

[78]	Meng F G, Zhang S Q, Oh Y, Zhou Z B, Shin H S, Chae S R. (2017). Fouling in membrane bioreactors: an updated review. Water Research, 114: 151–180 CrossRef Google scholar

[79]

Mirbagheri S A, Bagheri M, Bagheri Z, Kamarkhani A M. (2015a). Evaluation and prediction of membrane fouling in a submerged membrane bioreactor with simultaneous upward and downward aeration using artificial neural network-genetic algorithm. Process Safety and Environmental Protection, 96: 111–124

CrossRef Google scholar

[80]

Mirbagheri S A, Bagheri M, Boudaghpour S, Ehteshami M, Bagheri Z. (2015b). Performance evaluation and modeling of a submerged membrane bioreactor treating combined municipal and industrial wastewater using radial basis function artificial neural networks. Journal of Environmental Health Science & Engineering, 13(1): 17

CrossRef Google scholar

[81]	Mirjalili S, Mirjalili S M, Lewis A. (2014). Grey Wolf Optimizer. Advances in Engineering Software, 69: 46–61 CrossRef Google scholar

[82]

Mohammadi E, Stokholm-Bjerregaard M, Hansen A A, Nielsen P H, Ortiz-Arroyo D, Durdevic P. (2024). Deep learning based simulators for the phosphorus removal process control in wastewater treatment via deep reinforcement learning algorithms. Engineering Applications of Artificial Intelligence, 133: 107992

CrossRef Google scholar

[83]	MolnarC (2019). Interpretable Machine Learning: a Guide for Making Black Box Models Explainable. München: Digital Reserch Academy

[84]	Nguyen P C H, Nguyen Y T, Choi J B, Seshadri P K, Udaykumar H S, Baek S S. (2023). PARC: physics-aware recurrent convolutional neural networks to assimilate meso scale reactive mechanics of energetic materials. Science Advances, 9(17): eadd6868 CrossRef Google scholar

[85]	Nguyen X C, Ly Q V, Nguyen T T H, Ngo H T T, Hu Y X, Zhang Z H. (2022). Potential application of machine learning for exploring adsorption mechanisms of pharmaceuticals onto biochars. Chemosphere, 287: 132203 CrossRef Google scholar

[86]

Niu C, Zhang Z, Cai T, Pan Y, Lu X, Zhen G. (2024). Sludge bound-EPS solubilization enhance CH₄ bioconversion and membrane fouling mitigation in electrochemical anaerobic membrane bioreactor: insights from continuous operation and interpretable machine learning algorithms. Water Research, 264: 122243

CrossRef Google scholar

[87]	Niu C X, Li X S, Dai R B, Wang Z W. (2022). Artificial intelligence-incorporated membrane fouling prediction for membrane-based processes in the past 20 years: a critical review. Water Research, 216: 118299 CrossRef Google scholar

[88]	Olden J D, Joy M K, Death R G. (2004). An accurate comparison of methods for quantifying variable importance in artificial neural networks using simulated data. Ecological Modelling, 178(3−4): 389–397 CrossRef Google scholar

[89]	Oliker N, Ostfeld A. (2014). A coupled classification: evolutionary optimization model for contamination event detection in water distribution systems. Water Research, 51: 234–245 CrossRef Google scholar

[90]	Panda S R, Bhandaru N, Mukherjee R, De S. (2015). Ultrafiltration of oily waste water: contribution of surface roughness in membrane properties and fouling characteristics of polyacrylonitrile membranes. Canadian Journal of Chemical Engineering, 93(11): 2031–2042 CrossRef Google scholar

[91]	Peter B, Yoav F, Wee Sun L, Robert E S. (1998). Boosting the margin: a new explanation for the effectiveness of voting methods. Annals of Statistics, 26(5): 1651–1686

[92]	Qian Y G, Zhou W Q, Yan J L, Li W F, Han L J. (2015). Comparing machine learning classifiers for object-based land cover classification using very high resolution imagery. Remote Sensing (Basel), 7(1): 153–168 CrossRef Google scholar

[93]	Qu J H, Dai X H, Hu H Y, Huang X, Chen Z, Li T, Cao Y S, Daigger G T. (2022). Emerging trends and prospects for municipal wastewater management in China. ACS ES&T Engineering, 2(3): 323–336 CrossRef Google scholar

[94]	QuinlanJ R (1993). C4.5: Programs for Machine Learning. San Francisco: Morgan Kaufmann Publishers Inc.

[95]	RaychaudhuriS (2008). Introduction to Monte Carlo simulation, 2008 Winter Simulation Conference, 91–100, December 7–10, 2008, Miami, FL, USA,

[96]	Rouet-Leduc B, Hulbert C. (2024). Automatic detection of methane emissions in multispectral satellite imagery using a vision transformer. Nature Communications, 15(1): 3801 CrossRef Google scholar

[97]	Sadeghi I, Aroujalian A, Raisi A, Dabir B, Fathizadeh M. (2013). Surface modification of polyethersulfone ultrafiltration membranes by corona air plasma for separation of oil/water emulsions. Journal of Membrane Science, 430: 24–36 CrossRef Google scholar

[98]	Salehin I, Islam M S, Saha P, Noman S M, Tuni A, Hasan M M, Baten M A. (2024). AutoML: a systematic review on automated machine learning with neural architecture search. Journal of Information and Intelligence, 2(1): 52–81 CrossRef Google scholar

[99]	Samek W. (2020). Learning with explainable trees. Nature Machine Intelligence, 2(1): 16–17 CrossRef Google scholar

[100]

Schmitt F, Banu R, Yeom I T, Do K U. (2018). Development of artificial neural networks to predict membrane fouling in an anoxic-aerobic membrane bioreactor treating domestic wastewater. Biochemical Engineering Journal, 133: 47–58

CrossRef Google scholar

[101]

Senthil Kumar D, Arumugam S S, Lordwin Cecil Prabhaker M, Daisy Merina R. (2024). Eamlm: Enhanced automated machine learning model for IoT based water quality analysis with real-time dataset. Automatic Control and Computer Sciences, 58(1): 66–77

CrossRef Google scholar

[102]

Shi Y K, Wang Z W, Du X J, Ling G B, Jia W C, Lu Y R. (2022). Research on the membrane fouling diagnosis of MBR membrane module based on ECA-CNN. Journal of Environmental Chemical Engineering, 10(3): 107649

CrossRef Google scholar

[103]

Shimizu Y, Rokudai M, Tohya S, Kayawake E, Yazawa T, Tanaka H, Eguchi K. (1990). Effect of membrane resistance on filtration characteristics for methanogenic waste. Kagaku Kogaku Ronbunshu, 16(1): 145–151

CrossRef Google scholar

[104]

Silver D, Huang A, Maddison C J, Guez A, Sifre L, Van Den Driessche G, Schrittwieser J, Antonoglou I, Panneershelvam V, Lanctot M. . (2016). Mastering the game of Go with deep neural networks and tree search. Nature, 529(7587): 484–489

CrossRef Google scholar

[105]

Suman B, Kumar P. (2006). A survey of simulated annealing as a tool for single and multiobjective optimization. Journal of the Operational Research Society, 57(10): 1143–1160

CrossRef Google scholar

[106]

Sun X, Wang P, Lu W, Zhu Z, Lu X, He Q, Li J, Rong X, Yang Z, Chang H. . (2023). RingMo: a remote sensing foundation model with masked image modeling. IEEE Transactions on Geoscience and Remote Sensing, 61: 1–22

CrossRef Google scholar

[107]

SzegedyC, Ioffe S, VanhouckeV, Alemi A a J A (2016). Inception-v4, Inception-ResNet and the impact of residual connections on learning. ArXiv,

[108]

SzegedyC, Wei L, YangqingJ, SermanetP, ReedS, AnguelovD, Erhan D, VanhouckeV, RabinovichA (2015). Going deeper with convolutions. 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 1–9, June 7–12, 2015, Boston, MA, USA

[109]

Taheri E, Amin M M, Fatehizadeh A, Rezakazemi M, Aminabhavi T M. (2021). Artificial intelligence modeling to predict transmembrane pressure in anaerobic membrane bioreactor-sequencing batch reactor during biohydrogen production. Journal of Environmental Management, 292: 112759

CrossRef Google scholar

[110]

TaoY, LiC (2018). Application of PSO-RBF neural network in MBR membrane pollution prediction. 2018 Eighth International Conference on Instrumentation & Measurement, Computer, Communication and Control (IMCCC), 873–877, July 19–21, 2018, Harbin, China

[111]

TellaA, Balogun A L, AdebisiN, AbdullahS (2021). Spatial assessment of PM₁₀ hotspots using random forest, k-nearest neighbour and Naive Bayes. Atmospheric Pollution Research, 12(10): 101202

[112]

Tu S, Xu L. (2012). A theoretical investigation of several model selection criteria for dimensionality reduction. Pattern Recognition Letters, 33(9): 1117–1126

CrossRef Google scholar

[113]

Uddin M G, Nash S, Rahman A, Olbert A I. (2023). Performance analysis of the water quality index model for predicting water state using machine learning techniques. Process Safety and Environmental Protection, 169: 808–828

CrossRef Google scholar

[114]

Vasilaki V, Conca V, Frison N, Eusebi A L, Fatone F, Katsou E. (2020). A knowledge discovery framework to predict the N₂O emissions in the wastewater sector. Water Research, 178: 115799

CrossRef Google scholar

[115]

VasuP K A, Gabriel J, ZhuJ, TuzelO, RanjanA (2023). FastViT: A fast hybrid vision Transformer using structural reparameterization. Paris, France: IEEE, 5762–5772

[116]

Vatanpour V, Madaeni S S, Moradian R, Zinadini S, Astinchap B. (2011). Fabrication and characterization of novel antifouling nanofiltration membrane prepared from oxidized multiwalled carbon nanotube/polyethersulfone nanocomposite. Journal of Membrane Science, 375(1−2): 284–294

CrossRef Google scholar

[117]

Viet N D, Im S J, Kim C M, Jang A. (2021). An osmotic membrane bioreactor-clarifier system with a deep learning model for simultaneous reduction of salt accumulation and membrane fouling. Chemosphere, 272: 129872

CrossRef Google scholar

[118]

Viet N D, Jang A. (2021). Development of artificial intelligence-based models for the prediction of filtration performance and membrane fouling in an osmotic membrane bioreactor. Journal of Environmental Chemical Engineering, 9(4): 105337

CrossRef Google scholar

[119]

Wang H, Zeng J, Dai R, Wang Z. (2024a). Understanding rejection mechanisms of trace organic contaminants by polyamide membranes via data-knowledge codriven machine learning. Environmental Science & Technology, 58(13): 5878–5888

CrossRef Google scholar

[120]

Wang X T, Sun X, Wu Y B, Gao F, Yang Y. (2024b). Optimizing reverse osmosis desalination from brackish waters: predictive approach employing response surface methodology and artificial neural network models. Journal of Membrane Science, 704: 122883

CrossRef Google scholar

[121]

Wang Z W, Wu X H. (2015). Mathematical and artificial neural network models to predict the membrane fouling behavior of an intermittently-aerated membrane bioreactor under sub-critical flux. Clean, 43(7): 1002–1009

CrossRef Google scholar

[122]

WeiS, KangY, PengZ, Xiao X, WangL, YangY, SalimF D J A (2024). STEMO: early spatio-temporal forecasting with multi-objective reinforcement learning. ArXiv,

[123]

Wolf G, Almeida J S, Crespo J G, Reis M A M. (2003). Monitoring of biofilm reactors using natural fluorescence fingerprints. Water Science and Technology, 47(5): 161–167

CrossRef Google scholar

[124]

Wolf G, Almeida J S, Pinheiro C, Correia V, Rodrigues C, Reis M A M, Crespo J G. (2001). Two-dimensional fluorometry coupled with artificial neural networks: a novel method for on-line monitoring of complex biological processes. Biotechnology and Bioengineering, 72(3): 297–306

CrossRef Google scholar

[125]

Woo T, Nam K, Heo S, Lim J Y, Kim S, Yoo C. (2022). Predictive maintenance system for membrane replacement time detection using AI-based functional profile monitoring: application to a full-scale MBR plant. Journal of Membrane Science, 649: 120400

CrossRef Google scholar

[126]

Wu J L, Chen F T, Huang X, Geng W Y, Wen X H. (2006). Using inorganic coagulants to control membrane fouling in a submerged membrane bioreactor. Desalination, 197(1−3): 124–136

CrossRef Google scholar

[127]

Wu J L, Huang X. (2008). Effect of dosing polymeric ferric sulfate on fouling characteristics, mixed liquor properties and performance in a long-term running membrane bioreactor. Separation and Purification Technology, 63(1): 45–52

CrossRef Google scholar

[128]

Wu Q, Ye Y, Liu Y, Ng M K. (2012). SNP selection and classification of genome-wide SNP data using stratified sampling random forests. IEEE Transactions on Nanobioscience, 11(3): 216–227

CrossRef Google scholar

[129]

Xiao K, Liang S, Wang X M, Chen C S, Huang X. (2019). Current state and challenges of full-scale membrane bioreactor applications: a critical review. Bioresource Technology, 271: 473–481

CrossRef Google scholar

[130]

Xiao K, Xu Y, Liang S, Lei T, Sun J Y, Wen X H, Zhang H X, Chen C S, Huang X. (2014). Engineering application of membrane bioreactor for wastewater treatment in China: current state and future prospect. Frontiers of Environmental Science & Engineering, 8(6): 805–819

CrossRef Google scholar

[131]

Xie L, Luo S, Liu Y, Ruan X, Gong K, Ge Q, Li K, Valev V K, Liu G, Zhang L. (2023). Automatic identification of individual nanoplastics by Raman spectroscopy based on machine learning. Environmental Science & Technology, 57(46): 18203–18214

CrossRef Google scholar

[132]

Xu H, Xiao K, Wang X M, Liang S, Wei C H, Wen X H, Huang X. (2020). Outlining the roles of membrane-foulant and foulant-foulant interactions in organic fouling during microfiltration and ultrafiltration: a mini-review. Frontiers in Chemistry, 8: 417

CrossRef Google scholar

[133]

Xu J, Xu Z, Kuang J, Lin C, Xiao L, Huang X, Zhang Y. (2021). An alternative to laboratory testing: random forest-based water quality prediction framework for inland and nearshore water bodies. Water, 13(22): 3262

CrossRef Google scholar

[134]

Xu Y R, Zeng X H, Bernard S, He Z. (2022). Data-driven prediction of neutralizer pH and valve position towards precise control of chemical dosage in a wastewater treatment plant. Journal of Cleaner Production, 348: 131360

CrossRef Google scholar

[135]

Yamamoto K, Hiasa M, Mahmood T, Matsuo T. (1989). Direct solid-liquid separation using hollow fiber membrane in an activated-sludge aeration tank. Water Science and Technology, 21(4−5): 43–54

CrossRef Google scholar

[136]

Yamato N, Kimura K, Miyoshi T, Watanabe Y. (2006). Difference in membrane fouling in membrane bioreactors (MBRs) caused by membrane polymer materials. Journal of Membrane Science, 280(1−2): 911–919

CrossRef Google scholar

[137]

Yang F, Lu W H, Luo L K, Li T. (2012). Margin optimization based pruning for random forest. Neurocomputing, 94: 54–63

CrossRef Google scholar

[138]

YangX S (2009). Firefly algorithms for multimodal optimization. Watanabe O and Zeugmann T, eds. Berlin, Heidelberg: Springer Berlin Heidelberg, 169–178

[139]

YangX S (2010). Nature inspired cooperative strategies for optimization (NICSO 2010). González J R, Pelta D A, Cruz C, Terrazas G, Krasnogor N, eds. Berlin, Heidelberg: Springer Berlin Heidelberg, 65–74

[140]

Yao J Q, Wu Z Y, Liu Y, Zheng X Y, Zhang H B, Dong R J, Qiao W. (2022). Predicting membrane fouling in a high solid AnMBR treating OFMSW leachate through a genetic algorithm and the optimization of a BP neural network model. Journal of Environmental Management, 307: 114585

CrossRef Google scholar

[141]

Yaqub M, Asif H, Kim S, Lee W. (2020). Modeling of a full-scale sewage treatment plant to predict the nutrient removal efficiency using a long short-term memory (LSTM) neural network. Journal of Water Process Engineering, 37: 101388

CrossRef Google scholar

[142]

YasminN S A, Wahab N A, YusufZ (2017). Modeling of membrane bioreactor of wastewater treatment using support vector machine. Melaka, MALAYSIA: Springer, Singapore, 485–495

[143]

YoonS H (2015). Membrane Bioreactor Processes: Principles and applications. Boca Raton: Membrane Bioreactor Processes: Principles and Applications

[144]

YuZ, RuanJ, XingD (2023). Explainable reinforcement learning via a causal world model. ArXiv,

[145]

ZadehL A (2023). Granular, Fuzzy, And Soft Computing. Lin T Y, Liau C J, Kacprzyk J, eds. New York: Springer US, 19–49

[146]

ZagoruykoS, Komodakis N J A (2016). Wide residual networks. ArXiv,

[147]

Zeiler M D, Fergus R. (2014). Visualizing and understanding convolutional networks. Fleet D, Pajdla T, Schiele B, Tuytelaars T, eds. Computer Vision – ECCV 2014. ECCV 2014. Lecture Notes in Computer Science, Cham: Springer, 8689: 818–833

[148]

Zhang B, Mao X, Tang X M, Tang H L, Zhang B, Shen Y, Shi W X. (2022). Effect of modified microbial flocculant on membrane fouling alleviation in a hybrid aerobic granular sludge membrane system for wastewater reuse. Separation and Purification Technology, 290: 120819

CrossRef Google scholar

[149]

Zhang G J, Ji S L, Gao X, Liu Z Z. (2008). Adsorptive fouling of extracellular polymeric substances with polymeric ultrafiltration membrances. Journal of Membrane Science, 309(1−2): 28–35

[150]

ZhangH, Ma Y, JiangT, ZhangG, YangF (2012). Influence of activated sludge properties on flux behavior in osmosis membrane bioreactor (OMBR). Journal of Membrane Science, 390–391: 270–276

[151]

Zhang Q, Yang L T, Chen Z, Li P. (2018). A survey on deep learning for big data. Information Fusion, 42: 146–157

CrossRef Google scholar

[152]

Zhang Q Y, Singh S, Stuckey D C. (2017). Fouling reduction using adsorbents/flocculants in a submerged anaerobic membrane bioreactor. Bioresource Technology, 239: 226–235

CrossRef Google scholar

[153]

Zhang X, Long T, Deng S, Chen Q, Chen S, Luo M, Yu R, Zhu X. (2023). Machine learning modeling based on microbial community for prediction of natural attenuation in groundwater. Environmental Science & Technology, 57(50): 21212–21223

CrossRef Google scholar

[154]

Zhao B, Chen H, Gao D K, Xu L Z, Zhang Y Y. (2020). Cleaning decision model of MBR membrane based on Bandelet neural network optimized by improved Bat algorithm. Applied Soft Computing, 91: 106211

CrossRef Google scholar

[155]

Zhao Z, Lou Y, Chen Y, Lin H, Li R, Yu G. (2019). Prediction of interfacial interactions related with membrane fouling in a membrane bioreactor based on radial basis function artificial neural network (ANN). Bioresource Technology, 282: 262–268

CrossRef Google scholar

[156]

Zhong H, Yuan Y, Luo L, Ye J, Chen M, Zhong C. (2022). Water quality prediction of MBR based on machine learning: a novel dataset contribution analysis method. Journal of Water Process Engineering, 50: 103296

CrossRef Google scholar

[157]

Zhong S F, Guan X H. (2023). Count-based Morgan fingerprint: a more efficient and interpretable molecular representation in developing machine learning-based predictive regression models for water contaminants: activities and properties. Environmental Science & Technology, 57(46): 18193–18202

CrossRef Google scholar

[158]

Zhong S F, Zhang K, Bagheri M, Burken J G, Gu A, Li B K, Ma X M, Marrone B L, Ren Z J, Schrier J. . (2021). Machine learning: New ideas and tools in environmental science and engineering. Environmental Science & Technology, 55(19): 12741–12754

CrossRef Google scholar

[159]

Zhou M, Li Y. (2024). Spatial distribution and source identification of potentially toxic elements in Yellow River Delta soils, China: an interpretable machine-learning approach. Science of the Total Environment, 912: 169092

CrossRef Google scholar

[160]

Zhuang L P, Tang B, Bin L Y, Li P, Huang S S, Fu F L. (2021). Performance prediction of an internal-circulation membrane bioreactor based on models comparison and data features analysis. Biochemical Engineering Journal, 166: 107850

CrossRef Google scholar

Acknowledgements

This work was supported by the National Natural Science Foundation of China (No. 52370059), the Beijing Natural Science Foundation (No. JQ22027), and the Fundamental Research Funds for the Central Universities (No. E2EG0502X2).

Conflict of Interests

Kang Xiao and Xia Huang are editorial board members of Frontiers of Environmental Science & Engineering. The authors declare that this research was conducted without any commercial or financial relationships that could be construed as a potential conflict of interest.

Electronic Supplementary Material

Supplementary material is available in the online version of this article at https://doi.org/10.1007/s11783-025-1954-2 and is accessible for authorized users.

RIGHTS & PERMISSIONS

2025 Higher Education Press 2025

AI Summary AI Mindmap

PDF(5722 KB)

Supplementary files

FSE-24134-OF-LYZ_suppl_1 (863 KB)

807

Accesses

Citations

Detail

Sections

Recommended

Highlights
Abstract
Graphical abstract
Keywords
Cite this article
1 Introduction
2 Principles and methods of machine learning
Fig.1 Machine learning process: (a) general procedure; (b) detailed steps.
Fig.2 Classification of machine learning methods.
2.1 Machine learning methods
Tab.1 Summary and comparison of machine learning methods
2.1.1 Support vector machine
2.1.2 Artificial neural network
Fig.3 Schematic diagrams of artificial neural networks: (a) MLP, (b) RBFNN, (c) CNN, (d) RNN.
2.1.3 Decision tree and ensemble learning
2.1.4 k-nearest neighbors
2.1.5 Other methods
2.1.6 Related algorithms
2.2 Model optimization
Fig.4 Intelligent optimization algorithms: (a) genetic algorithms (GA), (b) particle swarm optimization (PSO), (c) simulated annealing (SA).
2.3 Assessment of model performance
2.4 Assessment of model interpretation
2.5 Model selection guide
3 Application of machine learning in MBRs
3.1 Overview of application
Fig.5 Distribution of different machine learning models used for MBR research: (a) membrane fouling prediction models, (b) pollutant removal prediction models.
3.2 Machine learning models to predict pollutant removal performances
3.3 Machine learning models to predict membrane fouling
3.3.1 ANN for membrane fouling prediction
Tab.2 Examples of ANN models to predict membrane filtration state in MBRs
3.3.2 Other model applications
4 Tutorial example
4.1 Method
4.1.1 Data preprocessing
4.1.2 Selection of model and policy
4.1.3 Model evaluation
4.2 Results and discussion
4.2.1 Presentation of raw data
4.2.2 Model performance
Tab.3 Performance of different machine learning models for the tutorial example
5 Summary and prospect
6 Conclusions
7 Abbreviations
References
Acknowledgements
Conflict of Interests
Electronic Supplementary Material
RIGHTS & PERMISSIONS

Received	Revised	Accepted	Published
26 Aug 2024	29 Oct 2024	22 Nov 2024	15 Mar 2025
Issue Date
23 Dec 2024

About the journal

Browse

Authors & reviewers

Highlights

Abstract

Graphical abstract

Keywords

Cite this article

1 Introduction

2 Principles and methods of machine learning

Fig.1 Machine learning process: (a) general procedure; (b) detailed steps.

Fig.2 Classification of machine learning methods.

2.1 Machine learning methods

Tab.1 Summary and comparison of machine learning methods

2.1.1 Support vector machine

2.1.2 Artificial neural network

Fig.3 Schematic diagrams of artificial neural networks: (a) MLP, (b) RBFNN, (c) CNN, (d) RNN.

2.1.3 Decision tree and ensemble learning

2.1.4 k-nearest neighbors

2.1.5 Other methods

2.1.6 Related algorithms

2.2 Model optimization

Fig.4 Intelligent optimization algorithms: (a) genetic algorithms (GA), (b) particle swarm optimization (PSO), (c) simulated annealing (SA).

2.3 Assessment of model performance

2.4 Assessment of model interpretation

2.5 Model selection guide

3 Application of machine learning in MBRs

3.1 Overview of application

Fig.5 Distribution of different machine learning models used for MBR research: (a) membrane fouling prediction models, (b) pollutant removal prediction models.

3.2 Machine learning models to predict pollutant removal performances

3.3 Machine learning models to predict membrane fouling

3.3.1 ANN for membrane fouling prediction

Tab.2 Examples of ANN models to predict membrane filtration state in MBRs

3.3.2 Other model applications

4 Tutorial example

4.1 Method

4.1.1 Data preprocessing

4.1.2 Selection of model and policy

4.1.3 Model evaluation

4.2 Results and discussion

4.2.1 Presentation of raw data

4.2.2 Model performance

Tab.3 Performance of different machine learning models for the tutorial example

5 Summary and prospect

6 Conclusions

7 Abbreviations

{{custom_sec.title}}

{{custom_sec.title}}

References

Acknowledgements

Conflict of Interests

Electronic Supplementary Material

RIGHTS & PERMISSIONS