ArticlePublisher preview available

Prediction of CO–NOx Emissions from a Natural Gas Power Plant Using Proper Machine Learning Models

Authors:
To read the full-text of this research, you can request a copy directly from the authors.

Abstract and Figures

Four machine learning (ML) models including a deep neural network, a long short‐term memory network, a random forest (RF), and an extreme gradient boosting are implemented to predict CO–NOx emissions from a natural gas power plant. A new feature optimization scheme (FOS) via a sequencing process of feature selection and hyperparameter optimization can intensify the ML models. Through the procedures of training, validation, and testing, reliable ML models need to take high prediction accuracy and fast training into account. After a few comparisons, it is found that 1) the FOS effectively improves the prediction accuracy by 18%–67%; 2) the FOS‐based RF model is an appropriate option to carry out the fast and accurate prediction of CO–NOx emissions by using the decision tree classifiers.
This content is subject to copyright. Terms and conditions apply.
Prediction of CONO
x
Emissions from a Natural Gas
Power Plant Using Proper Machine Learning Models
Wei Wu,* Yan-Ting Lin, Po-Hsuan Liao, Muhammad Aziz, and Po-Chih Kuo*
1. Introduction
The increase in CO
2
emissions has caused irreversible damage to
the earths ecological environment,
[1]
and CO
2
emissions from
fossil fuel and industrial processes were up to 60% of global
greenhouse gas emissions.
[2]
Recently, machine learning (ML)
technologies have been widely implemented in energy systems
in terms of power efciency and gas emissions. A comprehen-
sive review of ML applications showed that ML technology was
an effective way to reduce power losses and increase the system
efciency of the integrated power system using a combination of
smart grid and renewable energy sectors.
[3]
As internal combus-
tion engines played an essential role in power generation, ML
techniques using unsupervised learning, supervised learning,
and reinforcement learning could provide useful solutions for
modeling internal combustion engines.
[4]
The ue gas from coal-red power plants contains harmful
pollutants such as SO
x
and NO
x
. Regarding the prediction of
SO
x
NO
x
emissions from a class of power generation systems,
the support vector machine (SVM) model
was more accurate for predicting NO
x
emission than the feedforward neural
network (FNN) model,
[5]
a deep neural net-
work (DNN) via the data preprocessing and
specic feature selection could effectively
reduce the computational time for predict-
ing SO
x
NO
x
emissions,
[6]
an ensemble
DNN model has better prediction perfor-
mance for predicting NO
x
emission,
[7]
and an adaptive network-based fuzzy infer-
ence system as a kind of articial neural
network was validated to accurately predict
NO
x
emissions.
[8]
Besides, a neural net-
work time-series nonlinear autoregressive
and a Gaussian process regression (GPR)
have performed better in forecasting CO
2
emissions of power plants in some specic countries.
[9]
On
the other hand, a case study showed the enhanced long
short-term memory network (LSTMN) could most accurate
and stable prediction of NO
x
emission rates from a coal-red
power plant during transient operation.
[10]
An FNN framework
integrated with a kinetic-based process was utilized to generate
life cycle inventory data from different types of woody biomass
with hundreds of characterization data samples such that the
large variations in energy consumption and greenhouse gas
emissions across different biomass species were specied.
[11]
A ML algorithm using an autoregressive moving average model
with exogenous inputs was developed to forecast the CO
2
emission intensities in European electrical power grids, where
short-term forecasts could help electricity consumers schedule
their load to minimize CO
2
emissions.
[12]
In addition, a deep
learning-based FNN model was found to be suitable for predict-
ing the amount of CO
2
emission from the specic power sector
in Kuwait,
[13]
and the SVM and DNN models were effectively
implemented to forecast transportation-based-CO
2
emission
and energy demand in Turkey,
[14]
an extreme learning machine
for predicting carbon emission intensity of some cities,
[15]
and an
FNN-based optimization using LSTMN could outperform the
steady-state optimization and improve the thermal efciency
of coal-red boilers.
[16]
By the above statements, the FNN using a deep learning algo-
rithm could effectively predict NO
x
and CO
2
emissions, but the
prediction performance depends on raw data features. It means
that the preprocessing step for transforming raw data into fea-
tures is a core approach to improving the performance of ML
models. In this article, the appropriate and ecologically valid data
for harmful pollutants including NO
x
and CO from a natural gas
power plant are denoted as the raw data.
[17]
The descriptions of a
natural gas power plant and four ML models are addressed in
W. Wu, Y.-T. Lin, P.-H. Liao
Department of Chemical Engineering
National Cheng Kung University
Tainan 70101, Taiwan
E-mail: weiwu@gs.ncku.edu.tw
M. Aziz, P.-C. Kuo
Institute of Industrial Science
The University of Tokyo
Tokyo 153-8505, Japan
E-mail: pckuo@iis.u-tokyo.ac.jp
The ORCID identication number(s) for the author(s) of this article
can be found under https://doi.org/10.1002/ente.202300041.
DOI: 10.1002/ente.202300041
Four machine learning (ML) models including a deep neural network, a long
short-term memory network, a random forest (RF), and an extreme gradient
boosting are implemented to predict CONO
x
emissions from a natural gas
power plant. A new feature optimization scheme (FOS) via a sequencing process
of feature selection and hyperparameter optimization can intensify the ML
models. Through the procedures of training, validation, and testing, reliable ML
models need to take high prediction accuracy and fast training into account. After
a few comparisons, it is found that 1) the FOS effectively improves the prediction
accuracy by 18%67%; 2) the FOS-based RF model is an appropriate option to
carry out the fast and accurate prediction of CONO
x
emissions by using the
decision tree classiers.
RESEARCH ARTICLE
www.entechnol.de
Energy Technol. 2023,11, 2300041 2300041 (1 of 12) © 2023 Wiley-VCH GmbH
Article
Full-text available
A critical review of the existing Internal Combustion Engine (ICE) modeling, optimization, diagnosis, and control challenges and the promising state-of-the-art Machine Learning (ML) solutions for them is provided in this paper. Some of the major challenges include Real Driving Emission (RDE) modeling and control, combustion knock detection and control, combustion mode transition in multi-mode engines, combustion noise modeling and control, combustion instability and cyclic variability control, costly and time-consuming engine calibration, and fault diagnostics of some ICE components. In this paper, conventional ICE modeling approaches are discussed along with their limitations for realtime ICE optimization and control. Promising ML approaches to address ICE challenges are then classified into three main groups of unsupervised learning, supervised learning, and reinforcement learning. The working principles of each approach along with their advantages and disadvantages in addressing ICE challenges are discussed. ML-based grey-box approach is proposed as a solution that combines the benefits from physics-based and ML-based models to provide robust and high fidelity solutions for ICE modeling and control challenges. This review provides in-depth insight into the applications of ML for ICEs and provides recommendations for future directions to address ICE challenges.
Article
Full-text available
Global warming and climate change have become one of the most embarrassing and explosive problems/challenges all over the world, especially in third-world countries. It is due to a rapid increase in industrialization and urbanization process that has given the boost to the volume of greenhouse gases (GHGs) emissions. In this regard, carbon dioxide (CO2) is considered a significant driver of GHGs and is the major contributing factor for global warming. Considering the goal of mitigating environmental pollution, this research has applied multiple methods such as neural network time series nonlinear autoregressive, Gaussian Process Regression, and Holt’s methods for forecasting CO2 emission. It attempts to forecast the CO2 emission of Bahrain. These methods are evaluated for performance. The neural network model has the root mean square errors (RMSE) of merely 0.206, while the Gaussian Process Regression Rational Quadratic (GPR-RQ) Model has RMSE of 1.0171, and Holt’s method has RMSE of 1.4096. Therefore, it can be concluded that the neural network time series nonlinear autoregressive model has performed better for forecasting the CO2 emission in the case of Bahrain.
Article
Full-text available
An effective NO x prediction model is the basis for reducing pollutant emissions. In this paper, a real-time NO x prediction model based on an ensemble deep belief network (DBN) is proposed. Variable importance projection analysis is adopted to screen variables, the time delay of each variable is estimated, and the phase space of the original sample is reconstructed by analyzing the historical data. An ensemble strategy based on random subspace is presented, including the data set partition method and ensemble mode of the model. First, subspaces are constructed according to the component information extracted by partial least squares. Then, the deep belief network is used as a submodel. Finally, a back propagation neural network is developed for model combination. The ensemble deep belief network model has been used to model the NO x emission prediction of a 660 MW boiler. The simulation results show that the ensemble DBN model can fully exploit the nonlinear mapping relationship between input variables and NO x concentration by using various learning learners. Compared with the back propagation neural network and support vector machine, which are commonly used in NO x modeling, the ensemble DBN model has better prediction performance and generalization ability.
Article
The increasing fraction of intermittent renewable energy in the electrical grid is resulting in coal-fired boilers now routinely ramp up and down. The current state-of-the-art operation for such boilers is to apply steady-state, neural network-based optimization to make control decisions in real-time, and this work demonstrates the feasibility of extending this to dynamic, neural network-based optimization using a long short-term memory neural network. A simplified numerical simulation of a t-fired coal boiler and supporting equipment is used to represent a real plant subjected to both steady-state, neural network-based optimization and dynamic, neural network-based optimization. Using the same intervals and a particle swarm optimization algorithm, the dynamic optimization outperforms the steady-state optimization and realizes up to 4.58% improvement in thermal efficiency. Dynamic optimization with a long short-term memory neural network is shown to both be feasible and beneficial for operation of a coal-fired boiler under changing load.
Article
The second largest share of Greenhouse Gas (GHG) emissions is generated by electricity production. Approximately 63% of the generated electricity is from burning fossil fuels. Currently, The Ministry of Electricity and Water (MEW) owns and operates 8 power plants to secure the demand for electricity in Kuwait. Burning more fuel to generate electricity increases CO2 emissions to the air which causes air pollution and environmental issues. This study aims to calculate the amount of CO2 emission from the power sector specifically from each power plant in Kuwait in 2019 using combustion equation from United States Environmental Protection Agency (USEPA) and Intergovernmental panel on Climate Change (IPCC). According to USEPA, total CO2 emissions from the power sector in Kuwait in 2019 were found to be 38.47 MtCO2. However, IPCC equation gave total CO2 emissions of 45.57 MtCO2. The second part of the research focused on forecasting CO2 emissions for 5 years (2018–2022) using machine learning (ML) algorithms, which are mainly support vector machine (SVM), deep learning (DL), and ANN. Based on DL model results, the forecasted CO2 emissions for the 5 years were 44.2, 46, 48, 47, and 49 MtCO2, respectively. While ANN model showed the following CO2 emissions result for each year: 43, 44, 49, 51, and 50 MtCO2, respectively. Moreover, SVM algorithm found the forecasted CO2 emissions for the 5 years to be 43.8, 52 , 56, and 56 MtCO2, respectively. DL model was found to be the most appropriate one to fit the data followed by ANN and lastly SVM respectively.
Article
Combined cycle power plants, which combine gas and steam turbines, have negative impacts on surrounding populations and structures. Control of NOx emissions is an important issue for these gas-fired power plants. Accurate estimation of NOx emissions is critical for developing incinerators and reducing the environmental impact of existing plants. The objective of this study is to model ANFISGA and estimate NOx emissions from a natural gas-fired combined cycle power plant using emission monitoring system (PEMS) data. First, Adaptive Neuro Fuzzy Inference System (ANFIS) models were developed using fuzzy C-Means (FCM). Then, the parameters were optimized using a genetic algorithm (GA) to reduce the error. The proposed ANFISGA system was created, trained, and tested with PEMS datasets. The developed models were compared using several statistical performance criteria, including correlation coefficient (R²), mean squared error (MSE), error mean (EM), root mean square error (RMSE), standard deviation of error (STD), and mean absolute percentage error (MAPE). The obtained results show that the coefficient of determination varies between 0.79933 and 0.90363 for the data separated into test and training data with different rates. The minimum values of the criteria MSE, RMSE, EM, STD, and MAPE were found to be 24.8379, 4.9838, 3.4625e-05, 4.9839, and 5.1660, respectively, for the training data. The minimum values of these criteria for the test data were 26.5961, 5.1571, 0.065696, 5.157, and 5.3695, respectively. The collected results show that the proposed ANFISGA models have high potential for NOx prediction. Thus, the results show that GA has a great impact on the performance of ANFIS training and significantly improves the predictive accuracy of the model.
Article
Given the severe global warming situation, it is very important to explore the factors influencing carbon emission intensity and accurately analyze the trends in the development of carbon emission intensity to achieve the goal of reducing carbon emissions. In contrast with the existing research, this paper starts from the perspective of carbon emission efficiency, applies stochastic frontier analysis to screen the factors influencing carbon intensity, and constructs a model for predicting carbon emission intensity based on factor analysis and an extreme learning machine. The results suggest that, first, there is a high correlation between carbon emission efficiency and carbon emission intensity. Second, the level of economic development, industrial structure, urbanization level, and government intervention all promote a reduction in carbon emission intensity. The structure of energy consumption and dependence on foreign trade restrain reductions in carbon emission intensity. Finally, the proposed model accurately predicts carbon emission intensity. The research results provide theoretical support for the development of technologies to reduce carbon emissions. This idea can be applied to predict carbon emission intensity in different regions and has practical significance.
Article
Adverse impacts of the transportation sector on not only air quality but also economic growth of a country are nowadays well-noticed, particularly by developing countries. Today, the transportation sector is powered by burning the fossil-based fuels at more than 99% and approximately 6.5 million deaths annually occur due to air-pollution-related diseases worldwide. Therefore, knowledge of both energy demand and CO2 emission of a country is a very significant issue in order to revise its future energy investments and policies. In this framework, three machine learning algorithms (deep learning (DL), support vector machine (SVM), and artificial neural network (ANN)) are used to forecast the transportation-based-CO2 emission and energy demand in Turkey. The gross domestic product per capita (GDP), population, vehicle kilometer, and year are used as input parameters in the study. It is noticed that there is a very high correlation among year, economic indicators, population, vehicle kilometer, transportation-based energy demand, and CO2 emissions. To present a better comparison, the results of these algorithms are discussed with six frequently used statistical metrics (R², RMSE, MAPE, MBE, rRMSE, and MABE). For all machine learning algorithms, R² values are varying between 0.8639 and 0.9235, and RMSE is smaller than 5 × 10⁶ tons for CO2 emission and 2 Mtoe for energy demand. According to the classifications in the literature, the forecast results are generally categorized as "excellent" for rRMSE metric (<10%), and “high prediction accuracy” for MAPE metric (<10%). On the other hand, with two mathematical models, future energy demand and CO2 emission arising from the transportation sector in Turkey is forecasted by the year 2050. In the results, it is forecasted that the annual growth rate for transportation-related energy demand and CO2 emission in Turkey cumulatively rise by 3.7% and 3.65%, respectively. Both energy demand and CO2 emissions from the transportation sector in Turkey will increase nearly 3.4 times higher in the year 2050 than those of today. In conclusion, the paper clearly reports that the future energy investments of the country should be revised, and various policies, regulations, norms, restrictions, legislations, and challenges on both energy consumption and emission mitigation from the transportation sector should be established by the policy-makers.
Article
A comprehensive review about machine learning application in power system especially in smart grid, renewable energy sector etc. is summarized in this paper. In the power sector, the power consumption is increased day by day very tremendously. So, it is very important that we have to generate the more power without disturbing the environment and whatever the generated power must be utilize effectively with minimum losses and higher efficiency. This will be possible with effective way of using the modern technology like machine learning (ML), artificial intelligence etc. This paper also describes the different types of machine learning techniques with diagram which will be very useful for many researchers who want to understand the basic fundamentals of machine learnings.
Article
Ten established, data-driven dynamic algorithms are surveyed and a practical guide for understanding these methods generated. Existing Python programming packages for implementing each algorithm are acknowledged, and the model equations necessary for prediction are presented. A case study on a coal-fired power plant’s NOx emission rates is performed, directly comparing each modeling method’s performance on a mutual system. Each model is evaluated by its root mean squared error (RMSE) on out-of-sample future horizon predictions. Optimal hyperparameters are identified using either an exhaustive search or genetic algorithm. The top five model structures of each method are used to recursively predict future NOx emission rates over a 60-step time horizon. The RMSE at each future timestep is determined, and the recursive output prediction trends compared against measurements in time. The GRU neural network is identified as the best candidate for representing the system, demonstrating accurate and stable predictions across the future horizon by all considered models, while satisfactory performance was observed in several of the ARX/NARX formulations. These efforts have contributed 1) a concise resource of multiple proven dynamic machine learning methods, 2) a practical guide explaining the use of these methods, effectively lowering the “barrier-to-entry” of deploying such models in control systems, 3) a comparison study evaluating each method’s performance on a mutual system, 4) demonstration of accurate multi-timestep emissions modeling suitable for systems-level control, and 5) generalizable results demonstrating the suitability of each method for prediction over a multi-step future horizon to other complex dynamic systems.