ArticlePublisher preview available

Prediction of CO–NOx Emissions from a Natural Gas Power Plant Using Proper Machine Learning Models

April 2023
Energy Technology 11(7)

April 2023
11(7)

DOI:10.1002/ente.202300041

Authors:

Wei Wu

National Cheng Kung University

Muhammad Aziz

The University of Tokyo

Show all 5 authorsHide

Four machine learning (ML) models including a deep neural network, a long short‐term memory network, a random forest (RF), and an extreme gradient boosting are implemented to predict CO–NOx emissions from a natural gas power plant. A new feature optimization scheme (FOS) via a sequencing process of feature selection and hyperparameter optimization can intensify the ML models. Through the procedures of training, validation, and testing, reliable ML models need to take high prediction accuracy and fast training into account. After a few comparisons, it is found that 1) the FOS effectively improves the prediction accuracy by 18%–67%; 2) the FOS‐based RF model is an appropriate option to carry out the fast and accurate prediction of CO–NOx emissions by using the decision tree classifiers.

Natural gas power plant with nine input variables and two output variables.

…

ML architecture: a) DNN, b) LSTMN, c) RF, and d) XGBoost.

…

Flowchart of FOS‐based ML algorithm.

…

Feature selection for NGPP in terms of CO emission: a) ρ versus input variable and b) MAE versus a series of input datasets (type I), and c) MAE versus different combinations of input datasets (type II).

…

Feature selection for NGPP in terms of NOx emission: a) ρ versus input variable and b) MAE versus a series of input datasets (type I), and c) MAE versus different combinations of input datasets (type II).

…

Figures - available from: Energy Technology

This content is subject to copyright. Terms and conditions apply.

A preview of this full-text is provided by Wiley.

Learn more

Content available from Energy Technology

This content is subject to copyright. Terms and conditions apply.

Prediction of CO–NO

Emissions from a Natural Gas

Power Plant Using Proper Machine Learning Models

Wei Wu,* Yan-Ting Lin, Po-Hsuan Liao, Muhammad Aziz, and Po-Chih Kuo*

1. Introduction

The increase in CO

emissions has caused irreversible damage to

the earth’s ecological environment,

[1]

and CO

emissions from

fossil fuel and industrial processes were up to 60% of global

greenhouse gas emissions.

[2]

Recently, machine learning (ML)

technologies have been widely implemented in energy systems

in terms of power efﬁciency and gas emissions. A comprehen-

sive review of ML applications showed that ML technology was

an effective way to reduce power losses and increase the system

efﬁciency of the integrated power system using a combination of

smart grid and renewable energy sectors.

[3]

As internal combus-

tion engines played an essential role in power generation, ML

techniques using unsupervised learning, supervised learning,

and reinforcement learning could provide useful solutions for

modeling internal combustion engines.

[4]

The ﬂue gas from coal-ﬁred power plants contains harmful

pollutants such as SO

and NO

. Regarding the prediction of

–NO

emissions from a class of power generation systems,

the support vector machine (SVM) model

was more accurate for predicting NO

emission than the feedforward neural

network (FNN) model,

[5]

a deep neural net-

work (DNN) via the data preprocessing and

speciﬁc feature selection could effectively

reduce the computational time for predict-

ing SO

–NO

emissions,

[6]

an ensemble

DNN model has better prediction perfor-

mance for predicting NO

emission,

[7]

and an adaptive network-based fuzzy infer-

ence system as a kind of artiﬁcial neural

network was validated to accurately predict

emissions.

[8]

Besides, a neural net-

work time-series nonlinear autoregressive

and a Gaussian process regression (GPR)

have performed better in forecasting CO

emissions of power plants in some speciﬁc countries.

[9]

the other hand, a case study showed the enhanced long

short-term memory network (LSTMN) could most accurate

and stable prediction of NO

emission rates from a coal-ﬁred

power plant during transient operation.

[10]

An FNN framework

integrated with a kinetic-based process was utilized to generate

life cycle inventory data from different types of woody biomass

with hundreds of characterization data samples such that the

large variations in energy consumption and greenhouse gas

emissions across different biomass species were speciﬁed.

[11]

A ML algorithm using an autoregressive moving average model

with exogenous inputs was developed to forecast the CO

emission intensities in European electrical power grids, where

short-term forecasts could help electricity consumers schedule

their load to minimize CO

emissions.

[12]

In addition, a deep

learning-based FNN model was found to be suitable for predict-

ing the amount of CO

emission from the speciﬁc power sector

in Kuwait,

[13]

and the SVM and DNN models were effectively

implemented to forecast transportation-based-CO

emission

and energy demand in Turkey,

[14]

an extreme learning machine

for predicting carbon emission intensity of some cities,

[15]

and an

FNN-based optimization using LSTMN could outperform the

steady-state optimization and improve the thermal efﬁciency

of coal-ﬁred boilers.

[16]

By the above statements, the FNN using a deep learning algo-

rithm could effectively predict NO

and CO

emissions, but the

prediction performance depends on raw data features. It means

that the preprocessing step for transforming raw data into fea-

tures is a core approach to improving the performance of ML

models. In this article, the appropriate and ecologically valid data

for harmful pollutants including NO

and CO from a natural gas

power plant are denoted as the raw data.

[17]

The descriptions of a

natural gas power plant and four ML models are addressed in

W. Wu, Y.-T. Lin, P.-H. Liao

Department of Chemical Engineering

National Cheng Kung University

Tainan 70101, Taiwan

E-mail: weiwu@gs.ncku.edu.tw

M. Aziz, P.-C. Kuo

Institute of Industrial Science

The University of Tokyo

Tokyo 153-8505, Japan

E-mail: pckuo@iis.u-tokyo.ac.jp

The ORCID identiﬁcation number(s) for the author(s) of this article

can be found under https://doi.org/10.1002/ente.202300041.

DOI: 10.1002/ente.202300041

Four machine learning (ML) models including a deep neural network, a long

short-term memory network, a random forest (RF), and an extreme gradient

boosting are implemented to predict CO–NO

emissions from a natural gas

power plant. A new feature optimization scheme (FOS) via a sequencing process

of feature selection and hyperparameter optimization can intensify the ML

models. Through the procedures of training, validation, and testing, reliable ML

models need to take high prediction accuracy and fast training into account. After

a few comparisons, it is found that 1) the FOS effectively improves the prediction

accuracy by 18%–67%; 2) the FOS-based RF model is an appropriate option to

carry out the fast and accurate prediction of CO–NO

emissions by using the

decision tree classiﬁers.

RESEARCH ARTICLE

www.entechnol.de

An emission predictive system for CO and NOx from gas turbine based on ensemble machine learning approach

Article

Jun 2024
FUEL

Nikhil Pachauri

Modeling, diagnostics, optimization, and control of internal combustion engines via modern machine learning techniques: A review and future directions

Article

Full-text available

Oct 2021
PROG ENERG COMBUST

A critical review of the existing Internal Combustion Engine (ICE) modeling, optimization, diagnosis, and control challenges and the promising state-of-the-art Machine Learning (ML) solutions for them is provided in this paper. Some of the major challenges include Real Driving Emission (RDE) modeling and control, combustion knock detection and control, combustion mode transition in multi-mode engines, combustion noise modeling and control, combustion instability and cyclic variability control, costly and time-consuming engine calibration, and fault diagnostics of some ICE components. In this paper, conventional ICE modeling approaches are discussed along with their limitations for realtime ICE optimization and control. Promising ML approaches to address ICE challenges are then classified into three main groups of unsupervised learning, supervised learning, and reinforcement learning. The working principles of each approach along with their advantages and disadvantages in addressing ICE challenges are discussed. ML-based grey-box approach is proposed as a solution that combines the benefits from physics-based and ML-based models to provide robust and high fidelity solutions for ICE modeling and control challenges. This review provides in-depth insight into the applications of ML for ICEs and provides recommendations for future directions to address ICE challenges.

Forecasting carbon emissions due to electricity power generation in Bahrain

Article

Full-text available

Mar 2022
ENVIRON SCI POLLUT R

Global warming and climate change have become one of the most embarrassing and explosive problems/challenges all over the world, especially in third-world countries. It is due to a rapid increase in industrialization and urbanization process that has given the boost to the volume of greenhouse gases (GHGs) emissions. In this regard, carbon dioxide (CO2) is considered a significant driver of GHGs and is the major contributing factor for global warming. Considering the goal of mitigating environmental pollution, this research has applied multiple methods such as neural network time series nonlinear autoregressive, Gaussian Process Regression, and Holt’s methods for forecasting CO2 emission. It attempts to forecast the CO2 emission of Bahrain. These methods are evaluated for performance. The neural network model has the root mean square errors (RMSE) of merely 0.206, while the Gaussian Process Regression Rational Quadratic (GPR-RQ) Model has RMSE of 1.0171, and Holt’s method has RMSE of 1.4096. Therefore, it can be concluded that the neural network time series nonlinear autoregressive model has performed better for forecasting the CO2 emission in the case of Bahrain.

An Ensemble Deep Belief Network Model Based on Random Subspace for NO x Concentration Prediction

Article

Full-text available

Mar 2021

An effective NO x prediction model is the basis for reducing pollutant emissions. In this paper, a real-time NO x prediction model based on an ensemble deep belief network (DBN) is proposed. Variable importance projection analysis is adopted to screen variables, the time delay of each variable is estimated, and the phase space of the original sample is reconstructed by analyzing the historical data. An ensemble strategy based on random subspace is presented, including the data set partition method and ensemble mode of the model. First, subspaces are constructed according to the component information extracted by partial least squares. Then, the deep belief network is used as a submodel. Finally, a back propagation neural network is developed for model combination. The ensemble deep belief network model has been used to model the NO x emission prediction of a 660 MW boiler. The simulation results show that the ensemble DBN model can fully exploit the nonlinear mapping relationship between input variables and NO x concentration by using various learning learners. Compared with the back propagation neural network and support vector machine, which are commonly used in NO x modeling, the ensemble DBN model has better prediction performance and generalization ability.

Development of Novel Dynamic Machine Learning-based Optimization of a Coal-fired Power Plant

Article

May 2022
COMPUT CHEM ENG

The increasing fraction of intermittent renewable energy in the electrical grid is resulting in coal-fired boilers now routinely ramp up and down. The current state-of-the-art operation for such boilers is to apply steady-state, neural network-based optimization to make control decisions in real-time, and this work demonstrates the feasibility of extending this to dynamic, neural network-based optimization using a long short-term memory neural network. A simplified numerical simulation of a t-fired coal boiler and supporting equipment is used to represent a real plant subjected to both steady-state, neural network-based optimization and dynamic, neural network-based optimization. Using the same intervals and a particle swarm optimization algorithm, the dynamic optimization outperforms the steady-state optimization and realizes up to 4.58% improvement in thermal efficiency. Dynamic optimization with a long short-term memory neural network is shown to both be feasible and beneficial for operation of a coal-fired boiler under changing load.

Forecasting of carbon dioxide emissions from power plants in Kuwait using United States Environmental Protection Agency, Intergovernmental panel on climate change, and machine learning methods

Article

Apr 2022
RENEW ENERG

The second largest share of Greenhouse Gas (GHG) emissions is generated by electricity production. Approximately 63% of the generated electricity is from burning fossil fuels. Currently, The Ministry of Electricity and Water (MEW) owns and operates 8 power plants to secure the demand for electricity in Kuwait. Burning more fuel to generate electricity increases CO2 emissions to the air which causes air pollution and environmental issues. This study aims to calculate the amount of CO2 emission from the power sector specifically from each power plant in Kuwait in 2019 using combustion equation from United States Environmental Protection Agency (USEPA) and Intergovernmental panel on Climate Change (IPCC). According to USEPA, total CO2 emissions from the power sector in Kuwait in 2019 were found to be 38.47 MtCO2. However, IPCC equation gave total CO2 emissions of 45.57 MtCO2. The second part of the research focused on forecasting CO2 emissions for 5 years (2018–2022) using machine learning (ML) algorithms, which are mainly support vector machine (SVM), deep learning (DL), and ANN. Based on DL model results, the forecasted CO2 emissions for the 5 years were 44.2, 46, 48, 47, and 49 MtCO2, respectively. While ANN model showed the following CO2 emissions result for each year: 43, 44, 49, 51, and 50 MtCO2, respectively. Moreover, SVM algorithm found the forecasted CO2 emissions for the 5 years to be 43.8, 52 , 56, and 56 MtCO2, respectively. DL model was found to be the most appropriate one to fit the data followed by ANN and lastly SVM respectively.

Prediction of NOx emissions from gas turbines of a combined cycle power plant using an ANFIS model optimized by GA

Article

Aug 2022
FUEL

Mahmut Dirik

Combined cycle power plants, which combine gas and steam turbines, have negative impacts on surrounding populations and structures. Control of NOx emissions is an important issue for these gas-fired power plants. Accurate estimation of NOx emissions is critical for developing incinerators and reducing the environmental impact of existing plants. The objective of this study is to model ANFISGA and estimate NOx emissions from a natural gas-fired combined cycle power plant using emission monitoring system (PEMS) data. First, Adaptive Neuro Fuzzy Inference System (ANFIS) models were developed using fuzzy C-Means (FCM). Then, the parameters were optimized using a genetic algorithm (GA) to reduce the error. The proposed ANFISGA system was created, trained, and tested with PEMS datasets. The developed models were compared using several statistical performance criteria, including correlation coefficient (R²), mean squared error (MSE), error mean (EM), root mean square error (RMSE), standard deviation of error (STD), and mean absolute percentage error (MAPE). The obtained results show that the coefficient of determination varies between 0.79933 and 0.90363 for the data separated into test and training data with different rates. The minimum values of the criteria MSE, RMSE, EM, STD, and MAPE were found to be 24.8379, 4.9838, 3.4625e-05, 4.9839, and 5.1660, respectively, for the training data. The minimum values of these criteria for the test data were 26.5961, 5.1571, 0.065696, 5.157, and 5.3695, respectively. The collected results show that the proposed ANFISGA models have high potential for NOx prediction. Thus, the results show that GA has a great impact on the performance of ANFIS training and significantly improves the predictive accuracy of the model.

Predictions of carbon emission intensity based on factor analysis and an improved extreme learning machine from the perspective of carbon emission efficiency

Article

Jan 2022
J CLEAN PROD

Given the severe global warming situation, it is very important to explore the factors influencing carbon emission intensity and accurately analyze the trends in the development of carbon emission intensity to achieve the goal of reducing carbon emissions. In contrast with the existing research, this paper starts from the perspective of carbon emission efficiency, applies stochastic frontier analysis to screen the factors influencing carbon intensity, and constructs a model for predicting carbon emission intensity based on factor analysis and an extreme learning machine. The results suggest that, first, there is a high correlation between carbon emission efficiency and carbon emission intensity. Second, the level of economic development, industrial structure, urbanization level, and government intervention all promote a reduction in carbon emission intensity. The structure of energy consumption and dependence on foreign trade restrain reductions in carbon emission intensity. Finally, the proposed model accurately predicts carbon emission intensity. The research results provide theoretical support for the development of technologies to reduce carbon emissions. This idea can be applied to predict carbon emission intensity in different regions and has practical significance.

Forecasting of transportation-related energy demand and CO2 emissions in Turkey with different machine learning algorithms

Article

Jan 2022

Ümit Ağbulut

Adverse impacts of the transportation sector on not only air quality but also economic growth of a country are nowadays well-noticed, particularly by developing countries. Today, the transportation sector is powered by burning the fossil-based fuels at more than 99% and approximately 6.5 million deaths annually occur due to air-pollution-related diseases worldwide. Therefore, knowledge of both energy demand and CO2 emission of a country is a very significant issue in order to revise its future energy investments and policies. In this framework, three machine learning algorithms (deep learning (DL), support vector machine (SVM), and artificial neural network (ANN)) are used to forecast the transportation-based-CO2 emission and energy demand in Turkey. The gross domestic product per capita (GDP), population, vehicle kilometer, and year are used as input parameters in the study. It is noticed that there is a very high correlation among year, economic indicators, population, vehicle kilometer, transportation-based energy demand, and CO2 emissions. To present a better comparison, the results of these algorithms are discussed with six frequently used statistical metrics (R², RMSE, MAPE, MBE, rRMSE, and MABE). For all machine learning algorithms, R² values are varying between 0.8639 and 0.9235, and RMSE is smaller than 5 × 10⁶ tons for CO2 emission and 2 Mtoe for energy demand. According to the classifications in the literature, the forecast results are generally categorized as "excellent" for rRMSE metric (<10%), and “high prediction accuracy” for MAPE metric (<10%). On the other hand, with two mathematical models, future energy demand and CO2 emission arising from the transportation sector in Turkey is forecasted by the year 2050. In the results, it is forecasted that the annual growth rate for transportation-related energy demand and CO2 emission in Turkey cumulatively rise by 3.7% and 3.65%, respectively. Both energy demand and CO2 emissions from the transportation sector in Turkey will increase nearly 3.4 times higher in the year 2050 than those of today. In conclusion, the paper clearly reports that the future energy investments of the country should be revised, and various policies, regulations, norms, restrictions, legislations, and challenges on both energy consumption and emission mitigation from the transportation sector should be established by the policy-makers.

A comprehensive review: Machine learning and its application in integrated power system

Article

Nov 2021

A comprehensive review about machine learning application in power system especially in smart grid, renewable energy sector etc. is summarized in this paper. In the power sector, the power consumption is increased day by day very tremendously. So, it is very important that we have to generate the more power without disturbing the environment and whatever the generated power must be utilize effectively with minimum losses and higher efficiency. This will be possible with effective way of using the modern technology like machine learning (ML), artificial intelligence etc. This paper also describes the different types of machine learning techniques with diagram which will be very useful for many researchers who want to understand the basic fundamentals of machine learnings.

A systematic comparison of machine learning methods for modeling of dynamic processes applied to combustion emission rate modeling

Article

Jun 2021
APPL ENERG

Ten established, data-driven dynamic algorithms are surveyed and a practical guide for understanding these methods generated. Existing Python programming packages for implementing each algorithm are acknowledged, and the model equations necessary for prediction are presented. A case study on a coal-fired power plant’s NOx emission rates is performed, directly comparing each modeling method’s performance on a mutual system. Each model is evaluated by its root mean squared error (RMSE) on out-of-sample future horizon predictions. Optimal hyperparameters are identified using either an exhaustive search or genetic algorithm. The top five model structures of each method are used to recursively predict future NOx emission rates over a 60-step time horizon. The RMSE at each future timestep is determined, and the recursive output prediction trends compared against measurements in time. The GRU neural network is identified as the best candidate for representing the system, demonstrating accurate and stable predictions across the future horizon by all considered models, while satisfactory performance was observed in several of the ARX/NARX formulations. These efforts have contributed 1) a concise resource of multiple proven dynamic machine learning methods, 2) a practical guide explaining the use of these methods, effectively lowering the “barrier-to-entry” of deploying such models in control systems, 3) a comparison study evaluating each method’s performance on a mutual system, 4) demonstration of accurate multi-timestep emissions modeling suitable for systems-level control, and 5) generalizable results demonstrating the suitability of each method for prediction over a multi-step future horizon to other complex dynamic systems.

Prediction of CO–NOx Emissions from a Natural Gas Power Plant Using Proper Machine Learning Models

Abstract and Figures

Recommended publications

The Investigation of Lattice Properties for Group-IV Sigesn Ternary Alloy: By Using Machine Learning...

A Machine Learning Approach to Credit Default Prediction and Individual Credit Scoring

Data Analytics for Monitoring the Satisfactory Parameters of Airline Passengers using Machine Learni...

A machine learning application in wine quality prediction