ArticlePDF Available

Estimation of CO2 Diffusivity in Brine by Use of the Genetic Algorithm and Mixed Kernels-Based Support Vector Machine Model

Authors:

Abstract and Figures

Diffusion coefficient of carbon dioxide (CO2), a significant parameter describing the mass transfer process, exerts a profound influence on the safety of CO2 storage in depleted reservoirs, saline aquifers, and marine ecosystems. However, experimental determination of diffusion coefficient in CO2-brine system is time-consuming and complex because the procedure requires sophisticated laboratory equipment and reasonable interpretation methods. To facilitate the acquisition of more accurate values, an intelligent model, termed MKSVM-GA, is developed using a hybrid technique of support vector machine (SVM), mixed kernels (MK), and genetic algorithm (GA). Confirmed by the statistical evaluation indicators, our proposed model exhibits excellent performance with high accuracy and strong robustness in a wide range of temperatures (273–473.15K), pressures (0.1–49.3MPa), and viscosities (0.139–1.950 mPa·s). Our results show that the proposed model is more applicable than the artificial neural network (ANN) model at this sample size, which is superior to four commonly-used traditional empirical correlations. The technique presented in this study can provide a fast and precise prediction of CO2 diffusivity in brine at reservoir conditions for the engineering design and the technical risk assessment during the process of CO2 injection.
Content may be subject to copyright.
Qihong Feng
School of Petroleum Engineering,
China University of Petroleum (East China),
Qingdao 266580, Shandong, China
Ronghao Cui
1
School of Petroleum Engineering,
China University of Petroleum (East China),
Qingdao 266580, Shandong, China
e-mail: ronghao.cui1993@gmail.com
Sen Wang
1
School of Petroleum Engineering,
China University of Petroleum (East China),
Qingdao 266580, Shandong, China
e-mail: fwforest@gmail.com
Jin Zhang
School of Petroleum Engineering,
China University of Petroleum (East China),
Qingdao 266580, Shandong, China
Zhe Jiang
School of Petroleum Engineering,
China University of Petroleum (East China),
Qingdao 266580, Shandong, China
Estimation of CO
2
Diffusivity
in Brine by Use of the Genetic
Algorithm and Mixed
Kernels-Based Support
Vector Machine Model
Diffusion coefficient of carbon dioxide (CO
2
), a significant parameter describing the
mass transfer process, exerts a profound influence on the safety of CO
2
storage in
depleted reservoirs, saline aquifers, and marine ecosystems. However, experimental
determination of diffusion coefficient in CO
2
-brine system is time-consuming and complex
because the procedure requires sophisticated laboratory equipment and reasonable inter-
pretation methods. To facilitate the acquisition of more accurate values, an intelligent
model, termed MKSVM-GA, is developed using a hybrid technique of support vector
machine (SVM), mixed kernels (MK), and genetic algorithm (GA). Confirmed by the sta-
tistical evaluation indicators, our proposed model exhibits excellent performance with
high accuracy and strong robustness in a wide range of temperatures (273–473.15 K),
pressures (0.1–49.3 MPa), and viscosities (0.139–1.950 mPas). Our results show that
the proposed model is more applicable than the artificial neural network (ANN) model at
this sample size, which is superior to four commonly used traditional empirical correla-
tions. The technique presented in this study can provide a fast and precise prediction of
CO
2
diffusivity in brine at reservoir conditions for the engineering design and the techni-
cal risk assessment during the process of CO
2
injection. [DOI: 10.1115/1.4041724]
Keywords: carbon dioxide, diffusion coefficient, support vector machine, multivariate
regression, artificial neural network
1 Introduction
Due to global energy crisis and environmental pollution, recent
years have witnessed the accelerating exploitation and progressive
utilization of new and clean energy. In this field, carbon dioxide
(CO
2
) exhibits extensive and valuable applications. Replacement
of methane hydrate with CO
2
is recognized as one of the most
promising approaches to recovering methane from its hydrates,
avoiding damage to the seabed environment [1,2]. It is also found
that using CO
2
as the heat transmission fluid has a stronger
capacity of mining heat from hot fractured rock than water [35].
As an ancillary benefit, injecting CO
2
into marine ecosystems and
geothermal reservoirs can reduce the CO
2
emission into the
atmosphere and be conducive for the carbon storage and seques-
tration [612]. CO
2
, regardless of its physical states when
injected, always comes into contact with water and dissolves into
brine through molecular diffusion. Diffusion coefficient, which is
commonly employed to characterize fluid diffusivity, plays an
essential role in the storage safety in the geologic strata and the
sea bed [13,14], because it can dominate the rates of interfacial
mass transfer and heterogeneous chemical reactions related to
saline solution and porous media [15,16].
Considerable investigations have been conducted on the labora-
tory measurement of CO
2
diffusion coefficients [1651]. Among
these studies, commonly used experimental methods can be clas-
sified as two types, i.e., the direct methods and the indirect meth-
ods [40]. The direct methods, such as the Taylor–Aris dispersion
method, start with precise analysis of the concentration of gas in
solvent [21,41], while indirect methods need to measure the
variance of a certain property that is related to gas diffusivity
[22,51]. Such a property includes liquid volume and shape
[42,45], gas volume and pressure [49], and interfacial tension
[51]. However, at elevated pressures, the convection can strongly
enhance the mass transfer of CO
2
into water, leading to an inaccu-
rate estimation of CO
2
diffusion coefficients [14,43]. Because of
the difficulties in overcoming the effects of convection and con-
ducting in situ measurements of gas concentrations at high pres-
sures, only a few studies reported the experimental results at
elevated pressures (up to 49.3 MPa) [19,40,41]. Also, note that
experimentally determining the diffusion coefficients of CO
2
through direct and indirect methods is both time-consuming and
complex, because it requires sophisticated laboratory equipment
and reasonable interpretation procedures. For ease of application,
a few empirical correlations (Table 1) verified by experimental
data have been developed [16,40,5254].
Pressure and temperature, as the main operating conditions dur-
ing the mass transfer process, can exert prominent impacts on the
molecular diffusion of CO
2
. Therefore, the diffusivity of CO
2
under the sedimentary condition, i.e., high pressure and high tem-
perature, is significantly different from that under normal condi-
tion. Moreover, in contrast to the pure water, groundwater in a
typical reservoir is brine with high salinity, which inevitably influ-
ences CO
2
diffusion. Because different salinity results in various
viscosities of brine [55], in this work, we account for the depend-
ence of CO
2
diffusivity on the groundwater salinity by including
the solvent viscosity in our model.
In 1955, Wilke and Chang [53] put forward a general model for
diffusion coefficients in various dilute solutions and offered an
improved diffusion-factor chart for simplification. In their model,
the CO
2
diffusivity varies as a function of temperatures and vis-
cosity. Then, Lu et al. [40] ignored the effect of pressure and pro-
posed an equation for CO
2
diffusion in pure water (268 T
473 K). Using molecular dynamics simulations, Moultos et al.
1
Corresponding authors.
Contributed by the Petroleum Division of ASME for publication in the JOURNAL
OF ENERGY RESOURCES TECHNOLOGY. Manuscript received July 13, 2018; final
manuscript received October 6, 2018; published online November 19, 2018. Assoc .
Editor: Daoyong (Tony) Yang.
Journal of Energy Resources Technology APRIL 2019, Vol. 141 / 041001-1Copyright V
C2019 by ASME
Downloaded From: https://energyresources.asmedigitalcollection.asme.org on 11/21/2018 Terms of Use: http://www.asme.org/about-asme/terms-of-use
[54] extended Lu’s expression and developed a phenomenological
correlation to estimate the diffusion coefficient in CO
2
–H
2
O mix-
tures at ultrahigh temperatures and pressures. Because the viscos-
ity of saline solution should be taken into account to precisely
determine the CO
2
diffusivity in brine, the correlations developed
by Lu et al. and Moultos et al., although exhibit remarkably accu-
rate results for the solvent of pure water, cannot be directly uti-
lized for the CO
2
-brine system. On the basis of the experimental
results for CO
2
diffusion coefficients D
CO2
versus brine viscosity
at T¼298 K, Cadogan et al. [16] reported a modified
Stokes–Einstein relation; however, the validity of this correlation
at higher temperatures has not been substantiated.
In order to overcome aforementioned limitations, accurate pre-
diction of CO
2
diffusion coefficients in brine is a key issue for
urgent solution. With the extensive application of artificial intelli-
gence, machine learning exhibits excellent performance in diverse
engineering domains and provides better solutions for uncertain
problems [5661]. Here, coupling support vector machine (SVM),
mixed kernels (MK), and genetic algorithm (GA), we present a
hybrid intelligent model (termed MKSVM-GA model) to predict
the molecular diffusion of CO
2
in brine. As a machine learning
technique derived from statistics, SVM emanates an overwhelm-
ing superiority in solving problems concerning small-scale sam-
ple, nonlinearity, and high-dimensional pattern recognition
[62,63]. The comparison with laboratory data obtained from pre-
vious literature manifests that the proposed MKSVM-GA model,
representing a large range of temperatures (273–473.15 K) and
pressures (0.1–49.3 MPa), is proven to be reliable and high-
precise (R
2
>0.99). Our model can not only furnish great signifi-
cance to the engineering design and the technical risk assessment
during CO
2
injection, but also shed light on the further research
involved with mass transfer process in CO
2
-brine system.
2 Methodology
2.1 Data Collection. From previous literatures, we collected
92 reliable experimental data points of CO
2
diffusion in brine
[16,1921,24,2730,32,33,35,3841,51]. The data set is summar-
ized in the supplementary material, which is available under the
Supplemental Materials tab for this paper on the ASME Digital
Collection. We consider temperature, pressure, and viscosity as
the main correlation parameters influencing CO
2
diffusion in brine
and obtain the viscosity of pure water from National Institute of
Standards and Technology’s (NIST) Chemistry Webbook [64]. As
shown in the set, the gathered data for temperature ranges from
273 to 473.15 K, pressure from 0.1 to 49.3 MPa, viscosity from
0.139 to 1.950 mPas, and diffusion coefficient from 3.1 10
10
to 16.1 10
9
m
2
/s.
2.2 Fundamentals of Algorithm
2.2.1 Support Vector Machine. By using SVM, the potential
rules can be concluded from sampling data and then utilized to
predict unknown parameters [6568]. Particularly, in the small-
sample cases, SVM shows excellent generalization properties
[6972]. In SVM, the training samples are given as {(x
1
,y
1
), (x
2
,
y
2
), …, (x
n
,y
n
)}, where x
1
,x
2
,…, x
n
denotes the space of input
variables, i.e., pressure, temperature, and viscosity in this work;
y
1
,y
2
,…,y
n
are the experimental diffusion coefficients of CO
2
in
brine (outputs). The objective of SVM is to find an optimal hyper-
plane that can minimize the structural risk using the SVM regres-
sion function. In other words, it aims to strike a great balance
between the learning capability and the model complexity. The
regression function can be designed as follows:
fðxÞ¼wTwðxÞþb(1)
where wand bare the weight vector and the bias, respectively;
w(x) is the map function, which transforms the n-dimensional
input vector into a high-dimensional feature vector in feature
space to make the nonlinear issues turn into linear regression
problem. In SVM proposed by Vapnik [65,68], all pairs (x
i
,y
i
) are
supposed to satisfy the following requirement:
jyifðxiÞj exi;yiÞ2Re>0(2)
where eis the given error and Ris the set of inputs. In the preci-
sion of e, searching for the regression function f(x) as flat as possi-
ble is equivalent to minimizing jjwjj
2
. However, some samples out
of the margin may exert a serious impact on the fitting results.
Therefore, the slack variables need to be introduced to address the
problem, which enable some unrealistic samples to deviate from
the optimal hyperplane in a certain extent. We reach the mathe-
matical model stated by Vapnik [65,68]
min 1
2jjwjj2þcX
n
i¼1
niþf
i

subject to yiwTwxi
ðÞ
beþni
wTwxi
ðÞ
þbyieþf
i
ni;f
i0
(3)
where crepresents the penalty coefficient, which controls the
tradeoff between the flatness of the function and the minimum
cumulative value of deviations; n
i
and n
i
*are the slack variables
that depend on the e-insensitive loss function expressed as below:
jn 0ifjyifðxiÞj <e
jyifðxiÞj eotherwise
(4)
In Fig. 1, we schematically illustrate this situation. The samples
out or on the edge of the e-tube are the support vectors. The loss
function of circular dots inside the tube is zero while the square
Table 1 Empirical correlations for the diffusion coefficients of CO
2
in water
Author Reference Year Correlation Solvents
Othmer and Thakar
a
[52] 1953 DCO2¼14 109
l1:1V0:6
m
Brine
Wilke and Chang
b
[53] 1955 DCO2¼7:4108Tffiffiffiffiffiffiffi
/M
p
lV0:6
m
Brine
Lu et al.
c
[40] 2013 DCO2¼D0½T=Ts1mPure water
Cadogan et al.
d
[16] 2014 DCO2¼kBT=ðnSEplaÞBrine
Moultos et al.
e
[54] 2016 DCO2¼D0ðPÞ½T=Ts1mðPÞPure water
a
D
CO2
is the diffusion coefficient of CO
2
,m
2
/s; lis the solvent viscosity, mPas; V
m
is the molar volume of the diffusing substances, cm
3
/gmol.
b
Tis the temperature, K; /is the association parameter; Mis the molecular weight of solvent.
c
D
0
¼13.942 10
9
m
2
/s; T
s
¼227.0 K; m¼1.7094.
d
k
B
¼1.38065 10
23
J/K; n
SE
is the Stokes–Einstein number; ais the hydrodynamic radius of the solute, pm; a¼a
298
[1 þa(T298)], where
a
298
¼168 pm, a¼2.0 10
3
.
e
D
0
¼a
1
ln(P)þa
2
,m¼b
1
ln(P)þb
2
, where a
1
¼2.3097 10
9
,a
2
¼2.1064 10
8
,b
1
¼0.17812, and b
2
¼2.59406; Pis the pressure.
041001-2 / Vol. 141, APRIL 2019 Transactions of the ASME
Downloaded From: https://energyresources.asmedigitalcollection.asme.org on 11/21/2018 Terms of Use: http://www.asme.org/about-asme/terms-of-use
dots outside the e-tube are penalized. The penalty coefficient c
ensures the strength of the punishment. It is worth mentioning that
because cis given artificially, finding the optimal cis a requisite
for the excellent performance of SVM. We will discuss the choice
of cin Sec. 2.3.
It has been confirmed that obtaining the solution of convex
optimization problem in Eq. (3) through its dual formulation is
much easier [73]. The corresponding dual optimization model for
Eq. (3) is [65,74]
max eX
n
i¼1
aiþa
i

þX
n
i¼1
yiaia
i

1
2X
n
i¼1;j¼1
aia
i

aja
j

wxi
ðÞ
Twxj
ðÞ
subject to X
n
i¼1
aia
i

¼0
0aic;0a
ic
8
>
<
>
:
(5)
where a
i
and a
i
*
are Lagrange multipliers.
2.2.2 Mixed Kernels. The value of w(x
i
)
T
w(x) can be calcu-
lated through the kernel function, i.e., K(x
i
,x)¼w(x
i
)
T
w(x). Then
simple computation in low dimensional space replaces the com-
plex computation in high dimensional space. Owing to the utiliza-
tion of kernel function, SVM avoids the curse of dimensionality
[67,75]. In fact, the kernel function is divided into two types, i.e.,
global kernels and local kernels. The polynomial kernel in Eq. (6)
and the radial basis function (RBF) kernel in Eq. (7) are typical
examples of global and local kernels, respectively [76]. Figure 2
depicts the difference between them. As shown in Fig. 2(a), data
points far from each other are able to affect kernel values effec-
tively in global kernels, while local kernels in Fig. 2(b)only allow
data points close to each other to exert an impact on kernel values
Kpolyðxi;xÞ¼ðhxi;x1Þd(6)
KRBFðxi;xÞ¼expðcjjxixjj2Þ(7)
where dis the polynomial degree and cis the kernel parameter.
Any function which satisfies Mercer’s conditions can be desig-
nated as the kernel function, including mixtures of global and
local kernels. It was found that the mixed kernel would combine
the advantages of global and local kernels, and strike a great
balance between the interpolation and extrapolation capability of
SVM [77]. The mixed kernel is defined as below:
Kmixðxi;xjÞ¼mKpoly ðxi;xjÞþð1mÞKRBFðxi;xjÞ(8)
where mis the fraction coefficient. As shown in Fig. 2(c),
mixed kernels not only receive strong response around the test
point but also guarantee that the value of response far from the
test point wouldn’t be attenuated rapidly. However, we can see
from Fig. 2that all of these free coefficients, i.e., d,c, and m
influence the performance of SVM. In Sec. 2.3, we will find
their optimal values.
Fig. 1 Schematic showing the fundamentals of SVM. The
square dots represent support vectors. The circular dots are
normal points. The full line denotes the hyperplane and the e-
tube is the region between two dotted lines.
Fig. 2 Function curves of the polynomial kernel (a), the RBF
kernel (b), and the mixed kernel (c). x50.2 is the test point in
three types of kernels. As an example of mixed kernel func-
tions, d51 and c510 are set.
Journal of Energy Resources Technology APRIL 2019, Vol. 141 / 041001-3
Downloaded From: https://energyresources.asmedigitalcollection.asme.org on 11/21/2018 Terms of Use: http://www.asme.org/about-asme/terms-of-use
2.2.3 Genetic Algorithm. We use GA to obtain the optimal
values of c,d,c, and m. Introduced by imitating the evolutionary
principle of biological population, GA has been one of the most
popular global optimization methods [7880]. Generally, GA
starts with an original population in which the optimal solution
candidates may exist. The population is composed of a certain
number of individuals with gene encoding. According to the prin-
ciple of survival of the fittest, approximate optimal solutions can
be found during evolution. Depending on the evolutionary goals,
the values of fitness function are calculated. Three genetic opera-
tors, including selection, crossover, and mutation, are performed
to create a new generation in each iteration. Particularly, it is
important for GA to set appropriate initial parameters such as pop-
ulation size, genetic algebra, crossover rate, and mutation rate
[81,82].
2.3 Model Development of Genetic Algorithm and Mixed
Kernels -Based Support Vector Machine. We present the com-
bination of GA and MK based SVM in this section. Based on the
principle that both of training set and testing set cover ranges as
large as possible with no overlap, we divided the whole database
into two groups. The first group is the training set composed of 72
data points for the development of our model. The second one is
the testing set (20 data points) employed to verify our model. We
summarize the information of the two groups in Table 2.
In order to accelerate the convergence and improve the model
accuracy in the training process, both of inputs and outputs are
normalized to a range of [1, þ1]
Xn¼2XXmin
ðÞ
Xmax Xmin 1(9)
where X
n
is the normalized value; Xis the actual value; X
min
and
X
max
are the minimum and maximum values of the parameter Xin
the whole data set.
After data preprocessing, as shown in Fig. 3, the hybrid model
of GA and MK based SVM can be established through the follow-
ing steps:
(1) Design a basic SVM model that is used in the regression
analysis.
(2) Mix the polynomial kernel and the RBF kernel using a frac-
tion coefficient m.
(3) Initialize the operating parameters of GA including popula-
tion size, evolutionary generations, selection rate, crossover
rate, and mutation rate.
(4) Generate initial values of c,d,c, and mrandomly in a cer-
tain scope we preset.
(5) With given c,d,c, and m, the fitness value for training set
is calculated. If the stopping criterion is met, stop the opti-
mization process and output c,d,c, and m. Otherwise, run a
GA and optimize the values of c,d,c, and m.
(6) Repeat step 5 with new values of c,d,c, and muntil the
stopping criterion can be satisfied. The optimal c,d,c, and
mcan be output.
(7) With the optimal c,d,c, and m, the training set is used to
train SVM. Finally, diffusion coefficients of CO
2
can be
estimated by a trained SVM.
To determine the optimal c,d,c, and m, mean square error
(MSE), as the fitness value, is calculated as below:
MSE ¼1
nX
n
i¼1
Dexp
iDpre
i

2(10)
where D
i
exp
and D
i
pre
are, respectively, the diffusion coefficients
estimated from experiments and our proposed model, m
2
/s. When
the fitness value reaches its minimum value, the optimal c,d,c,
and mis determined. The stochastic property of GA makes it pos-
sible to globally search for multiple optimal solutions. But its opti-
mal solution aggregate is not fixed every time. In order to obtain
the most appropriate parameter configuration for diffusivity pre-
diction, we run the MKSVM-GA model ten times in different
population size and evolutionary generations, as shown in Fig. 4.
When population number and generation number are set as 200,
uptime (about 30 min for each run) is sacrificed for minimum
average and error margin of MSE. Therefore, in this work, for the
balanced performance between model precision and computa-
tional efficiency, we choose 50 as population number and genera-
tion number. Besides, the rates of selection, crossover, and
mutation are adjusted in a small scope, referred to values set in
the literature [71]. Table 3summarizes parameter configuration of
the MKSVM-GA model.
2.4 Model Evaluation
2.4.1 Quantitative Evaluation. To examine the capability of
our proposed model, we utilized various frequently used statistical
Table 2 Information of the training set and the testing set
Group Temperature (K) Pressure (MPa) Viscosity (mPas) Diffusion coefficient (10
9
m
2
/s) Number of data points
Training set 273.15–473.15 0.1–49.3 0.139–1.950 0.31–16.10 72
Testing set 273.00–423.00 0.1–48.6 0.186–1.807 0.67–12.33 20
Fig. 3 Computational procedure of the MKSVM-GA technique
041001-4 / Vol. 141, APRIL 2019 Transactions of the ASME
Downloaded From: https://energyresources.asmedigitalcollection.asme.org on 11/21/2018 Terms of Use: http://www.asme.org/about-asme/terms-of-use
evaluation indicators to measure its accuracy, i.e., mean absolute
error (MAE), mean absolute relative error (MARE), root-mean-
squared error (RMSE), and coefficient of determination (R
2
).
These indicators are defined as follows:
MAE ¼1
nX
n
i¼1
yexp
iypre
i
(11)
MARE ¼100% 1
nX
n
i¼1
yexp
iypre
i
yexp
i
(12)
RMSE ¼ffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi
1
nX
n
i¼1
yexp
iypre
i

2
s(13)
R2¼1X
n
i¼1
yexp
iypre
i

2.X
n
i¼1ðyexp
iyave
exp Þ2(14)
where y
i
exp
and y
i
pre
are, respectively, the experimentally deter-
mined value and prediction, m
2
/s; y
exp
ave
is the average value of
experimental results, m
2
/s.
In addition to evaluation indicators and scatter plots that are
visually intuitive methods measuring the prediction performances
of the regression model, there exists a more quantitative
method—hypothesis test. Two widely used hypothesis tests, T-test
and F-test, are conducted to further assess the robustness of our
model [83]. In T-test, null hypothesis that the data belongs to nor-
mal distributions can be determined to accept or not, while F-test
(also known as the homoscedasticity test) estimates the null
hypothesis that the variances of two samples from normal distri-
butions are identical. If the return value of the test is zero, the null
hypotheses cannot be rejected at 5% significant level.
2.4.2 Outlier Diagnostics. We also applied outlier diagnostics
to ensure the results had a goodness of fitting. This regression
analysis approach is commonly employed to identify single datum
or a series of data which probably differ from the vast majority of
the data points in the whole dataset [8385].
We conducted Leverage approach in the Williams plot to detect
outliers as well as doubtful data. The essential components in
Williams plot, including the standardized residual (SR), the hat
values (H), and the warning leverage (H*), need to be first calcu-
lated. Interested readers could find more details of the computa-
tional procedure for this method from Refs. [85] and [86]. As
shown in Fig. 5, the data points falling into the square region of
3<SR <3 and 0 <H<H* are valid, while those points lying
out of the range will be identified as outliers [85]. Typically, there
are two kinds of outliers: good high leverage (GHL) points and
bad high leverage (BHL) points. The points located in the ranges
of 3SR 3 and HH* are defined as GHL points that can-
not be predicted by the model. BHL points, which drop into the
domain of SR <3orSR>3 (no matter what H* value is) prob-
ably result from experimental errors [85,86].
3 Results and Discussion
3.1 Performance of the Proposed Model. Using the hybrid
technique of SVM, MK, and GA, the parameters c,d,c, and m
were determined within a wide range (0 c200, 0 d10, 0
c100, 0 m1). The optimal values of c,d,c, and mare
38.4115, 88.9051, 4, and 0.3430, respectively. Then, CO
2
diffu-
sion coefficients in brine are predicted.
The cross plot exhibited in Fig. 6(a)compares the estimations
of our model and laboratory results. Evidently, all of the points in
training and testing datasets closely scatter around the bisector of
the first quadrant, i.e., the 45 deg line, which manifests a good
agreement between our predictions and the experimental data
from previous literatures. In the light of the statistical evaluation
results presented in Table 4, MAE, MARE, RMSE, and R
2
for the
total data set are 1.311 10
10
m
2
/s, 7.91%, 1.954 10
10
m
2
/s,
and 0.9960, respectively. Therefore, it is concluded that our pre-
diction model is of high accuracy and strong generalization. In
Table 3 Basic parameters used in the MKSVM-GA model
Parameters Value
Population size 50
Evolutionary generations 50
Selection rate 1.0
Crossover rate 0.6
Mutation rate 0.03
Fig. 5 Illustration of the Williams plot
Fig. 4 Mean square error and average uptime of the MKSVM-GA model in different
population size and evolutionary generations
Journal of Energy Resources Technology APRIL 2019, Vol. 141 / 041001-5
Downloaded From: https://energyresources.asmedigitalcollection.asme.org on 11/21/2018 Terms of Use: http://www.asme.org/about-asme/terms-of-use
order to highlight the performance of our model in a visually bet-
ter way, we exhibit the measured and predicted diffusion coeffi-
cients of CO
2
versus the index of data points in Fig. 6(b). The
comparison indicates that the predictions estimated from our
model are remarkably consistent with the experimental results.
Figure 7shows that the residuals follow the normal distribution
with a mean value of 4.115 10
11
m
2
/s. The return values of
T-test and F-test are both zero, which indicates that both of the
null hypotheses are accepted. Therefore, the statistical signifi-
cance of our prediction is substantiated.
In Fig. 8, Williams plot is presented to detect the outliers. Here,
the warning leverage evaluated for the whole dataset is 0.130.
Given that the majority of the predicted points lie in the square
region of 3SR 3 and 0 H0.130, the MKSVM-GA
model, establishing a credible internal relation between these vari-
ables and diffusion coefficients of CO
2
in brine, is proven to be
statistically effective. There is no GHL point and we list the three
detected BHL points in Table 5. The data point measured at 373
K, 20 MPa reported by Lu et al. [40] does not coincide with its
variation trend with the pressure reported by Cadogan et al. [41].
Therefore, the experimental value at this condition may be errone-
ous. The validity of the other outliers cannot be determined
because the diffusion coefficients of CO
2
at these conditions were
not reported by any other researchers. As reported by Lu et al.,
overestimation of diffusion coefficients at temperatures higher
than 353 K is attributed to the scarcity of data points in high tem-
peratures [40]. With more sufficient data points at high tempera-
tures, some unreasonable data could be removed.
3.2 Comparison With Other Models
3.2.1 Artificial Neural Network Model. In the field of
machine learning, artificial neural network (ANN) is another prev-
alent technique. This method can deal with complicated functional
relationships; hence, it is widely used to predict many important
chemical parameters [58,59]. Typically, an artificial neural net-
work is composed of a bunch of nodes called neurons and the con-
nection constitution. Each neuron in ANN is featured by three
elements including weight, bias, and activation function. The
weight parameter measures the connecting strength for certain
input. The bias is a nonzero constant representing the type of con-
nection weight added to the summation of inputs and correspond-
ing weights. The activation function is used to introduce a
nonlinear relation into a multilayer perception neural network
[8789].
To examine the capability of our model, we built a typical
three-layered feed forward ANN model to forecast diffusion coef-
ficients of CO
2
in brine and compared the results with the
Fig. 6 Prediction performance of the proposed MKSVM-GA model: (a) cross plot
of the estimated results versus experimental diffusion coefficients of CO
2
in brine;
(b) comparison of each experimental and predicted data points
Table 4 Statistical evaluation results of the MKSVM-GA model
Evaluation matrices Training set Testing set Total set
MAE (10
9
m
2
/s) 0.1112 0.2028 0.1311
MARE (%) 7.17 10.55 7.91
RMSE (10
9
m
2
/s) 0.1527 0.3028 0.1954
R
2
(fraction) 0.9975 0.9910 0.9960
Fig. 7 Distribution of the residuals between predicted and
experimental diffusion coefficients of CO
2
. The columns are
instances and the solid curve is the normal distribution curve.
Std Dev represents the standard deviation of residuals.
Fig. 8 Williams plot used for the detection of outliers
041001-6 / Vol. 141, APRIL 2019 Transactions of the ASME
Downloaded From: https://energyresources.asmedigitalcollection.asme.org on 11/21/2018 Terms of Use: http://www.asme.org/about-asme/terms-of-use
predictions of our MKSVM-GA model. Figure 9illustrates the
structure of this ANN model including an input layer, a hidden
layer, and an output layer. The input layer consists of three neu-
rons denoting attributes of these samples i.e., pressure, tempera-
ture, and viscosity. Diffusion coefficients of CO
2
in brine are
obtained from the neuron of the output layer. We determined the
number of neurons in the hidden layer through the means of trial
and error. In this work, the number of neurons was raised from 4
to 20 at intervals of two neurons. At each number of neurons, the
model was tested ten times and its MSE was traced. Aimed at the
minimum MSE for the training set, we found the optimal number
of neurons for this ANN model. The minimum MSE curve with
variation of neuron number is shown in Fig. 10. The minimal one
is situated at the neuron number of 8. Therefore, the optimal ANN
architecture is determined as 3-8-1.
The MKSVM-GA model and the ANN model were both run
ten times. Figure 11 depicts performance comparison between the
MKSVM-GA model and the ANN model in the training stage.
The convergence scope of the MKSVM-GA model, shaded with
blue, is distinctly lower and narrower than results of the ANN
training model. This phenomenon is attributed to small-scale sam-
ples. ANN is based on traditional statistics, a gradual theory that
sample size is tending to be infinite. Thus, when small-scale sam-
ples are manipulated in ANN, it is doubtful to take it for granted
that statistical properties are concluded. In fact, desired perform-
ance is not achieved in ANN in this situation. On the other hand,
the statistical learning theory is fundamental to SVM, which is a
theory on the condition of small sample size. Professionally
speaking, SVM, which embodies the structural risk minimization
principle, can minimize the upper bound of the generalization
error rather than the training error from ANN, which embodies
the empirical risk minimization principle [9092].
The most optimal result (the largest R
2
) predicted by ANN is
shown in Fig. 12. The statistical evaluation results of ANN model
and our proposed model are summarized in Table 6. Lower MAE,
MARE, RMSE, and higher R
2
demonstrate that the MKSVM-GA
model is more precise than the ANN model in both the testing and
the training stages. These evaluation results, as well as the
Table 5 Outliers recognized in the Williams plot
Author Reference Temperature (K) Pressure (MPa) Viscosity (mPas) Diffusion coefficient (10
9
m
2
/s) Solvent Outlier type
Maharajh et al. [19] 0.1 273.00 1.807 1.00 Pure water BHL
Lu et al. [40] 20.0 373.15 0.287 6.43 Pure water BHL
Lu et al. [40] 20.0 393.15 0.237 8.13 Pure water BHL
Fig. 9 Illustration of the structure of a typical three-layered
feed forward ANN model
Fig. 10 The minimum MSE with variation of number of neurons
in the hidden layer
Fig. 11 Comparison of the training performance between the
MKSVM-GA model and the ANN model
Fig. 12 Comparison of the prediction performance between
the MKSVM-GA model and the ANN model
Journal of Energy Resources Technology APRIL 2019, Vol. 141 / 041001-7
Downloaded From: https://energyresources.asmedigitalcollection.asme.org on 11/21/2018 Terms of Use: http://www.asme.org/about-asme/terms-of-use
absolute error curve shown in Fig. 13, substantiate the fact that
MKSVM-GA performs better than ANN on the condition of small
sample size. However, due to solution difficulty of high dimen-
sional matrices in convex quadratic programming, it is a technical
conundrum for SVM to be implemented with huge sample size,
while ANN could handle it with ease. Even so, there is no clear
dividing boundary on sample size between SVM and ANN. In
fact, most of researchers prefer to investigate the adaptability of
SVM or ANN through trial and error methods [87,93].
3.2.2 Empirical Correlations. In this section, we compared
our model with empirical correlations to examine its validity. As
mentioned above, there are five correlations that can predict the
diffusion coefficient in CO
2
-brine system, i.e., Othmer–Thakar’s
model [52], Wilke–Chang’s model [53], Lu’s model [40], Cado-
gan’s model [16], and Moultos’ model [54]. However, Moultos’
correlation cannot be used to conduct this comparison because
this formula, modified from Lu’s model, is available only in ultra-
high temperature (323.15 T1023.15 K) and pressures (200
P1000 MPa), which are beyond the scope of our training set.
Table 6 Comparison of the statistical evaluation results between the MKSVM-GA model and the ANN model
Training set Testing set Total set
Evaluation matrices MKSVM-GA ANN MKSVM-GA ANN MKSVM-GA ANN
MAE (10
9
m
2
/s) 0.1112 0.1490 0.2028 0.3172 0.1311 0.1855
MARE (%) 7.17 11.97 10.55 14.80 7.91 12.59
RMSE (10
9
m
2
/s) 0.1527 0.2275 0.3028 0.4608 0.1954 0.2944
R
2
(fraction) 0.9975 0.9944 0.9910 0.9785 0.9960 0.9908
Fig. 13 Absolute error curves of the MKSVM-GA model, the
ANN model, and four empirical correlations
Fig. 14 Comparison of the prediction performances between the MKSVM-GA model and four
empirical correlations: (a) Othmer and Thakar; (b) Wilke and Chang; (c) Lu et al.; (d) Cadogan
et al. Squares and triangles represent the values predicted using the MKSVM-GA model and
empirical correlations, respectively.
041001-8 / Vol. 141, APRIL 2019 Transactions of the ASME
Downloaded From: https://energyresources.asmedigitalcollection.asme.org on 11/21/2018 Terms of Use: http://www.asme.org/about-asme/terms-of-use
Figure 14 compares the performance of each empirical correla-
tion with our proposed model. Unlike the data points from the
MKSVM-GA model, which distribute uniformly around the
45 deg line, the predictions calculated by using the first three
correlations—Othmer’s model, Wilke–Chang’s model, and Lu’s
model, deviated up or down obviously, particularly for higher dif-
fusion coefficients. Meanwhile, the results estimated from Cado-
gan’s model are more dispersed along the 45 deg line than our
MKSVM-GA model. Therefore, the four empirical correlations
show lower accuracy than our proposed model. In addition, the
evaluation results shown in Table 7and the absolute error curves
depicted in Fig. 13 confirm that all of the four empirical correla-
tions have weaker prediction capability. MAE, MARE, and
RMSE of empirical correlations approximate to twice or more as
much as that of the MKSVM-GA model. In comparison to the
high fitting degree of intelligent algorithms, none of the empirical
correlations has a greater coefficient of determination (R
2
) than
0.99, even though the performance of Cadogan’s model is far bet-
ter than the other formulas.
Because the models proposed by Othmer and Thakar as well as
Wilke and Chang are general equations for gas diffusivity in solu-
tions, they only provide the rough guess and estimation for CO
2
in
brine. The model of Lu et al., only available in pure water, failed
to account for the effect of solvent viscosity. These empirical cor-
relations including Cadogan’s model are established on the same
assumption that the effect of pressure is negligible. As a matter of
fact, the variation in pressure can lead to fluctuations of CO
2
dif-
fusion coefficients. The ignorance of influencing parameters is
indeed the most important reason to explain why the empirical
correlations exhibit poor performances. A considerate composi-
tion of influencing parameters unleashes the advantages of artifi-
cial intelligent algorithms. With the aid of the excellent
robustness and fault tolerance, our proposed model exhibits an
incomparable superiority to the traditional empirical correlations.
4 Conclusions
We developed a hybrid technique of support vector machine,
mixed kernels, and genetic algorithm to predict CO
2
molecular
diffusivity in brine. A total of 92 experimental points obtained
from previous literatures are employed to train and test the
MKSVM-GA model. The statistical evaluation indicators of total
data set, i.e., MAE (1.311 10
10
m
2
/s), MARE (7.91%), RMSE
(1.954 10
10
m
2
/s), and R
2
(0.9960), demonstrate that our pro-
posed model exhibits an excellent performance with high accu-
racy and strong generalization. To highlight the superiority of our
model, a typical three-layered ANN model and four commonly
used empirical correlations are employed to conduct the compari-
son with the MKSVM-GA model. The results confirm that the
proposed model has significantly better prediction capability.
Potentially, our technique can be extended to other engineering
disciplines that demand fast and convenient access to basic physi-
cal parameters.
Funding Data
National Program for Fundamental Research and Develop-
ment of China (973 Program) (Grant No. 2015CB250905).
Program for Changjiang Scholars and Innovative Research
Team in University (Grant No. IRT1294).
National Postdoctoral Program for Innovative Talents (Grant
No. BX201600153).
China Postdoctoral Science Foundation (Grant No.
2016M600571).
Qingdao Postdoctoral Applied Research Project (Grant No.
2016218).
References
[1] Ota, M., Saito, T., Aida, T., Watanabe, M., Sato, Y., Smith, R. L., and Inomata,
H., 2007, “Macro and Microscopic CH
4
-CO
2
Replacement in CH
4
Hydrate
Under Pressurized CO
2
,” AIChE J.,53(10), pp. 2715–2721.
[2] Bai, D., Zhang, X., Chen, G., and Wang, W., 2012, “Replacement Mechanism
of Methane Hydrate With Carbon Dioxide From Microsecond Mol ecular
Dynamics Simulations,” Energy Environ. Sci.,5(5), pp. 7033–7041.
[3] Pruess, K., 2006, “Enhanced Geothermal Systems (EGS) Usin g CO
2
as Work-
ing Fluid—A Novel Approach for Generating Renewable Energy With Simulta -
neous Sequestration of Carbon,” Geothermics,35(4), pp. 351–367.
[4] Zhang, L., Cui, G., Zhang, Y., Ren, B., Ren, S., and Wang, X., 2016, “Influence
of Pore Water on the Heat Mining Performance of Supercritical CO
2
Injected
for Geothermal Development,” J. CO
2
Util.,16, pp. 287–300.
[5] Cui, G., Ren, S., Rui, Z., Ezekiel, J., Zhang, L., and Wang, H., 2018, “The Influ-
ence of Complicated Fluid-Rock Interactions on the Geothermal Exploitation in
the CO
2
Plume Geothermal System,” Appl. Energy,227, pp. 49–63.
[6] Mohamed, I. M., He, J., and Nasr-El-Din, H. A., 2012, “Experimental Analysis
of CO
2
Injection on Permeability of Vuggy Carbonate Aquifers,” ASME J.
Energy Resour. Technol.,135(1), p. 013301.
[7] Cui, G., Wang, Y., Rui, Z., Chen, B., Ren, S., and Zhang, L., 2018,
“Assessing the Combined Influence of Fluid-Rock Interactions on Reservoir
Properties and Injectivity During CO
2
Storage in Saline Aquifers,” Energy,
155, pp. 281–296.
[8] Rau, G. H., and Caldeira, K., 1999, “Enhanced Carbonate Dissolution: A Means
of Sequestering Waste CO
2
as Ocean Bicarbonate,” Energy Convers. Manage.,
40(17), pp. 1803–1813.
[9] Chen, B., and Reynolds, A. C., 2018, “CO
2
Water-Alternating-Gas Injection for
Enhanced Oil Recovery: Optimal Well Controls and Half-Cycle Lengths,”
Comput. Chem. Eng.,113, pp. 44–56.
[10] Li, A., Ren, X., Fu, S., Lv, J., Li, X., Liu, Y., and Lu, Y., 2018, “The Experi-
mental Study on the Flooding Regularities of Various CO
2
Flooding Modes
Implemented on Ultralow Permeability Cores,” ASME J. Energy Resour. Tech-
nol.,140(7), p. 072902.
[11] Ren, B., Zhang, L., Huang, H., Ren, S., Chen, G., and Zhang, H., 2015,
“Performance Evaluation and Mechanisms Study of Near-Miscible CO
2
Flood-
ing in a Tight Oil Reservoir of Jilin Oilfield China,” J. Nat. Gas Sci. Eng.,27,
pp. 1796–1805.
[12] Li, S., Li, B., Zhang, Q., Li, Z., and Yang, D., 2018, “Effect of CO
2
on Heavy
Oil Recovery and Physical Properties in Huff-n-Puff Processes Under Reservoir
Conditions,” ASME J. Energy Resour. Technol.,140(7), p. 072907.
[13] Mutoru, J. W., Leahy-Dios, A., and Firoozabadi, A., 2011, “Modeling Infinite
Dilution and Fickian Diffusion Coefficients of Carbon Dioxide in Water,”
AIChE J.,57(6), pp. 1617–1627.
[14] Farajzadeh, R., Zitha, P. L., and Bruining, J., 2009, “Enhanced Mass Transfer
of CO
2
Into Water: Experiment and Modeling,” Ind. Eng. Chem. Res.,48(13),
pp. 6423–6431.
[15] Trevisan, L., Pini, R., Cihan, A., Birkholzer, J. T., Zhou, Q., and Illangasekare,
T. H., 2014, “Experimental Investigation of Supercritical CO
2
Trapping Mecha-
nisms at the Intermediate Laboratory Scale in Well-Defined Heterogeneous
Porous Media,” Energy Procedia,63, pp. 5646–5653.
[16] Cadogan, S. P., Hallett, J. P., Maitland, G. C., and Trusler, J. M., 2014,
“Diffusion Coefficients of Carbon Dioxide in Brines Measured Using
13
C
Pulsed-Field Gradient Nuclear Magnetic Resonance,” J. Chem. Eng. Data,
60(1), pp. 181–184.
[17] Vivian, J. E., and King, C. J., 1964, “Diffusivities of Slightly Soluble Gases in
Water,” AIChE J.,10(2), pp. 220–221.
[18] Mazarei, A. F., and Sandall, O. C., 1980, “Diffusion Coefficients for Helium,
Hydrogen, and Carbon Diox ide in Water at 25 C,” AIChE J.,26(1), pp.
154–157.
[19] Maharajh, D. M., and Walkley, J., 1973, “The Tempe rature Dependence of the
Diffusion Coefficients of Ar, CO
2
,CH
4
,CH
3
Cl, CH
3
Br, and CHCl
2
F in Water,”
Can. J. Chem.,51(6), pp. 944–952.
[20] Tamimi, A., Rinker, E. B., and Sandall, O. C., 1994, “Diffusion Coefficients for
Hydrogen Sulfide, Carbon Dioxide, and Nitrous Oxide in Water Over the Tem-
perature Range 293-368 K,” J. Chem. Eng. Data,39(2), pp. 330–332.
Table 7 Comparison of the statistical evaluation results between the MKSVM-GA model and correlations for the whole database
Evaluation matrices MKSVM-GA Othmer and Thakar Wilke and Chang Lu et al. Cadogan et al.
MAE (10
9
m
2
/s) 0.1311 0.3454 0.3380 0.4036 0.2627
MARE (%) 7.91 12.75 12.60 24.77 13.84
RMSE (10
9
m
2
/s) 0.1954 0.5661 0.7311 0.6333 0.3661
R
2
(fraction) 0.9960 0.9661 0.9434 0.9575 0.9858
Journal of Energy Resources Technology APRIL 2019, Vol. 141 / 041001-9
Downloaded From: https://energyresources.asmedigitalcollection.asme.org on 11/21/2018 Terms of Use: http://www.asme.org/about-asme/terms-of-use
[21] Frank, M. J., Kuipers, J. A., and van Swaaij, W. P., 1996, “Diffusion Coeffi-
cients and Viscosities of CO
2
þH
2
O, CO
2
þCH
3
OH, NH
3
þH
2
O, and
NH
3
þCH
3
OH Liquid Mixtures,” J. Chem. Eng. Data,41(2), pp. 297–302.
[22] Ng, W. Y., and Walkley, J., 1969, “Diffusion of Gases in Liquids: The Constant
Size Bubble Method,” Can. J. Chem.,47(6), pp. 1075–1077.
[23] J
ahne, B., Heinz, G., and Dietrich, W., 1987, “Measurement of the Diffusion
Coefficients of Sparingly Soluble Gases in Water,” J. Geophys. Res.,92(C10),
pp. 10767–10776.
[24] Hirai, S., Okazaki, K., Yazawa, H., Ito, H., Tabe, Y., and Hijikata, K., 1997,
“Measurement of CO
2
Diffusion Coefficient and Application of LIF in Pressur-
ized Water,” Energy,22(2–3), pp. 363–367.
[25] Guzman, J., and Garrido, L., 2012, “Deter mination of Carbon Dioxide Trans-
port Coefficients in Liquids and Polymers by NMR Spectroscopy,” J. Phys.
Chem. B,116(20), pp. 6050–6058.
[26] Liger-Belair, G., Prost, E., Parmentier, M., Jeandet, P., and Nuzillard, J. M., 2003,
“Diffusion Coefficient of CO
2
MoleculesasDeterminedby
13
CNMRinVarious
Carbonated Beverages,” J. Agric. Food Chem.,51(26), pp. 7560–7563.
[27] Maharajh, D., 1973, “Solubility and Diffusion of Gases in Water,” Ph.D. thesis,
Simon Fraser University, Burnaby, BC, Canada.
[28] Versteeg, G. F., and Van Swaalj, W., 1988, “Solubility and Diffusivity of Acid
Gases (Carbon Dioxide, Nitrous Oxide) in Aqueous Alkanolamine Solutions,”
J. Chem. Eng. Data,33(1), pp. 29–34.
[29] Himmelblau, D. M., 1964, “Diffusion of Dissolved Gase s in Liquids,” Chem.
Rev.,64(5), pp. 527–550.
[30] Thomas, W. J., and Adams, M. J., 1965, “Measurement of the Diffusion Coeffi-
cients of Carbon Dioxide and Nitrous Oxide in Water and Aqueous Solutions of
Glycerol,” Trans. Faraday Soc.,61, pp. 668–673.
[31] Brignole, E. A., and Echarte, R., 1981, “Mass Transfer in Laminar Liquid Jets:
Measurement of Diffusion Coefficients,” Chem. Eng. Sci.,36(4), pp. 705–711.
[32] Nijsing, R., Hendriksz, R. H., and Krame rs, H., 1959, “Absorption of CO
2
in
Jets and Falling Films of Electrolyte Solutions, With and Without Chemical
Reaction,” Chem. Eng. Sci.,10(1–2), pp. 88–104.
[33] Tan, K. K., and Thorpe, R. B., 1992, “Gas Diffusion Into Viscous and Non-
Newtonian Liquids,” Chem. Eng. Sci.,47(13–14), pp. 3565–3572.
[34] Tham, M. J., Bhatia, K. K., and Gubbins, K. F., 1967, “Steady-State Method for
Studying Diffusion of Gases in Liquids,” Chem. Eng. Sci.,22(3), pp. 309–311.
[35] Vivian, J. E., and Peaceman, D. W., 1956, “Liquid-Side Resistance in Gas
Absorption,” AIChE J.,2(4), pp. 437–443.
[36] Pratt, K. C., Slater, D. H., and Wakeha m, W. A., 1973, “A Rapid Method for
the Determination of Diffusion Coefficients of Gases in Liquids,” Chem. Eng.
Sci.,28(10), pp. 1901–1903.
[37] Bodnar, L. H., and Himmelblau, D. M., 1962, “Continuous Measurement of
Diffusion Coefficients of Gases in Liquids Using Glass Scintillators,” Int. J.
Appl. Radiat. Isot.,13(1), pp. 1–6.
[38] Choudhari, R., and Doraiswamy, L. K., 1972, “Physical Properties in Reaction
of Ethylene and Hydrogen Chloride in Liquid Media. Diffusivities and Sol-
ubilities,” J. Chem. Eng. Data,17(4), pp. 428–432.
[39] Reddy, K. A., and Doraiswamy, L. K., 1967, “Estimating Liquid Diffusivity,”
Ind. Eng. Chem. Fundam.,6(1), pp. 77–79.
[40] Lu, W., Guo, H., Chou, I. M., Burruss, R. C., and Li, L., 2013, “Determination
of Diffusion Coefficients of Carbon Dioxide in Water Between 268 and 473 K
in a High-Pressure Capillary Optical Cell With in Situ Raman Spectroscopic
Measurements,” Geochim. Cosmochim. Acta,115, pp. 183–204.
[41] Cadogan, S. P., Maitland, G. C., and Trusler, J. M., 2014, “Diffusion Coeffi-
cients of CO
2
and N
2
in Water at Temperatures Between 298.15 K and 423.15
K at Pressures Up to 45 MPa,” J. Chem. Eng. Data ,59(2), pp. 519–525.
[42] Jang, H. W., Yang, D., and Li, H., 2018, “A Power-Law Mixing Rule for Pre-
dicting Apparent Diffusion Coefficients of Binary Gas Mixtures in Heavy Oil,”
ASME J. Energy Resour. Technol.,140(5), p. 052904.
[43] Shi, Y., Zheng, S., and Yang, D., 2017, “Determination of Individual Diffusion
Coefficients of Alkane Solvent(s)–CO
2
–Heavy Oil Systems With Consideration
of Natural Convection Induced by Swelling Effect,” Int. J. Heat Mass Transfer,
107, pp. 572–585.
[44] Zheng, S., and Yang, D., 2017, “Experimental and Theoretical Determination
of Diffusion Coefficients of CO
2
-Heavy Oil Systems by Coupling Heat and
Mass Transfer,” ASME J. Energy Resour. Technol.,139(2), p. 022901.
[45] Zheng, S., and Yang, D., 2017, “Determination of Individual Diffusion Coeffi-
cients of C
3
H
8
/n-C
4
H
10
/CO
2
/Heavy-Oil Systems at High Pressures and Ele-
vated Temperatures by Dynamic Volume Analysis,” SPE J.,22, pp. 799–816.
[46] Li, H. A., Sun, H., and Yang, D., 2017, “Effective Diffusion Coefficients of Gas
Mixture in Heavy Oil Under Constant-Pressure Conditions,” Heat Mass Trans-
fer,53(5), pp. 1527–1540.
[47] Zheng, S., Sun, H., and Yang, D., 2016, “Coupling Heat and Mass Transfer for
Determining Individual Diffusion Coefficient of a Hot C
3
H
8
–CO
2
Mixture in
Heavy Oil Under Reservoir Conditions,” Int. J. Heat Mass Transfer,102, pp.
251–263.
[48] Zheng, S., Li, H. A., Sun, H., and Yang, D., 2016, “Determination of Diffusion
Coefficient for Alkane Solvent–CO
2
Mixtures in Heavy Oil With Consideration
of Swelling Effect,” Ind. Eng. Chem. Res.,55(6), pp. 1533–1549.
[49] Li, H., and Yang, D., 2016, “Determination of Individual Diffusion Coefficients
of Solvent/CO
2
Mixture in Heavy Oil With Pressure-Decay Method,” SPE J.,
21(1), pp. 131–143.
[50] Sun, H., Li, H., and Yang, D., 2014, “Coupling Heat and Mass Transfer for a
Gas Mixture–Heavy Oil System at High Pressures and Elevated Temperatures,”
Int. J. Heat Mass Transfer,74, pp. 173–184.
[51] Yang, D., Tontiwachwuthikul, P., and Gu, Y., 2006, “Dynamic Interfacial Ten-
sion Method for Measuring Gas Diffusion Coefficient and Interface Mass
Transfer Coefficient in a Liquid,” Ind. Eng. Chem. Res.,45(14), pp.
4999–5008.
[52] Othmer, D. F., and Thakar, M. S., 1953, “Correlating Diffusion Coefficient in
Liquids,” Ind. Eng. Chem.,45(3), pp. 589–593.
[53] Wilke, C. R., and Chang, P., 1955, “Correlation of Diffusion Coefficients in
Dilute Solutions,” AIChE J.,1(2), pp. 264–270.
[54] Moultos, O. A., Tsimpanogiannis, I. N., Panagiotopoulos, A. Z., and Econo-
mou, I. G., 2016, “Self-Diffusion Coefficients of the Binary (H
2
OþCO
2
) Mix-
ture at High Temperatures and Pressures,” J. Chem. Thermodyn.,93, pp.
424–429.
[55] Cadogan, S., 2015, “Diffusion of CO
2
in Fluids Relevant to Carbon Capture,
Utilisation and Storage,” Ph.D. thesis, Imperial College London, London.
[56] Shokrollahi, A., Arabloo, M., Gharagheizi, F., and Mohammadi, A. H., 2013,
“Intelligent Model for Prediction of CO
2
–Reservoir Oil Minimum Miscibility
Pressure,” Fuel,112, pp. 375–384.
[57] Le Van, S., and Chon, B. H., 2018, “Effective Prediction and Management of a
CO
2
Flooding Process for Enhancing Oil Recovery Using Artificial Neural
Networks,” ASME J. Energy Resour. Technol.,140(3), p. 032906.
[58] Zhang, J., Feng, Q., Zhang, X., Zhang, X., Yuan, N., Wen, S., Wang, S., and
Zhang, A., 2015, “The Use of an Artificial Neural Network to Estimate Natural
Gas/Water Interfacial Tension,” Fuel,157, pp. 28–36.
[59] Khaksar Manshad, A., Rostami, H., Moein Hosseini, S., and Rezaei, H., 2016,
“Application of Artificial Neural Network–Particle Swarm Optimization Algo-
rithm for Prediction of Gas Condensate Dew Point Pressure and Comparison
With Gaussian Processes Regression–Particle Swarm Optimization Algorithm,”
ASME J. Energy Resour. Technol.,138(3), p. 032903.
[60] Paul, A., Bhowmik, S., Panua, R., and Debroy, D., 2018, “Artificial Neural
Network-Based Prediction of Performances-Exhaust Emissions of Diesohol
Piloted Dual Fuel Diesel Engine Under Varying Compressed Natural Gas Flow-
rates,” ASME J. Energy Resour. Technol.,140(11), p. 112201.
[61] Chen, B., Harp, D. R., Lin, Y., Keating, E. H., and Pawar, R. J., 2018, “Geologic
CO
2
Sequestration Monitoring Design: A Machine Learning and Uncertainty
Quantification Based Approach,” Appl. Energy,225, pp. 332–345.
[62] Kamari, A., Arabloo, M., Shokrollahi, A., Gharagheizi, F., and Mohammadi, A.
H., 2015, “Rapid Method to Estimate the Minimum Miscibility Pressure (MMP)
in Live Reservoir Oil Systems During CO
2
Flooding,” Fuel,153, pp. 310–319.
[63] Tatar, A., Barati-Harooni, A., Najafi-Marghmaleki, A., Najafi-Marghmaleki,
A., Mohebbi, A., Ghiasi, M. M., Mohammadi, A. H., and Hajinezhad, A., 2016,
“Comparison of Two Soft Computing Approaches for Predicting CO
2
Solubil-
ity in Aqueous Solution of Piperazine,” Int. J. Greenhouse Gas Control,53, pp.
85–97.
[64] Linstrom, P. J., and Mallard, W. G. E., 2016, “NIST Chemistry WebBook,”
National Institute of Standards and Technology, Gaithersburg, MD, accessed
Aug. 17, 2016, NIST Standard Reference DatabaseNo. 69, http://webbook.nist.gov
[65] Cortes, C., and Vapnik, V., 1995, “Support-Vector Networks,” Mach. Learn.,
20(3), pp. 273–297.
[66] Vapnik, V., 1995, The Nature of Statistical Learning Theory, Springer, New
York.
[67] Boser, B. E., Guyon, I. M., and Vapnik, V. N., 1992, “A Training Algorithm for
Optimal Margin Classifiers,” Fifth Annual Workshop on Computational Learn-
ing Theory, Pittsburgh, PA, July 27–29, pp. 144–152.
[68] Drucker, H., Burges, C. J., Kaufman, L., Smola, A., and Vapnik, V., 1997,
“Support Vector Regression Machines,” Advances in Neural Information Proc-
essing Systems 9, MIT Press, Cambridge, MA, pp. 155–161.
[69] Geng, Y., Chen, J., Fu, R., Bao, G., and Pahlavan, K., 2016, “Enlighten Weara-
ble Physiological Monitoring Systems: On-Body of Characteristics Based
Human Motion Classification Using a Support Vector Machine,” IEEE Trans.
Mobile Comput.,15(3), pp. 656–671.
[70] Lee, Y. J., and Mangasarian, O. L., 2001, “SSVM: A Smooth Support Vector
Machine for Classification,” Comput. Optim. Appl.,20, pp. 5–22.
[71] Bian, X. Q., Han, B., Du, Z. M., Jaubert, J. N., and Li, M. J., 2016, “Integrating
Support Vector Regression With Genetic Algorithm for CO
2
-Oil Minimum
Miscibility Pressure (MMP) in Pure and Impure CO
2
Streams,” Fuel,182, pp.
550–557.
[72] Filgueiras, P. R., Portela, N. A., Silva, S. R., Castro, E. V., Oliveira, L. M.,
Dias, J. C., Neto, A. C., Rom~
ao, W., and Poppi, R. J., 2016, “Determination of
Saturates, Aromatics, and Polars in Crude Oil by
13
C NMR and Support Vector
Regression With Variable Selection by Genetic Algorithm,” Energy Fuels,
30(3), pp. 1972–1978.
[73] Fiacco, A. V., and McCormick, G. P., 1964, “The Sequential Unconstrained
Minimization Technique for Nonlinear Programing, A Primal-Dual Method,”
Manag. Sci.,10(2), pp. 360–366.
[74] Smola, A. J., and Sch
olkopf, B., 2004, “A Tutorial on Support Vector
Regression,” Stat. Comput.,14(3), pp. 199–222.
[75] Sch
olkopf, B., and Burges, C. J., 1999, Advances in Kernel Methods: Support
Vector Learning, MIT Press, Cambridge, MA.
[76] Tehrany, M. S., Pradhan, B., and Jebur, M. N., 2014, “Flood Susceptibility
Mapping Using a Novel Ensemble Weights-of-Evidence and Support Vector
Machine Models in GIS,” J. Hydrol.,512, pp. 332–343.
[77] Smits, G. F., and Jordaan, E. M., 2002, “Improved SVM Regression Using Mix-
tures of Kernels,” International Joint Conference on Neural Networks , Vol. 3,
pp. 2785–2790.
[78] Holland, J. H., 1992, “Genetic Algorithms,” Sci. Am.,267(1), pp. 66–72.
[79] Khadse, A., Blanchette, L., Kapat, J., Vasu, S., Hossain, J., and Donaz zolo, A.,
2018, “Optimization of Supercritical CO
2
Brayton Cycle for Simple Cycle Gas
Turbines Exhaust Heat Recovery Using Genetic Algorithm,” ASME J. Energy
Resour. Technol.,140(7), p. 071601.
041001-10 / Vol. 141, APRIL 2019 Transactions of the ASME
Downloaded From: https://energyresources.asmedigitalcollection.asme.org on 11/21/2018 Terms of Use: http://www.asme.org/about-asme/terms-of-use
[80] Salmachi, A., Sayyafzadeh, M., and Haghighi, M., 2013, “Infill Well Placement
Optimization in Coal Bed Methane Reservoirs Using Genetic Algorithm,” Fuel,
111, pp. 248–258.
[81] Velez-Langs, O., 2005, “Genetic Algorithms in Oil Industry: An Overview,”
J. Pet. Sci. Eng.,47(1–2), pp. 15–22.
[82] Davis, L., 1991, Hand book of Genetic Algorithms, Van Nostrand Reinhold,
New York.
[83] Chatt erjee, S., and Hadi, A. S., 2015, Regression Analysis by Example, Wiley,
New York.
[84] Rousseeuw, P. J., and Leroy, A. M., 2005, Robust Regression and Outlier
Detection, Wiley, New York.
[85] Mohammadi, A. H., Eslamimanesh, A., Gharagheizi, F., and Richon, D., 2012,
“A Novel Method for Evaluation of Asphaltene Precipitation Titration Data,”
Chem. Eng. Sci.,78, pp. 181–185.
[86] Mohammadi, A. H., Gharagheizi, F., Eslamimanesh, A., and Richon, D., 2012,
“Evaluation of Experimental Data for Wax and Diamondoids Solubility in Gas-
eous Systems,” Chem. Eng. Sci.,81, pp. 1–7.
[87] Feng, Q., Zhang, J., Zhang, X., and Wen, S., 2015, “Proximate Analysis Based
Prediction of Gross Calorific Value of Coals: A Comparison of Support Vector
Machine, Alternating Conditional Expectation and Artificial Neural Network,”
Fuel Process. Technol.,129, pp. 120–129.
[88] Togun, N. K., and Baysec, S., 2010, “Prediction of Torque and Specific Fuel
Consumption of a Gasoline Engine by Using Artificial Neural Networks,”
Appl. Energy,87(1), pp. 349–355.
[89] Pradhan, B., and Lee, S., 2010, “Landslide Susceptibility Assessment and Fac-
tor Effect Analysis: Backpropagation Artificial Neural Networks and Their
Comparison With Frequency Ratio and Bivariate Logistic Regression Mod-
elling,” Environ. Modell. Software,25(6), pp. 747–759.
[90] Li, Q., Meng, Q., Cai, J., Yoshino, H., and Mochida, A., 2009, “Predicting
Hourly Cooling Load in the Building: A Comparison of Support Vector
Machine and Different Artificial Neural Networks,” Energy Convers. Manage.,
50(1), pp. 90–96.
[91] Sorgun, M., Murat Ozbay oglu, A. A., and Evren Ozbayoglu, M. M., 2014,
“Support Vector Regression and Computational Fluid Dynamics Modeling of
Newtonian and Non-Newtonian Fluids in Annulus With Pipe Rotation,” ASME
J. Energy Resour. Technol.,137(3), p. 032901.
[92] Pradhan, B., 2013, “A Comparative Study on the Predictive Ability of the Deci-
sion Tree, Support Vector Machine and Neuro-Fuzzy Models in Landslide Sus-
ceptibility Mapping Using GIS,” Comput. Geosci.,51, pp. 350–365.
[93] Rostami, A., Hemmati-Sarapardeh, A., and Shamshirband, S., 2018, “Rigorous
Prognostication of Natural Gas Viscosity: Smart Modeling and Comparative
Study,” Fuel,222, pp. 766–778.
Journal of Energy Resources Technology APRIL 2019, Vol. 141 / 041001-11
Downloaded From: https://energyresources.asmedigitalcollection.asme.org on 11/21/2018 Terms of Use: http://www.asme.org/about-asme/terms-of-use
... However, the hybrid kernel complicates the structure of SVM, which necessitates the adjustment and selection of more hyperparameters. To address this Research Article problem, numerous heuristic optimization methods such as the particle swarm optimization (PSO) [22], the genetic algorithm (GA) [23], the gravitational search algorithm (GSA) [24], the cuckoo search algorithm (CSA) [11], and the beetle antennae search algorithm (BAS) [25] have been applied to search the optimal hyperparameters of SVM. PSO and GA suffer the local minimum problem while CSA performs poorly in local searches [26,27]. ...
... Here, ω and b are the model parameters to be trained, C is the penalty coefficient, ε is the insensitive function, and ξ andξ are the slack variables [23]. The Lagrange method is used to solve Eq. (5), and then the SVM mode can be written as ...
... The most commonly used global and local kernel functions are the polynomial kernel and RBF kernel, as listed in Eqs. (7) and (8), respectively [23]. The polynomial kernel in Eq. (7) has a strong extrapolation ability, while the RBF kernel in Eq. (8) has strong interpolation ability. ...
Article
Full-text available
In this paper, the optimal hybrid kernel support vector machine is employed to propose a compensation strategy intended for the temperature drift of a fiber optical gyroscope (FOG). First, the mode of the hybrid kernel with an interpolation and extrapolation capability is constructed, which consists of the radial basis function and the polynomial kernel function. Second, the combination model of the beetle antennae search algorithm and gravitational search algorithm that has both local and global search capability is proposed to optimize the structure-related parameters of a hybrid kernel support vector machine (HKSVM). Finally, the proposed approach is trained and tested using the experimental data of temperature drift at two different rates of temperature change (10°C/min and 5°C/min). In addition, the proposed method is validated against those conventional compensation algorithms. According to the research results, the compensation error (mean squared error) of the proposed approach is reduced by 92% compared to the traditional support vector machine based on the radial basis function.
... An in-depth review of the available correlations for predicting the diffusivity coefficient of CO 2 in brine reveals the limitations of these techniques from the applicability and accuracy perspectives (Feng et al., 2019). ...
... In the context of the affecting variables, salinity affects the solubility, interfacial tension and phase equilibria, thus influencing the diffusivity. In addition, salinity of the solvents affects brine's viscosities (Cadogan, 2015;Feng et al., 2019); therefore, the salinity effect on diffusivity of CO 2 in brine is emulated by considering the brine's viscosities as an input parameter while establishing the correlations and the paradigms. The data points collected from previous experimental studies were obtained using various techniques and equipments such as Taylor dispersion, a modified version of Ringborm's apparatus, laminar jet apparatus, laminar falling film, laser-induced fluorescence (LIF), 13C pulsed-field gradient NMR, physical absorption experiments in a stirred vessel operated with a horizontal gas-liquid interface, optical capillary cell via time-dependent Raman spectroscopy, wetted sphere apparatus, Taylor Aris dispersion method and see-through windowed high-pressure cell. ...
... The comparison includes the empirical models, namely those of Othmer and Thakar (1953), Wilke and Chang (1955) and Cadogan et al. (2015). In addition to the empirical models, the implemented GEP correlation was compared with one of the most recent intelligent paradigms proposed by Feng et al. (2019) based on hybrid genetic algorithm and mixed Kernels-based support vector machine. It is worth mentioning that when performing the comparison with the pre-existing approaches, we included only the points that satisfy the applicability conditions in each correlation. ...
Article
Accurate knowledge of the diffusivity coefficient of CO2 in brine has a significant effect on the design success and monitoring of CO2 storage in saline aquifers, which is a part of carbon capture and sequestration (CCS). Frequently applied experimental approaches for determining this parameter are expensive and time-consuming, and empirical models cannot ensure accurate predictions. Therefore, there is a need to establish cutting-edge correlations for prediction of the diffusivity coefficient of CO2 in brine under various operating conditions. In this work, two white-box machine learning techniques, namely group method of data handling (GMDH) and gene expression programming (GEP) were implemented for correlating the diffusivity coefficient of CO2 in brine with pressure, temperature and the viscosity of the solvent. The obtained results demonstrated the accuracy of the proposed correlations. In addition, statistical and graphical analysis of the performances revealed that GEP correlation outperforms the GMDH correlation, decision trees (DTs), random forest (RF) and all the previous predictive models. GEP correlation exhibited an overall average absolute relative deviation (AARD) of 4.3014% and coefficient of determination (R²) of 0.9979. Finally, by performing the outliers detection, the validity of the GEP correlation was confirmed and only two experimental data points were identified as outliers.
... Generally, CO 2 diffusion coefficient in brine is known as a function of pressure, temperature, salinity, and also the composition of salt. However, several researches showed that CO 2 diffusion coefficient in liquid may be correlated to the properties of the liquid phase (such as viscosity, and density) [49,51,52]. Eq. (7.2) also shows that the effective diffusivity (D eff ;CO 2 2water ) in the presence of porous media could be reliant on the porosity ([) and tortuosity [26]. ...
Chapter
Carbonated water injection (CWI) is considered as a feasible enhanced oil recovery (EOR) method that its mechanisms, which are responsible for oil recovery, mainly initiate with the partitioning of dissolved carbon dioxide (CO2) from carbonated water to the adjacent oil phase. Alongside the EOR, CWI is considered as a promising method for geological CO2 storage beneath earth layers in depleted oil reservoirs. Countless researches have been performed on in-depth analysis of the role of CWI in improving oil recovery at the core scale. A better understanding of the primary mechanisms leading to additional oil recovery is gained through pore-scale visual investigations. The role of operational parameters such as injection rate, CO2 content, pressure, and temperature, which influence the performance of CWI, are discussed in this chapter. Challenges of CWI application are presented and discussed technically and economically. Finally, some important research findings and gaps in present understanding are highlighted.
... In [31], the effectiveness of the LSSVM in estimating the CO2 solubility is studied and the influence of salinity, pressure, and temperature is analyzed. In [32], a model is developed by the use of GA and support vector machine (SVM), and its proficiency is examined in versus of NN-based approaches. In [33][34][35][36], decision making techniques are used for modeling problems in engineering applications. ...
Article
The modeling problem is one of the important topics in engineering applications. In various applications, it is required to find a mathematical model to represent the relationship between output and the associated input variables. In this study, an approach on basis of a new deep learned type-3 (T3) fuzzy logic system (FLS) is introduced. The modeling of CO2 solubility on basis of temperature, molality of NaCl, and pressure is considered as an application. The monitoring of carbon dioxide (CO2) solubility in brine is one of the effective approaches in carbon capture and sequestration technique to reduce it in the atmosphere. A new hybrid learning method is presented to optimize the suggested model. The new adaptation laws are carry-out to tune the rule parameters and centers of membership functions (MFs). The values of horizontal slices and α-cuts are learned by the unscented Kalman filter (UKF). By the real-world experimental data sets, several statistical examinations, and comparison with conventional well-known fuzzy neural networks (NNs) and learning methods, the reliability and good performance of the suggested method are demonstrated. Also, the sensitivity of the input variables is analyzed by the use of the Sobol approach. © 2022, Budapest Tech Polytechnical Institution. All rights reserved.
... Amooie et al. conducted a comprehensive data-driven study on the interfacial tension of pure and impure gas-brine mixtures within saline aquifers, then, they developed seven machine learningbased models for predicting the IFT [19]. Feng et al. used a hybrid technique of support vector machine, mixed kernels, and genetic algorithm to realize an efficient and accurate prediction of the CO 2 diffusivity in brine at reservoir conditions [20]. Sahour et al. developed a methodology combining the statistical methods and machine learning techniques based on the available hydrogeology and hydrometeorology data, to map the groundwater salinity in the southern coastal aquifer of the Caspian Sea. ...
Article
This paper constructs a prediction model based on Multilayer Perceptron (MLP) to explore the formation mechanism of brine (seawater evaporation or freezing). Four brine sets are extracted from the published, real-world data, and the simulation test with the same six chemical substance features. After integrated comparative experiments on six evaluation metrics, the results show that this model outperforms the other baseline prediction algorithms, achieving the highest precision 0.9625 and at least 8.45% improvement. Furthermore, for predicting the real-world test set, the results confirm the existence of freezing brine for the first time in Laizhou Bay area, China. This model is also used to analyze the mixed simulation results for brine and fresh groundwater. The experimental results indicate that only two out of twenty-nine samples of various concentrations change formation mechanisms after mixing. Overall, the model can effectively distinguish the evaporation and freezing brine, and discover the seawater concentration pathway.
... By modifying the structure of the SVM, Suykens and Vandewalle proposed least-squares support vector machines [38]. LS-SVM is also categorized as a supervised learning paradigm suitable for classification as well as regression tasks [38,39]. LS-SVM interprets experimental datasets using the least-squares methodology for solving a system of linear equations [40]. ...
Article
Bubble point pressure (BPP) not only is a basic pressure-volume-temperature (PVT) parameter for calculation nearly all of crude oil characteristics, but it also determines phase type of oil reservoirs, gas to oil ratio, oil formation volume factor, inflow performance relationship, and so on. Since measurement of BPP of crude oil is an expensive and time-consuming experiment, this study develops a committee machine-ensemble (CME) paradigm for accurate estimation of this parameter from solution gas oil ratio, reservoir temperature, gas specific gravity, and stock-tank oil gravity. Our CME approach is designed using linear combination of predictions of four different expert systems. Unknown coefficients of this combination are adjusted through minimizing deviation between actual BPPs and their associated predictions using differential evolution and genetic algorithm. Our proposed CME paradigm is developed using 380 PVT datasets for crude oils from different geological regions. This novel intelligent paradigm estimates available experimental databank with an excellent accuracy i.e. absolute average relative deviation (AARD) of 6.06% and regression coefficient (R2) of 0.98777. Accurate prediction of BPP using our CME paradigm decreases the risk of producing from two-phase region of oil reservoirs.
... The capability to utilize CO2 for enhanced oil recovery in mature reservoirs as well as store it in depleted reservoirs to reduce greenhouse emissions has continued to make CO2 flooding an attractive process to several governments and energy companies around the world [1][2][3][4]. CO2 gas flooding is a popular method of enhanced oil recovery that aids microscopic fluid displacement by mechanical piston-like displacement, viscosity reduction, swelling of the oil and development of miscibility. Miscibility is one of the essential mechanisms, specifically for CO2-EOR processes that need to be carefully studied [5]. ...
Article
The minimum miscibility pressure (MMP) is one of the critical parameters needed in the successful design of a miscible gas injection for enhanced oil recovery purposes. In this study, we explore the capability of using the Gaussian Process Machine Learning (GPML) Approach, for accurate prediction of this vital property in both pure and impure CO2-injection streams. We first performed a sensitivity analysis of different kernels and then a comparative analysis with other techniques. The new GPML model, when compared to previously published predictive models, including both correlations and other machine learning/intelligent models, showed superior performance with the highest correlation coefficient and the lowest error metrics.
Article
Undersaturated oil viscosity is an important physical property for reservoir simulation, enhanced oil recovery, and optimal production. There are two distinct methods for undersaturated oil viscosity determination: the first one is experimental measurements which are usually expensive or unavailable; whereas the second one is empirical correlations which frequently have appropriate accuracy. Accordingly, searching for a high reliability undersaturated oil viscosity model is vital. This paper presents a new undersaturated crude oil viscosity model by using multi-gene genetic programming (MGGP). This model was built by using 528 experimental measurements data points that presents broad range of reservoir pressure and oil properties. Another, 276 points were used for validating and testing the new model against eleven published correlations. The results indicated that the new MGGP-based model yields a precise prediction of undersaturated oil viscosity.
Article
The dew point pressure (DPP) is a crucial thermodynamic property for gas reservoir performance evaluation, gas/condensate characterization, reservoir development and management, and downstream facility design. However, dew point pressure measurement is an expensive and time-consuming task; its estimation using the thermodynamic approaches has convergency problems, and available empirical correlations often provide high uncertainty levels. In this paper, the hybrid neuro-fuzzy connectionist paradigm is developed using 390 literature measurements. The adaptive neuro-fuzzy inference system (ANFIS) topology, including the training algorithm and cluster radius (radii), was determined by combining trial-and-error and statistical analyses. The hybrid optimization algorithm and radii=0.675 are distinguished as the best characteristics for the ANFIS model. A high value of observed R2 = 0.97948 confirms the excellent performance of the designed approach for calculating the DPP of retrograde gas condensate reservoirs. Furthermore, visual inspections and statistical indices are employed to compare the ANFIS reliability and available empirical correlations. The results showed that the ANFIS model is more accurate than the well-known empirical correlations and previous intelligent paradigms in the literature. The designed ANFIS model, the best empirical correlation, and the most accurate intelligent paradigm in the literature present the absolute average relative deviation (AARD) of 1.60%, 11.25%, 2.10%, and, respectively.
Article
Full-text available
Monitoring is a crucial aspect of geologic carbon dioxide (CO2) sequestration risk management. Effective monitoring is critical to ensure CO2 is safely and permanently stored throughout the life-cycle of a geologic CO2 sequestration project. Effective monitoring involves deciding: (i) where is the optimal location to place the monitoring well(s), and (ii) what type of data (pressure, temperature, CO2 saturation, etc.) should be measured taking into consideration the uncertainties at geologic sequestration sites. We have developed a filtering-based data assimilation procedure to design effective monitoring approaches. To reduce the computational cost of the filtering-based data assimilation process, a machine-learning algorithm: Multivariate Adaptive Regression Splines is used to derive computationally efficient reduced order models from results of full-physics numerical simulations of CO2 injection in saline aquifer and subsequent multi-phase fluid flow. We use example scenarios of CO2 leakage through legacy wellbore and demonstrate a monitoring strategy can be selected with the aim of reducing uncertainty in metrics related to CO2 leakage. We demonstrate the proposed framework with two synthetic examples: a simple validation case and a more complicated case including multiple monitoring wells. The examples demonstrate that the proposed approach can be effective in developing monitoring approaches that take into consideration uncertainties.
Article
Full-text available
For the application of waste heat recovery (WHR), supercritical CO2 (S-CO2) Brayton power cycles offer significant suitable advantages such as compactness, low capital cost, and applicability to a broad range of heat source temperatures. The current study is focused on thermodynamic modeling and optimization of recuperated (RC) and recuperated recompression (RRC) configurations of S-CO2 Brayton cycles for exhaust heat recovery from a next generation heavy duty simple cycle gas turbine using genetic algorithm (GA). This nongradient based algorithm yields a simultaneous optimization of key S-CO2 Brayton cycle decision variables such as turbine inlet temperature, pinch point temperature difference, compressor pressure ratio, and mass flow rate of CO2. The main goal of the optimization is to maximize power out of the exhaust stream which makes it single objective optimization. The optimization is based on thermodynamic analysis with suitable practical assumptions which can be varied according to the need of user. The optimal cycle design points are presented for both RC and RRC configurations and comparison of net power output is established for WHR. For the chosen exhaust gas mass flow rate, RRC cycle yields more power output than RC cycle. The main conclusion drawn from the current study is that the choice of best cycle for WHR actually depends heavily on mass flow rate of the exhaust gas. Further, the economic analysis of the more power producing RRC cycle is performed and cost comparison between the optimized RRC cycle and steam Rankine bottoming cycle is presented.
Article
The present study surveys the effects on performance and emission parameters of a partially modified single cylinder direct injection (DI) Diesel engine fueled with Diesohol blends under varying compressed natural gas (CNG) flowrates in dual fuel mode. Based on experimental data an Artificial Intelligence (AI) based specialized Artificial Neural Network (ANN) model have been developed for predicting the output parameters viz. brake thermal efficiency (Bth), brake specific energy consumption (BSEC) along with emission characteristics such as oxides of nitrogen (NOX), unburned hydrocarbon (UBHC), carbon dioxide (CO2) and carbon monoxide (CO) emissions. Engine load, ethanol share and CNG strategies have been used as input parameters for the model. Among the tested models, the Levenberg-Marquardt feed-forward back propagation with three input neurons or nodes, two hidden layers with ten neurons in each layer and six output neurons, and tansig-purelin activation function have been found to the optimal model topology for the Diesohol-CNG platforms. The statistical results acquired from the optimal network topology such as correlation coefficient (0.992-0.999), mean square error (0.0001-0.0009) and mean absolute percentage error (0.09-2.41%) along with Nash-Sutcliffe coefficient of efficiency, Kling-Gupta efficiency, mean square relative error and model uncertainty established itself as a real time robust type machine learning tool under Diesohol- CNG paradigms. The study also incorporated a special type of measure namely Pearson's Chisquare test or goodness of fit, which brings up the model validation to a higher level.
Article
Complex fluid-rock interactions can occur during the injection of carbon dioxide (CO2) into saline aquifers for sequestration, which may affect CO2 injectivity and storage capacity. In this paper, a comprehensive reactive transport model is established to analyze salt precipitation, CO2-water-rock geochemical reactions, and their effects on reservoir physical properties and injectivity. In addition, sensitivity analyses are conducted to investigate the main factors that affect fluid-rock interactions and injectivity with relevance for site selection for CO2 storage. Results show that the back flow of formation water not only affects the salt precipitation but also affects the CO2-water-rock geochemical reactions, resulting in salt and calcite precipitations mainly occurring in the dry-out zone. However, most of the mineral dissolution/precipitation caused by CO2-water-rock reaction occurs in the two-phase and aqueous-phase zones, and their effect on reservoir porosity and permeability are small. A considerable amount of sodium chloride precipitates in the dry-out zone as brine is drawn by capillary action into this zone, with significant consequences for porosity, permeability and injectivity. The injection rate, salinity, capillary pressure–saturation relationships, and reservoir permeability strongly affect the distribution of salt precipitation. Moderate injection rates, salinities, capillary pressures, and permeabilities all lead to favorable CO2 injectivity.
Article
In this paper, experimental and numerical techniques have been utilized to quantify heavy oil properties in CO2 huff-n-puff processes under reservoir conditions. Experimentally, fluid properties together with viscosity reduction of heavy oil and interfacial properties between CO2 and heavy oil have been quantified, while five cycles of CO2 huff-n-puff processes have been conducted to determine oil recovery together with component variation of produced and residual oils. Theoretically, numerical simulation has been conducted to analyze the underlying recovery mechanisms associated with the CO2 huff-n-puff processes. CO2 huff-n-puff processes are only effective in the first two cycles under the existing experimental conditions, while the effective sweep range is limited near the wellbore region, resulting in poor oil recovery in the subsequent cycles. As for produced oil, its viscosity, density, resin and asphaltene contents, and molecular weight of asphaltene are reduced, whereas, for the residual oil, they are increased. The asphaltene component in the residual oil shows weak stability compared to that of the produced oil, while the ultimate oil recovery after the fifth CO2 cycle of huff-n-huff processes is measured to be 31.56%.
Article
The application of water flooding is not successful for the development of low permeability reservoirs due to the fine pore sizes and the difficulty of water injection operation. CO2 can dissolve readily in crude oil and highly improve the mobility of crude oil. Which makes CO2 flooding an effective way to the development of the ultralow-permeability reservoirs. The regularities of various CO2 displacement methods were studied via experiments implemented on cores from Chang 8 Formation of Honghe Oilfield. The results show that CO2 miscible displacement has the minimum displacement differential pressure and the maximum oil recovery; CO2-alternating-water miscible flooding has lower oil recovery, higher drive pressure, and relatively lower gas-oil ratio; water flooding has the minimum oil recovery and the maximum driving pressure. A large amount of oil still can be produced under a high gas-oil ratio condition through CO2 displacement method. This fact proves that the increase of gas-oil ratio is caused by the production of dissolved CO2 in oil rather than the free gas breakthrough. At the initial stage of CO2 injection, CO2 does not improve the oil recovery immediately. As the injection continues, the oil recovery can be improved rapidly. This phenomenon suggests that when CO2 displacement is performed at high water cut period, the water cut does not decrease immediately and will remain high for a period of time, then a rapid decline of water cut and increase of oil production can be observed.
Article
The current study plays a major role in modeling natural gas viscosity in terms of several operating parameters including pseudo-reduced properties and molecular weight through radial basis function neural network (RBFNN), least-squares support vector machine (LSSVM), and multilayer perceptron neural network (MLPFNN). As it known, an important feature of any comprehensive modeling is the application of a large database for model development. Therefore, more than 3800 gas viscosity data points were used for modeling. For upgrading the efficiency of the abovementioned predictive tools, four optimization algorithms including levenberg-marquardt (LM), coupled simulating annealing (CSA), Bayesian regularization (BR), and scaled conjugate gradient (SCG), were integrated with them to find the optimal models’ parameters during prediction analysis. Consequently, it was understood that among the all suggested tools in this study, the MLP-LM and then MLP-BR are the most accurate models for estimating gas viscosity with root mean square error (RMSE) of 0.001 and 0.002, respectively. Comparison of the MLP-LM and MLP-BR with previously published models in literature demonstrates their higher prediction capability, with less numbers of input parameters (without needing any density data), than the existing literature models. Based on the sensitivity analysis, it is concluded that the molecular weight is the most affecting variable on the viscosity prediction. Finally, the suggested tools in this study can be of great value for effective estimation of gas viscosity in simulating both upstream and downstream natural gas processes.
Article
The ubiquitous natural sedimentary reservoirs and their high permeability have made the CO2 plume geothermal system increasingly attractive. However, the complicated fluid-rock interactions during the geothermal exploitation can cause severe reservoir damage, constraining the excellent heat mining performance of the CO2 and decreasing the possible applications of the CO2 plume geothermal system. In order to analyze and solve this energy issue affecting the geothermal exploitation, in this study, a comprehensive numerical simulation model was established, which can consider formation water evaporation, salt precipitation, CO2-water-rock geochemical reactions, and the changes in reservoir porosity and permeability in the CO2 plume geothermal (CPG) system. Using this model, the geochemical reactions and salt precipitation and their effects on the geothermal exploitation were analyzed, and some measures were proposed to reduce the influence of fluid-rock interactions on the heat mining rate. The simulation results show that the gravity and the negative gas-liquid capillary pressure gradient induced by evaporation can cause the formation water to flow toward the injector. The back flow of the formation water results in salt precipitation accumulation in the injection well region, which can cause severe reservoir damage and consequent reductions to the heat mining rate. The CO2-water-rock geochemical reactions could result in the dissolution of certain minerals and precipitation of others, but its minimal influence on the heat mining rate can be ignored. However, salt precipitation can affect the geochemical reactions by influencing the CO2 flow and distribution, which can reduce the heat mining rate up to 2/5 of the original. Sensitivity studies show that the reservoir condition can affect the salt precipitation and heat mining rate, so a sedimentary reservoir with high temperature, high porosity and permeability, and low salinity should be selected for CPG application, with an appropriately high injection-production pressure difference. The injection of low salinity water before CO2 injection and the combined injection of CO2 and water vapor can be applied to reduce the salt precipitation and increase the heat mining rate in the CPG system.
Article
A power-law mixing rule has been developed to determine apparent diffusion coefficient of a binary gas mixture on the basis of molecular diffusion coefficients for pure gases in heavy oil. Diffusion coefficient of a pure gas under different pressures and different temperatures is predicted on the basis of the Hayduk and Cheng's equation incorporating the principle of corresponding states for one-dimensional gas diffusion in heavy oil such as the diffusion in a pressure-volume-temperature (PVT) cell. Meanwhile, a specific surface area term is added to the generated equation for three-dimensional gas diffusion in heavy oil such as the diffusion in a pendant drop. In this study, the newly developed correlations are used to reproduce the measured diffusion coefficients for pure gases diffusing in three different heavy oils, i.e., two Lloydminster heavy oils and a Cactus Lake heavy oil. Then, such predicted pure gas diffusion coefficients are adjusted based on reduced pressure, reduced temperature, and equilibrium ratio to determine apparent diffusion coefficient for a gas mixture in heavy oil, where the equilibrium ratios for hydrocarbon gases and CO2 are determined by using the equilibrium ratio charts and Standing's equations, respectively. It has been found for various gas mixtures in two different Lloydminster heavy oils that the newly developed empirical mixing rule is able to reproduce the apparent diffusion coefficient for binary gas mixtures in heavy oil with a good accuracy. For the pure gas diffusion in heavy oil, the absolute average relative deviations (AARDs) for diffusion systems with two different Lloydminster heavy oils and a Cactus Lake heavy oil are calculated to be 2.54%, 14.79%, and 6.36%, respectively. Meanwhile, for the binary gas mixture diffusion in heavy oil, the AARDs for diffusion systems with two different Lloydminster heavy oils are found to be 3.56% and 6.86%, respectively.