ArticlePDF Available

Estimation of CO2 Diffusivity in Brine by Use of the Genetic Algorithm and Mixed Kernels-Based Support Vector Machine Model

April 2019
Journal of Energy Resources Technology, Transactions of the ASME 141(4)

April 2019
141(4)

DOI:10.1115/1.4041724

Authors:

Ronghao Cui

King Abdullah University of Science and Technology

Sen Wang

University of Texas at Austin

Zhang Jin

China University of Petroleum - Beijing

Show all 5 authorsHide

Diffusion coefficient of carbon dioxide (CO2), a significant parameter describing the mass transfer process, exerts a profound influence on the safety of CO2 storage in depleted reservoirs, saline aquifers, and marine ecosystems. However, experimental determination of diffusion coefficient in CO2-brine system is time-consuming and complex because the procedure requires sophisticated laboratory equipment and reasonable interpretation methods. To facilitate the acquisition of more accurate values, an intelligent model, termed MKSVM-GA, is developed using a hybrid technique of support vector machine (SVM), mixed kernels (MK), and genetic algorithm (GA). Confirmed by the statistical evaluation indicators, our proposed model exhibits excellent performance with high accuracy and strong robustness in a wide range of temperatures (273–473.15K), pressures (0.1–49.3MPa), and viscosities (0.139–1.950 mPa·s). Our results show that the proposed model is more applicable than the artificial neural network (ANN) model at this sample size, which is superior to four commonly-used traditional empirical correlations. The technique presented in this study can provide a fast and precise prediction of CO2 diffusivity in brine at reservoir conditions for the engineering design and the technical risk assessment during the process of CO2 injection.

Schematic showing the fundamentals of SVM. The blue and green square dots represent support vectors. The red circular dots are normal points. The full line denotes the hyperplane and the ε-tube is the region between two dotted lines.

…

Empirical correlations for the diffusion coefficients of CO 2 in water

…

Function curves of the polynomial kernel (a), the RBF kernel (b), and the mixed kernel (c). x 5 0.2 is the test point in three types of kernels. As an example of mixed kernel functions, d 5 1 and c 5 10 are set.

…

. Basic parameters used in the MKSVM-GA model Parameters Value

…

. Statistical evaluation results of the MKSVM-GA model

…

Figures - uploaded by Ronghao Cui

Content may be subject to copyright.

Content uploaded by Ronghao Cui

Content may be subject to copyright.

Qihong Feng

School of Petroleum Engineering,

China University of Petroleum (East China),

Qingdao 266580, Shandong, China

Ronghao Cui

School of Petroleum Engineering,

China University of Petroleum (East China),

Qingdao 266580, Shandong, China

e-mail: ronghao.cui1993@gmail.com

Sen Wang

School of Petroleum Engineering,

China University of Petroleum (East China),

Qingdao 266580, Shandong, China

e-mail: fwforest@gmail.com

Jin Zhang

School of Petroleum Engineering,

China University of Petroleum (East China),

Qingdao 266580, Shandong, China

Zhe Jiang

School of Petroleum Engineering,

China University of Petroleum (East China),

Qingdao 266580, Shandong, China

Estimation of CO

Diffusivity

in Brine by Use of the Genetic

Algorithm and Mixed

Kernels-Based Support

Vector Machine Model

Diffusion coefﬁcient of carbon dioxide (CO

), a signiﬁcant parameter describing the

mass transfer process, exerts a profound inﬂuence on the safety of CO

storage in

depleted reservoirs, saline aquifers, and marine ecosystems. However, experimental

determination of diffusion coefﬁcient in CO

-brine system is time-consuming and complex

because the procedure requires sophisticated laboratory equipment and reasonable inter-

pretation methods. To facilitate the acquisition of more accurate values, an intelligent

model, termed MKSVM-GA, is developed using a hybrid technique of support vector

machine (SVM), mixed kernels (MK), and genetic algorithm (GA). Conﬁrmed by the sta-

tistical evaluation indicators, our proposed model exhibits excellent performance with

high accuracy and strong robustness in a wide range of temperatures (273–473.15 K),

pressures (0.1–49.3 MPa), and viscosities (0.139–1.950 mPas). Our results show that

the proposed model is more applicable than the artiﬁcial neural network (ANN) model at

this sample size, which is superior to four commonly used traditional empirical correla-

tions. The technique presented in this study can provide a fast and precise prediction of

diffusivity in brine at reservoir conditions for the engineering design and the techni-

cal risk assessment during the process of CO

injection. [DOI: 10.1115/1.4041724]

Keywords: carbon dioxide, diffusion coefﬁcient, support vector machine, multivariate

regression, artiﬁcial neural network

1 Introduction

Due to global energy crisis and environmental pollution, recent

years have witnessed the accelerating exploitation and progressive

utilization of new and clean energy. In this ﬁeld, carbon dioxide

(CO

) exhibits extensive and valuable applications. Replacement

of methane hydrate with CO

is recognized as one of the most

promising approaches to recovering methane from its hydrates,

avoiding damage to the seabed environment [1,2]. It is also found

that using CO

as the heat transmission ﬂuid has a stronger

capacity of mining heat from hot fractured rock than water [3–5].

As an ancillary beneﬁt, injecting CO

into marine ecosystems and

geothermal reservoirs can reduce the CO

emission into the

atmosphere and be conducive for the carbon storage and seques-

tration [6–12]. CO

, regardless of its physical states when

injected, always comes into contact with water and dissolves into

brine through molecular diffusion. Diffusion coefﬁcient, which is

commonly employed to characterize ﬂuid diffusivity, plays an

essential role in the storage safety in the geologic strata and the

sea bed [13,14], because it can dominate the rates of interfacial

mass transfer and heterogeneous chemical reactions related to

saline solution and porous media [15,16].

Considerable investigations have been conducted on the labora-

tory measurement of CO

diffusion coefﬁcients [16–51]. Among

these studies, commonly used experimental methods can be clas-

siﬁed as two types, i.e., the direct methods and the indirect meth-

ods [40]. The direct methods, such as the Taylor–Aris dispersion

method, start with precise analysis of the concentration of gas in

solvent [21,41], while indirect methods need to measure the

variance of a certain property that is related to gas diffusivity

[22,51]. Such a property includes liquid volume and shape

[42,45], gas volume and pressure [49], and interfacial tension

[51]. However, at elevated pressures, the convection can strongly

enhance the mass transfer of CO

into water, leading to an inaccu-

rate estimation of CO

diffusion coefﬁcients [14,43]. Because of

the difﬁculties in overcoming the effects of convection and con-

ducting in situ measurements of gas concentrations at high pres-

sures, only a few studies reported the experimental results at

elevated pressures (up to 49.3 MPa) [19,40,41]. Also, note that

experimentally determining the diffusion coefﬁcients of CO

through direct and indirect methods is both time-consuming and

complex, because it requires sophisticated laboratory equipment

and reasonable interpretation procedures. For ease of application,

a few empirical correlations (Table 1) veriﬁed by experimental

data have been developed [16,40,52–54].

Pressure and temperature, as the main operating conditions dur-

ing the mass transfer process, can exert prominent impacts on the

molecular diffusion of CO

. Therefore, the diffusivity of CO

under the sedimentary condition, i.e., high pressure and high tem-

perature, is signiﬁcantly different from that under normal condi-

tion. Moreover, in contrast to the pure water, groundwater in a

typical reservoir is brine with high salinity, which inevitably inﬂu-

ences CO

diffusion. Because different salinity results in various

viscosities of brine [55], in this work, we account for the depend-

ence of CO

diffusivity on the groundwater salinity by including

the solvent viscosity in our model.

In 1955, Wilke and Chang [53] put forward a general model for

diffusion coefﬁcients in various dilute solutions and offered an

improved diffusion-factor chart for simpliﬁcation. In their model,

the CO

diffusivity varies as a function of temperatures and vis-

cosity. Then, Lu et al. [40] ignored the effect of pressure and pro-

posed an equation for CO

diffusion in pure water (268 T

473 K). Using molecular dynamics simulations, Moultos et al.

Corresponding authors.

Contributed by the Petroleum Division of ASME for publication in the JOURNAL

OF ENERGY RESOURCES TECHNOLOGY. Manuscript received July 13, 2018; ﬁnal

manuscript received October 6, 2018; published online November 19, 2018. Assoc .

Editor: Daoyong (Tony) Yang.

Journal of Energy Resources Technology APRIL 2019, Vol. 141 / 041001-1Copyright V

C2019 by ASME

Downloaded From: https://energyresources.asmedigitalcollection.asme.org on 11/21/2018 Terms of Use: http://www.asme.org/about-asme/terms-of-use

[54] extended Lu’s expression and developed a phenomenological

correlation to estimate the diffusion coefﬁcient in CO

–H

O mix-

tures at ultrahigh temperatures and pressures. Because the viscos-

ity of saline solution should be taken into account to precisely

determine the CO

diffusivity in brine, the correlations developed

by Lu et al. and Moultos et al., although exhibit remarkably accu-

rate results for the solvent of pure water, cannot be directly uti-

lized for the CO

-brine system. On the basis of the experimental

results for CO

diffusion coefﬁcients D

CO2

versus brine viscosity

at T¼298 K, Cadogan et al. [16] reported a modiﬁed

Stokes–Einstein relation; however, the validity of this correlation

at higher temperatures has not been substantiated.

In order to overcome aforementioned limitations, accurate pre-

diction of CO

diffusion coefﬁcients in brine is a key issue for

urgent solution. With the extensive application of artiﬁcial intelli-

gence, machine learning exhibits excellent performance in diverse

engineering domains and provides better solutions for uncertain

problems [56–61]. Here, coupling support vector machine (SVM),

mixed kernels (MK), and genetic algorithm (GA), we present a

hybrid intelligent model (termed MKSVM-GA model) to predict

the molecular diffusion of CO

in brine. As a machine learning

technique derived from statistics, SVM emanates an overwhelm-

ing superiority in solving problems concerning small-scale sam-

ple, nonlinearity, and high-dimensional pattern recognition

[62,63]. The comparison with laboratory data obtained from pre-

vious literature manifests that the proposed MKSVM-GA model,

representing a large range of temperatures (273–473.15 K) and

pressures (0.1–49.3 MPa), is proven to be reliable and high-

precise (R

>0.99). Our model can not only furnish great signiﬁ-

cance to the engineering design and the technical risk assessment

during CO

injection, but also shed light on the further research

involved with mass transfer process in CO

-brine system.

2 Methodology

2.1 Data Collection. From previous literatures, we collected

92 reliable experimental data points of CO

diffusion in brine

[16,19–21,24,27–30,32,33,35,38–41,51]. The data set is summar-

ized in the supplementary material, which is available under the

“Supplemental Materials” tab for this paper on the ASME Digital

Collection. We consider temperature, pressure, and viscosity as

the main correlation parameters inﬂuencing CO

diffusion in brine

and obtain the viscosity of pure water from National Institute of

Standards and Technology’s (NIST) Chemistry Webbook [64]. As

shown in the set, the gathered data for temperature ranges from

273 to 473.15 K, pressure from 0.1 to 49.3 MPa, viscosity from

0.139 to 1.950 mPas, and diffusion coefﬁcient from 3.1 10

10

to 16.1 10

9

/s.

2.2 Fundamentals of Algorithm

2.2.1 Support Vector Machine. By using SVM, the potential

rules can be concluded from sampling data and then utilized to

predict unknown parameters [65–68]. Particularly, in the small-

sample cases, SVM shows excellent generalization properties

[69–72]. In SVM, the training samples are given as {(x

), (x

), …, (x

)}, where x

,…, x

denotes the space of input

variables, i.e., pressure, temperature, and viscosity in this work;

,…,y

are the experimental diffusion coefﬁcients of CO

brine (outputs). The objective of SVM is to ﬁnd an optimal hyper-

plane that can minimize the structural risk using the SVM regres-

sion function. In other words, it aims to strike a great balance

between the learning capability and the model complexity. The

regression function can be designed as follows:

fðxÞ¼wTwðxÞþb(1)

where wand bare the weight vector and the bias, respectively;

w(x) is the map function, which transforms the n-dimensional

input vector into a high-dimensional feature vector in feature

space to make the nonlinear issues turn into linear regression

problem. In SVM proposed by Vapnik [65,68], all pairs (x

) are

supposed to satisfy the following requirement:

jyifðxiÞj  e8ðxi;yiÞ2Re>0(2)

where eis the given error and Ris the set of inputs. In the preci-

sion of e, searching for the regression function f(x) as ﬂat as possi-

ble is equivalent to minimizing jjwjj

. However, some samples out

of the margin may exert a serious impact on the ﬁtting results.

Therefore, the slack variables need to be introduced to address the

problem, which enable some unrealistic samples to deviate from

the optimal hyperplane in a certain extent. We reach the mathe-

matical model stated by Vapnik [65,68]

min 1

2jjwjj2þcX

i¼1

niþf



subject to yiwTwxi

ðÞ

beþni

wTwxi

ðÞ

þbyieþf

ni;f

i0

(3)

where crepresents the penalty coefﬁcient, which controls the

tradeoff between the ﬂatness of the function and the minimum

cumulative value of deviations; n

and n

*are the slack variables

that depend on the e-insensitive loss function expressed as below:

jnj¼ 0ifjyifðxiÞj <e

jyifðxiÞj  eotherwise

(4)

In Fig. 1, we schematically illustrate this situation. The samples

out or on the edge of the e-tube are the support vectors. The loss

function of circular dots inside the tube is zero while the square

Table 1 Empirical correlations for the diffusion coefﬁcients of CO

in water

Author Reference Year Correlation Solvents

Othmer and Thakar

[52] 1953 DCO2¼14 109

l1:1V0:6

Brine

Wilke and Chang

[53] 1955 DCO2¼7:4108Tﬃﬃﬃﬃﬃﬃﬃﬃ

lV0:6

Brine

Lu et al.

[40] 2013 DCO2¼D0½T=Ts1mPure water

Cadogan et al.

[16] 2014 DCO2¼kBT=ðnSEplaÞBrine

Moultos et al.

[54] 2016 DCO2¼D0ðPÞ½T=Ts1mðPÞPure water

CO2

is the diffusion coefﬁcient of CO

/s; lis the solvent viscosity, mPas; V

is the molar volume of the diffusing substances, cm

/gmol.

Tis the temperature, K; /is the association parameter; Mis the molecular weight of solvent.

¼13.942 10

9

/s; T

¼227.0 K; m¼1.7094.

¼1.38065 10

23

J/K; n

is the Stokes–Einstein number; ais the hydrodynamic radius of the solute, pm; a¼a

298

[1 þa(T298)], where

298

¼168 pm, a¼2.0 10

3

¼a

ln(P)þa

,m¼b

ln(P)þb

, where a

¼2.3097 10

9

¼2.1064 10

8

¼0.17812, and b

¼2.59406; Pis the pressure.

041001-2 / Vol. 141, APRIL 2019 Transactions of the ASME

Downloaded From: https://energyresources.asmedigitalcollection.asme.org on 11/21/2018 Terms of Use: http://www.asme.org/about-asme/terms-of-use

dots outside the e-tube are penalized. The penalty coefﬁcient c

ensures the strength of the punishment. It is worth mentioning that

because cis given artiﬁcially, ﬁnding the optimal cis a requisite

for the excellent performance of SVM. We will discuss the choice

of cin Sec. 2.3.

It has been conﬁrmed that obtaining the solution of convex

optimization problem in Eq. (3) through its dual formulation is

much easier [73]. The corresponding dual optimization model for

Eq. (3) is [65,74]

max eX

i¼1

aiþa



þX

i¼1

yiaia



1

i¼1;j¼1

aia



aja



wxi

ðÞ

Twxj

ðÞ

subject to X

i¼1

aia



¼0

0aic;0a

ic

(5)

where a

and a

are Lagrange multipliers.

2.2.2 Mixed Kernels. The value of w(x

)

w(x) can be calcu-

lated through the kernel function, i.e., K(x

,x)¼w(x

)

w(x). Then

simple computation in low dimensional space replaces the com-

plex computation in high dimensional space. Owing to the utiliza-

tion of kernel function, SVM avoids the curse of dimensionality

[67,75]. In fact, the kernel function is divided into two types, i.e.,

global kernels and local kernels. The polynomial kernel in Eq. (6)

and the radial basis function (RBF) kernel in Eq. (7) are typical

examples of global and local kernels, respectively [76]. Figure 2

depicts the difference between them. As shown in Fig. 2(a), data

points far from each other are able to affect kernel values effec-

tively in global kernels, while local kernels in Fig. 2(b)only allow

data points close to each other to exert an impact on kernel values

Kpolyðxi;xÞ¼ðhxi;xiþ1Þd(6)

KRBFðxi;xÞ¼expðcjjxixjj2Þ(7)

where dis the polynomial degree and cis the kernel parameter.

Any function which satisﬁes Mercer’s conditions can be desig-

nated as the kernel function, including mixtures of global and

local kernels. It was found that the mixed kernel would combine

the advantages of global and local kernels, and strike a great

balance between the interpolation and extrapolation capability of

SVM [77]. The mixed kernel is deﬁned as below:

Kmixðxi;xjÞ¼mKpoly ðxi;xjÞþð1mÞKRBFðxi;xjÞ(8)

where mis the fraction coefﬁcient. As shown in Fig. 2(c),

mixed kernels not only receive strong response around the test

point but also guarantee that the value of response far from the

test point wouldn’t be attenuated rapidly. However, we can see

from Fig. 2that all of these free coefﬁcients, i.e., d,c, and m

inﬂuence the performance of SVM. In Sec. 2.3, we will ﬁnd

their optimal values.

Fig. 1 Schematic showing the fundamentals of SVM. The

square dots represent support vectors. The circular dots are

normal points. The full line denotes the hyperplane and the e-

tube is the region between two dotted lines.

Fig. 2 Function curves of the polynomial kernel (a), the RBF

kernel (b), and the mixed kernel (c). x50.2 is the test point in

three types of kernels. As an example of mixed kernel func-

tions, d51 and c510 are set.

Journal of Energy Resources Technology APRIL 2019, Vol. 141 / 041001-3

Downloaded From: https://energyresources.asmedigitalcollection.asme.org on 11/21/2018 Terms of Use: http://www.asme.org/about-asme/terms-of-use

2.2.3 Genetic Algorithm. We use GA to obtain the optimal

values of c,d,c, and m. Introduced by imitating the evolutionary

principle of biological population, GA has been one of the most

popular global optimization methods [78–80]. Generally, GA

starts with an original population in which the optimal solution

candidates may exist. The population is composed of a certain

number of individuals with gene encoding. According to the prin-

ciple of survival of the ﬁttest, approximate optimal solutions can

be found during evolution. Depending on the evolutionary goals,

the values of ﬁtness function are calculated. Three genetic opera-

tors, including selection, crossover, and mutation, are performed

to create a new generation in each iteration. Particularly, it is

important for GA to set appropriate initial parameters such as pop-

ulation size, genetic algebra, crossover rate, and mutation rate

[81,82].

2.3 Model Development of Genetic Algorithm and Mixed

Kernels -Based Support Vector Machine. We present the com-

bination of GA and MK based SVM in this section. Based on the

principle that both of training set and testing set cover ranges as

large as possible with no overlap, we divided the whole database

into two groups. The ﬁrst group is the training set composed of 72

data points for the development of our model. The second one is

the testing set (20 data points) employed to verify our model. We

summarize the information of the two groups in Table 2.

In order to accelerate the convergence and improve the model

accuracy in the training process, both of inputs and outputs are

normalized to a range of [1, þ1]

Xn¼2XXmin

ðÞ

Xmax Xmin 1(9)

where X

is the normalized value; Xis the actual value; X

min

and

max

are the minimum and maximum values of the parameter Xin

the whole data set.

After data preprocessing, as shown in Fig. 3, the hybrid model

of GA and MK based SVM can be established through the follow-

ing steps:

(1) Design a basic SVM model that is used in the regression

analysis.

(2) Mix the polynomial kernel and the RBF kernel using a frac-

tion coefﬁcient m.

(3) Initialize the operating parameters of GA including popula-

tion size, evolutionary generations, selection rate, crossover

rate, and mutation rate.

(4) Generate initial values of c,d,c, and mrandomly in a cer-

tain scope we preset.

(5) With given c,d,c, and m, the ﬁtness value for training set

is calculated. If the stopping criterion is met, stop the opti-

mization process and output c,d,c, and m. Otherwise, run a

GA and optimize the values of c,d,c, and m.

(6) Repeat step 5 with new values of c,d,c, and muntil the

stopping criterion can be satisﬁed. The optimal c,d,c, and

mcan be output.

(7) With the optimal c,d,c, and m, the training set is used to

train SVM. Finally, diffusion coefﬁcients of CO

can be

estimated by a trained SVM.

To determine the optimal c,d,c, and m, mean square error

(MSE), as the ﬁtness value, is calculated as below:

MSE ¼1

i¼1

Dexp

iDpre



2(10)

where D

exp

and D

pre

are, respectively, the diffusion coefﬁcients

estimated from experiments and our proposed model, m

/s. When

the ﬁtness value reaches its minimum value, the optimal c,d,c,

and mis determined. The stochastic property of GA makes it pos-

sible to globally search for multiple optimal solutions. But its opti-

mal solution aggregate is not ﬁxed every time. In order to obtain

the most appropriate parameter conﬁguration for diffusivity pre-

diction, we run the MKSVM-GA model ten times in different

population size and evolutionary generations, as shown in Fig. 4.

When population number and generation number are set as 200,

uptime (about 30 min for each run) is sacriﬁced for minimum

average and error margin of MSE. Therefore, in this work, for the

balanced performance between model precision and computa-

tional efﬁciency, we choose 50 as population number and genera-

tion number. Besides, the rates of selection, crossover, and

mutation are adjusted in a small scope, referred to values set in

the literature [71]. Table 3summarizes parameter conﬁguration of

the MKSVM-GA model.

2.4 Model Evaluation

2.4.1 Quantitative Evaluation. To examine the capability of

our proposed model, we utilized various frequently used statistical

Table 2 Information of the training set and the testing set

Group Temperature (K) Pressure (MPa) Viscosity (mPas) Diffusion coefficient (10

9

/s) Number of data points

Training set 273.15–473.15 0.1–49.3 0.139–1.950 0.31–16.10 72

Testing set 273.00–423.00 0.1–48.6 0.186–1.807 0.67–12.33 20

Fig. 3 Computational procedure of the MKSVM-GA technique

041001-4 / Vol. 141, APRIL 2019 Transactions of the ASME

Downloaded From: https://energyresources.asmedigitalcollection.asme.org on 11/21/2018 Terms of Use: http://www.asme.org/about-asme/terms-of-use

evaluation indicators to measure its accuracy, i.e., mean absolute

error (MAE), mean absolute relative error (MARE), root-mean-

squared error (RMSE), and coefﬁcient of determination (R

These indicators are deﬁned as follows:

MAE ¼1

i¼1

yexp

iypre

i

(11)

MARE ¼100% 1

i¼1

yexp

iypre

yexp

i

(12)

RMSE ¼ﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃ

i¼1

yexp

iypre



s(13)

R2¼1X

i¼1

yexp

iypre



2.X

i¼1ðyexp

iyave

exp Þ2(14)

where y

exp

and y

pre

are, respectively, the experimentally deter-

mined value and prediction, m

/s; y

exp

ave

is the average value of

experimental results, m

/s.

In addition to evaluation indicators and scatter plots that are

visually intuitive methods measuring the prediction performances

of the regression model, there exists a more quantitative

method—hypothesis test. Two widely used hypothesis tests, T-test

and F-test, are conducted to further assess the robustness of our

model [83]. In T-test, null hypothesis that the data belongs to nor-

mal distributions can be determined to accept or not, while F-test

(also known as the homoscedasticity test) estimates the null

hypothesis that the variances of two samples from normal distri-

butions are identical. If the return value of the test is zero, the null

hypotheses cannot be rejected at 5% signiﬁcant level.

2.4.2 Outlier Diagnostics. We also applied outlier diagnostics

to ensure the results had a goodness of ﬁtting. This regression

analysis approach is commonly employed to identify single datum

or a series of data which probably differ from the vast majority of

the data points in the whole dataset [83–85].

We conducted Leverage approach in the Williams plot to detect

outliers as well as doubtful data. The essential components in

Williams plot, including the standardized residual (SR), the hat

values (H), and the warning leverage (H*), need to be ﬁrst calcu-

lated. Interested readers could ﬁnd more details of the computa-

tional procedure for this method from Refs. [85] and [86]. As

shown in Fig. 5, the data points falling into the square region of

3<SR <3 and 0 <H<H* are valid, while those points lying

out of the range will be identiﬁed as outliers [85]. Typically, there

are two kinds of outliers: good high leverage (GHL) points and

bad high leverage (BHL) points. The points located in the ranges

of 3SR 3 and HH* are deﬁned as GHL points that can-

not be predicted by the model. BHL points, which drop into the

domain of SR <3orSR>3 (no matter what H* value is) prob-

ably result from experimental errors [85,86].

3 Results and Discussion

3.1 Performance of the Proposed Model. Using the hybrid

technique of SVM, MK, and GA, the parameters c,d,c, and m

were determined within a wide range (0 c200, 0 d10, 0

c100, 0 m1). The optimal values of c,d,c, and mare

38.4115, 88.9051, 4, and 0.3430, respectively. Then, CO

diffu-

sion coefﬁcients in brine are predicted.

The cross plot exhibited in Fig. 6(a)compares the estimations

of our model and laboratory results. Evidently, all of the points in

training and testing datasets closely scatter around the bisector of

the ﬁrst quadrant, i.e., the 45 deg line, which manifests a good

agreement between our predictions and the experimental data

from previous literatures. In the light of the statistical evaluation

results presented in Table 4, MAE, MARE, RMSE, and R

for the

total data set are 1.311 10

10

/s, 7.91%, 1.954 10

10

/s,

and 0.9960, respectively. Therefore, it is concluded that our pre-

diction model is of high accuracy and strong generalization. In

Table 3 Basic parameters used in the MKSVM-GA model

Parameters Value

Population size 50

Evolutionary generations 50

Selection rate 1.0

Crossover rate 0.6

Mutation rate 0.03

Fig. 5 Illustration of the Williams plot

Fig. 4 Mean square error and average uptime of the MKSVM-GA model in different

population size and evolutionary generations

Journal of Energy Resources Technology APRIL 2019, Vol. 141 / 041001-5

Downloaded From: https://energyresources.asmedigitalcollection.asme.org on 11/21/2018 Terms of Use: http://www.asme.org/about-asme/terms-of-use

order to highlight the performance of our model in a visually bet-

ter way, we exhibit the measured and predicted diffusion coefﬁ-

cients of CO

versus the index of data points in Fig. 6(b). The

comparison indicates that the predictions estimated from our

model are remarkably consistent with the experimental results.

Figure 7shows that the residuals follow the normal distribution

with a mean value of 4.115 10

11

/s. The return values of

T-test and F-test are both zero, which indicates that both of the

null hypotheses are accepted. Therefore, the statistical signiﬁ-

cance of our prediction is substantiated.

In Fig. 8, Williams plot is presented to detect the outliers. Here,

the warning leverage evaluated for the whole dataset is 0.130.

Given that the majority of the predicted points lie in the square

region of 3SR 3 and 0 H0.130, the MKSVM-GA

model, establishing a credible internal relation between these vari-

ables and diffusion coefﬁcients of CO

in brine, is proven to be

statistically effective. There is no GHL point and we list the three

detected BHL points in Table 5. The data point measured at 373

K, 20 MPa reported by Lu et al. [40] does not coincide with its

variation trend with the pressure reported by Cadogan et al. [41].

Therefore, the experimental value at this condition may be errone-

ous. The validity of the other outliers cannot be determined

because the diffusion coefﬁcients of CO

at these conditions were

not reported by any other researchers. As reported by Lu et al.,

overestimation of diffusion coefﬁcients at temperatures higher

than 353 K is attributed to the scarcity of data points in high tem-

peratures [40]. With more sufﬁcient data points at high tempera-

tures, some unreasonable data could be removed.

3.2 Comparison With Other Models

3.2.1 Artificial Neural Network Model. In the ﬁeld of

machine learning, artiﬁcial neural network (ANN) is another prev-

alent technique. This method can deal with complicated functional

relationships; hence, it is widely used to predict many important

chemical parameters [58,59]. Typically, an artiﬁcial neural net-

work is composed of a bunch of nodes called neurons and the con-

nection constitution. Each neuron in ANN is featured by three

elements including weight, bias, and activation function. The

weight parameter measures the connecting strength for certain

input. The bias is a nonzero constant representing the type of con-

nection weight added to the summation of inputs and correspond-

ing weights. The activation function is used to introduce a

nonlinear relation into a multilayer perception neural network

[87–89].

To examine the capability of our model, we built a typical

three-layered feed forward ANN model to forecast diffusion coef-

ﬁcients of CO

in brine and compared the results with the

Fig. 6 Prediction performance of the proposed MKSVM-GA model: (a) cross plot

of the estimated results versus experimental diffusion coefﬁcients of CO

in brine;

(b) comparison of each experimental and predicted data points

Table 4 Statistical evaluation results of the MKSVM-GA model

Evaluation matrices Training set Testing set Total set

MAE (10

9

/s) 0.1112 0.2028 0.1311

MARE (%) 7.17 10.55 7.91

RMSE (10

9

/s) 0.1527 0.3028 0.1954

(fraction) 0.9975 0.9910 0.9960

Fig. 7 Distribution of the residuals between predicted and

experimental diffusion coefﬁcients of CO

. The columns are

instances and the solid curve is the normal distribution curve.

Std Dev represents the standard deviation of residuals.

Fig. 8 Williams plot used for the detection of outliers

041001-6 / Vol. 141, APRIL 2019 Transactions of the ASME

Downloaded From: https://energyresources.asmedigitalcollection.asme.org on 11/21/2018 Terms of Use: http://www.asme.org/about-asme/terms-of-use

predictions of our MKSVM-GA model. Figure 9illustrates the

structure of this ANN model including an input layer, a hidden

layer, and an output layer. The input layer consists of three neu-

rons denoting attributes of these samples i.e., pressure, tempera-

ture, and viscosity. Diffusion coefﬁcients of CO

in brine are

obtained from the neuron of the output layer. We determined the

number of neurons in the hidden layer through the means of trial

and error. In this work, the number of neurons was raised from 4

to 20 at intervals of two neurons. At each number of neurons, the

model was tested ten times and its MSE was traced. Aimed at the

minimum MSE for the training set, we found the optimal number

of neurons for this ANN model. The minimum MSE curve with

variation of neuron number is shown in Fig. 10. The minimal one

is situated at the neuron number of 8. Therefore, the optimal ANN

architecture is determined as 3-8-1.

The MKSVM-GA model and the ANN model were both run

ten times. Figure 11 depicts performance comparison between the

MKSVM-GA model and the ANN model in the training stage.

The convergence scope of the MKSVM-GA model, shaded with

blue, is distinctly lower and narrower than results of the ANN

training model. This phenomenon is attributed to small-scale sam-

ples. ANN is based on traditional statistics, a gradual theory that

sample size is tending to be inﬁnite. Thus, when small-scale sam-

ples are manipulated in ANN, it is doubtful to take it for granted

that statistical properties are concluded. In fact, desired perform-

ance is not achieved in ANN in this situation. On the other hand,

the statistical learning theory is fundamental to SVM, which is a

theory on the condition of small sample size. Professionally

speaking, SVM, which embodies the structural risk minimization

principle, can minimize the upper bound of the generalization

error rather than the training error from ANN, which embodies

the empirical risk minimization principle [90–92].

The most optimal result (the largest R

) predicted by ANN is

shown in Fig. 12. The statistical evaluation results of ANN model

and our proposed model are summarized in Table 6. Lower MAE,

MARE, RMSE, and higher R

demonstrate that the MKSVM-GA

model is more precise than the ANN model in both the testing and

the training stages. These evaluation results, as well as the

Table 5 Outliers recognized in the Williams plot

Author Reference Temperature (K) Pressure (MPa) Viscosity (mPas) Diffusion coefficient (10

9

/s) Solvent Outlier type

Maharajh et al. [19] 0.1 273.00 1.807 1.00 Pure water BHL

Lu et al. [40] 20.0 373.15 0.287 6.43 Pure water BHL

Lu et al. [40] 20.0 393.15 0.237 8.13 Pure water BHL

Fig. 9 Illustration of the structure of a typical three-layered

feed forward ANN model

Fig. 10 The minimum MSE with variation of number of neurons

in the hidden layer

Fig. 11 Comparison of the training performance between the

MKSVM-GA model and the ANN model

Fig. 12 Comparison of the prediction performance between

the MKSVM-GA model and the ANN model

Journal of Energy Resources Technology APRIL 2019, Vol. 141 / 041001-7

Downloaded From: https://energyresources.asmedigitalcollection.asme.org on 11/21/2018 Terms of Use: http://www.asme.org/about-asme/terms-of-use

absolute error curve shown in Fig. 13, substantiate the fact that

MKSVM-GA performs better than ANN on the condition of small

sample size. However, due to solution difﬁculty of high dimen-

sional matrices in convex quadratic programming, it is a technical

conundrum for SVM to be implemented with huge sample size,

while ANN could handle it with ease. Even so, there is no clear

dividing boundary on sample size between SVM and ANN. In

fact, most of researchers prefer to investigate the adaptability of

SVM or ANN through trial and error methods [87,93].

3.2.2 Empirical Correlations. In this section, we compared

our model with empirical correlations to examine its validity. As

mentioned above, there are ﬁve correlations that can predict the

diffusion coefﬁcient in CO

-brine system, i.e., Othmer–Thakar’s

model [52], Wilke–Chang’s model [53], Lu’s model [40], Cado-

gan’s model [16], and Moultos’ model [54]. However, Moultos’

correlation cannot be used to conduct this comparison because

this formula, modiﬁed from Lu’s model, is available only in ultra-

high temperature (323.15 T1023.15 K) and pressures (200 

P1000 MPa), which are beyond the scope of our training set.

Table 6 Comparison of the statistical evaluation results between the MKSVM-GA model and the ANN model

Training set Testing set Total set

Evaluation matrices MKSVM-GA ANN MKSVM-GA ANN MKSVM-GA ANN

MAE (10

9

/s) 0.1112 0.1490 0.2028 0.3172 0.1311 0.1855

MARE (%) 7.17 11.97 10.55 14.80 7.91 12.59

RMSE (10

9

/s) 0.1527 0.2275 0.3028 0.4608 0.1954 0.2944

(fraction) 0.9975 0.9944 0.9910 0.9785 0.9960 0.9908

Fig. 13 Absolute error curves of the MKSVM-GA model, the

ANN model, and four empirical correlations

Fig. 14 Comparison of the prediction performances between the MKSVM-GA model and four

empirical correlations: (a) Othmer and Thakar; (b) Wilke and Chang; (c) Lu et al.; (d) Cadogan

et al. Squares and triangles represent the values predicted using the MKSVM-GA model and

empirical correlations, respectively.

041001-8 / Vol. 141, APRIL 2019 Transactions of the ASME

Downloaded From: https://energyresources.asmedigitalcollection.asme.org on 11/21/2018 Terms of Use: http://www.asme.org/about-asme/terms-of-use

Figure 14 compares the performance of each empirical correla-

tion with our proposed model. Unlike the data points from the

MKSVM-GA model, which distribute uniformly around the

45 deg line, the predictions calculated by using the ﬁrst three

correlations—Othmer’s model, Wilke–Chang’s model, and Lu’s

model, deviated up or down obviously, particularly for higher dif-

fusion coefﬁcients. Meanwhile, the results estimated from Cado-

gan’s model are more dispersed along the 45 deg line than our

MKSVM-GA model. Therefore, the four empirical correlations

show lower accuracy than our proposed model. In addition, the

evaluation results shown in Table 7and the absolute error curves

depicted in Fig. 13 conﬁrm that all of the four empirical correla-

tions have weaker prediction capability. MAE, MARE, and

RMSE of empirical correlations approximate to twice or more as

much as that of the MKSVM-GA model. In comparison to the

high ﬁtting degree of intelligent algorithms, none of the empirical

correlations has a greater coefﬁcient of determination (R

) than

0.99, even though the performance of Cadogan’s model is far bet-

ter than the other formulas.

Because the models proposed by Othmer and Thakar as well as

Wilke and Chang are general equations for gas diffusivity in solu-

tions, they only provide the rough guess and estimation for CO

brine. The model of Lu et al., only available in pure water, failed

to account for the effect of solvent viscosity. These empirical cor-

relations including Cadogan’s model are established on the same

assumption that the effect of pressure is negligible. As a matter of

fact, the variation in pressure can lead to ﬂuctuations of CO

dif-

fusion coefﬁcients. The ignorance of inﬂuencing parameters is

indeed the most important reason to explain why the empirical

correlations exhibit poor performances. A considerate composi-

tion of inﬂuencing parameters unleashes the advantages of artiﬁ-

cial intelligent algorithms. With the aid of the excellent

robustness and fault tolerance, our proposed model exhibits an

incomparable superiority to the traditional empirical correlations.

4 Conclusions

We developed a hybrid technique of support vector machine,

mixed kernels, and genetic algorithm to predict CO

molecular

diffusivity in brine. A total of 92 experimental points obtained

from previous literatures are employed to train and test the

MKSVM-GA model. The statistical evaluation indicators of total

data set, i.e., MAE (1.311 10

10

/s), MARE (7.91%), RMSE

(1.954 10

10

/s), and R

(0.9960), demonstrate that our pro-

posed model exhibits an excellent performance with high accu-

racy and strong generalization. To highlight the superiority of our

model, a typical three-layered ANN model and four commonly

used empirical correlations are employed to conduct the compari-

son with the MKSVM-GA model. The results conﬁrm that the

proposed model has signiﬁcantly better prediction capability.

Potentially, our technique can be extended to other engineering

disciplines that demand fast and convenient access to basic physi-

cal parameters.

Funding Data

National Program for Fundamental Research and Develop-

ment of China (973 Program) (Grant No. 2015CB250905).

Program for Changjiang Scholars and Innovative Research

Team in University (Grant No. IRT1294).

National Postdoctoral Program for Innovative Talents (Grant

No. BX201600153).

China Postdoctoral Science Foundation (Grant No.

2016M600571).

Qingdao Postdoctoral Applied Research Project (Grant No.

2016218).

References

[1] Ota, M., Saito, T., Aida, T., Watanabe, M., Sato, Y., Smith, R. L., and Inomata,

H., 2007, “Macro and Microscopic CH

-CO

Replacement in CH

Hydrate

Under Pressurized CO

,” AIChE J.,53(10), pp. 2715–2721.

[2] Bai, D., Zhang, X., Chen, G., and Wang, W., 2012, “Replacement Mechanism

of Methane Hydrate With Carbon Dioxide From Microsecond Mol ecular

Dynamics Simulations,” Energy Environ. Sci.,5(5), pp. 7033–7041.

[3] Pruess, K., 2006, “Enhanced Geothermal Systems (EGS) Usin g CO

as Work-

ing Fluid—A Novel Approach for Generating Renewable Energy With Simulta -

neous Sequestration of Carbon,” Geothermics,35(4), pp. 351–367.

[4] Zhang, L., Cui, G., Zhang, Y., Ren, B., Ren, S., and Wang, X., 2016, “Inﬂuence

of Pore Water on the Heat Mining Performance of Supercritical CO

Injected

for Geothermal Development,” J. CO

Util.,16, pp. 287–300.

[5] Cui, G., Ren, S., Rui, Z., Ezekiel, J., Zhang, L., and Wang, H., 2018, “The Inﬂu-

ence of Complicated Fluid-Rock Interactions on the Geothermal Exploitation in

the CO

Plume Geothermal System,” Appl. Energy,227, pp. 49–63.

[6] Mohamed, I. M., He, J., and Nasr-El-Din, H. A., 2012, “Experimental Analysis

of CO

Injection on Permeability of Vuggy Carbonate Aquifers,” ASME J.

Energy Resour. Technol.,135(1), p. 013301.

[7] Cui, G., Wang, Y., Rui, Z., Chen, B., Ren, S., and Zhang, L., 2018,

“Assessing the Combined Inﬂuence of Fluid-Rock Interactions on Reservoir

Properties and Injectivity During CO

Storage in Saline Aquifers,” Energy,

155, pp. 281–296.

[8] Rau, G. H., and Caldeira, K., 1999, “Enhanced Carbonate Dissolution: A Means

of Sequestering Waste CO

as Ocean Bicarbonate,” Energy Convers. Manage.,

40(17), pp. 1803–1813.

[9] Chen, B., and Reynolds, A. C., 2018, “CO

Water-Alternating-Gas Injection for

Enhanced Oil Recovery: Optimal Well Controls and Half-Cycle Lengths,”

Comput. Chem. Eng.,113, pp. 44–56.

[10] Li, A., Ren, X., Fu, S., Lv, J., Li, X., Liu, Y., and Lu, Y., 2018, “The Experi-

mental Study on the Flooding Regularities of Various CO

Flooding Modes

Implemented on Ultralow Permeability Cores,” ASME J. Energy Resour. Tech-

nol.,140(7), p. 072902.

[11] Ren, B., Zhang, L., Huang, H., Ren, S., Chen, G., and Zhang, H., 2015,

“Performance Evaluation and Mechanisms Study of Near-Miscible CO

Flood-

ing in a Tight Oil Reservoir of Jilin Oilﬁeld China,” J. Nat. Gas Sci. Eng.,27,

pp. 1796–1805.

[12] Li, S., Li, B., Zhang, Q., Li, Z., and Yang, D., 2018, “Effect of CO

on Heavy

Oil Recovery and Physical Properties in Huff-n-Puff Processes Under Reservoir

Conditions,” ASME J. Energy Resour. Technol.,140(7), p. 072907.

[13] Mutoru, J. W., Leahy-Dios, A., and Firoozabadi, A., 2011, “Modeling Inﬁnite

Dilution and Fickian Diffusion Coefﬁcients of Carbon Dioxide in Water,”

AIChE J.,57(6), pp. 1617–1627.

[14] Farajzadeh, R., Zitha, P. L., and Bruining, J., 2009, “Enhanced Mass Transfer

of CO

Into Water: Experiment and Modeling,” Ind. Eng. Chem. Res.,48(13),

pp. 6423–6431.

[15] Trevisan, L., Pini, R., Cihan, A., Birkholzer, J. T., Zhou, Q., and Illangasekare,

T. H., 2014, “Experimental Investigation of Supercritical CO

Trapping Mecha-

nisms at the Intermediate Laboratory Scale in Well-Deﬁned Heterogeneous

Porous Media,” Energy Procedia,63, pp. 5646–5653.

[16] Cadogan, S. P., Hallett, J. P., Maitland, G. C., and Trusler, J. M., 2014,

“Diffusion Coefﬁcients of Carbon Dioxide in Brines Measured Using

Pulsed-Field Gradient Nuclear Magnetic Resonance,” J. Chem. Eng. Data,

60(1), pp. 181–184.

[17] Vivian, J. E., and King, C. J., 1964, “Diffusivities of Slightly Soluble Gases in

Water,” AIChE J.,10(2), pp. 220–221.

[18] Mazarei, A. F., and Sandall, O. C., 1980, “Diffusion Coefﬁcients for Helium,

Hydrogen, and Carbon Diox ide in Water at 25 C,” AIChE J.,26(1), pp.

154–157.

[19] Maharajh, D. M., and Walkley, J., 1973, “The Tempe rature Dependence of the

Diffusion Coefﬁcients of Ar, CO

,CH

Cl, CH

Br, and CHCl

F in Water,”

Can. J. Chem.,51(6), pp. 944–952.

[20] Tamimi, A., Rinker, E. B., and Sandall, O. C., 1994, “Diffusion Coefﬁcients for

Hydrogen Sulﬁde, Carbon Dioxide, and Nitrous Oxide in Water Over the Tem-

perature Range 293-368 K,” J. Chem. Eng. Data,39(2), pp. 330–332.

Table 7 Comparison of the statistical evaluation results between the MKSVM-GA model and correlations for the whole database

Evaluation matrices MKSVM-GA Othmer and Thakar Wilke and Chang Lu et al. Cadogan et al.

MAE (10

9

/s) 0.1311 0.3454 0.3380 0.4036 0.2627

MARE (%) 7.91 12.75 12.60 24.77 13.84

RMSE (10

9

/s) 0.1954 0.5661 0.7311 0.6333 0.3661

(fraction) 0.9960 0.9661 0.9434 0.9575 0.9858

Journal of Energy Resources Technology APRIL 2019, Vol. 141 / 041001-9

Downloaded From: https://energyresources.asmedigitalcollection.asme.org on 11/21/2018 Terms of Use: http://www.asme.org/about-asme/terms-of-use

[21] Frank, M. J., Kuipers, J. A., and van Swaaij, W. P., 1996, “Diffusion Coefﬁ-

cients and Viscosities of CO

þH

O, CO

þCH

OH, NH

þH

O, and

þCH

OH Liquid Mixtures,” J. Chem. Eng. Data,41(2), pp. 297–302.

[22] Ng, W. Y., and Walkley, J., 1969, “Diffusion of Gases in Liquids: The Constant

Size Bubble Method,” Can. J. Chem.,47(6), pp. 1075–1077.

[23] J€

ahne, B., Heinz, G., and Dietrich, W., 1987, “Measurement of the Diffusion

Coefﬁcients of Sparingly Soluble Gases in Water,” J. Geophys. Res.,92(C10),

pp. 10767–10776.

[24] Hirai, S., Okazaki, K., Yazawa, H., Ito, H., Tabe, Y., and Hijikata, K., 1997,

“Measurement of CO

Diffusion Coefﬁcient and Application of LIF in Pressur-

ized Water,” Energy,22(2–3), pp. 363–367.

[25] Guzman, J., and Garrido, L., 2012, “Deter mination of Carbon Dioxide Trans-

port Coefﬁcients in Liquids and Polymers by NMR Spectroscopy,” J. Phys.

Chem. B,116(20), pp. 6050–6058.

[26] Liger-Belair, G., Prost, E., Parmentier, M., Jeandet, P., and Nuzillard, J. M., 2003,

“Diffusion Coefﬁcient of CO

MoleculesasDeterminedby

CNMRinVarious

Carbonated Beverages,” J. Agric. Food Chem.,51(26), pp. 7560–7563.

[27] Maharajh, D., 1973, “Solubility and Diffusion of Gases in Water,” Ph.D. thesis,

Simon Fraser University, Burnaby, BC, Canada.

[28] Versteeg, G. F., and Van Swaalj, W., 1988, “Solubility and Diffusivity of Acid

Gases (Carbon Dioxide, Nitrous Oxide) in Aqueous Alkanolamine Solutions,”

J. Chem. Eng. Data,33(1), pp. 29–34.

[29] Himmelblau, D. M., 1964, “Diffusion of Dissolved Gase s in Liquids,” Chem.

Rev.,64(5), pp. 527–550.

[30] Thomas, W. J., and Adams, M. J., 1965, “Measurement of the Diffusion Coefﬁ-

cients of Carbon Dioxide and Nitrous Oxide in Water and Aqueous Solutions of

Glycerol,” Trans. Faraday Soc.,61, pp. 668–673.

[31] Brignole, E. A., and Echarte, R., 1981, “Mass Transfer in Laminar Liquid Jets:

Measurement of Diffusion Coefﬁcients,” Chem. Eng. Sci.,36(4), pp. 705–711.

[32] Nijsing, R., Hendriksz, R. H., and Krame rs, H., 1959, “Absorption of CO

Jets and Falling Films of Electrolyte Solutions, With and Without Chemical

Reaction,” Chem. Eng. Sci.,10(1–2), pp. 88–104.

[33] Tan, K. K., and Thorpe, R. B., 1992, “Gas Diffusion Into Viscous and Non-

Newtonian Liquids,” Chem. Eng. Sci.,47(13–14), pp. 3565–3572.

[34] Tham, M. J., Bhatia, K. K., and Gubbins, K. F., 1967, “Steady-State Method for

Studying Diffusion of Gases in Liquids,” Chem. Eng. Sci.,22(3), pp. 309–311.

[35] Vivian, J. E., and Peaceman, D. W., 1956, “Liquid-Side Resistance in Gas

Absorption,” AIChE J.,2(4), pp. 437–443.

[36] Pratt, K. C., Slater, D. H., and Wakeha m, W. A., 1973, “A Rapid Method for

the Determination of Diffusion Coefﬁcients of Gases in Liquids,” Chem. Eng.

Sci.,28(10), pp. 1901–1903.

[37] Bodnar, L. H., and Himmelblau, D. M., 1962, “Continuous Measurement of

Diffusion Coefﬁcients of Gases in Liquids Using Glass Scintillators,” Int. J.

Appl. Radiat. Isot.,13(1), pp. 1–6.

[38] Choudhari, R., and Doraiswamy, L. K., 1972, “Physical Properties in Reaction

of Ethylene and Hydrogen Chloride in Liquid Media. Diffusivities and Sol-

ubilities,” J. Chem. Eng. Data,17(4), pp. 428–432.

[39] Reddy, K. A., and Doraiswamy, L. K., 1967, “Estimating Liquid Diffusivity,”

Ind. Eng. Chem. Fundam.,6(1), pp. 77–79.

[40] Lu, W., Guo, H., Chou, I. M., Burruss, R. C., and Li, L., 2013, “Determination

of Diffusion Coefﬁcients of Carbon Dioxide in Water Between 268 and 473 K

in a High-Pressure Capillary Optical Cell With in Situ Raman Spectroscopic

Measurements,” Geochim. Cosmochim. Acta,115, pp. 183–204.

[41] Cadogan, S. P., Maitland, G. C., and Trusler, J. M., 2014, “Diffusion Coefﬁ-

cients of CO

and N

in Water at Temperatures Between 298.15 K and 423.15

K at Pressures Up to 45 MPa,” J. Chem. Eng. Data ,59(2), pp. 519–525.

[42] Jang, H. W., Yang, D., and Li, H., 2018, “A Power-Law Mixing Rule for Pre-

dicting Apparent Diffusion Coefﬁcients of Binary Gas Mixtures in Heavy Oil,”

ASME J. Energy Resour. Technol.,140(5), p. 052904.

[43] Shi, Y., Zheng, S., and Yang, D., 2017, “Determination of Individual Diffusion

Coefﬁcients of Alkane Solvent(s)–CO

–Heavy Oil Systems With Consideration

of Natural Convection Induced by Swelling Effect,” Int. J. Heat Mass Transfer,

107, pp. 572–585.

[44] Zheng, S., and Yang, D., 2017, “Experimental and Theoretical Determination

of Diffusion Coefﬁcients of CO

-Heavy Oil Systems by Coupling Heat and

Mass Transfer,” ASME J. Energy Resour. Technol.,139(2), p. 022901.

[45] Zheng, S., and Yang, D., 2017, “Determination of Individual Diffusion Coefﬁ-

cients of C

/n-C

/CO

/Heavy-Oil Systems at High Pressures and Ele-

vated Temperatures by Dynamic Volume Analysis,” SPE J.,22, pp. 799–816.

[46] Li, H. A., Sun, H., and Yang, D., 2017, “Effective Diffusion Coefﬁcients of Gas

Mixture in Heavy Oil Under Constant-Pressure Conditions,” Heat Mass Trans-

fer,53(5), pp. 1527–1540.

[47] Zheng, S., Sun, H., and Yang, D., 2016, “Coupling Heat and Mass Transfer for

Determining Individual Diffusion Coefﬁcient of a Hot C

–CO

Mixture in

Heavy Oil Under Reservoir Conditions,” Int. J. Heat Mass Transfer,102, pp.

251–263.

[48] Zheng, S., Li, H. A., Sun, H., and Yang, D., 2016, “Determination of Diffusion

Coefﬁcient for Alkane Solvent–CO

Mixtures in Heavy Oil With Consideration

of Swelling Effect,” Ind. Eng. Chem. Res.,55(6), pp. 1533–1549.

[49] Li, H., and Yang, D., 2016, “Determination of Individual Diffusion Coefﬁcients

of Solvent/CO

Mixture in Heavy Oil With Pressure-Decay Method,” SPE J.,

21(1), pp. 131–143.

[50] Sun, H., Li, H., and Yang, D., 2014, “Coupling Heat and Mass Transfer for a

Gas Mixture–Heavy Oil System at High Pressures and Elevated Temperatures,”

Int. J. Heat Mass Transfer,74, pp. 173–184.

[51] Yang, D., Tontiwachwuthikul, P., and Gu, Y., 2006, “Dynamic Interfacial Ten-

sion Method for Measuring Gas Diffusion Coefﬁcient and Interface Mass

Transfer Coefﬁcient in a Liquid,” Ind. Eng. Chem. Res.,45(14), pp.

4999–5008.

[52] Othmer, D. F., and Thakar, M. S., 1953, “Correlating Diffusion Coefﬁcient in

Liquids,” Ind. Eng. Chem.,45(3), pp. 589–593.

[53] Wilke, C. R., and Chang, P., 1955, “Correlation of Diffusion Coefﬁcients in

Dilute Solutions,” AIChE J.,1(2), pp. 264–270.

[54] Moultos, O. A., Tsimpanogiannis, I. N., Panagiotopoulos, A. Z., and Econo-

mou, I. G., 2016, “Self-Diffusion Coefﬁcients of the Binary (H

OþCO

) Mix-

ture at High Temperatures and Pressures,” J. Chem. Thermodyn.,93, pp.

424–429.

[55] Cadogan, S., 2015, “Diffusion of CO

in Fluids Relevant to Carbon Capture,

Utilisation and Storage,” Ph.D. thesis, Imperial College London, London.

[56] Shokrollahi, A., Arabloo, M., Gharagheizi, F., and Mohammadi, A. H., 2013,

“Intelligent Model for Prediction of CO

–Reservoir Oil Minimum Miscibility

Pressure,” Fuel,112, pp. 375–384.

[57] Le Van, S., and Chon, B. H., 2018, “Effective Prediction and Management of a

Flooding Process for Enhancing Oil Recovery Using Artiﬁcial Neural

Networks,” ASME J. Energy Resour. Technol.,140(3), p. 032906.

[58] Zhang, J., Feng, Q., Zhang, X., Zhang, X., Yuan, N., Wen, S., Wang, S., and

Zhang, A., 2015, “The Use of an Artiﬁcial Neural Network to Estimate Natural

Gas/Water Interfacial Tension,” Fuel,157, pp. 28–36.

[59] Khaksar Manshad, A., Rostami, H., Moein Hosseini, S., and Rezaei, H., 2016,

“Application of Artiﬁcial Neural Network–Particle Swarm Optimization Algo-

rithm for Prediction of Gas Condensate Dew Point Pressure and Comparison

With Gaussian Processes Regression–Particle Swarm Optimization Algorithm,”

ASME J. Energy Resour. Technol.,138(3), p. 032903.

[60] Paul, A., Bhowmik, S., Panua, R., and Debroy, D., 2018, “Artiﬁcial Neural

Network-Based Prediction of Performances-Exhaust Emissions of Diesohol

Piloted Dual Fuel Diesel Engine Under Varying Compressed Natural Gas Flow-

rates,” ASME J. Energy Resour. Technol.,140(11), p. 112201.

[61] Chen, B., Harp, D. R., Lin, Y., Keating, E. H., and Pawar, R. J., 2018, “Geologic

Sequestration Monitoring Design: A Machine Learning and Uncertainty

Quantiﬁcation Based Approach,” Appl. Energy,225, pp. 332–345.

[62] Kamari, A., Arabloo, M., Shokrollahi, A., Gharagheizi, F., and Mohammadi, A.

H., 2015, “Rapid Method to Estimate the Minimum Miscibility Pressure (MMP)

in Live Reservoir Oil Systems During CO

Flooding,” Fuel,153, pp. 310–319.

[63] Tatar, A., Barati-Harooni, A., Najaﬁ-Marghmaleki, A., Najaﬁ-Marghmaleki,

A., Mohebbi, A., Ghiasi, M. M., Mohammadi, A. H., and Hajinezhad, A., 2016,

“Comparison of Two Soft Computing Approaches for Predicting CO

Solubil-

ity in Aqueous Solution of Piperazine,” Int. J. Greenhouse Gas Control,53, pp.

85–97.

[64] Linstrom, P. J., and Mallard, W. G. E., 2016, “NIST Chemistry WebBook,”

National Institute of Standards and Technology, Gaithersburg, MD, accessed

Aug. 17, 2016, NIST Standard Reference DatabaseNo. 69, http://webbook.nist.gov

[65] Cortes, C., and Vapnik, V., 1995, “Support-Vector Networks,” Mach. Learn.,

20(3), pp. 273–297.

[66] Vapnik, V., 1995, The Nature of Statistical Learning Theory, Springer, New

York.

[67] Boser, B. E., Guyon, I. M., and Vapnik, V. N., 1992, “A Training Algorithm for

Optimal Margin Classiﬁers,” Fifth Annual Workshop on Computational Learn-

ing Theory, Pittsburgh, PA, July 27–29, pp. 144–152.

[68] Drucker, H., Burges, C. J., Kaufman, L., Smola, A., and Vapnik, V., 1997,

“Support Vector Regression Machines,” Advances in Neural Information Proc-

essing Systems 9, MIT Press, Cambridge, MA, pp. 155–161.

[69] Geng, Y., Chen, J., Fu, R., Bao, G., and Pahlavan, K., 2016, “Enlighten Weara-

ble Physiological Monitoring Systems: On-Body of Characteristics Based

Human Motion Classiﬁcation Using a Support Vector Machine,” IEEE Trans.

Mobile Comput.,15(3), pp. 656–671.

[70] Lee, Y. J., and Mangasarian, O. L., 2001, “SSVM: A Smooth Support Vector

Machine for Classiﬁcation,” Comput. Optim. Appl.,20, pp. 5–22.

[71] Bian, X. Q., Han, B., Du, Z. M., Jaubert, J. N., and Li, M. J., 2016, “Integrating

Support Vector Regression With Genetic Algorithm for CO

-Oil Minimum

Miscibility Pressure (MMP) in Pure and Impure CO

Streams,” Fuel,182, pp.

550–557.

[72] Filgueiras, P. R., Portela, N. A., Silva, S. R., Castro, E. V., Oliveira, L. M.,

Dias, J. C., Neto, A. C., Rom~

ao, W., and Poppi, R. J., 2016, “Determination of

Saturates, Aromatics, and Polars in Crude Oil by

C NMR and Support Vector

Regression With Variable Selection by Genetic Algorithm,” Energy Fuels,

30(3), pp. 1972–1978.

[73] Fiacco, A. V., and McCormick, G. P., 1964, “The Sequential Unconstrained

Minimization Technique for Nonlinear Programing, A Primal-Dual Method,”

Manag. Sci.,10(2), pp. 360–366.

[74] Smola, A. J., and Sch€

olkopf, B., 2004, “A Tutorial on Support Vector

Regression,” Stat. Comput.,14(3), pp. 199–222.

[75] Sch€

olkopf, B., and Burges, C. J., 1999, Advances in Kernel Methods: Support

Vector Learning, MIT Press, Cambridge, MA.

[76] Tehrany, M. S., Pradhan, B., and Jebur, M. N., 2014, “Flood Susceptibility

Mapping Using a Novel Ensemble Weights-of-Evidence and Support Vector

Machine Models in GIS,” J. Hydrol.,512, pp. 332–343.

[77] Smits, G. F., and Jordaan, E. M., 2002, “Improved SVM Regression Using Mix-

tures of Kernels,” International Joint Conference on Neural Networks , Vol. 3,

pp. 2785–2790.

[78] Holland, J. H., 1992, “Genetic Algorithms,” Sci. Am.,267(1), pp. 66–72.

[79] Khadse, A., Blanchette, L., Kapat, J., Vasu, S., Hossain, J., and Donaz zolo, A.,

2018, “Optimization of Supercritical CO

Brayton Cycle for Simple Cycle Gas

Turbines Exhaust Heat Recovery Using Genetic Algorithm,” ASME J. Energy

Resour. Technol.,140(7), p. 071601.

041001-10 / Vol. 141, APRIL 2019 Transactions of the ASME

Downloaded From: https://energyresources.asmedigitalcollection.asme.org on 11/21/2018 Terms of Use: http://www.asme.org/about-asme/terms-of-use

[80] Salmachi, A., Sayyafzadeh, M., and Haghighi, M., 2013, “Inﬁll Well Placement

Optimization in Coal Bed Methane Reservoirs Using Genetic Algorithm,” Fuel,

111, pp. 248–258.

[81] Velez-Langs, O., 2005, “Genetic Algorithms in Oil Industry: An Overview,”

J. Pet. Sci. Eng.,47(1–2), pp. 15–22.

[82] Davis, L., 1991, Hand book of Genetic Algorithms, Van Nostrand Reinhold,

New York.

[83] Chatt erjee, S., and Hadi, A. S., 2015, Regression Analysis by Example, Wiley,

New York.

[84] Rousseeuw, P. J., and Leroy, A. M., 2005, Robust Regression and Outlier

Detection, Wiley, New York.

[85] Mohammadi, A. H., Eslamimanesh, A., Gharagheizi, F., and Richon, D., 2012,

“A Novel Method for Evaluation of Asphaltene Precipitation Titration Data,”

Chem. Eng. Sci.,78, pp. 181–185.

[86] Mohammadi, A. H., Gharagheizi, F., Eslamimanesh, A., and Richon, D., 2012,

“Evaluation of Experimental Data for Wax and Diamondoids Solubility in Gas-

eous Systems,” Chem. Eng. Sci.,81, pp. 1–7.

[87] Feng, Q., Zhang, J., Zhang, X., and Wen, S., 2015, “Proximate Analysis Based

Prediction of Gross Caloriﬁc Value of Coals: A Comparison of Support Vector

Machine, Alternating Conditional Expectation and Artiﬁcial Neural Network,”

Fuel Process. Technol.,129, pp. 120–129.

[88] Togun, N. K., and Baysec, S., 2010, “Prediction of Torque and Speciﬁc Fuel

Consumption of a Gasoline Engine by Using Artiﬁcial Neural Networks,”

Appl. Energy,87(1), pp. 349–355.

[89] Pradhan, B., and Lee, S., 2010, “Landslide Susceptibility Assessment and Fac-

tor Effect Analysis: Backpropagation Artiﬁcial Neural Networks and Their

Comparison With Frequency Ratio and Bivariate Logistic Regression Mod-

elling,” Environ. Modell. Software,25(6), pp. 747–759.

[90] Li, Q., Meng, Q., Cai, J., Yoshino, H., and Mochida, A., 2009, “Predicting

Hourly Cooling Load in the Building: A Comparison of Support Vector

Machine and Different Artiﬁcial Neural Networks,” Energy Convers. Manage.,

50(1), pp. 90–96.

[91] Sorgun, M., Murat Ozbay oglu, A. A., and Evren Ozbayoglu, M. M., 2014,

“Support Vector Regression and Computational Fluid Dynamics Modeling of

Newtonian and Non-Newtonian Fluids in Annulus With Pipe Rotation,” ASME

J. Energy Resour. Technol.,137(3), p. 032901.

[92] Pradhan, B., 2013, “A Comparative Study on the Predictive Ability of the Deci-

sion Tree, Support Vector Machine and Neuro-Fuzzy Models in Landslide Sus-

ceptibility Mapping Using GIS,” Comput. Geosci.,51, pp. 350–365.

[93] Rostami, A., Hemmati-Sarapardeh, A., and Shamshirband, S., 2018, “Rigorous

Prognostication of Natural Gas Viscosity: Smart Modeling and Comparative

Study,” Fuel,222, pp. 766–778.

Journal of Energy Resources Technology APRIL 2019, Vol. 141 / 041001-11

Downloaded From: https://energyresources.asmedigitalcollection.asme.org on 11/21/2018 Terms of Use: http://www.asme.org/about-asme/terms-of-use

Temperature drift compensation of a FOG based on an HKSVM optimized by an improved hybrid BAS-GSA algorithm

Article

Full-text available

Nov 2021

In this paper, the optimal hybrid kernel support vector machine is employed to propose a compensation strategy intended for the temperature drift of a fiber optical gyroscope (FOG). First, the mode of the hybrid kernel with an interpolation and extrapolation capability is constructed, which consists of the radial basis function and the polynomial kernel function. Second, the combination model of the beetle antennae search algorithm and gravitational search algorithm that has both local and global search capability is proposed to optimize the structure-related parameters of a hybrid kernel support vector machine (HKSVM). Finally, the proposed approach is trained and tested using the experimental data of temperature drift at two different rates of temperature change (10°C/min and 5°C/min). In addition, the proposed method is validated against those conventional compensation algorithms. According to the research results, the compensation error (mean squared error) of the proposed approach is reduced by 92% compared to the traditional support vector machine based on the radial basis function.

Prediction of CO2 diffusivity in brine using white-box machine learning

Article

Feb 2020
J PETROL SCI ENG

Accurate knowledge of the diffusivity coefficient of CO2 in brine has a significant effect on the design success and monitoring of CO2 storage in saline aquifers, which is a part of carbon capture and sequestration (CCS). Frequently applied experimental approaches for determining this parameter are expensive and time-consuming, and empirical models cannot ensure accurate predictions. Therefore, there is a need to establish cutting-edge correlations for prediction of the diffusivity coefficient of CO2 in brine under various operating conditions. In this work, two white-box machine learning techniques, namely group method of data handling (GMDH) and gene expression programming (GEP) were implemented for correlating the diffusivity coefficient of CO2 in brine with pressure, temperature and the viscosity of the solvent. The obtained results demonstrated the accuracy of the proposed correlations. In addition, statistical and graphical analysis of the performances revealed that GEP correlation outperforms the GMDH correlation, decision trees (DTs), random forest (RF) and all the previous predictive models. GEP correlation exhibited an overall average absolute relative deviation (AARD) of 4.3014% and coefficient of determination (R²) of 0.9979. Finally, by performing the outliers detection, the validity of the GEP correlation was confirmed and only two experimental data points were identified as outliers.

Carbonated water injection

Chapter

Jan 2023

Carbonated water injection (CWI) is considered as a feasible enhanced oil recovery (EOR) method that its mechanisms, which are responsible for oil recovery, mainly initiate with the partitioning of dissolved carbon dioxide (CO2) from carbonated water to the adjacent oil phase. Alongside the EOR, CWI is considered as a promising method for geological CO2 storage beneath earth layers in depleted oil reservoirs. Countless researches have been performed on in-depth analysis of the role of CWI in improving oil recovery at the core scale. A better understanding of the primary mechanisms leading to additional oil recovery is gained through pore-scale visual investigations. The role of operational parameters such as injection rate, CO2 content, pressure, and temperature, which influence the performance of CWI, are discussed in this chapter. Challenges of CWI application are presented and discussed technically and economically. Finally, some important research findings and gaps in present understanding are highlighted.

A Deep-learned Type-3 Fuzzy System and Its Application in Modeling Problems

Article

Jan 2022

The modeling problem is one of the important topics in engineering applications. In various applications, it is required to find a mathematical model to represent the relationship between output and the associated input variables. In this study, an approach on basis of a new deep learned type-3 (T3) fuzzy logic system (FLS) is introduced. The modeling of CO2 solubility on basis of temperature, molality of NaCl, and pressure is considered as an application. The monitoring of carbon dioxide (CO2) solubility in brine is one of the effective approaches in carbon capture and sequestration technique to reduce it in the atmosphere. A new hybrid learning method is presented to optimize the suggested model. The new adaptation laws are carry-out to tune the rule parameters and centers of membership functions (MFs). The values of horizontal slices and α-cuts are learned by the unscented Kalman filter (UKF). By the real-world experimental data sets, several statistical examinations, and comparison with conventional well-known fuzzy neural networks (NNs) and learning methods, the reliability and good performance of the suggested method are demonstrated. Also, the sensitivity of the input variables is analyzed by the use of the Sobol approach. © 2022, Budapest Tech Polytechnical Institution. All rights reserved.

Machine learning-based seawater concentration pathway prediction

Article

Sep 2021
COMPUT ELECTR ENG

This paper constructs a prediction model based on Multilayer Perceptron (MLP) to explore the formation mechanism of brine (seawater evaporation or freezing). Four brine sets are extracted from the published, real-world data, and the simulation test with the same six chemical substance features. After integrated comparative experiments on six evaluation metrics, the results show that this model outperforms the other baseline prediction algorithms, achieving the highest precision 0.9625 and at least 8.45% improvement. Furthermore, for predicting the real-world test set, the results confirm the existence of freezing brine for the first time in Laizhou Bay area, China. This model is also used to analyze the mixed simulation results for brine and fresh groundwater. The experimental results indicate that only two out of twenty-nine samples of various concentrations change formation mechanisms after mixing. Overall, the model can effectively distinguish the evaporation and freezing brine, and discover the seawater concentration pathway.

Committee Machine-Ensemble as a General Paradigm for Accurate Prediction of Bubble Point Pressure of Crude Oil

Article

Aug 2020

Bubble point pressure (BPP) not only is a basic pressure-volume-temperature (PVT) parameter for calculation nearly all of crude oil characteristics, but it also determines phase type of oil reservoirs, gas to oil ratio, oil formation volume factor, inflow performance relationship, and so on. Since measurement of BPP of crude oil is an expensive and time-consuming experiment, this study develops a committee machine-ensemble (CME) paradigm for accurate estimation of this parameter from solution gas oil ratio, reservoir temperature, gas specific gravity, and stock-tank oil gravity. Our CME approach is designed using linear combination of predictions of four different expert systems. Unknown coefficients of this combination are adjusted through minimizing deviation between actual BPPs and their associated predictions using differential evolution and genetic algorithm. Our proposed CME paradigm is developed using 380 PVT datasets for crude oils from different geological regions. This novel intelligent paradigm estimates available experimental databank with an excellent accuracy i.e. absolute average relative deviation (AARD) of 6.06% and regression coefficient (R2) of 0.98777. Accurate prediction of BPP using our CME paradigm decreases the risk of producing from two-phase region of oil reservoirs.

Improved Method for the Estimation of Minimum Miscibility Pressure for Pure and Impure CO2 - Crude Oil Systems Using Gaussian Process Machine Learning Approach

Article

May 2020

The minimum miscibility pressure (MMP) is one of the critical parameters needed in the successful design of a miscible gas injection for enhanced oil recovery purposes. In this study, we explore the capability of using the Gaussian Process Machine Learning (GPML) Approach, for accurate prediction of this vital property in both pure and impure CO2-injection streams. We first performed a sensitivity analysis of different kernels and then a comparative analysis with other techniques. The new GPML model, when compared to previously published predictive models, including both correlations and other machine learning/intelligent models, showed superior performance with the highest correlation coefficient and the lowest error metrics.

A Review of Intelligent Decision-Making Strategy for Geological CO2 Storage: Insights from Reservoir Engineering

Article

May 2024

Undersaturated Oil Viscosity Based on Multi-Gene Genetic Programming

Article

Aug 2022

Undersaturated oil viscosity is an important physical property for reservoir simulation, enhanced oil recovery, and optimal production. There are two distinct methods for undersaturated oil viscosity determination: the first one is experimental measurements which are usually expensive or unavailable; whereas the second one is empirical correlations which frequently have appropriate accuracy. Accordingly, searching for a high reliability undersaturated oil viscosity model is vital. This paper presents a new undersaturated crude oil viscosity model by using multi-gene genetic programming (MGGP). This model was built by using 528 experimental measurements data points that presents broad range of reservoir pressure and oil properties. Another, 276 points were used for validating and testing the new model against eleven published correlations. The results indicated that the new MGGP-based model yields a precise prediction of undersaturated oil viscosity.

Retrograde Gas Condensate Reservoirs: Reliable Estimation of Dew Point Pressure by the Hybrid Neuro-Fuzzy Connectionist Paradigm

Article

Aug 2021

The dew point pressure (DPP) is a crucial thermodynamic property for gas reservoir performance evaluation, gas/condensate characterization, reservoir development and management, and downstream facility design. However, dew point pressure measurement is an expensive and time-consuming task; its estimation using the thermodynamic approaches has convergency problems, and available empirical correlations often provide high uncertainty levels. In this paper, the hybrid neuro-fuzzy connectionist paradigm is developed using 390 literature measurements. The adaptive neuro-fuzzy inference system (ANFIS) topology, including the training algorithm and cluster radius (radii), was determined by combining trial-and-error and statistical analyses. The hybrid optimization algorithm and radii=0.675 are distinguished as the best characteristics for the ANFIS model. A high value of observed R2 = 0.97948 confirms the excellent performance of the designed approach for calculating the DPP of retrograde gas condensate reservoirs. Furthermore, visual inspections and statistical indices are employed to compare the ANFIS reliability and available empirical correlations. The results showed that the ANFIS model is more accurate than the well-known empirical correlations and previous intelligent paradigms in the literature. The designed ANFIS model, the best empirical correlation, and the most accurate intelligent paradigm in the literature present the absolute average relative deviation (AARD) of 1.60%, 11.25%, 2.10%, and, respectively.

Geologic CO2 sequestration monitoring design: A machine learning and uncertainty quantification based approach

Article

Full-text available

May 2018
APPL ENERG

Monitoring is a crucial aspect of geologic carbon dioxide (CO2) sequestration risk management. Effective monitoring is critical to ensure CO2 is safely and permanently stored throughout the life-cycle of a geologic CO2 sequestration project. Effective monitoring involves deciding: (i) where is the optimal location to place the monitoring well(s), and (ii) what type of data (pressure, temperature, CO2 saturation, etc.) should be measured taking into consideration the uncertainties at geologic sequestration sites. We have developed a filtering-based data assimilation procedure to design effective monitoring approaches. To reduce the computational cost of the filtering-based data assimilation process, a machine-learning algorithm: Multivariate Adaptive Regression Splines is used to derive computationally efficient reduced order models from results of full-physics numerical simulations of CO2 injection in saline aquifer and subsequent multi-phase fluid flow. We use example scenarios of CO2 leakage through legacy wellbore and demonstrate a monitoring strategy can be selected with the aim of reducing uncertainty in metrics related to CO2 leakage. We demonstrate the proposed framework with two synthetic examples: a simple validation case and a more complicated case including multiple monitoring wells. The examples demonstrate that the proposed approach can be effective in developing monitoring approaches that take into consideration uncertainties.

Optimization of Supercritical CO 2 Brayton Cycle for Simple Cycle Gas Turbines Exhaust Heat Recovery Using Genetic Algorithm

Article

Full-text available

Mar 2018

For the application of waste heat recovery (WHR), supercritical CO2 (S-CO2) Brayton power cycles offer significant suitable advantages such as compactness, low capital cost, and applicability to a broad range of heat source temperatures. The current study is focused on thermodynamic modeling and optimization of recuperated (RC) and recuperated recompression (RRC) configurations of S-CO2 Brayton cycles for exhaust heat recovery from a next generation heavy duty simple cycle gas turbine using genetic algorithm (GA). This nongradient based algorithm yields a simultaneous optimization of key S-CO2 Brayton cycle decision variables such as turbine inlet temperature, pinch point temperature difference, compressor pressure ratio, and mass flow rate of CO2. The main goal of the optimization is to maximize power out of the exhaust stream which makes it single objective optimization. The optimization is based on thermodynamic analysis with suitable practical assumptions which can be varied according to the need of user. The optimal cycle design points are presented for both RC and RRC configurations and comparison of net power output is established for WHR. For the chosen exhaust gas mass flow rate, RRC cycle yields more power output than RC cycle. The main conclusion drawn from the current study is that the choice of best cycle for WHR actually depends heavily on mass flow rate of the exhaust gas. Further, the economic analysis of the more power producing RRC cycle is performed and cost comparison between the optimized RRC cycle and steam Rankine bottoming cycle is presented.

Artificial Neural Network-Based Prediction of Performances-Exhaust Emissions of Diesohol Piloted Dual Fuel Diesel Engine Under Varying Compressed Natural Gas Flowrates

Article

May 2018

The present study surveys the effects on performance and emission parameters of a partially modified single cylinder direct injection (DI) Diesel engine fueled with Diesohol blends under varying compressed natural gas (CNG) flowrates in dual fuel mode. Based on experimental data an Artificial Intelligence (AI) based specialized Artificial Neural Network (ANN) model have been developed for predicting the output parameters viz. brake thermal efficiency (Bth), brake specific energy consumption (BSEC) along with emission characteristics such as oxides of nitrogen (NOX), unburned hydrocarbon (UBHC), carbon dioxide (CO2) and carbon monoxide (CO) emissions. Engine load, ethanol share and CNG strategies have been used as input parameters for the model. Among the tested models, the Levenberg-Marquardt feed-forward back propagation with three input neurons or nodes, two hidden layers with ten neurons in each layer and six output neurons, and tansig-purelin activation function have been found to the optimal model topology for the Diesohol-CNG platforms. The statistical results acquired from the optimal network topology such as correlation coefficient (0.992-0.999), mean square error (0.0001-0.0009) and mean absolute percentage error (0.09-2.41%) along with Nash-Sutcliffe coefficient of efficiency, Kling-Gupta efficiency, mean square relative error and model uncertainty established itself as a real time robust type machine learning tool under Diesohol- CNG paradigms. The study also incorporated a special type of measure namely Pearson's Chisquare test or goodness of fit, which brings up the model validation to a higher level.

Assessing the combined influence of fluid-rock interactions on reservoir properties and injectivity during CO 2 storage in saline aquifers

Article

May 2018
ENERGY

Complex fluid-rock interactions can occur during the injection of carbon dioxide (CO2) into saline aquifers for sequestration, which may affect CO2 injectivity and storage capacity. In this paper, a comprehensive reactive transport model is established to analyze salt precipitation, CO2-water-rock geochemical reactions, and their effects on reservoir physical properties and injectivity. In addition, sensitivity analyses are conducted to investigate the main factors that affect fluid-rock interactions and injectivity with relevance for site selection for CO2 storage. Results show that the back flow of formation water not only affects the salt precipitation but also affects the CO2-water-rock geochemical reactions, resulting in salt and calcite precipitations mainly occurring in the dry-out zone. However, most of the mineral dissolution/precipitation caused by CO2-water-rock reaction occurs in the two-phase and aqueous-phase zones, and their effect on reservoir porosity and permeability are small. A considerable amount of sodium chloride precipitates in the dry-out zone as brine is drawn by capillary action into this zone, with significant consequences for porosity, permeability and injectivity. The injection rate, salinity, capillary pressure–saturation relationships, and reservoir permeability strongly affect the distribution of salt precipitation. Moderate injection rates, salinities, capillary pressures, and permeabilities all lead to favorable CO2 injectivity.

CO2 Water-Alternating-Gas Injection for Enhanced Oil Recovery: Optimal Well Controls and Half-Cycle Lengths

Article

Mar 2018
COMPUT CHEM ENG

Effect of CO2 on Heavy Oil Recovery and Physical Properties in Huff-n-Puff Processes Under Reservoir Conditions

Article

Feb 2018

In this paper, experimental and numerical techniques have been utilized to quantify heavy oil properties in CO2 huff-n-puff processes under reservoir conditions. Experimentally, fluid properties together with viscosity reduction of heavy oil and interfacial properties between CO2 and heavy oil have been quantified, while five cycles of CO2 huff-n-puff processes have been conducted to determine oil recovery together with component variation of produced and residual oils. Theoretically, numerical simulation has been conducted to analyze the underlying recovery mechanisms associated with the CO2 huff-n-puff processes. CO2 huff-n-puff processes are only effective in the first two cycles under the existing experimental conditions, while the effective sweep range is limited near the wellbore region, resulting in poor oil recovery in the subsequent cycles. As for produced oil, its viscosity, density, resin and asphaltene contents, and molecular weight of asphaltene are reduced, whereas, for the residual oil, they are increased. The asphaltene component in the residual oil shows weak stability compared to that of the produced oil, while the ultimate oil recovery after the fifth CO2 cycle of huff-n-huff processes is measured to be 31.56%.

The Experimental Study on the Flooding Regularities of Various CO 2 Flooding Modes Implemented on Ultralow Permeability Cores

Article

Feb 2018

The application of water flooding is not successful for the development of low permeability reservoirs due to the fine pore sizes and the difficulty of water injection operation. CO2 can dissolve readily in crude oil and highly improve the mobility of crude oil. Which makes CO2 flooding an effective way to the development of the ultralow-permeability reservoirs. The regularities of various CO2 displacement methods were studied via experiments implemented on cores from Chang 8 Formation of Honghe Oilfield. The results show that CO2 miscible displacement has the minimum displacement differential pressure and the maximum oil recovery; CO2-alternating-water miscible flooding has lower oil recovery, higher drive pressure, and relatively lower gas-oil ratio; water flooding has the minimum oil recovery and the maximum driving pressure. A large amount of oil still can be produced under a high gas-oil ratio condition through CO2 displacement method. This fact proves that the increase of gas-oil ratio is caused by the production of dissolved CO2 in oil rather than the free gas breakthrough. At the initial stage of CO2 injection, CO2 does not improve the oil recovery immediately. As the injection continues, the oil recovery can be improved rapidly. This phenomenon suggests that when CO2 displacement is performed at high water cut period, the water cut does not decrease immediately and will remain high for a period of time, then a rapid decline of water cut and increase of oil production can be observed.

Rigorous Prognostication of Natural Gas Viscosity: Smart Modeling and Comparative Study

Article

Feb 2018
FUEL

The current study plays a major role in modeling natural gas viscosity in terms of several operating parameters including pseudo-reduced properties and molecular weight through radial basis function neural network (RBFNN), least-squares support vector machine (LSSVM), and multilayer perceptron neural network (MLPFNN). As it known, an important feature of any comprehensive modeling is the application of a large database for model development. Therefore, more than 3800 gas viscosity data points were used for modeling. For upgrading the efficiency of the abovementioned predictive tools, four optimization algorithms including levenberg-marquardt (LM), coupled simulating annealing (CSA), Bayesian regularization (BR), and scaled conjugate gradient (SCG), were integrated with them to find the optimal models’ parameters during prediction analysis. Consequently, it was understood that among the all suggested tools in this study, the MLP-LM and then MLP-BR are the most accurate models for estimating gas viscosity with root mean square error (RMSE) of 0.001 and 0.002, respectively. Comparison of the MLP-LM and MLP-BR with previously published models in literature demonstrates their higher prediction capability, with less numbers of input parameters (without needing any density data), than the existing literature models. Based on the sensitivity analysis, it is concluded that the molecular weight is the most affecting variable on the viscosity prediction. Finally, the suggested tools in this study can be of great value for effective estimation of gas viscosity in simulating both upstream and downstream natural gas processes.

The influence of complicated fluid-rock interactions on the geothermal exploitation in the CO 2 plume geothermal system

Article

Dec 2017
APPL ENERG

The ubiquitous natural sedimentary reservoirs and their high permeability have made the CO2 plume geothermal system increasingly attractive. However, the complicated fluid-rock interactions during the geothermal exploitation can cause severe reservoir damage, constraining the excellent heat mining performance of the CO2 and decreasing the possible applications of the CO2 plume geothermal system. In order to analyze and solve this energy issue affecting the geothermal exploitation, in this study, a comprehensive numerical simulation model was established, which can consider formation water evaporation, salt precipitation, CO2-water-rock geochemical reactions, and the changes in reservoir porosity and permeability in the CO2 plume geothermal (CPG) system. Using this model, the geochemical reactions and salt precipitation and their effects on the geothermal exploitation were analyzed, and some measures were proposed to reduce the influence of fluid-rock interactions on the heat mining rate. The simulation results show that the gravity and the negative gas-liquid capillary pressure gradient induced by evaporation can cause the formation water to flow toward the injector. The back flow of the formation water results in salt precipitation accumulation in the injection well region, which can cause severe reservoir damage and consequent reductions to the heat mining rate. The CO2-water-rock geochemical reactions could result in the dissolution of certain minerals and precipitation of others, but its minimal influence on the heat mining rate can be ignored. However, salt precipitation can affect the geochemical reactions by influencing the CO2 flow and distribution, which can reduce the heat mining rate up to 2/5 of the original. Sensitivity studies show that the reservoir condition can affect the salt precipitation and heat mining rate, so a sedimentary reservoir with high temperature, high porosity and permeability, and low salinity should be selected for CPG application, with an appropriately high injection-production pressure difference. The injection of low salinity water before CO2 injection and the combined injection of CO2 and water vapor can be applied to reduce the salt precipitation and increase the heat mining rate in the CPG system.

A Power-Law Mixing Rule for Predicting Apparent Diffusion Coefficients of Binary Gas Mixtures in Heavy Oil

Article

Nov 2017

A power-law mixing rule has been developed to determine apparent diffusion coefficient of a binary gas mixture on the basis of molecular diffusion coefficients for pure gases in heavy oil. Diffusion coefficient of a pure gas under different pressures and different temperatures is predicted on the basis of the Hayduk and Cheng's equation incorporating the principle of corresponding states for one-dimensional gas diffusion in heavy oil such as the diffusion in a pressure-volume-temperature (PVT) cell. Meanwhile, a specific surface area term is added to the generated equation for three-dimensional gas diffusion in heavy oil such as the diffusion in a pendant drop. In this study, the newly developed correlations are used to reproduce the measured diffusion coefficients for pure gases diffusing in three different heavy oils, i.e., two Lloydminster heavy oils and a Cactus Lake heavy oil. Then, such predicted pure gas diffusion coefficients are adjusted based on reduced pressure, reduced temperature, and equilibrium ratio to determine apparent diffusion coefficient for a gas mixture in heavy oil, where the equilibrium ratios for hydrocarbon gases and CO2 are determined by using the equilibrium ratio charts and Standing's equations, respectively. It has been found for various gas mixtures in two different Lloydminster heavy oils that the newly developed empirical mixing rule is able to reproduce the apparent diffusion coefficient for binary gas mixtures in heavy oil with a good accuracy. For the pure gas diffusion in heavy oil, the absolute average relative deviations (AARDs) for diffusion systems with two different Lloydminster heavy oils and a Cactus Lake heavy oil are calculated to be 2.54%, 14.79%, and 6.36%, respectively. Meanwhile, for the binary gas mixture diffusion in heavy oil, the AARDs for diffusion systems with two different Lloydminster heavy oils are found to be 3.56% and 6.86%, respectively.

Estimation of CO2 Diffusivity in Brine by Use of the Genetic Algorithm and Mixed Kernels-Based Support Vector Machine Model

Abstract and Figures

Recommended publications

Imaging the Aquistore reservoir after 36 kilotonnes of CO 2 injection using distributed acoustic sen...

The laboratory research on selective water - Plugging technology in crude oil emulsification

Flow path development in different CO2 storage reservoir scenarios: A critical state approach

Investigation of the pH effect of a typical host rock and buffer solution on CO 2 sequestration in s...