Conference PaperPDF Available

Embedding explicit smoothness constraints in data-driven turbulence models

September 2023

September 2023

Conference: 14th international ERCOFTAC symposium on engineering turbulence modelling and measurements (ETMM14)
At: Barcelona

Authors:

Hannes Mandler

Universität Stuttgart

Bernhard Weigand

Universität Stuttgart

This paper is concerned with an innovative regularization strategy for data-driven turbulence models. By enforcing explicit constraints on the Lipschitz continuity of the neural networks (NNs) which represent variable coefficients of the closure, their sensitivity with respect to input perturbations can be significantly reduced. This can be efficiently implemented by bounding the spectral norm of the NNs' weight matrices. Furthermore, the influence of different levels of Lipschitz continuity on the mean flow field prediction is illustrated for a two-dimensional channel flow with periodic hills. It is demonstrated that the Reynolds stress tensor prediction becomes smoother as the Lipschitz constants of the NNs decrease, which facilitates the stability and accuracy of the flow field solution.

Main RST component profiles in a two-dimensional channel with periodic hills at Re D h b = 150,664 predicted by the SST and variations of the neuralSST model in comparison with measurements

…

Relevant non-dimensional parameters for the training and test data sets

…

Figures - uploaded by Hannes Mandler

Content may be subject to copyright.

Content uploaded by Hannes Mandler

Content may be subject to copyright.

EMBEDDING EXPLICIT SMOOTHNESS CONSTRAINTS IN

DATA-DRIVEN TURBULENCE MODELS

H. Mandler1and B. Weigand1

1Institute of Aerospace Thermodynamics, University of Stuttgart

hannes.mandler@itlr.uni-stuttgart.de

Abstract

This paper is concerned with an innovative regular-

ization strategy for data-driven turbulence models. By

enforcing explicit constraints on the Lipschitz continu-

ity of the neural networks (NNs) which represent vari-

able coefﬁcients of the closure, their sensitivity with

respect to input perturbations can be signiﬁcantly re-

duced. This can be efﬁciently implemented by bound-

ing the spectral norm of the NNs’ weight matrices.

Furthermore, the inﬂuence of different levels of Lip-

schitz continuity on the mean ﬂow ﬁeld prediction is

illustrated for a two-dimensional channel ﬂow with pe-

riodic hills. It is demonstrated that the Reynolds stress

tensor prediction becomes smoother as the Lipschitz

constants of the NNs decrease, which facilitates the

stability and accuracy of the ﬂow ﬁeld solution.

1 Introduction

Due to the model form uncertainty associated with

the Boussinesq hypothesis and the assumption of con-

stant closure coefﬁcients (Xiao and Cinnella, 2019),

which are the foundation of most turbulence models of

practical relevance, data-driven models have recently

gained wide attraction. The vast majority of these ap-

proaches is based on a nonlinear closure equation

2b=τt

k+2

3I=

N≤10

n=1

gn(q)Tn(1)

which expresses the non-dimensional anisotropy ten-

sor bas a linear combination of Nbase tensors Tn.

In Pope’s (1975) original formulation, these are com-

binations of the mean strain and rotation rate tensors,

S=τ¯

Sand e

Ω=τ¯

Ω, respectively, which are non-

dimensionalized by the turbulent time scale τ. The

turbulent kinetic energy (TKE), which is related to the

trace of the Reynolds stress tensor (RST) τt, is de-

noted by k.

Recently, methods from the emerging ﬁeld of ma-

chine learning have been adopted in order to systemat-

ically infer the variability of the closure coefﬁcients

gn, which may be functions of the local ﬂow state.

The straight-forward procedure to obtain these func-

tions can be broken down into two steps (Duraisamy,

2021) : Firstly, the spatial distributions of the inde-

pendent variables q(x)and the optimal closure coef-

ﬁcients gop

n(x)are extracted from high-ﬁdelity simu-

lations of representative ﬂows. Secondly, a set of re-

gression problems can be posed opting for functions

hn:q(x)7→ gop

n(x)∀n, (2)

which are commonly referred to as hypotheses and

generalize the relationship between local ﬂow quan-

tities and optimal closure coefﬁcients.

A considerable number of data-driven closures re-

lies on NNs as representation for the hypotheses hn,

e.g. those proposed by Ling et al. (2016), Geneva and

Zabaras (2019), Jiang et al. (2021) as well as Mandler

and Weigand (2022b). Although NNs may approxi-

mate any non-linear relationship to arbitrary accuracy

(Hornik, 1991) and are suitable for deep learning ap-

plications, they are also prone to overﬁtting and ad-

versarial attacks (Rosca et al., 2020; Akhtar and Mian,

2018). Therefore, even slight input perturbations may

yield oscillatory or even anomalous predictions, which

would propagate into the momentum balance via the

divergence of the non-dimensional anisotropy tensor

∇ · b=

n=1 gn∇ · Tn+TT

n· ∇q·∂gn

∂qT!(3)

and thus decrease the stability and accuracy of its so-

lution. This effect may even be ampliﬁed by the Jaco-

bians of the hypotheses Jn=∂gn/∂q.

Data-driven turbulence models accordingly face

the bias-variance trade-off between exploiting the full

potential for model enhancement and the stability of

the solver, which is mainly controlled by the sensi-

tivity of the hypotheses w.r.t. their inputs and can be

adjusted by regularization techniques. The input sen-

sitivity can be expressed in terms of the Lipschitz con-

stant Kn, which is deﬁned by

khn(q2)−hn(q1)k2≤ Knkq2−q1k2∀q1,q2,

(4)

and provides an upper bound for the maximum mag-

nitude of the hypothesis’ Jacobian, i.e. kJnk2≤ Kn.

The most common regularization method is called

weight decay (WD). It penalizes large magnitudes

of the weights by adding another term to the cost

function. There are plenty of alternative strategies,

e.g. gathering more data, pruning, early stopping,

dropout and noise injection (Goodfellow et al., 2016).

Even though these methods indeed reduce the hy-

potheses’ input sensitivity, it is uncertain to which ex-

tent as the methods’ hyperparameters are only vaguely

linked to the hypotheses’ Lipschitz constants. Hence,

Yoshida and Miyato (2017) as well as Usama and

Chang (2019) proposed to modify the cost function

such that hypotheses with low Lipschitz constants are

incentivized. Because this soft constraint does still

not guarantee a certain value of the Lipschitz constant,

Miyato et al. (2018) suggested spectral normalization

of the weight matrices in order to enforce K= 1.

Gouk et al. (2021) were able to extent the latter con-

cept to arbitrary Lipschitz constants owe to a projec-

tion method. The aforementioned concepts rely on ac-

curate estimates of the Lipschitz constant for a given

NN. While the present work is based on a straight-

forward and conservative estimate (Neyshabur, 2017),

more accurate methods are described by Scaman and

Virmaux (2018) and Pauli et al. (2022).

This is the ﬁrst attempt to embed explicit smooth-

ness constraints into a data-driven turbulence models.

In the present work, it will be demonstrated that this

indeed yields spatially smoother ﬂow ﬁeld predictions

compared to models regularized by WD.

This paper is organized as follows. First, the Lips-

chitz continuity control (LCC) method described. Af-

ter brieﬂy reviewing the neuralSST model as an exam-

ple of a data-driven closure, the training and test cases

are presented. Finally, a selection of a priori and a pos-

teriori results is thoroughly discussed.

2 Lipschitz continuity control of NNs

The Lipschitz constant of a feed-forward NN us-

ing an activation function whose slope does not exceed

unity is bounded by the product of the spectral norms

of all Lhidden as well as the output layers’ weight ma-

trices. Since the spectral norm of the lth weight ma-

trix is given by its largest singular value max σ(l),

an upper bound for the Lipschitz constant of the entire

NN reads (Gouk et al., 2021)

K ≤

L+1

l=1

max σ(l).(5)

If all hidden layers are supposed to contribute equally

to the variability of the hypothesis, rearranging Eq. (5)

yields an upper bound for the maximum singular value

of each weight matrix (Gouk et al., 2021).

max σ(l)≤ K 1

L+1 ∀l≤L+ 1 (6)

To enforce the constraint given by Eq. (6), the updated

weight matrices returned by the optimizer at the end

of each epoch are normalized (Gouk et al., 2021).

W(l)←W(l)

max 1,max(σ(l))

L+1 ∀l≤L+ 1 (7)

3 Data-driven turbulence model

This section provides a concise review of the neu-

ralSST model of Mandler and Weigand (2022b).

Implemetation into RANS solver

All modiﬁcations of the closure equation w.r.t. the

underlying SST model (Menter et al., 2003) are

grouped in a single correction term. The non-

dimensional anisotropy tensor, therefore, reads

b=νt

SST

k¯

S−R(−bML ) + νt

SST

k¯

S,(8)

where its data-driven prediction

2bML =g1e

S+g2e

Ω−e

Ωe

S

+g3e

S2−1

3tr ne

S2oI(9)

is subject to (s.t.) a barycentric realizability correction

R. By evaluating the hypotheses and limiting their

predictions, the closure coefﬁcients

gn= max gmin

n,min [hn(q), gmax

n](10)

can be obtained. The corresponding limits are set such

that g1∈[0.1,0.4] , g2∈[0.0,0.3] , g3∈[−0.4,0.0]

and g4∈[−1.0,1.5].

In order to comply with the principle of turbu-

lent scale consistency (Ling and Templeton, 2015), the

scale equations

Dt =P∗

k−β∗kω +∇ · ν+σkνt

SST ∇k(11)

Dω

Dt =γP∗

νt

SST

−βω2+CD

+∇ · ν+σωνt

SST ∇ω(12)

are augmented by another data-driven coefﬁcient

g4(Schmelzer et al., 2020; Mandler and Weigand,

2022a), which can be incorporated into the TKE pro-

duction term

P∗

k= min τt:¯

S,10β∗kω+g4

τ.(13)

The constants β, β ∗, σk, σω, σω2and γas well as the

cross-diffusion term CD in Eqs. (11) and (12) are

adopted from Menter et al. (2003). Likewise, νt

SST

denotes the eddy viscosity predicted by the original

SST model’s closure. Finally, the turbulent time scale

reads (Menter et al., 2012)

τ=1

β∗ωmax 1.0,6.0rβ∗νω

k!.(14)

The optimal coefﬁcient distributions

The optimal spatial distributions of the closure co-

efﬁcients gop

n(x)can be efﬁciently obtained from the

p = Lx

bulk ﬂow

wall (no-slip)

periodicity

Figure 1: Geometry of the domain and boundary conditions

for the simulation of the ﬂow through a two-

dimensional channel with periodic hills (Mandler

and Weigand, 2022b)

high-ﬁdelity reference solution by a series of consecu-

tive tensor projections.

gop

n=2b−Pn−1

m=1 gop

mTm:Tn

kTnk2∀n≤N(15)

In order to determine gop

4, Eqs. (11) and (12) are

solved given the high-ﬁdelity solution for the mean

ﬂow ﬁeld, the RST and the TKE. This requires a pre-

cursor RANS simulation which only solves the scale

but neither the continuity nor the momentum equations

(Schmelzer et al., 2020).

Inference of the coefﬁcient variability

The optimal coefﬁcients’ variability is inferred by

solving four regression problems deﬁned by Eq. (2),

where the hypotheses hnare represented by NNs.

These consist of six hidden layers with six neurons

each. A rectiﬁed linear unit (ReLU) activation func-

tion is applied to all hidden neurons, whereas the out-

put neuron behaves linearly. By means of the ADAM

algorithm (Kingma and Ba, 2014), the mean squared

error cost function is minimized over 20,000 epochs at

a learning rate of 0.002. In the present work, WD with

a regularization constant of 0.1is replaced by LCC.

The raw features qare listed in Table 1. They

contain the wall-distance dand the blending argument

arg2, which are both provided by the underlying SST

model. In order facilitate the training process, a z-

score normalization is applied to the raw features q

which ensures that the ﬁnal features qhave zero mean

and unit variance.

4 Training and test cases

As this paper is concerned with the inﬂuence of

different regularization strategies on data-driven tur-

bulence models and not with extending their range of

application, a common training and test case is consid-

ered. The two-dimensional ﬂow over periodic hills is

characterized by a repeating pattern of ﬂow separation

and reattachment. Its geometry is depicted in Fig. 1.

The size of the recirculation zone is mainly governed

100101102103104

K[−]

0.2

0.4

0.6

0.8

ε[−]

εtrain

εtest

εtrain

εtest

εtrain

εtest

εtrain

εtest

Figure 2: RMSE of the predictions of the four hypotheses

hnon the training and test sets as function of their

Lipschitz constants. The optimal a priori and a

posteriori Lipschitz constants are highlighted by

squares and circles, respectively.

by the Reynolds number and the average hill inclina-

tion. While the model is only informed with the DNS

data of one particular combination of these parame-

ters, the a priori test cases cover a variation in the av-

erage hill inclination θand the a posteriori test case

features a signiﬁcantly higher bulk Reynolds number

ReDh

b(based on hydraulic diameter), see Tab. 2.

5 Results and discussion

Selection of an optimal Lipschitz constant

The Lipschitz constants can be regarded as hyper-

parameters of the hypotheses. In supervised learning

problems, hyperparameters are typically chosen such

that the test error is minimized. Thus, Fig. 2 depicts

the root mean squared error (RMSE) εnfor all four

hypothesis hnevaluated on the training and test set as

function of the Lipschitz constant. While the training

error continuously increases as the Lipschitz constant

decreases, the test error curves for all but the second

hypothesis exhibit a global minimum at an intermedi-

ate Lipschitz constant. This is a typical observation

for a regularization parameter variation. This global

minimum then deﬁnes the optimal a priori Lipschitz

constant for each hypothesis hn

KLCCprior

n= arg min

Kεtest

n,(16)

which is highlighted by a square in Fig. 2. The ideal

regularization strength obviously differs between the

different hypotheses. In contrast to the ﬁrst hypothe-

sis, which requires strong regularization, this suggests

practically dispensing with regularization for the sec-

ond hypothesis.

This a priori selection, however, does not take into

account how the four hypotheses interact with each

other and with the solver. It may, therefore, not lead to

Table 1: Deﬁnition of the raw features qusing the squashing functions N1(x) = tanh (x/2) (Geneva and Zabaras, 2019) and

N2(x) = x/ (|x|+ 1) (Ling and Templeton, 2015)

Raw feature qiPhysical interpretation

N2(τkSk)Ratio of turbulent and mean strain time scale

N2(τkΩk)Ratio of turbulent and mean rotation time scale

N2τk−0.5|∇k|Ratio of turbulent length scale and TKE gradient decay length

min 0.02k0.5dν−1,2Wall-distance based Reynolds number

N1(min [arg2,10]) Blending function

Table 2: Relevant non-dimensional parameters for the training and test data sets

Usage θ[◦] ReDh

b[−]Source

training 27.4 22,803 Xiao et al. (2020)

a priori testing [23.4,32.9] 22,803 Xiao et al. (2020)

a posteriori testing 27.4 150,664 Rapp and Manhart (2011)

the best possible agreement with the high-ﬁdelity ﬂow

solution. The brute-force a posteriori selection proce-

dure, i.e. testing all possible combinations of indepen-

dently regularized hypotheses in the solver, would be

far too expensive in general. Hence, starting from the

optimal a priori selection, the Lipschitz constants of

each hypotheses have been independently reduced un-

til the ﬂow solution did signiﬁcantly deteriorate. The

optimal a posteriori Lipschitz constants KLCCpost

nfor

each hypothesis are highlighted by circles in Fig. 2.

In order to prove the validity of the two variations

of the neuralSST model which are based on the opti-

mal a priori and a posteriori Lipschitz constants, re-

spectively, the corresponding mean ﬂow ﬁeld predic-

tions for the training case are illustrated in Fig. 3.

In fact, both models yield almost identical predic-

tions and the signiﬁcantly stronger regularization does

not sacriﬁce the model’s accuracy, which is measured

based on the agreement with the DNS solution. In

addition, there is no visible difference in the predic-

tions of the neuralSST models which were s.t. WD and

LCC.

Smoothness of the a posteriori coefﬁcient ﬁeld

As detailed above, the LCC regularization ap-

proach bounds the norm of hypotheses’ Jacobians and,

by virtue of the chain rule of differentiation, yields

smoother spatial distributions of the coefﬁcients. This

is veriﬁed for the predictions of the ﬁrst closure coefﬁ-

cient g1by the two neuralSST model variations, which

are KLCCprior

n- and KLCCpost

n-Lipschitz, respectively,

see Fig. 4. While the model based on the optimal

a priori Lipschitz constants better captures the true,

physical gradients, it also suffers from spatial oscilla-

tions in the prediction, in particular above the hill crest

and in the free stream where the velocity gradients al-

most vanish. On the contrary, the model based on the

optimal a posteriori Lipschitz constants yields a very

smooth prediction at the cost of not being able to re-

solve the steepest physical gradients. This side-effect

of the strong regularization is not critical, though, be-

cause the mean ﬂow solution for the training case is

the same.

The original neuralSST model whose hypotheses

were s.t. WD, also leads to a smooth ﬁeld for g1, as

shown in Fig. 4. By promoting smaller absolute val-

ues of the weights, WD also reduces the spectral norm

of the weight matrices and in turn the NN’s Lipschitz

constant. Evaluating Eq. (5) in fact yields that the WD-

regularized hypothesis h1is at most 0.1-Lipschitz.

The Lipschitz constant for the remaining hypotheses

which were s.t. WD are in the same order of mag-

nitude. This indicates that the regularization due to

the WD which was applied by Mandler and Weigand

(2022b) is stronger than enforcing the optimal a pos-

teriori Lipschitz constants. This is in contrast to the

intuition gained from the visual inspection of the coef-

ﬁcient ﬁelds shown in Fig. 4.

Smoothness of the a posteriori RST ﬁeld

A previous study (Mandler and Weigand, 2022b)

revealed that the WD-based neuralSST model drasti-

cally overpredicts the main normal RST component

close to the upper wall. As illustrated in Fig. 5, this

is not the case if the NNs are s.t. LCC. This overshoot

can consequently be attributed to this particular com-

bination of hyperparameters, including but not lim-

ited to the network size and the regularization strength.

Furthermore, the baseline model predicts strong oscil-

lations on the top of the hill crest, which further prop-

agate in streamwise direction, but enforcing the opti-

mal a posteriori Lipschitz constant by means of LCC

seems to prevent these issues. In light of the fact that

the WD-based model has the smallest upper bounds

for the Lipschitz constants, these observations are sur-

prising. However, they are consistent with the spatial

smoothness of the coefﬁcient ﬁelds depicted in Fig. 4.

Even though these spatial RST oscillations in the

prediction of the WD-based model do not manifest

012345678910

2vx/Ub+x/H [−]

y/H [−]

DNS SST WD LCCprior LCCpost

Figure 3: Mean axial velocity proﬁles in a two-dimensional channel with periodic hills at ReDh

b= 22,803 predicted by the

SST and variations of the neuralSST model in comparison with DNS results

(a) neuralSST prediction s.t. WD regularization

(b) neuralSST prediction s.t. KLCCprior

n∀n

Figure 4: Spatial distributions of the closure coefﬁcient g1

for the training case predicted by variations of the

neuralSST model

themselves in a signiﬁcant deterioration of the mean

axial velocity proﬁles, they hinder convergence. The

residuals of the momentum and pressure equation are

consistently one order of magnitude higher than for the

model which is s.t. LCC with the optimal a posteriori

Lipschitz constants.

6 Conclusion

Data-driven turbulence models which rely on NNs

to predict variable closure coefﬁcients may suffer from

spatially oscillating RST and ﬂow ﬁeld predictions. In

order to prevent these phenomena, regularization tech-

niques are applied during the training process of the

NNs. In the present work, LCC was proposed as an

alternative to the common WD. This allows to enforce

explicit upper bounds for the Lipschitz constants of the

NNs. Besides providing a theoretical guarantee for a

black-box model, the practical advantages of reason-

ably tightening these bounds were demonstrated. As

the Lipschitz constant of the NNs decreases, the pre-

dicted coefﬁcient ﬁelds become smoother which facil-

itates both stability and accuracy of the mean ﬂow so-

lution. These ﬁndings suggest that Lipschitz continu-

ity based regularization techniques are very well suited

for NNs serving as sub-models in partial differential

equations. A particularly fruitful application, which

will be investigated in the future, may be the combina-

tion of an augmented turbulence model with sophisti-

cated scalar-ﬂux models utilizing the entire RST rather

than assuming a constant turbulent Prandtl number.

Acknowledgments

The investigations were conducted as part of the

joint research programme Roboﬂex (AG Turbo 2019)

in the frame of AG Turbo. The work was supported

by the Bundesministerium f¨

ur Wirtschaft und Energie

(BMWE) under grant number 03EE5013C. The au-

thors gratefully acknowledge AG Turbo and MTU

Aero Engines AG for their support and the permission

to publish this paper.

References

Akhtar, N. and Mian, A. (2018), Threat of adversarial at-

tacks on deep learning in computer vision: A survey, IEEE

Access, Vol. 6, pp. 14410-14430.

Duraisamy, K. (2021), Perspectives on machine learning-

augmented reynolds-averaged and large eddy simulation

models of turbulence, Phys. Rev. Fluids, Vol. 6, pp. 050504.

Geneva, N. and Zabaras, N. (2019), Quantifying model form

uncertainty in reynolds-averaged turbulence models with

bayesian deep neural networks, J. Comp. Phys., Vol. 383,

pp. 125-147.

0123456789

16τt

xx/U 2

b+x/H [−]

y/H [−]

Exp. SST WD LCCpost

Figure 5: Main RST component proﬁles in a two-dimensional channel with periodic hills at ReDh

b= 150,664 predicted by the

SST and variations of the neuralSST model in comparison with measurements

Goodfellow, I., Bengio, Y., and Courville, A. (2016), Deep

Learning, MIT Press, Cambridge, MA, USA.

Gouk, H., Frank, E., Pfahringer, B., and Cree, M. J. (2021),

Regularisation of neural networks by enforcing Lipschitz

continuity, Mach. Learn., Vol. 110(2), pp. 393-416.

Hornik, K. (1991), Approximation capabilities of multilayer

feedforward networks, Neural Networks, Vol. 4(2), pp. 251-

257.

Jiang, C., Vinuesa, R., Chen, R., Mi, J., Laima, S., and Li,

H. (2021), An interpretable framework of data-driven turbu-

lence modeling using deep neural networks, Phys. Fluids,

Vol. 33(5), pp. 055133.

Kingma, D. P. and Ba, J. (2014), Adam: A method for

stochastic optimization, arXiv:1412.6980.

Ling, J., Kurzawski, A., and Templeton, J. (2016), Reynolds

averaged turbulence modelling using deep neural networks

with embedded invariance, J. Fluid Mech., Vol. 807, pp. 155-

166.

Ling, J. and Templeton, J. (2015), Evaluation of ma-

chine learning algorithms for prediction of regions of high

Reynolds averaged Navier Stokes uncertainty, Phys. Fluids,

Vol. 27(8), pp. 085103.

Mandler, H. and Weigand, B. (2022a), On frozen-RANS ap-

proaches in data-driven turbulence modeling: Practical rele-

vance of turbulent scale consistency during closure inference

and application, Int. J. Heat Fluid Flow, Vol. 97, pp. 109017.

Mandler, H. and Weigand, B. (2022b), A realizable and

scale-consistent data-driven non-linear eddy-viscosity mod-

eling framework for arbitrary regression algorithms, Int. J.

Heat Fluid Flow, Vol. 97, pp. 109018.

Menter, F. R., Garbaruk, A. V., and Egorov, Y. (2012), Ex-

plicit algebraic Reynolds stress models for anisotropic wall-

bounded ﬂows, In EUCASS Proc. Ser. - Adv. Aerosp. Sci.,

Vol. 3, pp. 89-104.

Menter, F. R., Kuntz, M., and Langtry, R. (2003), Ten years

of industrial experience with the SST turbulence model, In

Hanjalic, K., Nagano, Y., and Tummers, M. J. (eds.), Turb

Heat Mass Transfer, Vol. 4, pp. 625-632.

Miyato, T., Kataoka, T., Koyama, M., and Yoshida, Y.

(2018), Spectral normalization for generative adversarial

networks, arXiv:1802.05957.

Neyshabur, B. (2017), Implicit Regularization in Deep

Learning, PhD thesis, Toyota Technical Institute, Chicago,

IL, USA.

Pauli, P., Koch, A., Berberich, J., Kohler, P., and Allg¨

ower,

F. (2022), Training robust neural networks using Lipschitz

bounds, IEEE Control Syst. Lett., Vol. 6, pp. 121-126.

Pope, S. B. (1975), A more general effective-viscosity hy-

pothesis, J. Fluid Mech., Vol. 72(2), pp. 331-340.

Rapp, C. and Manhart, M. (2011), Flow over periodic hills:

an experimental study, Exp. Fluids, Vol. 51(1), pp. 247-269.

Rosca, M., Weber, T., Gretton, A., and Mohamed, S. (2020),

A case for new neural network smoothness constraints, In

Zosa Forde, J., Ruiz, F., Pradier, M. F., and Schein, A. (eds.),

Proc. ”I Can’t Believe It’s Not Better!” NeurIPS Workshop,

Vol. 137 of Proc. Mach. Learn. Res., pp. 21-32.

Scaman, K. and Virmaux, A. (2018), Lipschitz regularity of

deep neural networks: Analysis and efﬁcient estimation, In

Proc. 32nd Int. Conf. Neural Inf. Proc. Syst., pp. 3839-3848.

Schmelzer, M., Dwight, R. P., and Cinnella, P. (2020), Dis-

covery of algebraic Reynolds-stress models using sparse

symbolic regression, Flow Turbul. Combust., Vol. 104,

pp. 579-603.

Usama, M. and Chang, D. E. (2019), Towards robust neural

networks with Lipschitz continuity, In Yoo, C. D., Shi, Y.-Q.,

Kim, H. J., Piva, A., and Kim, G. (eds.), Digital Forensics

and Watermarking, Vol. 11378 of Lecture Notes in Computer

Science, pp. 373-389. Springer International Publishing.

Xiao, H. and Cinnella, P. (2019), Quantiﬁcation of model

uncertainty in RANS simulations: A review, Prog. Aerosp.

Sci., Vol. 108, pp. 1-31.

Xiao, H., Wu, J.-L., Laizet, S., and Duan, L. (2020), Flows

over periodic hills of parameterized geometries: A dataset

for data-driven turbulence modeling from direct simulations,

Comp. Fluids, Vol. 200, pp. 104431.

Yoshida, Y. and Miyato, T. (2017), Spectral norm regular-

ization for improving the generalizability of deep learning,

arXiv:1705.10941.

Feature importance in neural networks as a means of interpretation for data-driven turbulence models

Article

Jul 2023
COMPUT FLUIDS

An interpretable framework of data-driven turbulence modeling using deep neural networks

Article

Full-text available

Apr 2021
PHYS FLUIDS

Reynolds-averaged Navier–Stokes simulations represent a cost-effective option for practical engineering applications, but are facing ever-growing demands for more accurate turbulence models. Recently, emerging machine learning techniques have had a promising impact on turbulence modeling, but are still in their infancy regarding widespread industrial adoption. Toward their extensive uptake, this paper presents a universally interpretable machine learning (UIML) framework for turbulence modeling, which consists of two parallel machine learning-based modules to directly infer the structural and parametric representations of turbulence physics, respectively. At each phase of model development, data reflecting the evolution dynamics of turbulence and domain knowledge representing prior physical considerations are converted into modeling knowledge. The data- and knowledge-driven UIML is investigated with a deep residual network. The following three aspects are demonstrated in detail: (i) a compact input feature parameterizing a new turbulent timescale is introduced to prevent nonunique mappings between conventional input arguments and output Reynolds stress; (ii) a realizability limiter is developed to overcome the under-constrained state of modeled stress; and (iii) fairness and noise-insensitivity constraints are included in the training procedure. Consequently, an invariant, realizable, unbiased, and robust data-driven turbulence model is achieved. The influences of the training dataset size, activation function, and network hyperparameter on the performance are also investigated. The resulting model exhibits good generalization across two- and three-dimensional flows, and captures the effects of the Reynolds number and aspect ratio. Finally, the underlying rationale behind prediction is explored.

Discovery of Algebraic Reynolds-Stress Models Using Sparse Symbolic Regression

Article

Full-text available

Mar 2020
FLOW TURBUL COMBUST

A novel deterministic symbolic regression method SpaRTA (Sparse Regression of Turbulent Stress Anisotropy) is introduced to infer algebraic stress models for the closure of RANS equations directly from high-fidelity LES or DNS data. The models are written as tensor polynomials and are built from a library of candidate functions. The machine-learning method is based on elastic net regularisation which promotes sparsity of the inferred models. By being data-driven the method relaxes assumptions commonly made in the process of model development. Model-discovery and crossvalidation is performed for three cases of separating flows, i.e. periodic hills (Re=10595), converging-diverging channel (Re=12600) and curved backward-facing step (Re=13700). The predictions of the discovered models are significantly improved over the k-omega SST also for a true prediction of the flow over periodic hills at Re=37000. This study shows a systematic assessment of SpaRTA for rapid machinelearning of robust corrections for standard RANS turbulence models.

Threat of Adversarial Attacks on Deep Learning in Computer Vision: A Survey

Article

Full-text available

Jan 2018

Deep learning is at the heart of the current rise of machine learning and artificial intelligence. In the field of Computer Vision, it has become the workhorse for applications ranging from self-driving cars to surveillance and security. Whereas deep neural networks have demonstrated phenomenal success (often beyond human capabilities) in solving complex problems, recent studies show that they are vulnerable to adversarial attacks in the form of subtle perturbations to inputs that lead a model to predict incorrect outputs. For images, such perturbations are often too small to be perceptible, yet they completely fool the deep learning models. Adversarial attacks pose a serious threat to the success of deep learning in practice. This fact has lead to a large influx of contributions in this direction. This article presents the first comprehensive survey on adversarial attacks on deep learning in Computer Vision. We review the works that design adversarial attacks, analyze the existence of such attacks and propose defenses against them. To emphasize that adversarial attacks are possible in practical conditions, we separately review the contributions that evaluate adversarial attacks in the real-world scenarios. Finally, we draw on the literature to provide a broader outlook of the research direction.

On frozen-RANS approaches in data-driven turbulence modeling: Practical relevance of turbulent scale consistency during closure inference and application

Article

Oct 2022
INT J HEAT FLUID FL

This paper addresses a consistency problem in data-driven turbulence modeling, which arises as the hypotheses are inferred from high-fidelity data but evaluated within a low-fidelity RANS solver. After elaborating on its origin, which is the systematic difference of the turbulent scales predicted by a low- and a high-fidelity solver, the frozen-RANS concept is thoroughly discussed as one possible mitigation strategy. Different variations of this concept are proposed varying in the way to incorporate the turbulent kinetic energy correction in the RANS solver and also in their degree to which they fulfill scale consistency. Applying these concepts to the neuralSST model, which is introduced in the companion paper (Mandler and Weigand, 2022), confirms the importance of scale consistency for an improved mean flow field prediction and that the latter can be achieved by a k-correction. This becomes particularly evident as the complexity of the test cases, which are different types of separated channel flows, increases. However, which particular strategy is employed to augment the low-fidelity prediction of the turbulent kinetic energy is of minor importance and the decision is mainly driven by the trade-off between the flexibility of the solver to modify the scale equations and the numerical stability of the resulting model.

A realizable and scale-consistent data-driven non-linear eddy viscosity modeling framework for arbitrary regression algorithms

Article

Oct 2022
INT J HEAT FLUID FL

A data-driven modeling framework for non-linear eddy viscosity models is presented. In contrast to the majority of similar approaches, it splits the multivariate regression problem into a set of univariate problems and is, furthermore, independent of a specific machine learning algorithm. Instead of inferring the closure equation from high-fidelity data as a whole, the coefficients of the tensor polynomial are learned individually. The target variables are obtained from an efficient field inversion procedure by virtue of successive tensor projections. The hypotheses for each closure coefficient are then fitted separately by employing an arbitrary regression technique. Ordinary curve fitting, neural networks and gene expression programming are considered as examples. The robustness of the model against an extrapolation and the stability of the solver are promoted by coefficient limiters and a full barycentric realizability correction. Additionally, turbulent scale consistency is guaranteed by a k-corrective frozen-RANS method, which is the subject of a companion paper (Mandler and Weigand, 2022). The proposed strategy leads to models which are well suited for the application to the same class of flows they were inferred from, namely separated channel flows. As proven by an extensive extrapolation study, the resulting neuralSST model is robust against geometry and Reynolds number modifications provided the type of flow does not drastically change. It consistently outperforms both the shear stress transport (SST) and a more complex elliptic-blending model and agrees well with the reference data.

Training Robust Neural Networks Using Lipschitz Bounds

Article

Jan 2022

Due to their susceptibility to adversarial perturbations, neural networks (NNs) are hardly used in safety-critical applications. One measure of robustness to such perturbations in the input is the Lipschitz constant of the input-output map defined by an NN. In this letter, we propose a framework to train multi-layer NNs while at the same time encouraging robustness by keeping their Lipschitz constant small, thus addressing the robustness issue. More specifically, we design an optimization scheme based on the Alternating Direction Method of Multipliers that minimizes not only the training loss of an NN but also its Lipschitz constant resulting in a semidefinite programming based training procedure that promotes robustness. We design two versions of this training procedure. The first one includes a regularizer that penalizes an accurate upper bound on the Lipschitz constant. The second one allows to enforce a desired Lipschitz bound on the NN at all times during training. Finally, we provide two examples to show that the proposed framework successfully increases the robustness of NNs.

Flows Over Periodic Hills of Parameterized Geometries: A Dataset for Data-Driven Turbulence Modeling From Direct Simulations

Article

Jan 2020
COMPUT FLUIDS

Computational fluid dynamics models based on Reynolds-averaged Navier–Stokes equations with turbulence closures still play important roles in engineering design and analysis. However, the development of turbulence models has been stagnant for decades. With recent advances in machine learning, data-driven turbulence models have become attractive alternatives worth further explorations. However, a major obstacle in the development of data-driven turbulence models is the lack of training data. In this work, we survey currently available public turbulent flow databases and conclude that they are inadequate for developing and validating data-driven models. Rather, we need more benchmark data from systematically and continuously varied flow conditions (e.g., Reynolds number and geometry) with maximum coverage in the parameter space for this purpose. To this end, we perform direct numerical simulations of flows over periodic hills with varying slopes, resulting in a family of flows over periodic hills which ranges from incipient to mild and massive separations. We further demonstrate the use of such a dataset by training a machine learning model that predicts Reynolds stress anisotropy based on a set of mean flow features. We expect the generated dataset, along with its design methodology and the example application presented herein, will facilitate development and comparison of future data-driven turbulence models. *** DNS DATA FOR THIS PAPER AVAILABLE AT https://github.com/xiaoh/para-database-for-PIML ***

Quantification of model uncertainty in RANS simulations: A review

Article

Apr 2019
PROG AEROSP SCI

In computational fluid dynamics simulations of industrial flows, models based on the Reynolds-averaged Navier–Stokes (RANS) equations are expected to play an important role in decades to come. However, model uncertainties are still a major obstacle for the predictive capability of RANS simulations. This review examines both the parametric and structural uncertainties in turbulence models. We review recent literature on data-free (uncertainty propagation) and data-driven (statistical inference) approaches for quantifying and reducing model uncertainties in RANS simulations. Moreover, the fundamentals of uncertainty propagation and Bayesian inference are introduced in the context of RANS model uncertainty quantification. Finally, the literature on uncertainties in scale-resolving simulations is briefly reviewed with particular emphasis on large eddy simulations.

Implicit Regularization in Deep Learning

Article

Sep 2017

Behnam Neyshabur

In an attempt to better understand generalization in deep learning, we study several possible explanations. We show that implicit regularization induced by the optimization method is playing a key role in generalization and success of deep learning models. Motivated by this view, we study how different complexity measures can ensure generalization and explain how optimization algorithms can implicitly regularize complexity measures. We empirically investigate the ability of these measures to explain different observed phenomena in deep learning. We further study the invariances in neural networks, suggest complexity measures and optimization algorithms that have similar invariances to those in neural networks and evaluate them on a number of learning tasks.

Reynolds averaged turbulence modelling using deep neural networks with embedded invariance

Article

Nov 2016

There exists significant demand for improved Reynolds-averaged Navier–Stokes (RANS) turbulence models that are informed by and can represent a richer set of turbulence physics. This paper presents a method of using deep neural networks to learn a model for the Reynolds stress anisotropy tensor from high-fidelity simulation data. A novel neural network architecture is proposed which uses a multiplicative layer with an invariant tensor basis to embed Galilean invariance into the predicted anisotropy tensor. It is demonstrated that this neural network architecture provides improved prediction accuracy compared with a generic neural network architecture that does not embed this invariance property. The Reynolds stress anisotropy predictions of this invariant neural network are propagated through to the velocity field for two test cases. For both test cases, significant improvement versus baseline RANS linear eddy viscosity and nonlinear eddy viscosity models is demonstrated.

Embedding explicit smoothness constraints in data-driven turbulence models

Abstract and Figures

Recommended publications

Feature importance in neural networks as a means of interpretation for data-driven turbulence models

On frozen-RANS approaches in data-driven turbulence modeling: Practical relevance of turbulent scale...

A realizable and scale-consistent data-driven non-linear eddy viscosity modeling framework for arbit...

Towards Explainable Machine-Learning-Assisted Turbulence Modeling for Transonic Flows