Nicholas Lubbers's research works | Los Alamos National Laboratory, NM (LANL) and other places

(A) A schematic representation of the proposed end-to-end...

Optimization framework. The sensor positions in the training set are a...

The comparison of training strategies is conducted through density...

(A) Boxplot diagram of the L2 norm for the flow past cylinder dataset....

Test error comparison between Voronoi-CNN [21], the fixed positions and...

Journey over destination: dynamic sensor placement enhances generalization

Article

Full-text available

Jun 2024

Reconstructing complex, high-dimensional global fields from limited data points is a challenge across various scientific and industrial domains. This is particularly important for recovering spatio-temporal fields using sensor data from, for example, laboratory-based scientific experiments, weather forecasting, or drone surveys. Given the prohibiti...

Figure 1: An illustration of the workflow used to create and analyze...

Figure 2: An illustration of the three types of tests performed on the...

Figure 3: Subfigures (a) and (b) show a comparison of methanol RDFs...

Thermodynamic Transferability in Coarse-Grained Force Fields using Graph Neural Networks

Preprint

Full-text available

Jun 2024

Coarse-graining is a molecular modeling technique in which an atomistic system is represented in a simplified fashion that retains the most significant system features that contribute to a target output, while removing the degrees of freedom that are less relevant. This reduction in model complexity allows coarse-grained molecular simulations to re...

Flow regimes for simplified samples, in reality, these effects are...

The bottom panel shows a cross-section of the nanoconfined simulation...

(a) Performance of the model in the training and testing sets for each...

Learning a general model of single phase flow in complex 3D porous media

Article

Full-text available

May 2024

Modeling effective transport properties of 3D porous media, such as permeability, at multiple scales is challenging as a result of the combined complexity of the pore structures and fluid physics—in particular, confinement effects which vary across the nanoscale to the microscale. While numerical simulation is possible, the computational cost is pr...

ANI-1xnr uncertainty for all eight O2/C2H2 ratios in graphene ring...

Biofuel additive simulation results for ANI-1xnr over the entire...

Comparison of normalized production rate of OH for biofuel additive...

Bond dissociation diagram for C-H bond in methane
Data are presented as...

Uncertainty analysis of ANI-1xnr on reactive literature...

Exploring the frontiers of condensed-phase chemistry with a general reactive machine learning potential

Article

Full-text available

Mar 2024

Atomistic simulation has a broad range of applications from drug design to materials discovery. Machine learning interatomic potentials (MLIPs) have become an efficient alternative to computationally expensive ab initio simulations. For this reason, chemistry and materials science would greatly benefit from a general reactive MLIP, that is, an MLIP...

Linear Graphlet Models for Accurate and Interpretable Cheminformatics

Preprint

Feb 2024

Advances in machine learning have given rise to a plurality of data-driven methods for estimating chemical properties from molecular structure. For many decades, the cheminformatics field has relied heavily on structural fingerprinting, while in recent years much focus has shifted leveraging highly parameterized deep neural networks which usually m...

Machine Learning Potentials with the Iterative Boltzmann Inversion: Training to Experiment

Article

Feb 2024

Machine Learning Framework for Modeling Exciton Polaritons in Molecular Materials

Article

Jan 2024

ATAC-seq profiles of chromosome 9 form A549 cells. A TN5 binds to open...

Synthetic replicate generation via peak down-sampling. A An example...

Synthetic replicate bivariate plots and statistical profiles. A Scatter...

Correlation and association statistics across epigenomic experiments....

Random forest prediction of experimental relationships. A Distributions...

Improved quality metrics for association and reproducibility in chromatin accessibility data using mutual information

Article

Full-text available

Nov 2023

Background Correlation metrics are widely utilized in genomics analysis and often implemented with little regard to assumptions of normality, homoscedasticity, and independence of values. This is especially true when comparing values between replicated sequencing experiments that probe chromatin accessibility, such as assays for transposase-accessi...

Overview of sparse reconstruction using the Senseiver model
a, The...

Details of the encoder/decoder modules
This builds on the Perceiver IO...

Test error results for different sensor configurations
Left: test error...

Performance of the model varying the number of sensors and their...

Performance of the model with different amounts of training data
Left:...

Development of the Senseiver for efficient field reconstruction from sparse observations

Article

Full-text available

Nov 2023

The reconstruction of complex time-evolving fields from sensor observations is a grand challenge. Frequently, sensors have extremely sparse coverage and low-resource computing capacity for measuring highly nonlinear phenomena. While numerical simulations can model some of these phenomena using partial differential equations, the reconstruction prob...

Scale-bridging framework, with coarse- and fine-scale models, database,...

Our machine learning based scale-bridging framework for inertial...

Top row: We show the performance of the ML surrogate model for all the...

Candidate surrogates for the 8-dimensional Rosenbrock function³⁶,...

Top (a) shows a detailed workflow to include nanoconfinement effects in...

Predictive scale-bridging simulations through active learning

Article

Full-text available

Sep 2023

Throughout computational science, there is a growing need to utilize the continual improvements in raw computational horsepower to achieve greater physical fidelity through scale-bridging over brute-force increases in the number of mesh elements. For instance, quantitative predictions of transport in nanoporous media, critical to hydrocarbon extrac...

Synergy of semiempirical models and machine learning in computational chemistry

Article

Sep 2023

Catalyzed by enormous success in the industrial sector, many research programs have been exploring data-driven, machine learning approaches. Performance can be poor when the model is extrapolated to new regions of chemical space, e.g., new bonding types, new many-body interactions. Another important limitation is the spatial locality assumption in...

Representation of experimental design. Generation 0 (G0) describes the...

Plant growth trait response differences between the two factor level...

Three-factor interaction plots for response of interest (a) saturated...

Plant function response differences between the two-factor level means...

Volumetric water content (VWC) decline during the terminal drought....

Drought conditioning of rhizosphere microbiome influences maize water use traits

Article

Full-text available

Aug 2023

Background and Aims Beneficial plant–microbe interactions can improve plant performance under drought; however, we know less about how drought-induced shifts in microbial communities affect plant traits. Methods We cultivated Zea mays in fritted clay with soil microbiomes originating from contrasting environments (agriculture or forest) under two...

FIG. 1. A pair potential correction added to the ANI MLP improves...

FIG. 3. Similar corrective potentials are learned for two distinct...

FIG. 4. The corrective potential u(r) improves predictions for the...

FIG. 5. The corrective potential u(r) improves predictions for the...

Machine learning potentials with Iterative Boltzmann Inversion: training to experiment

Preprint

Full-text available

Jul 2023

Methodologies for training machine learning potentials (MLPs) to quantum-mechanical simulation data have recently seen tremendous progress. Experimental data has a very different character than simulated data, and most MLP training procedures cannot be easily adapted to incorporate both types of data into the training process. We investigate a trai...

Figure 1. A diverse collection of datasets, with varying levels of...

Figure 5. The error distribution for the CCSD(T) specialization task...

Figure 6. The a) 3BPA molecule, b) the force error versus the number of...

Figure S8: The bond dissociation energy when trained to the rMD17...

Learning Together: Towards foundational models for machine learning interatomic potentials with meta-learning

Preprint

Full-text available

Jul 2023

The development of machine learning models has led to an abundance of datasets containing quantum mechanical (QM) calculations for molecular and material systems. However, traditional training methods for machine learning models are unable to leverage the plethora of data available as they require that each dataset be generated using the same QM me...

Fig 1. Experimental design for maize. Two generations (G 0 , G 1 ) of...

Fig 2. Flowchart of the study design. Schematic LDA topic modeling for...

Fig 3. Topic abundance weighting for the treatment and species types....

Fig 4. Ternary plot of topic abundances for the outcome type (control,...

Fig 5. Distribution of ASV sequences in each learned LDA topic. Topics...

Latent Dirichlet Allocation modeling of environmental microbiomes

Article

Full-text available

Jun 2023

Interactions between stressed organisms and their microbiome environments may provide new routes for understanding and controlling biological systems. However, microbiomes are a form of high-dimensional data, with thousands of taxa present in any given sample, which makes untangling the interaction between an organism and its microbial environment...

FIG. 3. Potential energy surface (PES) scan with respect to the...

Figure S6: Potential energy surface (PES) scan with respect to the...

Machine Learning Framework for Modeling Exciton-Polaritons in Molecular Materials

Preprint

Full-text available

Jun 2023

When molecules are strongly coupled to an optical cavity, a new light-matter hybrid quasiparticle, called a polariton, is formed. Recent experiments have shown that polariton chemistry can be used to manipulate chemical reactions. Polariton chemistry is a collective phenomenon, and its effect increases with the number of molecules in a cavity. Howe...

Blackout Diffusion: Generative Diffusion Models in Discrete-State Spaces

Article

Full-text available

May 2023

Typical generative diffusion models rely on a Gaussian diffusion process for training the backward transformations, which can then be used to generate samples from Gaussian noise. However, real world data often takes place in discrete-state spaces, including many scientific applications. Here, we develop a theoretical formulation for arbitrary disc...

Blackout Diffusion: Generative Diffusion Models in Discrete-State Spaces

Preprint

Full-text available

May 2023

Typical generative diffusion models rely on a Gaussian diffusion process for training the backward transformations, which can then be used to generate samples from Gaussian noise. However, real world data often takes place in discrete-state spaces, including many scientific applications. Here, we develop a theoretical formulation for arbitrary disc...

Semi-Empirical Shadow Molecular Dynamics: A PyTorch Implementation

Article

May 2023

Extended Lagrangian Born-Oppenheimer molecular dynamics (XL-BOMD) in its most recent shadow potential energy version has been implemented in the semiempirical PyTorch-based software PySeQM. The implementation includes finite electronic temperatures, canonical density matrix perturbation theory, and an adaptive Krylov subspace approximation for the...

Lightweight and effective tensor sensitivity for atomistic neural networks

Article

May 2023

Atomistic machine learning focuses on the creation of models that obey fundamental symmetries of atomistic configurations, such as permutation, translation, and rotation invariances. In many of these schemes, translation and rotation invariance are achieved by building on scalar invariants, e.g., distances between atom pairs. There is growing inter...

Figure S4: The f1-scores, recall, and precision of the random forest...

Improved Quality Metrics for Association and Reproducibility in Chromatin Accessibility Data Using Mutual Information

Preprint

Full-text available

Apr 2023

Background Correlation metrics are widely utilized in genomics analysis and often implemented with little regard to assumptions of normality, homoscedasticity, and independence of values. This is especially true when comparing values between replicated sequencing experiments that probe chromatin accessibility, such as assays for transposase-accessi...

Figure 6. Computed absolute error of ML-inferred charges vs true...

Figure 8. Comparison of ML-predicted (blue and red) vs...

Machine Learning Models Capture Plasmon Dynamics in Ag Nanoparticles

Article

Full-text available

Apr 2023

Highly energetic electron-hole pairs (hot carriers) formed from plasmon decay in metallic nanostructures promise sustainable pathways for energy-harvesting devices. However, efficient collection before thermalization remains an obstacle for realization of their full energy generating potential. Addressing this challenge requires detailed understand...

Figure 1. Summary of the nanoreactor active learning workflow and...

Figure 4. Comparison of 3-, 4-, 5-, 6-, and 7-membered ring formation...

Figure 6. (a) Product molecule tracking plot of methane combustion...

Figure S.2. Tracking plot of major products of the biofuel simulations....

Figure S.3. Ignition delay time (IDT) for biofuel simulations based on...

Exploring the frontiers of chemistry with a general reactive machine learning potential

Preprint

Full-text available

Apr 2023

Reactive chemistry atomistic simulation has a broad range of applications from drug design to energy to materials discovery. Machine learning interatomic potentials (MLIPs) have become an efficient alternative to computationally expensive quantum chemistry simulations. In practice, developing reactive MLIPs requires prior knowledge of reaction netw...

Figure 2: LAMMPS-FitSNAP interface for calculating energies and forces...

FitSNAP: Atomistic machine learning with LAMMPS

Article

Full-text available

Apr 2023

Comparison of UDD-AL and MD-AL approaches for a glycine test case
a,...

Two-dimensional representation of the glycine conformational space...

Glycine interatomic distance distributions in the MD-AL and UDD-AL data...

Ensemble uncertainty and UDD in acetylacetone
a, Acetylacetone...

Uncertainty-driven dynamics for active learning of interatomic potentials

Article

Full-text available

Mar 2023

Machine learning (ML) models, if trained to data sets of high-fidelity quantum simulations, produce accurate and efficient interatomic potentials. Active learning (AL) is a powerful tool to iteratively generate diverse data sets. In this approach, the ML model provides an uncertainty estimate along with its prediction for each new atomic configurat...

Semi-Empirical Shadow Molecular Dynamics: A PyTorch implementation

Preprint

Full-text available

Mar 2023

Extended Lagrangian Born-Oppenheimer molecular dynamics (XL-BOMD) in its most recent shadow potential energy version has been implemented in the semiempirical PyTorch-based software PySeQM. The implementation includes finite electronic temperatures, canonical density matrix perturbation theory, and an adaptive Krylov Subspace Approximation for the...

Embedding hard physical constraints in neural network coarse-graining of three-dimensional turbulence

Article

Full-text available

Jan 2023

In recent years, deep learning approaches have shown much promise in modeling complex systems in the physical sciences. A major challenge in deep learning of partial differential equations is enforcing physical constraints and boundary conditions. In this work, we propose a general framework to directly embed the notion of an incompressible fluid i...

GLUE Code: A framework handling communication and interfaces between scales

Article

Full-text available

Dec 2022

Many scientific applications are inherently multiscale in nature. Such complex physical phenomena often require simultaneous execution and coordination of simulations spanning multiple time and length scales. This is possible by combining expensive small-scale simulations (such as molecular dynamics simulations) with larger scale simulations (such...

Figure 2. Inspection of the nanoreactor dataset. Panels a), b) c), and...

Figure 5. Tracking plot of major products (CO, CO 2 , and H 2 O) and O...

Exploring the frontiers of condensed-phase chemistry with a general reactive machine learning potential

Preprint

Full-text available

Dec 2022

Reactive chemistry atomistic simulation has a broad range of applications from drug design to energy to materials discovery. Machine learning interatomic potentials (MLIPs) have become an efficient alternative to computationally expensive quantum chemistry simulations. In practice, developing reactive MLIPs requires prior knowledge of reaction netw...

Figure 1. Illustration of how HIP-NN-TS captures angular information...

Figure 2. Mean absolute error of HIP-NN-TS models trained to QM9 for...

Figure 3. ML errors for ensemble models (bars) and single models...

Figure B.1. ANI-1x Test set errors vs. COMP6 extensibility set errors,...

Lightweight and Effective Tensor Sensitivity for Atomistic Neural Networks

Preprint

Full-text available

Dec 2022

Atomistic machine learning focuses on the creation of models which obey fundamental symmetries of atomistic configurations, such as permutation, translation, and rotation invariances. In many of these schemes, translation and rotation invariance are achieved by building on scalar invariants, e.g., distances between atom pairs. There is growing inte...

Figure S.1. ANI-nr ensemble standard deviation in energy normalized by...

Exploring the frontiers of chemistry with a general reactive machine learning potential

Preprint

Full-text available

Nov 2022

Reactive chemistry atomistic simulation has a broad range of applications from drug design to energy to materials discovery. Machine learning interatomic potentials (MLIP) have become an efficient alternative to computationally expensive quantum chemistry simulations. In practice, reactive MLIPs require refitting to extensive datasets for each new...

Interlingual Automatic Differentiation: Software 2.0 between PyTorch and Julia

Conference Paper

Full-text available

Nov 2022

Julia is a state-of-the-art tool for scientific computing with good support for automatic differentiation. PyTorch is a leading framework for machine learning. We describe how to perform automatic differentiation across the language boundary and connect these two ecosystems. By using the automatic differentiation ecosystems in each language, the ti...

Publisher Correction: Extending machine learning beyond interatomic potentials for predicting molecular properties

Article

Nov 2022

Figure 4. Comparison of 3-, 4-, 5-, 6-, and 7-member ring formation for...

Exploring the frontiers of chemistry with a general reactive machine learning potential

Preprint

Full-text available

Nov 2022

Reactive chemistry atomistic simulation has a broad range of applications from drug design to energy to materials discovery. Machine learning interatomic potentials (MLIP) have become an efficient alternative to computationally expensive quantum chemistry simulations. In practice, reactive MLIPs require refitting to extensive datasets for each new...

The Senseiver: attention-based global field reconstruction from sparse observations

Conference Paper

Full-text available

Nov 2022

The reconstruction of complex time-evolving fields from a small number of sensor observations is a grand challenge in a wide range of scientific and industrial applications. Frequently, sensors have very sparse spatial coverage, and report noisy observations from highly non-linear phenomena. While numerical simulations can model some of these pheno...

Selected samples for the dataset. The samples are divided into static,...

Some examples of binary geometries after the pre-processing steps. A...

Permeability change with scale for two very simple geometries (tube and...

Depiction of the computation set-up for a sample of size Lx, Ly, Lz. In...

A Dataset of 3D Structural and Simulated Transport Properties of Complex Porous Media

Article

Full-text available

Oct 2022

Physical processes that occur within porous materials have wide-ranging applications including - but not limited to - carbon sequestration, battery technology, membranes, oil and gas, geothermal energy, nuclear waste disposal, water resource management. The equations that describe these physical processes have been studied extensively; however, app...

Fig.1 | Comparison of UDD-AL and MD-AL approaches for a glycine test...

Fig. 3 | Glycine interatomic distance distributions in MD-AL and UDD-AL...

Fig. 4 | Ensemble uncertainty and UDD in acetylacetone. a Acetylacetone...

RMSEs of the four models on the test sets

Uncertainty Driven Dynamics for Active Learning of Interatomic Potentials

Preprint

Full-text available

Sep 2022

Machine learning (ML) models, if trained to datasets of high-fidelity quantum simulations, produce accurate and efficient interatomic potentials. Active learning (AL) is a powerful tool to iteratively generate diverse datasets. In this approach, the ML model provides an uncertainty estimate along with its prediction for each new atomic configuratio...

Figure 1. Scale-bridging framework, with coarse-and fine-scale models,...

Figure 2. Our machine learning based scale-bridging framework for...

Predictive Scale-Bridging Simulations through Active Learning

Preprint

Full-text available

Sep 2022

Throughout computational science, there is a growing need to utilize the continual improvements in raw computational horsepower to achieve greater physical fidelity through scale-bridging over brute-force increases in the number of mesh elements. For instance, quantitative predictions of transport in nanoporous media, critical to hydrocarbon extrac...

Fig. 1 Schematic for the component parts needed for a scalable and...

Fig. 2 Distributions of energy and bispectrum components of the...

Fig. 3 RMSE validation errors for as a function of the number of...

Fig. 4 Distribution of RSE errors for different combination of training...

Fig. 5 Energy conservation as a function of MD timestep size for the...

Training data selection for accuracy and transferability of interatomic potentials

Article

Full-text available

Sep 2022

Advances in machine learning (ML) have enabled the development of interatomic potentials that promise the accuracy of first principles methods and the low-cost, parallel efficiency of empirical potentials. However, ML-based potentials struggle to achieve transferability, i.e., provide consistent accuracy across configurations that differ from those...

Extending machine learning beyond interatomic potentials for predicting molecular properties

Article

Aug 2022

Machine learning (ML) is becoming a method of choice for modelling complex chemical processes and materials. ML provides a surrogate model trained on a reference dataset that can be used to establish a relationship between a molecular structure and its chemical properties. This Review highlights developments in the use of ML to evaluate chemical pr...

Deep learning of dynamically responsive chemical Hamiltonians with semiempirical quantum mechanics

Article

Full-text available

Jul 2022

Conventional machine-learning (ML) models in computational chemistry learn to directly predict molecular properties using quantum chemistry only for reference data. While these heuristic ML methods show quantum-level accuracy with speeds several orders of magnitude faster than traditional quantum chemistry methods, they suffer from poor extensibili...

Machine learning of consistent thermodynamic models using automatic differentiation

Article

Apr 2022

We propose a data-driven method to describe consistent equations of state (EOS) for arbitrary systems. Complex EOS are traditionally obtained by fitting suitable analytical expressions to thermophysical data. A key aspect of EOS is that the relationships between state variables are given by derivatives of the system free energy. In this work, we mo...

Scalable Solutions for Training Machine Learned Interatomic Potentials.

Conference Paper

Mar 2022

Scalable Solutions for Training Machine Learned Interatomic Potentials.

Conference Paper

Feb 2022

Figure 2. Distributions of energy and bispectrum components of the...

Figure 3. RMSE validation errors for as a function of the number of...

Figure 4. Distribution of RSE errors for different combination of...

Figure 6. Visual representation of the two neural network architecture...

Training Data Selection for Accuracy and Transferability of Interatomic Potentials

Preprint

Full-text available

Jan 2022

Advances in machine learning (ML) techniques have enabled the development of interatomic potentials that promise both the accuracy of first principles methods and the low-cost, linear scaling, and parallel efficiency of empirical potentials. Despite rapid progress in the last few years, ML-based potentials often struggle to achieve transferability,...

Fig. 1 Overview of our multiscale network approach. Starting from a 3D...

Fig. 5 Top: XY-plane cross-section of the velocity in Z-direction of...

Fig. 7 Velocity prediction per scale and sample loss L s for the...

Fig. 8 Cross-sectional view of simulation result, the predictions of...

Computationally Efficient Multiscale Neural Networks Applied to Fluid Flow in Complex 3D Porous Media

Article

Full-text available

Oct 2021

The permeability of complex porous materials is of interest to many engineering disciplines. This quantity can be obtained via direct flow simulation, which provides the most accurate results, but is very computationally expensive. In particular, the simulation convergence time scales poorly as the simulation domains become less porous or more hete...

Machine Learning of consistent thermodynamic models using automatic differentiation

Preprint

Full-text available

Aug 2021

We propose a method to describe consistent equations of state (EOS) for arbitrary systems. Complex EOS are traditionally obtained by fitting suitable analytical expressions to thermophysical data. A key aspect of EOS are that the relationships between state variables are given by derivatives of the system free energy. In this work, we model the fre...

Figure 1. Illustration of PADRE. For (b−d), quantities with hats · ̑...

Figure 3. Validation of PADRE σ̂ as an uncertainty metric. (a) 2D...

Figure 4. PADRE-RF MAE confidence curves for all tasks at n train =...

Figure 5. Confidence curves on the test data points in the redox...

Model Performance on Test Set with Random Train Sets a

Pairwise Difference Regression: A Machine Learning Meta-algorithm for Improved Prediction and Uncertainty Quantification in Chemical Search

Article

Full-text available

Aug 2021

Machine learning (ML) plays a growing role in the design and discovery of chemicals, aiming to reduce the need to perform expensive experiments and simulations. ML for such applications is promising but difficult, as models must generalize to vast chemical spaces from small training sets and must have reliable uncertainty quantification metrics to...

Figure 1. (A) Non-extensible fixed input size NN with the entire system...

Figure 2. (A) Correlation plot between DFT energies and HIPP-NN...

Figure 4. (A) Accuracy in predicting reaction and isomerization energy....

The Rise of Neural Networks for Materials and Chemical Dynamics

Article

Full-text available

Jul 2021

Machine learning (ML) is quickly becoming a premier tool for modeling chemical processes and materials. ML-based force fields, trained on large data sets of high-quality electron structure calculations, are particularly attractive due their unique combination of computational efficiency and physical accuracy. This Perspective summarizes some recent...

Figure 1: Sample set of the molecules used in this study. (Top panel)...

Figure 2: Parity plots of predicted versus true ΔE energy on the...

Figure 3: (a) Comparison of DFT spin density and HIP-loc localization...

Figure 5: Conformational scan over the dihedral angle around the single...

Predicting Phosphorescence Energies and Inferring Wavefunction Localization with Machine Learning

Preprint

Full-text available

Jun 2021

p>Phosphorescence is commonly utilized for applications including light-emitting diodes and photovoltaics. Machine learning (ML) approaches trained on ab-initio datasets of singlet-triplet energy gaps may expedite the discovery of phosphorescent compounds with the desired emission energies. However, we show that standard ML approaches for modeling...

Predicting Phosphorescence Energies and Inferring Wavefunction Localization with Machine Learning

Preprint

Full-text available

Jun 2021

p>Phosphorescence is commonly utilized for applications including light-emitting diodes and photovoltaics. Machine learning (ML) approaches trained on ab-initio datasets of singlet-triplet energy gaps may expedite the discovery of phosphorescent compounds with the desired emission energies. However, we show that standard ML approaches for modeling...

Sample set of the molecules used in this study. (top panel) Select...

Parity plots of predicted versus true ΔE energy on the held-out test...

(a) Comparison of DFT spin density and HIP-loc localization weights for...

Conformational scan over the dihedral angle around the single...

Predicting Phosphorescence Energies and Inferring Wavefunction Localization with Machine Learning

Article

Full-text available

Jun 2021

Phosphorescence is commonly utilized for applications including light-emitting diodes and photovoltaics. Machine learning (ML) approaches trained on ab initio datasets of singlet-triplet energy gaps may expedite the discovery of phosphorescent compounds with the desired emission energies. However, we show that standard ML approaches for modeling po...

Machine learned Hückel theory: Interfacing physics and deep neural networks

Article

Full-text available

Jun 2021

The Hückel Hamiltonian is an incredibly simple tight-binding model known for its ability to capture qualitative physics phenomena arising from electron interactions in molecules and materials. Part of its simplicity arises from using only two types of empirically fit physics-motivated parameters: the first describes the orbital energies on each ato...

A physics-informed and hierarchically regularized data-driven model for predicting fluid flow through porous media

Article

Jun 2021

This paper presents a new deep learning data-driven model for predicting structure dependent pore-fluid velocity fields in rock. The model is based on a Convolutional Auto-Encoder (CAE) artificial neural network capable of learning from image data generated by direct numerical simulations of fluid flow through pore-structures, such as by Lattice Bo...

Predicting Phosphorescence Energies and Inferring Wavefunction Localization with Machine Learning

Preprint

Apr 2021

p>Phosphorescence is commonly utilized for applications including light-emitting diodes and photovoltaics. Machine learning (ML) approaches trained on ab-initio datasets of singlet-triplet energy gaps may expedite the discovery of phosphorescent compounds with the desired emission energies. However, we show that standard ML approaches for modeling...

A multi-dimensional parametric study of variability in multi-phase flow dynamics during geologic CO 2 sequestration accelerated with machine learning

Article

Full-text available

Apr 2021

During carbon sequestration, CO 2 migration is affected by so many uncertainties. • Numerical simulations of multi-phase fluid dynamics are computational expensive. • The combined effects of capillary pressure and relative permeability are explored. • The application of Machine Learning provides a huge computational speed-up. • Capillary pressure i...

Building a Better Database to Learn From; Application to Interatomic Potentials.

Conference Paper

Mar 2021

Crystal energies relative to the ground state
Solid lines represent...

Transformational energy barriers
We compare ANI-Al and various...

Comparison of predicted vs. reference phonon spectrum
Phonon spectrum...

Molecular dynamics simulation in melt using the ANI-Al potential
a...

Predicting melt temperatures
a Melt curve as a function of pressure for...

Automated discovery of a robust interatomic potential for aluminum

Article

Full-text available

Feb 2021

Machine learning, trained on quantum mechanics (QM) calculations, is a powerful tool for modeling potential energy surfaces. A critical factor is the quality and diversity of the training dataset. Here we present a highly automated approach to dataset construction and demonstrate the method by building a potential for elemental aluminum (ANI-Al). I...

FIG. 5. Common drug molecules. Running predictions on common drug...

Bond order predictions using deep neural networks

Article

Full-text available

Feb 2021

Machine learning is an extremely powerful tool for the modern theoretical chemist since it provides a method for bypassing costly algorithms for solving the Schrödinger equation. Already, it has proven able to infer molecular and atomic properties such as charges, enthalpies, dipoles, excited state energies, and others. Most of these machine learni...

Figure 1: Overview of our multiscale network prediction. Starting from...

Figure 3: The MS-Net pipeline. Our model consists of a system of fully...

Figure 4: Original image and three subsequent scales of a fractured...

Figure 7: (top) Normalized mean velocity per scale. Coarser scales are...

Figure 8: Cross-sections of the Castlegate sandstone simulation result,...

Multi-Scale Neural Networks for Fluid Flow in 3D Porous Media

Preprint

Full-text available

Feb 2021

The permeability of complex porous materials can be obtained via direct flow simulation, which provides the most accurate results, but is very computationally expensive. In particular, the simulation convergence time scales poorly as simulation domains become tighter or more heterogeneous. Semi-analytical models that rely on averaged structural pro...

Computationally efficient multiscale neural networks applied to fluid flow in complex 3D porous media

Preprint

Full-text available

Jan 2021

The permeability of complex porous materials is of interest to many engineering disciplines. This quantity can be obtained via direct flow simulation, which provides the most accurate results, but is very computationally expensive. In particular, the simulation convergence time scales poorly as simulation domains become tighter or more heterogeneou...

A multi-dimensional study of parametric variability in geologic CO2 sequestration accelerated with machine learning

Presentation

Full-text available

Dec 2020

Successful geologic CO2 storage projects depend on numerical simulations to predict reservoir performance during site selection, injection verification, and post-injection monitoring phases of the project. These numerical simulations solve non-linear sets of coupled partial differential equations, while accounting for multi-phase fluid dynamics on...

Rapid Exploration of Optimization Strategies on Advanced Architectures using TestSNAP and LAMMPS

Preprint

Nov 2020

The exascale race is at an end with the announcement of the Aurora and Frontier machines. This next generation of supercomputers utilize diverse hardware architectures to achieve their compute performance, providing an added onus on the performance portability of applications. An expanding fragmentation of programming models would provide a compoun...

Modeling Nanoconfinement Effects Using Active Learning

Article

Oct 2020

Predicting the spatial configuration of gas in nanopores of isrelevant in applications such asfluidflow forecasting and hydrocarbonreserves estimation. For example, shale reservoirs have suffered fromcomputationally intractable multiscale problems, sincefluid properties suchas viscosity, density, and adsorption must be calculated by using expensive...

Evaluating diffusion and the thermodynamic factor for binary ionic mixtures

Article

Oct 2020

Molecular dynamics (MD) simulations are a powerful tool for the calculation of transport properties in mixtures. Not only are MD simulations capable of treating multicomponent systems, they are also applicable over a wide range of temperatures and densities. In plasma physics, this is particularly important for applications such as inertial confine...

Machine learning approaches for structural and thermodynamic properties of a Lennard-Jones fluid

Article

Sep 2020

Predicting the functional properties of many molecular systems relies on understanding how atomistic interactions give rise to macroscale observables. However, current attempts to develop predictive models for the structural and thermodynamic properties of condensed-phase systems often rely on extensive parameter fitting to empirically selected fun...

Multiscale simulation of plasma flows using active learning

Article

Full-text available

Aug 2020

Plasma flows encountered in high-energy-density experiments display features that differ from those of equilibrium systems. Nonequilibrium approaches such as kinetic theory (KT) capture many, if not all, of these phenomena. However, KT requires closure information, which can be computed from microscale simulations and communicated to KT. We present...

Our machine learning based scale-bridging framework. DNN emulators are...

Emulated and upscaled profiles for T=350K\documentclass[12pt]{minimal}...

Excess density for a variety of pore conditions. Solid lines show the...

Upscaled LBM adsorption coefficient as a function of temperature and...

Modeling and scale-bridging using machine learning: nanoconfinement effects in porous media

Article

Full-text available

Aug 2020

Fine-scale models that represent first-principles physics are challenging to represent at larger scales of interest in many application areas. In nanoporous media such as tight-shale formations, where the typical pore size is less than 50 nm, confinement effects play a significant role in how fluids behave. At these scales, fluids are under confine...

GPU-Accelerated Semi-Empirical Born Oppenheimer Molecular Dynamics using PyTorch

Article

Full-text available

Jul 2020

A new open-source high-performance implementation of Born Oppenheimer Molecular Dynamics based on semi-empirical quantum mechanics models using PyTorch called PYSEQM is presented. PYSEQM was designed to provide researchers in computational chemistry with an open-source, efficient, scalable, and stable quantum-based molecular dynamics engine. In par...

Simple and efficient algorithms for training machine learning potentials to force data

Preprint

Jun 2020

Machine learning models, trained on data from ab initio quantum simulations, are yielding molecular dynamics potentials with unprecedented accuracy. One limiting factor is the quantity of available training data, which can be expensive to obtain. A quantum simulation often provides all atomic forces, in addition to the total energy of the system. T...

Modeling nanoconfinement effects using active learning

Preprint

Full-text available

May 2020

Predicting the spatial configuration of gas molecules in nanopores of shale formations is crucial for fluid flow forecasting and hydrocarbon reserves estimation. The key challenge in these tight formations is that the majority of the pore sizes are less than 50 nm. At this scale, the fluid properties are affected by nanoconfinement effects due to t...

Ex Machina Determination of Structural Correlation Functions

Article

May 2020

Determining the structural properties of condensed phase systems is a fundamental problem in theoretical statistical mechanics. Here, we present a machine learning method that is able to predict structural correlation functions with significantly improved accuracy in comparison to traditional approaches. The usefulness of this ex machina (from the...

Fig. 1 Active learning schemes for building ANI data sets. (a) The...

Fig. 2 2D parametric t-SNE embeddings. These embeddings are for the 1st...

ANI data set energy and size distribution. (a) A histogram of the...

The ANI-1ccx and ANI-1x data sets, coupled-cluster and density functional theory properties for molecules

Article

Full-text available

May 2020

Maximum diversification of data is a central theme in building generalized and accurate machine learning (ML) models. In chemistry, ML has been used to develop models for predicting molecular properties, for example quantum mechanics (QM) calculated potential energy surfaces and atomic charge models. The ANI-1x and ANI-1ccx ML-based general-purpose...

Automated discovery of a robust interatomic potential for aluminum

Preprint

Mar 2020

Atomistic molecular dynamics simulation is an important tool for predicting materials properties. Accuracy depends crucially on the model for the interatomic potential. The gold standard would be quantum mechanics (QM) based force calculations, but such a first-principles approach becomes prohibitively expensive at large system sizes. Efficient mac...

Embedding Hard Physical Constraints in Neural Network Coarse-Graining of 3D Turbulence

Preprint

Full-text available

Jan 2020

In the recent years, deep learning approaches have shown much promise in modeling complex systems in the physical sciences. A major challenge in deep learning of PDEs is enforcing physical constraints and boundary conditions. In this work, we propose a general framework to directly embed the notion of an incompressible fluid into Convolutional Neur...

The ANI-1ccx and ANI-1x Data Sets, Coupled-Cluster and Density Functional Theory Properties for Molecules

Preprint

Full-text available

Oct 2019

p>Maximum diversification of data is a central theme in building generalized and accurate machine learning (ML) models. In chemistry, ML has been used to develop models for predicting molecular properties, for example quantum mechanics (QM) calculated potential energy surfaces and atomic charge models. The ANI-1x and ANI-1ccx ML-based eneral-purpos...

Machine Learned H\"uckel Theory: Interfacing Physics and Deep Neural Networks

Preprint

Full-text available

Sep 2019

The H\"uckel Hamiltonian is an incredibly simple tight-binding model famed for its ability to capture qualitative physics phenomena arising from electron interactions in molecules and materials. Part of its simplicity arises from using only two types of empirically fit physics-motivated parameters: the first describes the orbital energies on each a...

Fig. 2 Accuracy in predicting reaction and isomerization energy....

Fig. 3 Accuracy in predicting torsional energies relevant to drug...

Accuracy in predicting atomization energies. Error of the ANI-1ccx...

Diagram of the transfer learning technique evaluated in this work....

Approaching coupled cluster accuracy with a general-purpose neural network potential through transfer learning

Article

Full-text available

Jul 2019

Computational modeling of chemical and biological systems at atomic resolution is a crucial tool in the chemist's toolset. The use of computer simulations requires a balance between cost and accuracy: quantum-mechanical methods provide high accuracy but are computationally expensive and scale poorly to large systems, while classical force fields ar...

Machine learning for molecular dynamics with strongly correlated electrons

Article

Apr 2019

We use machine learning to enable large-scale molecular dynamics (MD) of a correlated electron model under the Gutzwiller approximation scheme. This model exhibits a Mott transition as a function of on-site Coulomb repulsion U. The repeated solution of the Gutzwiller self-consistency equations would be prohibitively expensive for large-scale MD sim...

Earthquake Catalog-Based Machine Learning Identification of Laboratory Fault States and the Effects of Magnitude of Completeness

Article

Nov 2018

Machine learning regression can predict macroscopic fault properties such as shear stress, friction, and time to failure using continuous records of fault zone acoustic emissions. Here we show that a similar approach is successful using event catalogs derived from the continuous data. Our methods are applicable to catalogs of arbitrary scale and ma...

Machine learning for molecular dynamics with strongly correlated electrons

Preprint

Full-text available

Nov 2018

We use machine learning to enable large-scale molecular dynamics (MD) of a correlated electron model under the Gutzwiller approximation scheme. This model exhibits a Mott transition as a function of on-site Coulomb repulsion $U$. Repeated solution of the Gutzwiller self-consistency equations would be prohibitively expensive for large-scale MD simul...

Earthquake catalog-based machine learning identification of laboratory fault states and the effects of magnitude of completeness

Preprint

Oct 2018

Machine learning regression can predict macroscopic fault properties such as shear stress, friction, and time to failure using continuous records of fault zone acoustic emissions. Here we show that a similar approach is successful using event catalogs derived from the continuous data. Our methods are applicable to catalogs of arbitrary scale and ma...

Dimensionality-Reduction of Climate Data using Deep Autoencoders

Preprint

Aug 2018

We explore the use of deep neural networks for nonlinear dimensionality reduction in climate applications. We train convolutional autoencoders (CAEs) to encode two temperature field datasets from pre-industrial control runs in the CMIP5 first ensemble, obtained with the CCSM4 model and the IPSL-CM5A-LR model, respectively. With the later dataset, c...

Discovering a Transferable Charge Assignment Model Using Machine Learning

Article

Jul 2018

Partial atomic charge assignment is of immense practical value to force field parametrization, molecular docking, and cheminformatics. Machine learning has emerged as a powerful tool for modeling chemistry at unprecedented computational speeds given accurate reference data. However, certain tasks, such as charge assignment, do not have a unique sol...

Less is more: Sampling chemical space with active learning

Article

Full-text available

Jan 2018

The development of accurate and transferable machine learning (ML) potentials for predicting molecular energetics is a challenging task. The process of data generation to train such ML potentials is a task neither well understood nor researched in detail. In this work, we present a fully automated approach for the generation of datasets with the in...

Discovering a Transferable Charge Assignment Model Using Machine Learning

Preprint

Full-text available

Jan 2018

p>Partial atomic charge assignment is of immense practical value to force field parametrization, molecular docking, and cheminformatics. Machine learning has emerged as a powerful tool for modeling chemistry at unprecedented computational speeds given ground-truth values, but for the task of charge assignment, the choice of ground-truth may not be...

Outsmarting Quantum Chemistry Through Transfer Learning

Preprint

Full-text available

Jan 2018

div>Computer simulations are foundational to theoretical chemistry. Quantum-mechanical (QM) methods provide the highest accuracy for simulating molecules but have difficulty scaling to large systems. Empirical interatomic potentials (classical force fields) are scalable, but lack transferability to new systems and are hard to systematically improve...

grl56367-sup-0001-supinfo

Data

Oct 2017

Hierarchical modeling of molecular energies using a deep neural network

Article

Full-text available

Sep 2017

We introduce the Hierarchically Interacting Particle Neural Network (HIP-NN) to model molecular properties from datasets of quantum calculations. Inspired by a many-body expansion, HIP-NN decomposes properties, such as energy, as a sum over hierarchical terms. These terms are generated from a neural network--a composition of many nonlinear transfor...

Machine Learning Predicts Laboratory Earthquakes

Article

Full-text available

Feb 2017

We apply machine learning to data sets from shear laboratory experiments, with the goal of identifying hidden signals that precede earthquakes. Here we show that by listening to the acoustic signal emitted by a laboratory fault, machine learning can predict the time remaining before it fails with great accuracy. These predictions are based solely o...

Inferring low-dimensional microstructure representations using convolutional neural networks

Article

Full-text available

Nov 2016

We apply recent advances in machine learning and computer vision to a central problem in materials informatics: The statistical representation of microstructural images. We use activations in a pre-trained convolutional neural network to provide a high-dimensional characterization of a set of synthetic microstructural images. Next, we use manifold...

The Effective Field Theory of Dark Matter Direct Detection

Article

Full-text available

Feb 2013

We extend and explore the general non-relativistic effective theory of dark matter (DM) direct detection. We describe the basic non-relativistic building blocks of operators and discuss their symmetry properties, writing down all Galilean-invariant operators up to quadratic order in momentum transfer arising from exchange of particles of spin 1 or...

Model Independent Direct Detection Analyses

Article

Nov 2012

Following the construction of the general effective theory for dark matter direct detection in 1203.3542, we perform an analysis of the experimental constraints on the full parameter space of elastically scattering dark matter. We review the prescription for calculating event rates in the general effective theory and discuss the sensitivity of vari...

Nicholas Lubbers's research while affiliated with Los Alamos National Laboratory and other places

What is this page?

Publications (93)

Citations