Flow chart of the ensemble long short‐term memory (EnLSTM).

When Swarm Learning meets energy series data: A decentralized collaborative learning design based on blockchain

Preprint

Jun 2024

Machine learning models offer the capability to forecast future energy production or consumption and infer essential unknown variables from existing data. However, legal and policy constraints within specific energy sectors render the data sensitive, presenting technical hurdles in utilizing data from diverse sources. Therefore, we propose adopting a Swarm Learning (SL) scheme, which replaces the centralized server with a blockchain-based distributed network to address the security and privacy issues inherent in Federated Learning (FL)'s centralized architecture. Within this distributed Collaborative Learning framework, each participating organization governs nodes for inter-organizational communication. Devices from various organizations utilize smart contracts for parameter uploading and retrieval. Consensus mechanism ensures distributed consistency throughout the learning process, guarantees the transparent trustworthiness and immutability of parameters on-chain. The efficacy of the proposed framework is substantiated across three real-world energy series modeling scenarios with superior performance compared to Local Learning approaches, simultaneously emphasizing enhanced data security and privacy over Centralized Learning and FL method. Notably, as the number of data volume and the count of local epochs increases within a threshold, there is an improvement in model performance accompanied by a reduction in the variance of performance errors. Consequently, this leads to an increased stability and reliability in the outcomes produced by the model.

A Noise-robust Multi-head Attention Mechanism for Formation Resistivity Prediction: Frequency Aware LSTM

Preprint

Jun 2024

The prediction of formation resistivity plays a crucial role in the evaluation of oil and gas reservoirs, identification and assessment of geothermal energy resources, groundwater detection and monitoring, and carbon capture and storage. However, traditional well logging techniques fail to measure accurate resistivity in cased boreholes, and the transient electromagnetic method for cased borehole resistivity logging encounters challenges of high-frequency disaster (the problem of inadequate learning by neural networks in high-frequency features) and noise interference, badly affecting accuracy. To address these challenges, frequency-aware framework and temporal anti-noise block are proposed to build frequency aware LSTM (FAL). The frequency-aware framework implements a dual-stream structure through wavelet transformation, allowing the neural network to simultaneously handle high-frequency and low-frequency flows of time-series data, thus avoiding high-frequency disaster. The temporal anti-noise block integrates multiple attention mechanisms and soft-threshold attention mechanisms, enabling the model to better distinguish noise from redundant features. Ablation experiments demonstrate that the frequency-aware framework and temporal anti-noise block contribute significantly to performance improvement. FAL achieves a 24.3% improvement in R2 over LSTM, reaching the highest value of 0.91 among all models. In robustness experiments, the impact of noise on FAL is approximately 1/8 of the baseline, confirming the noise resistance of FAL. The proposed FAL effectively reduces noise interference in predicting formation resistivity from cased transient electromagnetic well logging curves, better learns high-frequency features, and thereby enhances the prediction accuracy and noise resistance of the neural network model.

Reservoir parameters prediction based on spatially transferred long short-term memory network

Article

Full-text available

Jan 2024
PLOS ONE

Reservoir reconstruction, where parameter prediction plays a key role, constitutes an extremely important part in oil and gas reservoir exploration. With the mature development of artificial intelligence, parameter prediction methods are gradually shifting from previous petrophysical models to deep learning models, which bring about obvious improvements in terms of accuracy and efficiency. However, it is difficult to achieve large amount of data acquisition required for deep learning due to the cost of detection, technical difficulties, and the limitations of complex geological parameters. To address the data shortage problem, a transfer learning prediction model based on long short-term memory neural networks has been proposed, and the model structure has been determined by parameter search and optimization methods in this paper. The proposed approach transfers knowledge from historical data to enhance new well prediction by sharing some parameters in the neural network structure. Moreover, the practicality and effectiveness of this method was tested by comparison based on two block datasets. The results showed that this method could significantly improve the prediction accuracy of the reservoir parameters in the event of data shortage.

The Data Supplement Method of Azimuthal EM LWD Based on Deep Learning

Article

Full-text available

Jan 2024

The data of azimuthal electromagnetic (EM) Logging-While-Drilling (LWD) tool is crucial for controlling and optimizing the trajectory of the wellbore, making it a key technology in geosteering. However, the measurement of the tool involves multiple frequencies, spaces, and sectors, leading to a significant volume of measured data that can’t be uploaded in real-time. Attempting to invert formation resistivity and boundaries based solely on the limited data that transmitted to the surface may not accurately reflect the true formation model. Therefore, this paper proposes a method for supplementing the measurement curves of the tool based on deep learning. The intelligent method can predict the missing logging information according to limited data and improve the utilization efficiency of logging data. Firstly, the database of azimuthal EM LWD is generated using various synthetic formation models and numerical forward modeling techniques, and the complete logging data is artificially separated into known logging data and missing logging data. Then, three deep learning models are established based on LSTM, GRU, and UNET networks respectively, and use the above sample database for training and testing them. The results demonstrate that missing curves of the tool’s measurement can be accurately and efficiently predicted using deep learning techniques. Finally, the original logging data and the complete logging data after supplementing are used for inverting the formation information. The result shows that the latter yields higher inversion accuracy. Moreover, the difference in inversion accuracy will grow as the complexity of the formation model increases after data supplementing. Therefore, the data supplement of azimuthal EM LWD by deep learning is very important for the accurate inversion of complex formation models.

LogRegX: An Explainable Regression Network for Cross-Well Geophysical Logs Generation

Article

Jan 2023
IEEE T INSTRUM MEAS

Geophysical logging instruments continuously measure multiple geophysical properties of borehole rocks, thus providing a feasible way to fine borehole geology modelling. Since the missing problem of well logs is inevitable, it is essential to generate the missing logs by the available ones. Recently, a large body of interdisciplinary studies has demonstrated the effectiveness of applying machine learning to solve the missing logs generation problem, under which the training and testing datasets obey the independent and identical distribution (iid) assumption. This assumption, however, is not satisfied in the case of the cross-well missing logs generation task. A standard method to solve the non-iid issue is to map source and target data to a common feature space and then employ Mean Maximum Discrepancy (MMD) to measure domain differences. However, this method suffers from high computational complexity and poor feature explainability when dealing with logs generation tasks. To solve the above problems, we propose an explainable regression network for cross-well geophysical logs generation named LogRegX. LogRegX integrates single-well feature extraction, cross-well feature alignment, and missing logs prediction while maintaining the explainability of logging features. Specifically, LogRegX leverages the gating mechanism to fuse multi-scale logging features to capture the response characteristics of well logs. The learned source and target feature representations are subject to domain discrepancy constraints, measured by Random Fourier Feature transform induced MMD. Additionally, target-domain information retaining mechanism is introduced to maintain the structure of target data so that the transferred features are explainable. Experiments on real-world field data demonstrate the superiority and the explainability of LogRegX over the existing methods.

Method of Geomechanical Parameter Determination and Volumetric Fracturing Factor Simulation under Highly Stochastic Geologic Conditions

Article

Full-text available

Dec 2022

In order to accurately predict geomechanical parameters of oil-bearing reservoirs and influencing factors of volumetric fracturing, a new method of geomechanical parameter prediction combining seismic inversion, well logging interpretation and production data is proposed in this paper. Herein, we present a structure model, petrophysical model and geomechanical model. Moreover, a three-dimensional geomechanical model of a typical reservoir was established and corrected using history matching. On this basis, a typical well model was established, 11 influencing factors of volume fracturing including formation parameters and fracturing parameters were analyzed and their impact were ranked, and the oil recovery rate and the accumulated oil production before and after optimal fracturing were compared. The results show that with respect to formation parameters, reservoir thickness is the main influencing factor; interlayer thickness and stress difference are the secondary influencing factors; and formation permeability, Young’s modulus and Poisson’s ratio are the weak influencing factors. For a pilot well of a typical reservoir, the optimized fracture increased production by 7 tons/day relative to traditional fracturing. After one year of production, the method increased production by 4 tons/day relative to traditional fracturing, showing great potential in similar oil reservoirs.

TgDLF2.0: Theory-guided deep-learning for electrical load forecasting via Transformer and transfer learning

Preprint

Full-text available

Oct 2022

Electrical energy is essential in today's society. Accurate electrical load forecasting is beneficial for better scheduling of electricity generation and saving electrical energy. In this paper, we propose theory-guided deep-learning load forecasting 2.0 (TgDLF2.0) to solve this issue, which is an improved version of the theory-guided deep-learning framework for load forecasting via ensemble long short-term memory (TgDLF). TgDLF2.0 introduces the deep-learning model Transformer and transfer learning on the basis of dividing the electrical load into dimensionless trends and local fluctuations, which realizes the utilization of domain knowledge, captures the long-term dependency of the load series, and is more appropriate for realistic scenarios with scarce samples. Cross-validation experiments on different districts show that TgDLF2.0 is approximately 16% more accurate than TgDLF and saves more than half of the training time. TgDLF2.0 with 50% weather noise has the same accuracy as TgDLF without noise, which proves its robustness. We also preliminarily mine the interpretability of Transformer in TgDLF2.0, which may provide future potential for better theory guidance. Furthermore, experiments demonstrate that transfer learning can accelerate convergence of the model in half the number of training epochs and achieve better performance.

Metal Corrosion Rate Prediction of Small Samples Using an Ensemble Technique

Article

Full-text available

Aug 2022
Comput Model Eng Sci

Accurate prediction of the internal corrosion rates of oil and gas pipelines could be an effective way to prevent pipeline leaks. In this study, a proposed framework for predicting corrosion rates under a small sample of metal corrosion data in the laboratory was developed to provide a new perspective on how to solve the problem of pipeline corrosion under the condition of insufficient real samples. This approach employed the bagging algorithm to construct a strong learner by integrating several KNN learners. A total of 99 data were collected and split into training and test set with a 9:1 ratio. The training set was used to obtain the best hyperparameters by 10-fold cross-validation and grid search, and the test set was used to determine the performance of the model. The results showed that the Mean Absolute Error (MAE) of this framework is 28.06% of the traditional model and outperforms other ensemble methods. Therefore, the proposed framework is suitable for metal corrosion prediction under small sample conditions.

An expert system for insect pest population dynamics prediction

Article

Jul 2022
COMPUT ELECTRON AGR

Avocado (Persea americana) production is increasing in Kenya, with both small and largeholder farming for domestic and export markets. However, one of main challenges that limit production is infestation by insect pests, notably the oriential fruit fly Bactocera dorsalis and Ceratitis spp. fruit flies, which cause direct crop losses and are indirectly responsible for non-tariff trade barriers due to stringent export requirements. Data on weekly pest trap counts were collected between September 2017 and December 2020 within orchards in avocado plantations. Fuzzy neural network (FNN) were used to model the population dynamics of B. dorsalis and Ceratitis spp. Weekly pest counts, rainfall, average temperature, relative humidity and avocado plant physiological stages were used for predictive modeling in different orchards. The performance of the resulting models was evaluated using coefficient of determination (R²), mean absolute error (MAE), mean relative approximation error (MRAE) and root mean squared error (RMSE). FNN models achieved satisfactory results in predicting the dynamics of the pests in the orchards, with most of the models obtaining R² > 0.85. We demonstrated how FNN models can be used as predictive tools for managing and controlling fruit fly pest populations in these plantations, and how they may be suitable to predict fruit fly or other pests in similar cropping systems. Once the input variables are known, they can be loaded into the FNN models to predict field pest populations, and based on threshold values, allow for implementation of timely and adequate control measures such as the use of biopesticides.

EEG Daydreaming, A Machine Learning Approach to Detect Daydreaming Activities

Chapter

Full-text available

Jun 2022

In this paper, we propose a new method to detect noise hindrances in Electroencephalographic (EEG) signals caused by mental distractions, which we named “daydreaming signals.” Our approach is based on sliding windows and aims to detect and locate these daydreaming signals to specific points in time. We expect to get cleaner data and, therefore, higher prediction accuracy in current available EEG datasets by removing these daydreaming signals. Beyond these improvements to existing data, this approach also has the potential to improve the quality of future data collection, as researchers can discover the pattern of daydreaming signals in trial rounds and deal with these signals accordingly.KeywordsDesign methods and techniquesMachine learningSupervised learningElectroencephalography (EEG)Sliding windowsEEG signal classification

Flow chart of the ensemble long short‐term memory (EnLSTM).

Citations