32 bit genotype representing 2 activation functions

Source publication

The system architecture of the proposed method, LSTM architecture at...

Comparison of the performance of proposed method with other...

Neuroevolution based hierarchical activation function for long short-term model network

Article

Full-text available

Dec 2021

In the family of recurrent neural networks the long short-term model network provides promising solutions for many complex applications such as speech and voice recognition, machine translation and time series analysis. When building these networks, many tunable hyper-parameters need to be set early. Among these hyperparameters, the activation func...

Internet user penetration in Peru in 2016, by native language

A human comparison of the best-performing MT systems showing a high...

Neural machine translation with a polysynthetic low resource language

Article

Full-text available

Dec 2020

Low-resource languages (LRL) with complex morphology are known to be more difficult to translate in an automatic way. Some LRLs are particularly more difficult to translate than others due to the lack of research interest or collaboration. In this article, we experiment with a specific LRL, Quechua, that is spoken by millions of people in South Ame...

A Surrogate-Assisted Symbolic Time-Series Discretization Using Multi-breakpoints and a Multi-objective Evolutionary Algorithm

Preprint

Full-text available

Jun 2024

The enhanced multi-objective symbolic discretization for time series (eMODiTS) method employs a flexible discretization scheme using different value cuts for each non-equal time interval, which requires a computational cost for evaluating each objective function. Therefore, surrogate models were implemented to minimize this disadvantage. Nevertheless, each solution found by eMODiTS is a different-sized vector, so the surrogate model must be able to handle data sets under this characteristic. Consequently, this work's contribution lies in analyzing the surrogate models' implementation on the time series discretization, where each potential scheme is a real-number different-sized vector. For this reason, the surrogate model proposed was k-nearest Neighbors for regression with Dynamic Time Warping as a distance measure. Results suggest our proposal finds a suitable approximation to the final eMODiTS solutions with a functions evaluation reduction rate between 15% and 95%. However, according to Pareto front performance measures, the proposal Pareto front is competitive compared to the eMODiTS Pareto front, reaching an average Generational Distance (GD) between 0.0447 and 0.0536. Moreover, the average Hypervolume Ratio (HVR) ranges between 0.334 and 0.3891. Finally, our proposal compared against SAX-based methods presents a similar behavior regarding classification tasks and statistical tests.

Multi-step prediction of main pump leakage in nuclear power plants with an additive model

Article

Jan 2023
PROG NUCL ENERG

A New Hybrid Based on Long Short-Term Memory Network with Spotted Hyena Optimization Algorithm for Multi-Label Text Classification

Article

Full-text available

Feb 2022

An essential work in natural language processing is the Multi-Label Text Classification (MLTC). The purpose of the MLTC is to assign multiple labels to each document. Traditional text classification methods, such as machine learning usually involve data scattering and failure to discover relationships between data. With the development of deep learning algorithms, many authors have used deep learning in MLTC. In this paper, a novel model called Spotted Hyena Optimizer (SHO)-Long Short-Term Memory (SHO-LSTM) for MLTC based on LSTM network and SHO algorithm is proposed. In the LSTM network, the Skip-gram method is used to embed words into the vector space. The new model uses the SHO algorithm to optimize the initial weight of the LSTM network. Adjusting the weight matrix in LSTM is a major challenge. If the weight of the neurons to be accurate, then the accuracy of the output will be higher. The SHO algorithm is a population-based meta-heuristic algorithm that works based on the mass hunting behavior of spotted hyenas. In this algorithm, each solution of the problem is coded as a hyena. Then the hyenas are approached to the optimal answer by following the hyena of the leader. Four datasets are used (RCV1-v2, EUR-Lex, Reuters-21578, and Bookmarks) to evaluate the proposed model. The assessments demonstrate that the proposed model has a higher accuracy rate than LSTM, Genetic Algorithm-LSTM (GA-LSTM), Particle Swarm Optimization-LSTM (PSO-LSTM), Artificial Bee Colony-LSTM (ABC-LSTM), Harmony Algorithm Search-LSTM (HAS-LSTM), and Differential Evolution-LSTM (DE-LSTM). The improvement of SHO-LSTM model accuracy for four datasets compared to LSTM is 7.52%, 7.12%, 1.92%, and 4.90%, respectively.

NeuroSCA: Evolving Activation Functions for Side-Channel Analysis

Article

Full-text available

Jan 2022

The choice of activation functions can significantly impact the performance of neural networks. Due to an ever-increasing number of new activation functions being proposed in the literature, selecting the appropriate activation function becomes even more difficult. Consequently, many researchers approach this problem from a different angle, in which instead of selecting an existing activation function, an appropriate activation function is evolved for the problem at hand. In this paper, we demonstrate that evolutionary algorithms can evolve new activation functions for side-channel analysis (SCA), outperforming ReLU and other activation functions commonly applied to that problem. More specifically, we use Genetic Programming to define and explore candidate activation functions (neuroevolution) in the form of mathematical expressions that are gradually improved. Experiments with the ASCAD database show that this approach is highly effective compared to results obtained with standard activation functions and that it can match the state-of-the-art results from the literature. More precisely, the obtained results for the ASCAD fixed key dataset demonstrate that the evolved activation functions can improve the current state-of-the-art by achieving a guessing entropy of 287 for the Hamming weight model and 115 for the Identity leakage model, compared to 447 and 120 obtained in the literature.

Short-term power load forecasting for a region based on LSTM-Attention-GA.

Preprint

Sep 2023

Power system management and operation rely heavily on short-term power load forecasting. Accurate forecasting results can help reduce power waste and economic losses. The existing power forecasting methods only forecast the future load based on historical data, which factors have the greatest influence on the power load is not considered enough, and there are no effective methods for simultaneously mining time characteristics and correlation characteristics of multidimensional time series. Therefore, we propose a new hybrid approach, which combines LSTM with attention mechanism and GA (genetic algorithm). In LSTM, GA optimizes the number of layers, dense layers, hidden layer neurons, and dense layer neurons, so as to determine the optimal parameters. On the basis of the load data set containing five characteristics of dry bulb temperature, dew point temperature, wet bulb temperature, humidity and electricity price, the method proposed in this paper will be verified. By comparing with RNN, LSTM, GRU, LSTM-Attention and GRU-Attention. According to the experimental results, the application of the proposed method noticeably minimizes the prediction error and elevates the goodness of fit of the model.

Estimação do teor de óleos e graxas em água descartada no mar usando modelos baseados em dados

Thesis

Full-text available

Apr 2022

Jose Marques de Oliveira Júnior

Água produzida, em plataformas marítimas, é um dos efluentes recuperados de poços em conjunto com petróleo e gás natural, sendo o principal resíduo gerado nesse processo. O Teor de Óleos e Graxas (TOG) é considerado um dos principais parâmetros de controle do descarte de água produzida no mar, com limites diários e mensais definidos pela legislação vigente. A medição de TOG usada como referência pelo IBAMA é feita pelo método gravimétrico, com amostras de água coletadas diariamente e enviadas para laboratório acreditado, que fornece os resultados com defasagem de alguns dias a partir da data de amostragem. A necessidade de ações corretivas em caso de valores acima do limite tem motivado o uso de métodos alternativos que gerem estimativas com maior frequência. Neste trabalho, modelos baseados em dados são criados para obtenção de estimativas do TOG. Variáveis de processo de tratamento de água produzida, informações sobre produtos químicos e dados sobre produção diária de uma plataforma oshore foram coletados, tratados e utilizados para treinar, validar e testar esses modelos. Além disso, foram aplicadas técnicas de otimização de hiperparâmetros e seleção de atributos. Os resultados obtidos mostraram que os modelos baseados em redes neurais recorrentes (LSTM e CNN+LSTM) alcançaram desempenhos superiores se comparados aos sistemas de monitoramento online existentes.

32 bit genotype representing 2 activation functions

Similar publications

Citations