Neural Networks for Pattern Recognition

Early Fast Cost Estimates of Sewerage Projects Construction Costs Based on Ensembles of Neural Networks

Article

Full-text available

Nov 2023

This paper presents research results on the development of an original cost prediction model for construction costs in sewerage projects. The focus is placed on fast cost estimates applicable in the early stages of a project, based on fundamental information available during the initial design phase of sanitary sewers prior to the detailed design. The originality and novelty of this research lie in the application of artificial neural network ensembles, which include a combination of several individual neural networks and the use of simple averaging and generalized averaging approaches. The research resulted in the development of two ensemble-based models, including five neural networks that were trained and tested using data collected from 125 sewerage projects completed in the Czech Republic between 2018 and 2022. The data included information relevant to various aspects of projects and contract costs, updated to account for changes in costs over time. The developed models present satisfactory predictive performance, especially the ensemble model based on simple averaging, which offers prediction accuracy within the range of ±30% (in terms of percentage errors) for over 90% of the training and testing samples. The developed models, based on the ensembles of neural networks, outperformed the benchmark model based on the classical approach and the use of multiple linear regression.

Machine learning in physics: A short guide

Article

Full-text available

Nov 2023
EPL-EUROPHYS LETT

Francisco A Rodrigues

Machine learning is a rapidly growing field with the potential to revolutionize many areas of science, including physics. This review provides a brief overview of machine learning in physics, covering the main concepts of supervised, unsupervised, and reinforcement learning, as well as more specialized topics such as causal inference, symbolic regression, and deep learning. We present some of the principal applications of machine learning in physics and discuss the associated challenges and perspectives.

Binary Function Clone Search in the Presence of Code Obfuscation and Optimization over Multi-CPU Architectures

Conference Paper

Full-text available

Jul 2023

Binary function clone search is an essential capability that enables multiple applications and use cases, including reverse engineering, patch security inspection, threat analysis, vulnerable function detection, etc. As such, a surge of interest has been expressed in designing and implementing techniques to address function similarity on binary executables and firmware images. Although existing approaches have merit in fingerprinting function clones, they present limitations when the target binary code has been subjected to significant code transformation resulting from obfuscation, compiler optimization, and/or cross-compilation to multiple-CPU architectures. In this regard, we design and implement a system named BinFinder, which employs a neural network to learn binary function embeddings based on a set of extracted features that are resilient to both code obfuscation and compiler optimization techniques. Our experimental evaluation indicates that BinFinder outperforms state-of-the-art approaches for multi-CPU architectures by a large margin, with 46% higher Recall against Gemini, 55% higher Recall against SAFE, and 28% higher Recall against GMN. With respect to obfuscation and compiler optimization clone search approaches, BinFinder outperforms the asm2vec (single CPU architecture approach) with 30% higher Recall and BinMatch (multi-CPU architecture approach) with 10% higher Recall. Finally, our work is the first to provide noteworthy results with respect to binary clone search over the tigress obfuscator, which is a well-established open-source obfuscator.

The Paradox of Noise: An Empirical Study of Noise-Infusion Mechanisms to Improve Generalization, Stability, and Privacy in Federated Learning

Preprint

Full-text available

Oct 2023

In a data-centric era, concerns regarding privacy and ethical data handling grow as machine learning relies more on personal information. This empirical study investigates the privacy, generalization, and stability of deep learning models in the presence of additive noise in federated learning frameworks. Our main objective is to provide strategies to measure the generalization, stability, and privacy-preserving capabilities of these models and further improve them.To this end, five noise infusion mechanisms at varying noise levels within centralized and federated learning settings are explored. As model complexity is a key component of the generalization and stability of deep learning models during training and evaluation, a comparative analysis of three Convolutional Neural Network (CNN) architectures is provided.The paper introduces Signal-to-Noise Ratio (SNR) as a quantitative measure of the trade-off between privacy and training accuracy of noise-infused models, aiming to find the noise level that yields optimal privacy and accuracy. Moreover, the Price of Stability and Price of Anarchy are defined in the context of privacy-preserving deep learning, contributing to the systematic investigation of the noise infusion strategies to enhance privacy without compromising performance. Our research sheds light on the delicate balance between these critical factors, fostering a deeper understanding of the implications of noise-based regularization in machine learning. By leveraging noise as a tool for regularization and privacy enhancement, we aim to contribute to the development of robust, privacy-aware algorithms, ensuring that AI-driven solutions prioritize both utility and privacy.

A correlation-based feature weighting filter for multi-label Naive Bayes

Article

Full-text available

Oct 2023

Multi-label classification is used to solve the problem where multiple labels are associated with single sample. Naive Bayes (NB) classifier is widely used for single label classification due to its high performance and simplicity. Therefore it is vital to extend NB for multi-label classification. In single label classification feature weighted NB gives high accuracy by solving the conditional independence assumption of NB. However, NB is not much explored for multi-label classification. This paper proposes correlation dependent feature weighted NB (MLCFWNB) for multi-label classification. The proposed MLCFWNB is tested over eight benchmark datasets. The experimental result suggest that MLCFWNB wins 60% times in case of different multi-label learning evaluation parameters.

Self-Building Neural Networks

Conference Paper

Full-text available

Jul 2023

Graph Structure Learning with Interpretable Bayesian Neural Networks

Preprint

Full-text available

Jun 2024

Graphs serve as generic tools to encode the underlying relational structure of data. Often this graph is not given, and so the task of inferring it from nodal observations becomes important. Traditional approaches formulate a convex inverse problem with a smoothness promoting objective and rely on iterative methods to obtain a solution. In supervised settings where graph labels are available, one can unroll and truncate these iterations into a deep network that is trained end-to-end. Such a network is parameter efficient and inherits inductive bias from the optimization formulation, an appealing aspect for data constrained settings in, e.g., medicine, finance, and the natural sciences. But typically such settings care equally about uncertainty over edge predictions, not just point estimates. Here we introduce novel iterations with independently interpretable parameters, i.e., parameters whose values - independent of other parameters' settings - proportionally influence characteristics of the estimated graph, such as edge sparsity. After unrolling these iterations, prior knowledge over such graph characteristics shape prior distributions over these independently interpretable network parameters to yield a Bayesian neural network (BNN) capable of graph structure learning (GSL) from smooth signal observations. Fast execution and parameter efficiency allow for high-fidelity posterior approximation via Markov Chain Monte Carlo (MCMC) and thus uncertainty quantification on edge predictions. Synthetic and real data experiments corroborate this model's ability to provide well-calibrated estimates of uncertainty, in test cases that include unveiling economic sector modular structure from S$\&$P$500$ data and recovering pairwise digit similarities from MNIST images. Overall, this framework enables GSL in modest-scale applications where uncertainty on the data structure is paramount.

Modélisation Statistique des Cartes des Dissimilarités Locales d'Images et Applications Sous la direction de M. Frédéric MORAIN-NICOLIER et M. Florent RETRAINT Soutenance le 14/12/2022 devant le jury composé de

Thesis

Full-text available

Dec 2022

Moustapha Diaw

En raison de l’augmentation considérable des images dans la vie quotidienne, de nombreuses applications nécessitent une étude sur leur similarité. La Carte des Dissimilarités Locales (CDL) est une mesure, construite autour de la distance de Hausdorff, qui est très efficace pour localiser et quantifier les différences de structures entre les images. Cette mesure a été proposée par Baudrier et al. [1]. Avant cela, aucune solution spécifiquement locale n’a été proposée par la communauté scientifique. À partir d’une CDL, il est cependant difficile d’interpréter et de prendre une décision sur la similarité entre deux images. De plus, la mesure est mise en échec sur des images contenant à la fois des structures et des textures et le comportement statistique des valeurs de la CDL n’a jamais été étudié. Tout cela limitait ses domaines d’application. Cette thèse propose d’abord une distribution statistique pour modéliser les valeurs des niveaux de gris des CDL des images structurelles. Les deux paramètres de la distribution sont pertinents pour discriminer les paires d’images en classes similaires et dissimilaires. Des modèles d’apprentissage automatique et des tests statistiques sont utilisés pour classer les paires d’images. Mais, avant d’aborder les tests, une extension de l’approche au problème de classification d’images multi-classes est proposée. Ensuite, les mesures d’informations telles que l’Information Mutuelle (IM) et l’Information Disjointe (ID) sont utilisées pour adapter la CDL sur des images avec un mélange de structures et de textures. Nous proposons, enfin, d’appliquer la mesure au problème de détection de changements sur des séries d’images. Nous savons aussi que, de nos jours, de nombreuses images numériques sont falsifiées pour de la propagande ou pour cacher des informations importantes. La détection de ces falsifications intéresse donc de nombreux acteurs majeurs de la sécurité. Dans cette thèse, nous nous intéressons uniquement à la détection de falsifications par copier-coller. Toutes nos approches sont basées uniquement sur la CDL et essentiellement sur les deux paramètres de la distribution proposée. Elles sont pertinentes et certaines méthodes sont même comparées avec des approches d’apprentissage profond de l’état de l’art.

Artificial neural network, machine learning modelling of compressive strength of recycled coarse aggregate based self-compacting concrete

Article

Full-text available

May 2024
PLOS ONE

This research study aims to understand the application of Artificial Neural Networks (ANNs) to forecast the Self-Compacting Recycled Coarse Aggregate Concrete (SCRCAC) compressive strength. From different literature, 602 available data sets from SCRCAC mix designs are collected, and the data are rearranged, reconstructed, trained and tested for the ANN model development. The models were established using seven input variables: the mass of cementitious content, water, natural coarse aggregate content, natural fine aggregate content, recycled coarse aggregate content, chemical admixture and mineral admixture used in the SCRCAC mix designs. Two normalization techniques are used for data normalization to visualize the data distribution. For each normalization technique, three transfer functions are used for modelling. In total, six different types of models were run in MATLAB and used to estimate the 28th day SCRCAC compressive strength. Normalization technique 2 performs better than 1 and TANSING is the best transfer function. The best k-fold cross-validation fold is k = 7. The coefficient of determination for predicted and actual compressive strength is 0.78 for training and 0.86 for testing. The impact of the number of neurons and layers on the model was performed. Inputs from standards are used to forecast the 28th day compressive strength. Apart from ANN, Machine Learning (ML) techniques like random forest, extra trees, extreme boosting and light gradient boosting techniques are adopted to predict the 28th day compressive strength of SCRCAC. Compared to ML, ANN prediction shows better results in terms of sensitive analysis. The study also extended to determine 28th day compressive strength from experimental work and compared it with 28th day compressive strength from ANN best model. Standard and ANN mix designs have similar fresh and hardened properties. The average compressive strength from ANN model and experimental results are 39.067 and 38.36 MPa, respectively with correlation coefficient is 1. It appears that ANN can validly predict the compressive strength of concrete.

Efficient Learning on Large-Scale 3D Point Clouds

Thesis

Full-text available

Jan 2024

Damien Robert

Over the past decade, deep learning has advanced the analysis of text, image, audio, and video. More recently, transformers and self-supervised learning have triggered a global competition to train gigantic models on Internet-scale datasets, with massive computational resources. This thesis deals with large-scale 3D point cloud analysis and adopts a different approach focused on efficiency. We introduce methods which improve several aspects of the state-of-the-art: faster training, fewer parameters, smaller compute or memory footprint, and better utilization of realistically-available data. In doing so, we strive to devise solutions towards a more frugal and accessible Artificial Intelligence (AI). We first introduce a 3D semantic segmentation model that combines the efficiency of superpoint-based methods with the expressivity of transformers. We build a hierarchical data representation which allows us to drastically accelerate the parsing of large 3D point clouds. Our network proves to match or even surpass state-of-the-art approaches on a range of sensors and acquisition environments, while boasting orders of magnitude fewer parameters, with faster training and inference. We then build on this framework to tackle panoptic segmentation of large-scale 3D point clouds. Existing instance and panoptic segmentation methods do not scale well to large scene with numerous objects because the computation of their loss function implies a costly matching step between true and predicted instances. Instead, we frame this task as a scalable graph clustering problem, which a small network is trained to address from local objectives only, without computing the actual object instances at train time. Our lightweight model can process ten-million-point scenes at once on a single GPU in a few seconds, opening the door to 3D panoptic segmentation at unprecedented scales. Finally, we propose to exploit the complementarity of image and point cloud modalities to enhance 3D scene understanding. We place ourselves in a realistic acquisition setting where multiple arbitrarily-located images observe the same scene, with potential occlusions. Unlike previous 2D-3D fusion approaches, we learn to select information from various views of the same object based on their respective observation conditions: camera-to-object distance, occlusion rate, optical distortion, etc. Our efficient implementation achieves state-of-the-art results both in indoor and outdoor settings, with minimal requirements: raw point clouds, arbitrarily-positioned images, and their cameras poses. Overall, this thesis upholds the principle that for settings with limited data availability, exploiting the structure of the problem unlocks both efficient and performant architectures.

Optimization of negative sample selection for landslide susceptibility mapping based on machine learning using K-means-KNN algorithm

Article

Full-text available

Nov 2023

Chao Liu

The quality of the sample plays a vital role in developing accurate models using machine learning. This aspect is equally important when evaluating regional landslide susceptibility using machine learning. Previous studies have mostly employed random generation methods to select samples, which often fail to select representative samples. Therefore, this study proposes the KK-sampling method, which uses K-means and KNN algorithms to analyze relevant attributes of the study area and select samples. To evaluate the effectiveness of the proposed method, this study employed MLP, RF, and XGBoost models in conjunction with the KK-sampling method, with Zhong County, Chongqing serving as a case study. The results indicate that the KK-sampling method significantly improves the stability and accuracy of the model. Additionally, this study analyzed the importance of landslide factors in Zhong County using SHAP values. The findings provide a reference for establishing a reasonable and effective landslide susceptibility model in the region.

Indistinguishable network dynamics can emerge from unalike plasticity rules

Preprint

Full-text available

Nov 2023

Synaptic plasticity is thought to be critical for building and maintaining brain circuits. Models of plasticity, or plasticity rules, are typically designed by hand, and evaluated based on their ability to elicit similar neuron or circuit properties to ground truth. While this approach has provided crucial insights into plasticity mechanisms, it is limited in its scope by human intuition and cannot identify all plasticity mechanisms that are consistent with the empirical data of interest. In other words, focusing on individual hand-crafted rules ignores the potential degeneracy of plasticity mechanisms that explain the same empirical data, and may thus lead to inaccurate experimental predictions. Here, we use an unsupervised, adversarial approach to infer plasticity rules directly from neural activity recordings. We show that even in a simple, idealised network model, many mechanistically different plasticity rules are equally compatible with empirical data. Our results suggest the need for a shift in the study of plasticity rules, considering as many degenerate plasticity mechanisms consistent with data as possible, before formulating experimental predictions.

A Survey on Causal Discovery Methods for I.I.D. and Time Series Data

Article

Full-text available

Sep 2023

The ability to understand causality from data is one of the major milestones of human-level intelligence. Causal Discovery (CD) algorithms can identify the cause-effect relationships among the variables of a system from related observational data with certain assumptions. Over the years, several methods have been developed primarily based on the statistical properties of data to uncover the underlying causal mechanism. In this study, we present an extensive discussion on the methods designed to perform causal discovery from both independent and identically distributed (I.I.D.) data and time series data. For this purpose , we first introduce the common terminologies used in causal discovery literature and then provide a comprehensive discussion of the algorithms designed to identify causal relations in different settings. We further discuss some of the benchmark datasets available for evaluating the algorithmic performance, off-the-shelf tools or software packages to perform causal discovery readily, and the common metrics used to evaluate these methods. We also evaluate some widely used causal discovery algorithms on multiple benchmark datasets and compare their performances. Finally, we conclude by discussing the research challenges and the applications of causal discovery algorithms in multiple areas of interest.

Learning interpretable predictive biomarkers from multi-omics data

Thesis

Full-text available

Aug 2023

Ellimari Paunio

Advancements in technologies that generate large-scale omics data and the develop- ment of machine learning methods to analyze this data provide new opportunities for the field of medicine, such as improved prevention, diagnosis and treatment of diseases through the application of multivariate biomarkers. Moreover, multi- variate biomarkers offer opportunities for precision medicine where treatments can be tailored to the needs of individual patients. Multivariate biomarker discovery which involves the prediction of clinical outcomes reproducibly using a small set of biomarkers, has emerged as a promising approach. However, from a machine learning perspective, the integration of multi-omics data to discover multi-omics biomarkers remains challenging. In addition, interpretability and explainability are key issues in the translation of models into clinical practice. Recently proposed group of kernel methods called sparse pre-image kernel machines has an embedded feature selection and offers improved interpretability compared to traditional kernel methods. Another benefit for learning multi-omics biomarkers is that sparse pre-image kernel machines can be extended to multi-view learning. This thesis explores the application of sparse pre-image kernel machines to multivariate biomarker discovery using a multi-omics coronavirus disease 2019 data set. To study whether the stability of feature selection can be improved, this thesis couples a method known as stability selection with sparse pre-image kernel machines. The stability of feature selection and model performance with the selected features are compared to two baseline methods, random forest and logistic regression. This thesis considers two types of feature selection pipelines for sparse pre-image kernel machines, where the first is a general grid search approach to select a level of regularization, and thus features. In the second pipeline, sparse pre-image kernel machines is combined with stability selection. Results show that stability selection improves the stability of the learned features significantly. In addition, the proposed multi-view approach learns a more balanced set of features compared to other methods in terms of learning features from both views. The findings of this thesis provide insights into the potential application of sparse pre-image kernel machines for the discovery of multi-omics biomarkers in complex diseases.

Learning Active Subspaces and Discovering Important Features with Gaussian Radial Basis Functions Neural Networks

Preprint

Jul 2023

Providing a model that achieves a strong predictive performance and at the same time is interpretable by humans is one of the most difficult challenges in machine learning research due to the conflicting nature of these two objectives. To address this challenge, we propose a modification of the Radial Basis Function Neural Network model by equipping its Gaussian kernel with a learnable precision matrix. We show that precious information is contained in the spectrum of the precision matrix that can be extracted once the training of the model is completed. In particular, the eigenvectors explain the directions of maximum sensitivity of the model revealing the active subspace and suggesting potential applications for supervised dimensionality reduction. At the same time, the eigenvectors highlight the relationship in terms of absolute variation between the input and the latent variables, thereby allowing us to extract a ranking of the input variables based on their importance to the prediction task enhancing the model interpretability. We conducted numerical experiments for regression, classification, and feature selection tasks, comparing our model against popular machine learning models and the state-of-the-art deep learning-based embedding feature selection techniques. Our results demonstrate that the proposed model does not only yield an attractive prediction performance with respect to the competitors but also provides meaningful and interpretable results that potentially could assist the decision-making process in real-world applications. A PyTorch implementation of the model is available on GitHub at the following link. https://github.com/dannyzx/GRBF-NNs

Grassland mowing event detection using combined optical, SAR, and weather time series

Article

Full-text available

Sep 2023
REMOTE SENS ENVIRON

The European Union’s Common Agricultural Policy (CAP) and the Habitats Directive aim to improve biodiversity in agricultural landscapes. Both policies require enormous monitoring, which can be facilitated by remote sensing. Use intensity, measured by mowing frequency is an important indicator of biodiversity in permanent grasslands. The frequency and timing of mowing can be determined using satellite remote sensing because photosynthetically active biomass changes rapidly in response to mowing. However, the rapid regrowth of grasses requires very dense satellite time series for reliable detection. Radar time series can complement optical time series and fill in cloud-related gaps to overcome this problem. Additional weather data can support the detection of grassland mowing events, as mowing events are associated with specific meteorological conditions. However, previous studies have not fully exploited both potentials or different machine learning approaches for mowing event detection. This study presents a new transferable two-step approach to detect grassland mowing events using combined optical and SAR data and additional weather data. First, we filled cloud-related gaps in optical time series using a supervised machine learning regression with optical and SAR data. We then classified time series sequences of optical, SAR and weather data into mown and unmown using four different machine learning algorithms. We used time series of NDVI and EVI (combined Sentinel-2 and Landsat 8), SAR backscatter, six-day interferometric coherence, backscatter radar vegetation index, backscatter cross-ratio (Sentinel-1), and temperature and precipitation sums. Our test sites are distributed across Germany and cover the entire gradient of grassland use intensities. Mowing events could be detected with F1 values of up to 89%, first cut with up to 94%. Our results show no structural advantage of infilling time series with machine learning over linearly interpolated time series. The combined Sentinel-2 and Landsat-8 time series provided dense time series with mostly median gaps less than 20 days, which proved sufficient to reliably detect mowing events. SAR data were not essential for mowing event detection in our study, but weather data improved classification results for models trained on all areas and years. However, when the model was transferred to unknown years or areas that were not used for training, SAR data improved detection accuracy, whereas weather data degrade it. Models trained on all years but not all study sites detected mowing events with an accuracy of up to F1 = 76%. Models trained with all regions but not all years detected mowing events in untrained years with F1 up to 80%.

Sparse Signal Representations for Acoustic Modeling and Speech Recognition

Thesis

Full-text available

Nov 2016

Danijel Koržinek

This thesis describes several unconventional methods of signal analysis for the purpose of modeling and recognizing speech and music. This process is commonly referred to as feature extraction and is an important step in any machine learning task. Most of the current research on this topic involves Fourier transform derived features. These are usually formulated as a set of spectral features arranged according to a perceptual scale, like the mel scale, and possibly transformed into the cepstrum domain. The basis for this thesis lies in the use of alternative signal representation techniques derived from two signal processing methodologies. One involves sparse coding mechanisms and the matching pursuit (MP) algorithm. The other is a novel wavelet derived feature set known as the scattering wavelet transform (SWT). These methods have already been applied to various signal processing tasks, involving both audio and image processing. On the other hand, they have not been utilized in many practical settings, like the modern largevocabulary continuous speech recognition (LVCSR) systems. The sparse coding mechanisms are often used in computer vision research but rarely are they applied to analyzing audio and even less so to speech recognition. The SWT is a fairly novel technique and while it has been used for solving some speech related problems it was never utilized in an actual LVCSR system. Within the thesis, sparse coding mechanisms are studied in detail in order to verify their capacity for modeling speech signals. Several coding mechanisms and dictionary adaptation methods are discussed and the technique that yields the highest quality of reconstruction is chosen. Similarly, the SWT is chosen in a configuration best fitting its intended use. Next, both of these feature sets are tested on the problem of framewise phoneme classification, representative of the issues behind the acoustic modeling used in most speech recognition systems. The SWT is additionally tested on two more problems: musical genre recognition and LVCSR. All these methods are compared to the most commonly used signal processing methods. Various topics related to the above experiments were also discussed, like the construction of LVCSR and various usability concerns related to exploiting such systems in real-life situations, with an example of a dialog system operating in a telephony environment. This dissertation postulates four main theses. It is shown that sparse coding can be effectively used to encode speech signals and that this form of representation can be used to improve the performance of speech recognition. The second thesis shows that SWT also enhances speech recognition accuracy, which is proven using the same problem that was utilized in the first thesis. In addition to that, the third thesis demonstrates that SWT derived feature set also improves the performance of LVCSR. The final thesis shows that IA has a substantial significance in voice user interface (VUI) design. The author’s contribution to this field of science is primarily in the novel application of the methods described above, in order to make them usable in practical speech recognition tasks. The author’s contribution also includes a novel approach to the conversion of sparse coding into a form which can be applied to speech recognition and an innovative concept of exploiting IA in the domain of VUIs.

Utilizing Computer Vision and Data Mining for Predicting Road Traffic Congestion

Thesis

Full-text available

May 2023

Traffic Congestion wastes time and energy, which are the two most valuable commodities of the current century. It happens when too many vehicles try to use a transportation infrastructure without having enough capacity. However, researches indicate that adding extra lane without studying the future consequences does not improve the situation. Our goal is to add another layer of information to the traffic data, find which type of vehicles are contributing to road traffic congestion, and predict future road traffic congestion and demands based on the historical data. We collected more than 400,000 images from traffic cameras installed in Autoroute 40, in the city of Montreal. The images were collected for five consecutive weeks from different locations from April 14, 2019, up until May 18, 2019. To process these images and extract useful information out of them, we created an object detection and classification model using the Faster RCNN algorithm. Our goal was to be able to detect different types of vehicles and see if we have traffic congestion in an image. In order to improve the accuracy and reduce the error rate, we provided multiple examples with different conditions to the model. By introducing blurry, rainy, and low light images to the model, we managed to build a robust model that could do the detection and classification task with excellent accuracy. Furthermore, by extracting the information from the collected images, we created a dataset of the number of vehicles in each location. After analyzing and visualizing the data, we find out the most congested areas, the behavior of the traffic flow during the day, peak hours, the contribution of each type of vehicle to the traffic, seasonality of the data, and where we can see each type of vehicle the most. Finally, we managed to predict the total number of congestion incidents for seven days based on historical data. Besides, we were able to predict the total number of different types of vehicles on the road as well. In order to do this task, we developed multiple Regression, Deep Learning, and Time Series Forecasting models and trained them with our vehicle count dataset. Based on the experimental results, we were able to get the best predictions with the Deep Learning models and succeeded in predicting future road traffic congestion with excellent accuracy

CALM: Conditional Adversarial Latent Models for Directable Virtual Characters

Preprint

May 2023

In this work, we present Conditional Adversarial Latent Models (CALM), an approach for generating diverse and directable behaviors for user-controlled interactive virtual characters. Using imitation learning, CALM learns a representation of movement that captures the complexity and diversity of human motion, and enables direct control over character movements. The approach jointly learns a control policy and a motion encoder that reconstructs key characteristics of a given motion without merely replicating it. The results show that CALM learns a semantic motion representation, enabling control over the generated motions and style-conditioning for higher-level task training. Once trained, the character can be controlled using intuitive interfaces, akin to those found in video games.

Modelling landuse dynamics of ecologically sensitive peri-urban space by incorporating an ANN cellular automata-Markov model for Siliguri urban agglomeration, India

Article

Full-text available

Apr 2023

Numerous cities throughout the world are experiencing tremendous population growth in their peripheral areas, resulting in a progressive modification of landscapes and raising serious concerns about natural environments, notably forests and agricultural area. Monitoring LULC changes can assist in understanding historical trends, while simulation-based modelling shed light on possible potential future developments. Both of these tactics are indispensable and complimentary for implementing effective land use policies to mitigate the adverse ramifications of urbanization. Present area of investigation, Siliguri town one of prime trading hub of whole north-east India surrounded by ecologically sensitives zones Himalayas. To monitor land use dynamics of peri-urban spaces in Siliguri town Landsat images of 2000, 2010 and 2020 were derived from USGS and classified using Support vector machine learning algorithms. Following the quantification of the previous trend of landuse change, an integrated Artificial Neural Network (ANN) and CA-Markov chain Model was utilized to forecast LULC for the years 2030 and 2050. Eleven pertinent geographical factors, comprising topographical, socioeconomic, and connectivity information, were generated and validated using the crammer v test. The results from LULC modeling predicts as compared to 2020, the urban area is expected to increase by 48.23%, while forest areas, other vegetation cover, and agricultural areas are predicted to shrink by 9.42%, 29.83%, and 26.60% respectively, by the year 2050. The results could provide useful information about historical and potential landuse change and as well as assist local governments in formulating management strategies for the protection of ecological resource.

Utilizing Computer Vision and Data Mining for Predicting Road Traffic Congestion

Thesis

Apr 2023

Traffic Congestion wastes time and energy, which are the two most valuable commodities of the current century. It happens when too many vehicles try to use a transportation infrastructure without having enough capacity. However, researches indicate that adding extra lane without studying the future consequences does not improve the situation. Our goal is to add another layer of information to the traffic data, find which type of vehicles are contributing to road traffic congestion, and predict future road traffic congestion and demands based on the historical data. We collected more than 400,000 images from traffic cameras installed in Autoroute40, in the city of Montreal. The images were collected for five consecutive weeks from different locations from April 14, 2019, up until May 18, 2019. To process these images and extract useful information out of them, we created an object detection and classification model using the Faster RCNN algorithm. Our goal was to be able to detect different types of vehicles and see if we have traffic congestion in an image. In order to improve the accuracy and reduce the error rate, we provided multiple examples with different conditions to the model. By introducing blurry, rainy, and low light images to the model, we managed to build a robust model that could do the detection and classification task with excellent accuracy. Furthermore, by extracting the information from the collected images, we created a dataset of the number of vehicles in each location. After analyzing and visualizing the data, we find out the most congested areas, the behavior of the traffic flow during the day, peak hours, the contribution of each type of vehicle to the traffic, seasonality of the data, and where we can see each type of vehicle the most. Finally, we managed to predict the total number of congestion incidents for seven days based on historical data. Besides, we were able to predict the total number of different types of vehicles on the road as well. In order to do this task, we developed multiple Regression, Deep Learning, and Time Series Forecasting models and trained them with our vehicle count dataset. Based on the experimental results, we were able to get the best predictions with the Deep Learning models and succeeded in predicting future road traffic congestion with excellent accuracy.

Residual stress prediction of arc welded austenitic pipes with artificial neural network ensemble using experimental data

Article

Full-text available

Apr 2023
INT J PRES VES PIP

A Survey on Causal Discovery Methods for Temporal and Non-Temporal Data

Preprint

Full-text available

Mar 2023

Causal Discovery (CD) is the process of identifying the cause-effect relationships among the variables from data. Over the years, several methods have been developed primarily based on the statistical properties of data to uncover the underlying causal mechanism. In this study we introduce the common terminologies in causal discovery, and provide a comprehensive discussion of the approaches designed to identify the causal edges in different settings. We further discuss some of the benchmark datasets available for evaluating the performance of the causal discovery algorithms, available tools to perform causal discovery readily, and the common metrics used to evaluate these methods. Finally, we conclude by presenting the common challenges involved in CD and also, discuss the applications of CD in multiple areas of interest.

Automatic Scan Plane Identification from 2D Ultrasound for Pedicle Screw Guidance

Article

May 2018

In order to reduce the total amount of radiation exposure and provide real-time guidance ultrasound has been incorporated as a potential intra-operative imaging modality into various orthopedic procedures. However, high levels of noise, various imaging artifacts, and bone boundaries appearing several millimeters in thickness hinder the success of ultrasound as an alternative imaging modality in assisting orthopedic surgery procedures. Additional difficulties are also encountered during manual operation of the ultrasound transducer during image acquisition. In this work, we proposed a combination of novel scan plane identification method, based on convolutional neural networks, and bone surface localization method. The bone surface localization approach utilizes both local phase information, a combination of three different local image phase information and signal transmission map obtained from an L1 norm based contextual regularization method. The proposed network was utilized on two different US systems and to identify five different scan planes. Validation was performed on scans obtained from 16 volunteers. The correct scan plane identification rate of over 93% has been obtained. Validation against expert segmentation achieved a mean vertebra surface localization error of 0.42 mm.

A Data Driven Approach for Target Classification Based on Histogram Representation of Radar Cross Section

Conference Paper

Jan 2023

Using Traditional Machine Learning and Deep Learning Methods for On-and Off-Target Prediction in CRISPR/Cas9: A Review

Article

Full-text available

Apr 2023

CRISPR/Cas9 (Clustered Regularly Interspaced Short Palindromic Repeats and CRISPR-associated protein 9) is a popular and effective two-component technology used for targeted genetic manipulation. It is currently the most versatile and accurate method of gene and genome editing, which benefits from a large variety of practical applications. For example, in biomedicine it has been used in research related to cancer, virus infections, pathogen detection and genetic diseases. Recent CRISPR/Cas9 research is based on data-driven models for on-and off-target prediction as a cleavage may occur at non-target sequence locations. Currently, conventional machine learning and deep learning methods are applied on a regular basis to accurately predict the sgRNA (single-guide RNA) on-target knockout efficacy and off-target profile. In this paper, we present an overview and a comparative analysis of traditional machine learning and deep learning models used in CRISPR/Cas9. We highlight the key research challenges and directions associated with target activity prediction. We discuss some recent advances in the sgRNA-DNA sequence encoding used in state-of-the-art on-and off-target prediction models. Furthermore, we present the most popular deep learning neural network architectures used in CRISPR/Cas9 prediction models. Finally, we summarize existing challenges and discuss possible future investigations in the field of on-and off-target prediction. Our paper provides valuable support for academic and industrial researchers interested in the application of machine learning methods in the field of CRISPR/Cas9 genome editing.

GVT2RPM: An Empirical Study for General Video Transformer Adaptation to Remote Physiological Measurement

Preprint

Full-text available

Jun 2024

Remote physiological measurement (RPM) is an essential tool for healthcare monitoring as it enables the measurement of physiological signs, e.g., heart rate, in a remote setting via physical wearables. Recently, with facial videos, we have seen rapid advancements in video-based RPMs. However, adopting facial videos for RPM in the clinical setting largely depends on the accuracy and robustness (work across patient populations). Fortunately, the capability of the state-of-the-art transformer architecture in general (natural) video understanding has resulted in marked improvements and has been translated to facial understanding, including RPM. However, existing RPM methods usually need RPM-specific modules, e.g., temporal difference convolution and handcrafted feature maps. Although these customized modules can increase accuracy, they are not demonstrated for their robustness across datasets. Further, due to their customization of the transformer architecture, they cannot use the advancements made in general video transformers (GVT). In this study, we interrogate the GVT architecture and empirically analyze how the training designs, i.e., data pre-processing and network configurations, affect the model performance applied to RPM. Based on the structure of video transformers, we propose to configure its spatiotemporal hierarchy to align with the dense temporal information needed in RPM for signal feature extraction. We define several practical guidelines and gradually adapt GVTs for RPM without introducing RPM-specific modules. Our experiments demonstrate favorable results to existing RPM-specific module counterparts. We conducted extensive experiments with five datasets using intra-dataset and cross-dataset settings. We highlight that the proposed guidelines GVT2RPM can be generalized to any video transformers and is robust to various datasets.

Dual automatic relevance determination for linear latent variable models and its application to calcium imaging data analysis

Preprint

Full-text available

Apr 2024

In the analysis of high-dimensional data, applying dimensionality reduction techniques is often necessary. However, setting an suitable criterion for determining the extent of dimensionality reduction often poses a challenge. The use of automatic relevance determination (ARD) in a linear latent variable model, such as Bayesian PCA, offers a way to automatically identifying the effective dimensionality of the latent space. However, conventional ARD methods often fail to extract sparse representation of latent variables in dealing with noisy, non-Gaussian, or nonlinearly observed data, such as calcium imaging data. To encourage the sparsity of the latent space, we proposed a dual ARD method in a linear latent variable model that applies ARD priors to both loading weights and latent variables. We first detailed our dual ARD method and mathematically analyzed how the dual ARD priors promote more sparsity in the latent space. We then evaluated the performance of the dual ARD methods against existing dimensionality reduction techniques using both simulated datasets and actual calcium imaging data. While conventional methods could retrieve essential signals in linear Gaussian settings, the dual ARD method outperformed the previous models in extracting low-dimensional signals from simulated calcium imaging data that contain higher levels of nonlinear noise. In applying the dual ARD method to actual two-photon calcium imaging data, we were able to identify low-dimensional latent variables that were sufficient for performing a sound localization decoding task successfully. Additionally, decoding performance across different cortical depths reflects the varied roles that specific cortical layers play in sound localization. In conclusion, the dual ARD method is well-suited for automatically reducing dimensionality of calcium imaging data while preserving essential information for further analysis.

Text Summarization and Temporal Learning Models Applied to Portuguese Fake News Detection in a Novel Brazilian Corpus Dataset

Conference Paper

Jan 2024

Streaming content advances and the appearance of online media raised the ability for massive content sharing that reaches thousands of people worldwide in a real-time fashion. Fake news spreading is nowadays the main concern of several authorities worldwide due to the negative impact and potential to induce social and political instability in our society. Therefore, fake news detection and suppression gained increased attention as an important topic in natural language processing and machine learning academic research. Regardless of the state-of-the-art methods available for fake news detection , a good corpus revealing novel language-specific counterfeit aspects is also important to exploit and distinguish between real and fake news in the context of social and political impacts for specific regions. This paper extends a previous Brazilian Portuguese corpora dataset and proposes using and comparing several deep learning and classical machine learning models to detect counterfeit content in the Portuguese language. Moreover, we propose using text summarization to achieve concise news summaries and prevent losing relevant information. This work presents an updated and balanced version of the FakeRecogna dataset for detecting fake news articles using a temporal learning approach based on efficient and well-known deep learning models.

Cooperative Iteration Matching Method for Aligning Samples from Heterogeneous Industrial Datasets面向工业异构数据匹配的联合迭代匹配方法

Article

Jul 2023

Industrial data mining usually deals with data from different sources. These heterogeneous datasets describe the same object in different views. However, samples from some of the datasets may be lost. Then the remaining samples do not correspond one-to-one correctly. Mismatched datasets caused by missing samples make the industrial data unavailable for further machine learning. In order to align the mismatched samples, this article presents a cooperative iteration matching method (CIMM) based on the modified dynamic time warping (DTW). The proposed method regards the sequentially accumulated industrial data as the time series. Mismatched samples are aligned by the DTW. In addition, dynamic constraints are applied to the warping distance of the DTW process to make the alignment more efficient. Then a series of models are trained with the cumulated samples iteratively. Several groups of numerical experiments on different missing patterns and missing locations are designed and analyzed to prove the effectiveness and the applicability of the proposed method.

An optimization approach to supervised principal component analysis

Chapter

Full-text available

Dec 2023

Supervised dimensionality reduction has become an important theme in the last two decades. Despite the plethora of models and formulations, there is a lack of a simple model that aims to project the set of patterns into a space defined by the classes (or categories). We set up a model where each class is represented as a 1D subspace of the vector space formed by the features. Assuming the set of classes does not exceed the cardinality of the features, the model results in multi-class supervised learning in which the features of each class are projected into the class subspace. Class discrimination is guaranteed via the imposition of the orthogonality of the 1D class sub-spaces. The resulting optimization problem—formulated as the minimization of a sum of quadratic functions on a Stiefel manifold—while being non-convex (due to the constraints), has a structure for which we can identify when we have reached a global minimum. After formulating a version with standard inner products, we extend the formulation to a reproducing kernel Hilbert space and similarly to the kernel version. Comparisons with the multi-class Fisher discriminants and principal component analysis showcase the relative merits toward dimensionality reduction.

Aversion of a Person Facing the Risk of Failure When Starting a Business in Mexico: An Approach Through Some Educational Factors

Chapter

Oct 2023

Gerardo Reyes

The low quality of current jobs in Mexico and their scarcity have led to the need to undertake. Consequently, people have ceased to be employees to become entrepreneurs. However, the specialized literature ensures that there are factors that may well characterize this venture. In the chapter, a first approach is made to the risk aversion that a person faces when failing to decide to be an entrepreneur in Mexico. The information integrated by the reports of the Global Entrepreneurship Monitor (GEM) served as input so that through a Multiple Linear Regression Analysis (MLRA). During the period 2011-2021, it was verified if the factors of education, experience, knowledge, skills, age, among others, directly influence a person to make the decision to start a business in Mexico.

Air Quality Monitoring in a Near-City Industrial Zone by Low-Cost Sensor Technologies: A Case Study

Conference Paper

Full-text available

Sep 2023

Deep Learning Application for Inverting Petrophysical Properties Directly from Seismic

Conference Paper

Full-text available

Oct 2023

In this study, we introduce a method to directly invert for porosity, Vclay and hydrocarbon saturation (Shc) simultaneously from pre-stack seismic data using deep learning approach. We implemented L1 norm in the loss function for Shc estimation, added noise into synthetic seismic dataset for training, and estimated uncertainties in the inversion results by training multiple network models. UNet architecture (ResNet-18 as encoder) is used due to its ability to preserve spatial resolution. The inputs for the network are the angle stacks whereas the outputs are the petrophysical properties. We implemented mean-squared error and L1 norm as the loss functions during the training process. The L1 norm is the mean absolute values of the predicted hydrocarbon saturation, which can help promotes sparsity. The network learns on synthetic dataset. We use facies-based geostatistical simulation to generate 1D synthetic petrophysical logs. Then linking the petrophysical properties to elastic properties through rock physics model (RPM), followed by computation of reflectivities using full Zoeppritz equations at five different groups of incidence angles (0°-55°). The traces in each group are convolved with the source wavelet prior to stacking the synthetic seismograms. To increase the variability of possible scenarios, we vary the spherical variogram ranges (8,10, and 12ms), use four different types of suitable RPM, apply oil and gas cases for the hydrocarbon fluid types, and convolve with nine different sets of angle dependent source wavelets. Two synthetic datasets are prepared: Dataset1 (ideal noiseless case) and Dataset 2 (noise added to the angle stacks), and a field data. The first (MLT1) and second (MLT2) machine learning are trained on a sub-dataset in Dataset 1 and 2 respectively. Based on the field dataset, the results from MLT2 show a better prediction performance than MLT1, with an average correlation coefficient of 0.68 (porosity), 0.74 (Vclay) and 0.67 (Shc) achieved. The better results from MLT2 can be related to the nature of measured seismic which contain noise that being learnt by MLT2. For uncertainty estimation, the network (ML3) is trained for 20 times on randomly selected sub-dataset in Dataset 2 using Monte Carlo dropout technique. The uncertainty is estimated by calculating the standard deviation of the solutions provided by ML3 when applying to the field data. Uncertainty estimation allows quantification on the stability of the solutions when varying training dataset.

Deep Learning Model to Improve the Stability of Damage Identification via Output-only Signal

Conference Paper

May 2023

A Versatile Machine Learning-Based Vehicle-to-Vehicle Connectivity Model

Conference Paper

Full-text available

Jun 2023

We present in this paper a versatile machine learning (ML) based vehicle-to-vehicle (V2V) connectivity model constructed from empirical measurements to enable the mutual exchange of data among vehicles. We portray the results of a campaign for measurements and relevant models for the V2V channel within the 5-GHz band in the form of attained RSSI and bit rate values in multiple V2V environments. To that end, we perform a parallel analysis to compare performance of an assortment of ML regression algorithms to assess their capabilities in predicting RSSI values and bit rates, including K-nearest-neighbors, AdaBoost, Regression Trees, Random Forest, SGD, SVM, and Artificial Neural Networks to predict connectivity patterns. Results in the form of numerical analysis illustrate that in our connectivity model RSSI and bitrate could be effectually predicted utilizing a subset of the group of considered ML ...

Using Artificial Neural Networks for Predicting Ship Fuel Consumption

Article

Full-text available

Jul 2023
POL MARIT RES

In marine vessel operations, fuel costs are major operating costs which affect the overall profitability of the maritime transport industry. The effective enhancement of using ship fuel will increase ship operation efficiency. Since ship fuel consumption depends on different factors, such as weather, cruising condition, cargo load, and engine condition, it is difficult to assess the fuel consumption pattern for various types of ships. Most traditional statistical methods do not consider these factors when predicting marine vessel fuel consumption. With technological development, different statistical models have been developed for estimating fuel consumption patterns based on ship data. Artificial Neural Networks (ANN) are some of the most effective artificial methods for modelling and validating marine vessel fuel consumption. The application of ANN in maritime transport improves the accuracy of the regression models developed for analysing interactive relationships between various factors. The present review sheds light on consolidating the works carried out in predicting ship fuel consumption using ANN, with an emphasis on topics such as ANN structure, application and prediction algorithms. Future research directions are also proposed and the present review can be a benchmark for mathematical modelling of ship fuel consumption using ANN.

CALM: Conditional Adversarial Latent Models for Directable Virtual Characters

Conference Paper

Jul 2023

Defining Standard Values for FaceReader Facial Expression Software Output

Article

Jul 2023

Background FaceReader is a validated software package that uses computer vision technology for facial expression recognition which has become increasingly popular in academic research to expedite, scale, and decrease the cost of facial emotion analysis. In this study, we compare FaceReader analysis to human evaluator interpretation in order to define standard values for the software output.Methods Randomly generated facial images produced by generative adversarial networks were analyzed using FaceReader and by survey participants (n=496). The age, facial emotion, and intensity of emotion as determined by the software and survey participants were recorded. Results were analyzed and compared.Results80 randomly generated images (20 children, 20 young adult, 20 middle aged, and 20 elderly; 38 male and 42 female) were included.Analysis of correlation between most common expression identified by FaceReader and the primary emotion detected by surveyors showed strong correlation (κ = 0.77, 95% CI = 0.64–0.91).On analyzing this correlation by age group, there was fair correlation in children (κ = 0.40, 95% CI = 0.078–0.72), perfect correlation in young adults (κ = 1.0, 95% CI = 1.0–1.0), strong correlation in middle aged adults (κ = 0.79, 95% CI = 0.53–1) and near perfect in elderly adults(κ = 0.9 , 95% CI = 0.7–1.0).Conclusions We provided the first study defining the expected average values generated by FaceReader in generally smiling images. This can be used as a standard in future studies.Level of Evidence IV This journal requires that authors assign a level of evidence to each article. For a full description of these Evidence-Based Medicine ratings, please refer to the Table of Contents or the online Instructions to Authors www.springer.com/00266.

Revisiting Neuron Coverage for DNN Testing: A Layer-Wise and Distribution-Aware Criterion

Conference Paper

May 2023

Brain Tumor Detection using RCNN and MobileNet

Conference Paper

Full-text available

Jun 2023

Analysis of Subsurface Soil Radon with the Environmental Parameters and Its Relation with Seismic Events

Article

Jun 2023
J GEOL SOC INDIA

This study reports continuous measurements of subsurface soil radon as well as environmental parameters for a period of three years. The survey was carried out along the active fault area in the Indo-Myanmar subduction zone in the north-eastern part which lies in the highest seismic zone of India. The wavelet-based decomposition of the environmental parameters was done using discrete wavelet transformation technique. The denoised environmental parameters by discrete wavelet transformation technique was fed as the inputs to the MLR (multiple linear regression) and MLP (multilayer perceptron) models. Residual radon was calculated and correlated with nearby seismic events. Many events of magnitude greater than or equal to 5 have occurred in the investigation area. It was possible to successfully correlate one event with the anomalous variation in soil radon. The correlated event was the only one with the shallow epicentral depth indicating that the investigated area has undergone a shallow rock fracturing due to the stress generated before the occurrence of the seismic event.

RelaxNet: A structure-preserving neural network to approximate the Boltzmann collision operator

Article

Jun 2023
J COMPUT PHYS

L’intelligence artificielle dans les structures d’urgences : place de la formation et de la garantie humaine

Article

Jun 2023

La recherche sur l’intelligence artificielle (IA) appliquée à la médecine d’urgence et son utilisation au quotidien dans les structures d’urgences (SU) ont augmenté significativement ces dernières années. L’IA doit être considérée comme un outil d’aide à la prise en charge diagnostique et thérapeutique des patients et d’amélioration de l’organisation des SU, notamment par la prise en compte de contraintes « métiers », contextuelles, relatives aux patients et plus généralement structurelles. L’IA comporte des avantages (reproductibilité, rapidité) mais aussi des risques (erreur, perte d’esprit critique). À l’image du Règlement général sur la protection des données et notamment de santé, la Commission européenne a publié un projet de règlement nommé « AI Act » pour la conception, le développement et l’utilisation des algorithmes d’IA. Elle souhaite imposer, entre autres, une garantie humaine, autrement dit une supervision humaine pour assurer la sécurité des patients, des soignants et des institutions. La mise en place d’un collège de garantie humaine pluriprofessionnel visant à garantir la supervision des outils d’IA de la conception au développement, au déploiement et à l’utilisation quotidienne permettra ainsi d’assurer durablement la sécurité des patients.

Word Segmentation and Sandhi Resolution on Ayurveda Classical Scriptures

Conference Paper

Apr 2023

Multimodal Representation Learning for Textual Reasoning over Knowledge Graphs

Thesis

Full-text available

May 2023

Nurendra Choudhary

Knowledge graphs (KGs) store relational information in a flexible triplet schema and have become ubiquitous for information storage in domains such as web search, e-commerce, social networks, and biology. Retrieval of information from KGs is generally achieved through logical reasoning, but this process can be computationally expensive and has limited performance due to the large size and complexity of relationships within the KGs. Furthermore, to extend the usage of KGs to non-expert users, retrieval over them cannot solely rely on logical reasoning but also needs to consider text-based search. This creates a need for multi-modal representations that capture both the semantic and structural features from the KGs. The primary objective of the proposed work is to extend the accessibility of KGs to non-expert users/institutions by enabling them to utilize non-technical textual queries to search over the vast amount of information stored in KGs. To achieve this objective, the research aims to solve four limitations: (i) develop a framework for logical reasoning over KGs that can learn representations to capture hierarchical dependencies between entities, (ii) design an architecture that can effectively learn the logic flow of queries from natural language text, (iii) create a multi-modal architecture that can capture inherent semantic and structural features from the entities and KGs, respectively, and (iv) introduce a novel hyperbolic learning framework to enable the scalability of hyperbolic neural networks over large graphs using meta-learning. The proposed work is distinct from current research because it models the logical flow of textual queries in hyperbolic space and uses it to perform complex reasoning over large KGs. The models developed in this work are evaluated on both the standard research setting of logical reasoning, as well as, real-world scenarios of query matching and search, specifically, in the e-commerce domain. In summary, the proposed work aims to extend the accessibility of KGs to non-expert users by enabling them to use non-technical textual queries to search vast amounts of information stored in KGs. To achieve this objective, the work proposes the use of multi-modal representations that capture both semantic and structural features from the KGs, and a novel hyperbolic learning framework to enable scalability of hyperbolic neural networks over large graphs. The work also models the logical flow of textual queries in hyperbolic space to perform complex reasoning over large KGs. The models developed in this work are evaluated on both the standard research setting of logical reasoning and real-world scenarios in the e-commerce domain.

A Review of Trends in Corrosion-Resistant Structural Steels Research—From Theoretical Simulation to Data-Driven Directions

Article

Full-text available

Apr 2023

This paper provides a review of models commonly used over the years in the study of microscopic models of material corrosion mechanisms, data mining methods and the corrosion-resistant performance control of structural steels. The virtual process of material corrosion is combined with experimental data to reflect the microscopic mechanism of material corrosion from a nano-scale to macro-scale, respectively. Data mining methods focus on predicting and modeling the corrosion rate and corrosion life of materials. Data-driven control of the corrosion resistance of structural steels is achieved through micro-alloying and organization structure control technology. Corrosion modeling has been used to assess the effects of alloying elements, grain size and organization purity on corrosion resistance, and to determine the contents of alloying elements.

The Problems of “Artificial Intelligence” in Modern Philosophy and Science

Article

Full-text available

Apr 2023

Yuan Gao

Patriotism, an important component of the Chinese national spirit, has inspired generations of Chinese to strive for national prosperity. Promoting patriotism and implementing patriotic education is an eternal topic. If the youth is robust, the country will be strong. Because college students are the vital force of the country and the hope of the nation, it is especially important to cultivate their patriotism. China is facing new challenges, with profound changes in domestic and foreign situations, rapid technological development and increasingly frequent Internet exchanges. The patriotic education environment has also become more complex under the impact of undesirable Western culture. With the external and internal influences, further patriotic education for college students still faces many challenges. We should face up to the problems in contemporary patriotism education in higher education institutions, explore the solutions and cultivate patriotism among college students in the new era. Therefore, the study of patriotism education in higher education institutions has important theoretical and practical significance. This paper mainly collates literature through literature and historical research method, and the combination of theory and practice, analyzes the problems and causes of patriotism education in higher education institutions with contemporary society, college and family education as well as the characteristics of college students themselves, and puts forward targeted countermeasures for solutions. In the main body, this paper is divided into four parts. First of all, there is an introduction, which mainly includes the background and significance of research, the current status of domestic and international research, research methods, innovations and deficiencies. The framework of the paper was determined to be based on relevant domestic and international studies and the theory was well prepared for the article. The second part mainly elaborates the theories and necessity of patriotism education for college students, mainly including the connotation and characteristics of patriotism education. The third part presents a comprehensive analysis of the problem of patriotic education of college students and its causes from the social environment, patriotic education in higher education institutions, family education and college students themselves. The fourth part is the core of this paper, the practical effect of patriotic education in higher education institutions is ensured, by summarizing the relevant theories and proposing effective technologies and methods against the corresponding problems.

XAI-enabled neural network analysis of metabolite spatial distributions

Article

Apr 2023
ANAL BIOANAL CHEM

We used deep neural networks to process the mass spectrometry imaging (MSI) data of mouse muscle (young vs aged) and human cancer (tumor vs normal adjacent) tissues, with the aim of using explainable artificial intelligence (XAI) methods to rapidly identify biomarkers that can distinguish different classes of tissues, from several thousands of metabolite features. We also modified classic neural network architectures to construct a deep convolutional neural network that is more suitable for processing high-dimensional MSI data directly, instead of using dimension reduction techniques, and compared it to seven other machine learning analysis methods' performance in classification accuracy. After ascertaining the superiority of Channel-ResNet10, we used a novel channel selection-based XAI method to identify the key metabolite features that were responsible for its learning accuracy. These key metabolite biomarkers were then processed using MetaboAnalyst for pathway enrichment mapping. We found that Channel-ResNet10 was superior to seven other machine learning methods for MSI analysis, reaching > 98% accuracy in muscle aging and colorectal cancer datasets. We also used a novel channel selection-based XAI method to find that in young and aged muscle tissues, the differentially distributed metabolite biomarkers were especially enriched in the propanoate metabolism pathway, suggesting it as a novel target pathway for anti-aging therapy.

Use of neural networks to optimize biological treatment processes

Conference Paper

Mar 2023

Neural Networks for Pattern Recognition

Recommended publications

Learning Robust Deep Face Representation

Pattern Recognition with Quantum Neural Networks

Computationally Efficient Invariant Pattern Recognition With Higher Order Pi-Sigma Networks

Research on Roller Bearing with Fault Diagnosis Method Based on EMD and BP Neural Network