Home
French National Institute for Agriculture, Food, and Environment (INRAE)
Alberto Paolo Tonda

Alberto Paolo Tonda
French National Institute for Agriculture, Food, and Environment (INRAE) | INRAE · Department TRANSFORM (Food, bioproducts and waste)

Ph.D.

About

196

Publications

66,415

Reads

1,574

Citations

My main research interest concerns the application of evolutionary computation and stochastic optimization to real-world problems. I am currently working on semi-supervised modeling of food processes resorting to stochastic meta-heuristics.

Skills and Expertise

Optimization

Modeling

Machine Learning

Artificial Intelligence

Computational Intelligence

Algorithms

Simulation

Mathematical Programming

Heuristics

Bioinformatics and Computational Biology

March 2014 - present

French National Institute for Agriculture, Food, and Environment (INRAE)

Department of Science and Process Engineering of Agricultural Products
Paris, France

Position

Master Class "When Nature Inspires Engineers"

Description

Class for Master students, consisting in a survey of machine learning methods applied to the agri-food chain

May 2011 - December 2013

French National Institute for Agriculture, Food, and Environment (INRAE)

France

Position

DREAM European Project

Description

EU project focused on development of reliable models for food and agricultural processes. http://dream.aaeuropae.org/

May 2011 - July 2012

Institut des Systèmes Complexes, Paris Île-de-France

Paris, France

Position

PostDoc Position

Publications

Federated Behavioural Planes: Explaining the Evolution of Client Behaviour in Federated Learning

Preprint

Full-text available

May 2024

Federated Learning (FL), a privacy-aware approach in distributed deep learning environments, enables many clients to collaboratively train a model without sharing sensitive data, thereby reducing privacy risks. However, enabling human trust and control over FL systems requires understanding the evolving behaviour of clients, whether beneficial or d...

Development of a soft sensor for fouling prediction in pipe fittings using the example of particulate deposition from suspension flow

Article

Mar 2024

a The minimum number of features to obtain the higher accuracy, b Plot...

Overview of the proposed methodology. The upper shows the workflow for...

Overview of the datasets used for each experiment

Methodology for biomarker discovery with reproducibility in microbiome data using machine learning

Article

Full-text available

Jan 2024

Background In recent years, human microbiome studies have received increasing attention as this field is considered a potential source for clinical applications. With the advancements in omics technologies and AI, research focused on the discovery for potential biomarkers in the human microbiome using machine learning tools has produced positive ou...

A brief introduction to nature-inspired computing, optimization, and applications

Chapter

Jan 2024

Bayesian Optimization for the Inverse Problem in Electrocardiography

Conference Paper

Dec 2023

Individual and average AUCs from the validation phase and the...

Methodology for Biomarker Discovery with Reproducibility in Microbiome Data using Machine Learning

Preprint

Full-text available

Dec 2023

Background: In recent years, human microbiome studies have receivedincreasing attention as this field is considered a potential source for clinicalapplications. With the advancements in omics technologies and AI, researchfocused on the discovery for potential biomarkers in the human microbime usingmachine learning tools has produced positive outcom...

Optimization models for sustainable insect production chains

Article

Full-text available

Nov 2023

Insect value chains are a complex system with non-linear links between many economic, environmental, and social variables. Multi-objective optimization (MOO) algorithms for finding optimal options for complex system functioning can provide a valuable insight in the development of sustainable insect chains. This review proposes a framework for MOO a...

Veni, Vidi, Evolvi commentary on W. B. Langdon’s “Jaws 30”

Article

Full-text available

Nov 2023

A robust mRNA signature obtained via recursive ensemble feature selection predicts the responsiveness of omalizumab in moderate‐to‐severe asthma

Article

Full-text available

Nov 2023

Background Not being well controlled by therapy with inhaled corticosteroids and long‐acting β2 agonist bronchodilators is a major concern for severe‐asthma patients. The current treatment option for these patients is the use of biologicals such as anti‐IgE treatment, omalizumab, as an add‐on therapy. Despite the accepted use of omalizumab, patient...

Figure 4. Fuzzy membership functions depicting the relationship between...

Figure 6. PCA maps based on the two first dimensions illustrating the...

Figure 7. Comparison of the fuzzy membership functions of the OSA...

Example of output for the first step of the process for the Odorants:...

Composition of the 15 OSA in OQ from expertise: 7 simple OSA and 8...

Predicting odor profile of food from its chemical composition: Towards an approach based on artificial intelligence and flavorists expertise

Article

Full-text available

Nov 2023

Odor is central to food quality. Still, a major challenge is to understand how the odorants present in a given food contribute to its specific odor profile, and how to predict this olfactory outcome from the chemical composition. In this proof-of-concept study, we seek to develop an integrative model that combines expert knowledge, fuzzy logic, and...

GeneFinder COVID-19 Plus RealAmp standard viral load curve (Left)....

Comparison between non-specific primer set IP2 and IP4 and our designed...

of the procedure to create, validate and test a primer set designed...

of the EA where we expect to find a target 21-bps sequence in position...

(Left) Cost function in 20 runs of the EA for 100 generations to find a...

An Innovative AI-based primer design tool for precise and accurate detection of SARS-CoV-2 variants of concern

Article

Full-text available

Sep 2023

As the COVID-19 pandemic winds down, it leaves behind the serious concern that future, even more disruptive pandemics may eventually surface. One of the crucial steps in handling the SARS-CoV-2 pandemic was being able to detect the presence of the virus in an accurate and timely manner, to then develop policies counteracting the spread. Nevertheles...

Machine learning approaches in microbiome research: challenges and best practices

Article

Full-text available

Sep 2023

Microbiome data predictive analysis within a machine learning (ML) workflow presents numerous domain-specific challenges involving preprocessing, feature selection, predictive modeling, performance estimation, model interpretation, and the extraction of biological information from the results. To assist decision-making, we offer a set of recommenda...

Fine-Grained Cooperative Coevolution in a Single Population: Between Evolution and Swarm Intelligence

Chapter

Sep 2023

Particle Swarm Optimisation (PSO) and Evolutionary Algorithms (EAs) differ in various ways, in particular with respect to information sharing and diversity management, making their scopes of applications very diverse. Combining the advantages of both approaches is very attractive and has been successfully achieved through hybridisation. Another pos...

Environmental impact potential of insect production chains for food and feed in Europe

Article

Full-text available

Aug 2023

Towards Evolutionary Control Laws for Viability Problems

Conference Paper

Jul 2023

Fig 1. Overview of the visual analytics workflow. Right: Domain experts...

Fig 3. A screenshot of IMSIE's interactive manager for building...

Fig 4. Screenshots of Step 2 interface widgets. A: construction of...

Biosys-LiDeOGraM: A visual analytics framework for interactive modelling of multiscale biosystems

Preprint

Full-text available

Jun 2023

In this paper, we present a test of an interactive modelling scheme in real conditions. The aim is to use this scheme to identify the physiological responses of microorganisms at different scales in a real industrial application context. The originality of the proposed tool, Biosys-LiDeOGraM, is to generate through a human-machine cooperation a con...

Direct Comparative Analysis of Nature-Inspired Optimization Algorithms on Community Detection Problem in Social Networks

Chapter

May 2023

Nature-inspired optimization Algorithms (NIOAs) are nowadays a popular choice for community detection in social networks. Community detection problem in social network is treated as an optimization problem, where the objective is to either maximize the connection within the community or minimize connections between the communities. To apply NIOAs,...

An Intercontinental Machine Learning Analysis of Factors Explaining Consumer Awareness of Food Risk

Article

May 2023

Food safety is a common concern at the household level, with important variations across different countries and cultures. Nevertheless, identifying the factors that best explain similarities and differences in consumer awareness pertaining to this topic is not straightforward. Starting from a questionnaire administered in seven countries from four...

Figure 4. Sensitivity of model explanation when changing the radius of...

Figure 5. Model confidence as a function of the number of perturbed...

AUC of the explanation sensitivity curves when increasing the...

Interpretable Neural-Symbolic Concept Reasoning

Preprint

Full-text available

Apr 2023

Deep learning methods are highly accurate, yet their opaque decision process prevents them from earning full human trust. Concept-based models aim to address this issue by learning tasks based on a set of human-understandable concepts. However, state-of-the-art concept-based models rely on high-dimensional concept embedding representations which la...

Categorical Foundations of Explainable AI: A Unifying Formalism of Structures and Semantics

Preprint

Full-text available

Apr 2023

Explainable AI (XAI) aims to answer ethical and legal questions associated with the deployment of AI models. However, a considerable number of domain-specific reviews highlight the need of a mathematical foundation for the key notions in the field, considering that even the term "explanation" still lacks a precise definition. These reviews also adv...

Multi-objective Evolutionary Discretization of Gene Expression Profiles: Application to COVID-19 Severity Prediction

Chapter

Apr 2023

MAP-Elites with Cosine-Similarity for Evolutionary Ensemble Learning

Chapter

Full-text available

Mar 2023

Evolutionary ensemble learning methods with Genetic Programming have achieved remarkable results on regression and classification tasks by employing quality-diversity optimization techniques like MAP-Elites and Neuro-MAP-Elites. The MAP-Elites algorithm uses dimensionality reduction methods, such as variational auto-encoders, to reduce the high-dim...

Boundaries of the spatial units at which no-loss constraints in the...

Increase in carbon sequestration observed at the country scale in each...

Boxplots representing the variability of the ecosystem service...

Maps of the variation of ecosystem services per hectare in each small...

Handling ecosystem service trade-offs: the importance of the spatial scale at which no-loss constraints are posed

Article

Full-text available

Mar 2023

Context Managing land use to promote an ecosystem service (ES) without reducing others is challenging. The spatial scale at which no-loss constraints are imposed is relevant. Objectives We examined the influence of the spatial scale of no-loss constraints on ESs when one ES was optimised. Specifically, we investigated how carbon sequestration coul...

Predictive technique decision tree. What is the right approach given...

Process of knowledge revision. The prediction is progressively achieved...

Scale‐progression strategy. Also called “V strategy” in mechanistic...

From training to prediction in machine learning. Paradigm for obtaining...

A primer on predictive techniques for food and bioresources transformation processes

Article

Full-text available

Mar 2023

To meet current societal demand for more sustainable transformation processes and bioresources, these processes must be optimized and new ones developed. The evolution of various systems (raw material, food, or process attributes) can be predicted to optimize the uses of biomass for better quality, safety, economic benefit, and sustainability. Pred...

Dataset Statistics. First column contains dataset details, # N odes...

Comparative performance of MFO algorithm with alternative algorithms...

Direct Comparative Analysis of Nature-inspired Optimization Algorithms on Community Detection Problem in Social Networks

Preprint

Full-text available

Dec 2022

Nature-inspired optimization Algorithms (NIOAs) are nowadays a popular choice for community detection in social networks. Community detection problem in social network is treated as optimization problem, where the objective is to either maximize the connection within the community or minimize connections between the communities. To apply NIOAs, eit...

A Decision-Support System to Predict Grape Berry Quality and Wine Potential for a Chenin Vineyard

Article

Sep 2022

Grape berry ripening is a complex process, and predicting the quality of wine starting from the ripening kinetics of grape berries is a challenging task. To tackle this problem, we present a decision-support system based on coupling expert know-how with probability laws encapsulated in a probabilistic model, a dynamic Bayesian network. The proposed...

Towards multi-objective optimization of sustainable insect production chains

Conference Paper

Jul 2022

An evolutionary approach to the discretization of gene expression profiles to predict the severity of COVID-19

Conference Paper

Jul 2022

Looking for archetypes: Applying game data mining to hearthstone decks

Article

May 2022

Digital Collectible Cards Games such as Hearthstone have become a very prolific test-bed for Artificial Intelligence algorithms. The main researches have focused on the implementation of autonomous agents (bots) able to effectively play the game. However, this environment is also very attractive for the use of Data Mining (DM) and Machine Learning...

Consumers’ Motivations Towards Environment-Friendly Dietary Changes: An Assessment of Trends Related to the Consumption of Animal Products

Chapter

Apr 2022

Simple schematic diagram of a single-joint robot arm. The backlash...

The backlash gap between mating teeth of gears

The backlash states on the reversal of movement. The mating teeth of...

Motor speed signal and the superimposed oscillations due to an...

A virtual sensor for backlash in robotic manipulators

Article

Full-text available

Apr 2022

Gear backlash is a quite serious problem in industrial robots, it causes vibrations and impairs the robot positioning accuracy. Backlash estimation allows targeted maintenance interventions, preserving robot performances and avoiding unforeseen equipment breakdowns. However, a direct measure of the backlash is hard to obtain, and dedicated auxiliar...

An intercontinental machine learning analysis of factors explaining consumer awareness about food risk

Conference Paper

Full-text available

Apr 2022

This paper investigates to what extent food safety is perceived as a concern at the household level in different countries. It aims to identify the factors that best explain food safety concern, among the various foodrelated questions asked through a survey. To do so, a machine learning approach is used. The results show that the most significant e...

Predictable Features Elimination: An Unsupervised Approach to Feature Selection

Chapter

Feb 2022

We propose an unsupervised, model-agnostic, wrapper method for feature selection. We assume that if a feature can be predicted using the others, it adds little information to the problem, and therefore could be removed without impairing the performance of whatever model will be eventually built. The proposed method iteratively identifies and remove...

Predictable Features Elimination: An Unsupervised Approach to Feature Selection

Book

Full-text available

Feb 2022

SARS-CoV-2 Omicron Variant AI-based Primers

Preprint

Full-text available

Jan 2022

As the COVID-19 pandemic continues to affect the world, a new variant of concern, B.1.1.529 (Omicron), has been recently identified by the World Health Organization. At the time of writing, there are still no available primer sets specific to the Omicron variant, and its identification is only possible by using multiple targets, checking for specif...

Machine learning for agri-food processes: learning from data, human knowledge, and interactions

Chapter

Jan 2022

This chapter presents three examples of data-based machine learning (ML) on time series. The common denominator of these case studies is the sparseness of data, making ML results fragile and inaccurate. We show how human expertise can be effectively mobilized for building useful systems, for instance useful decision support systems, able to better...

Discovering Hierarchical Neural Archetype Sets

Book

Jul 2021

An evolutionary framework for maximizing influence propagation in social networks

Article

Jul 2021

Social networks are one the main sources of information transmission nowadays. However, not all nodes in social networks are equal: in fact, some nodes are more influential than others, i.e., their information tends to spread more. Finding the most influential nodes in a network—the so-called Influence Maximization problem—is an NP-hard problem wit...

Exploiting Artificial Swarms for the Virtual Measurement of Backlash in Industrial Robots

Conference Paper

Jun 2021

Design of specific primer sets for SARS-CoV-2 variants using evolutionary algorithms

Conference Paper

Full-text available

Jun 2021

Modelling Asthma Patients’ Responsiveness to Treatment Using Feature Selection and Evolutionary Computation

Chapter

Apr 2021

For several medical treatments, it is possible to observe transcriptional variations in gene expressions between responders and non-responders. Modelling the correlation between such variations and the patient’s response to drugs as a system of Ordinary Differential Equations could be invaluable to improve the efficacy of treatments and would repre...

Modelling Asthma Patients’ Responsiveness to Treatment Using Feature Selection and Evolutionary Computation

Book

Full-text available

Apr 2021

Figure 2. 2a) ROC curve of a simple rule-based classifier checking the...

Frequency of appearance of the characteristic mutations for the UK...

Design of Specific Primer Sets for the Detection of B.1.1.7, B.1.351 and P.1 SARS-CoV-2 Variants using Deep Learning

Preprint

Full-text available

Jan 2021

As the COVID-19 pandemic persists, new SARS-CoV-2 variants with potentially dangerous features have been identified by the scientific community. Variant B.1.1.7 lineage clade GR from Global Initiative on Sharing All Influenza Data (GISAID) was first detected in the UK, and it appears to possess an increased transmissibility. At the same time, South...

Figure 1. High-level scheme, featuring the stages of the cereal chain.

Figure 3. Sequence of steps often used for modelling the effect of...

Modelling Processes and Products in the Cereal Chain

Article

Full-text available

Jan 2021

In recent years, modelling techniques have become more frequently adopted in the field of food processing, especially for cereal-based products, which are among the most consumed foods in the world. Predictive models and simulations make it possible to explore new approaches and optimize proceedings, potentially helping companies reduce costs and l...

Discovering Hierarchical Neural Archetype Sets

Chapter

Jan 2021

In the field of machine learning, coresets are defined as subsets of the training set that can be used to obtain a good approximation of the behavior that a given algorithm would have on the whole training set. Advantages of using coresets instead of the training set include improving training speed and allowing for a better human understanding of...

Classification and specific primer design for accurate detection of SARS-CoV-2 using deep learning

Article

Jan 2021

In this paper, deep learning is coupled with explainable artificial intelligence techniques for the discovery of representative genomic sequences in SARS-CoV-2. A convolutional neural network classifier is first trained on 553 sequences from the National Genomics Data Center repository, separating the genome of different virus strains from the Coro...

Figure 1. 10 runs of the recursive ensemble feature selection algorithm...

Figure 2. ROC curve of a simple rule-based classifier checking the...

Frequency of appearance of the most significant mutations and the...

Organism and number of samples of other coronaviruses to compare...

Design of Specific Primer Set for Detection of B.1.1.7 SARS-CoV-2 Variant using Deep Learning

Preprint

Full-text available

Dec 2020

The SARS-CoV-2 variant B.1.1.7 lineage, also known as clade GR from Global Initiative on Sharing All Influenza Data (GISAID), Nextstrain clade 20B, or Variant Under Investigation in December 2020 (VUI - 202012/01), appears to have an increased transmissability in comparison to other variants. Thus, to contain and study this variant of the SARS-CoV-...

Modelling approaches for sustainable insect production chains

Conference Paper

Full-text available

Dec 2020

Insect value chains in Europe are evolving to large-scale industrial systems overcoming economic and environmental challenges. SUSINCHAIN, a H2020 EU-funded project, aims to define the leverages and solutions for sustainable insect value chains from multiple perspectives: economic, environmental, safety, nutritional, etc. Such perspectives have dif...

Understanding Cancer Phenomenon at Gene Expression Level by using a Shallow Neural Network Chain

Book

Sep 2020

Environmental modelling in the food supply chain - Future perspectives

Conference Paper

Full-text available

Sep 2020

Interaction of food systems and the environment has been in research focus for many years. In order to explain this interaction, scholars have developed and use various approaches in modelling and understanding this phenomenon. This paper gives an overview of three main perspectives in analyzing this issue and provides some future perspectives asso...

Evolutionary algorithms and machine learning: Synergies, Challenges and Opportunities

Conference Paper

Jul 2020

Batch correction of genomic data in chronic fatigue syndrome using CMA-ES

Conference Paper

Jul 2020

Feature importance by classifier: On the horizontal axis, the top...

The results of 10 runs of the recursive ensemble feature selection for...

Heatmap of average expression levels by cancer type for the 5 miRNAs...

Comparison of accuracy by classifier and tumor type for all 253...

Ten runs of the heterogeneous ensemble recursive selection algorithm....

Machine Learning-Based Ensemble Recursive Feature Selection of Circulating miRNAs for Cancer Tumor Classification

Article

Full-text available

Jul 2020

Circulating microRNAs (miRNA) are small noncoding RNA molecules that can be detected in bodily fluids without the need for major invasive procedures on patients. miRNAs have shown great promise as biomarkers for tumors to both assess their presence and to predict their type and subtype. Recently, thanks to the availability of miRNAs datasets, machi...

Modeling Generalization in Machine Learning: A Methodological and Computational Study

Preprint

Jun 2020

As machine learning becomes more and more available to the general public, theoretical questions are turning into pressing practical issues. Possibly, one of the most relevant concerns is the assessment of our confidence in trusting machine learning predictions. In many real-world cases, it is of utmost importance to estimate the capabilities of a...

A Novel Outlook on Feature Selection as a Multi-objective Problem

Chapter

Apr 2020

Feature selection is the process of choosing, or removing, features to obtain the most informative feature subset of minimal size. Such subsets are used to improve performance of machine learning algorithms and enable human understanding of the results. Approaches to feature selection in literature exploit several optimization algorithms. Multi-obj...

Specific Primer Design for Accurate Detection of SARS-CoV-2 Using Deep Learning

Preprint

Apr 2020

A Missense Mutation in SARS-CoV-2 Potentially Differentiates Between Asymptomatic and Symptomatic Cases

Preprint

Apr 2020

Two complementary methods for the computational modeling of cleaning processes in food industry

Article

Apr 2020

Insufficient cleaning in the food industry can create serious hygienic risks. However, when attempting to avoid these risks, food-processing plants frequently tend to clean for too long, at extremely high temperatures, or with too many chemicals, resulting in high cleaning costs and severe environmental impacts. Therefore, the optimization of clean...

Vers une action collective à l'échelle des paysages

Article

Apr 2020

A Novel Outlook on Feature Selection as a Multi-objective Problem

Book

Apr 2020

Figure 1: PCR Amplicons sequencing procedure.

Figure 2: Coding for the input sequences.

Figure 3: Scheme of a k-fold cross-validation. Available data is...

Figure 6: Confusion matrix resulting from the test of a 10-fold...

Accurate Identification of SARS-CoV-2 from Viral Genome Sequences using Deep Learning

Preprint

Full-text available

Mar 2020

One of the reasons for the fast spread of SARS-CoV-2 is the lack of accuracy in detection tools in the clinical field. Molecular techniques, such as quantitative real-time RT-PCR and nucleic acid sequencing methods, are widely used to identify pathogens. For this particular virus, however, they have an overall unsatisfying detection rate, due to it...

Uncovering Coresets for Classification With Multi-Objective Evolutionary Algorithms

Preprint

Feb 2020

A coreset is a subset of the training set, using which a machine learning algorithm obtains performances similar to what it would deliver if trained over the whole original data. Coreset discovery is an active and open line of research as it allows improving training speed for the algorithms and may help human understanding the results. Building on...

Making Sense of Economics Datasets with Evolutionary Coresets

Chapter

Feb 2020

Machine learning agents learn to take decisions extracting information from training data. When similar inferences can be obtained using a small subset of the same training set of samples, the subset is called coreset. Coresets discovery is an active line of research as it may be used to reduce the training speed as well as to allow human experts t...

Generating Neural Archetypes to Instruct Fast and Interpretable Decisions

Chapter

Feb 2020

In the field of artificial intelligence, agents learn how to take decisions by fitting their parameters on a set of samples called training set. Similarly, a core set is a subset of the training samples such that, if an agent exploits this set to fit its parameters instead of the whole training set, then the quality of the inferences does not chang...

Generating Neural Archetypes to Instruct Fast and Interpretable Decisions

Book

Feb 2020

Virtual Measurement of the Backlash Gap in Industrial Manipulators

Chapter

Jan 2020

Industrial manipulators are robots used to replace humans in dangerous or repetitive tasks. Also, these devices are often used for applications where high precision and accuracy is required. The increase of backlash caused by wear, that is, the increase of the amount by which teeth space exceeds the thickness of gear teeth, might be a significant p...

Virtual Measurement of the Backlash Gap in Industrial Manipulators

Book

Jan 2020

Consumers' Motivations towards Environment-Friendly Dietary Changes: An Assessment of Trends Related to the Consumption of Animal Products

Book

Full-text available

Jan 2020

In the context of global warming and environmental pressure, food chains must adapt to new production conditions while satisfying the evolving consumer demand. Livestock production is known for its negative ecological footprint, bringing forward the question of a possible transition towards more plant-based diets. Citizens' demand evolves at differ...

Inspyred: Bio-inspired algorithms in Python

Article

Nov 2019

Alberto Paolo Tonda

Fig. 1 Summary of the different datasets and their use in the experiments

Fig. 2 Heatmap with the frequency of the overall top 100 most frequent...

Fig. 3 Heatmap of the accuracy by cancer type, by classifier using the...

Fig. 5 miRNAs mean expression levels (RPMs) of the top 50 miRNAs for...

Abbreviations ACC: Adrenocortical carcinoma; BLCA: Bladder Urothelial...

Automatic discovery of 100-miRNA signature for cancer classification using ensemble feature selection

Article

Full-text available

Sep 2019

Background: MicroRNAs (miRNAs) are noncoding RNA molecules heavily involved in human tumors, in which few of them circulating the human body. Finding a tumor-associated signature of miRNA, that is, the minimum miRNA entities to be measured for discriminating both different types of cancer and normal tissues, is of utmost importance. Feature select...

Optimizing Hearthstone agents using an evolutionary algorithm

Article

Full-text available

Sep 2019

Digital collectible card games are not only a growing part of the video game industry, but also an interesting research area for the field of computational intelligence. This game genre allows researchers to deal with hidden information, uncertainty and planning, among other aspects. This paper proposes the use of evolutionary algorithms (EAs) to d...

Scientific challenges in performing life-cycle assessment in the food chain

Article

Full-text available

Jul 2019

This paper gives an overview of the scientific challenges that occur when performing life cycle assessment (LCA) in the food chain. In order to evaluate these risks, Failure Mode and Effect Analysis tool has been used. Challenges related to setting the goal and scope of LCA reveal four hot spots: system boundaries of LCA; functional units used; typ...

Evolutionary discovery of coresets for classification

Conference Paper

Full-text available

Jul 2019

When a machine learning algorithm is able to obtain the same performance given a complete training set, and a small subset of samples from the same training set, the subset is termed coreset. As using a coreset improves training speed and allows human experts to gain a better understanding of the data, by reducing the number of samples to be examin...

Beyond coreset discovery: evolutionary archetypes

Conference Paper

Full-text available

Jul 2019

In machine learning a coreset is defined as a subset of the training set using which an algorithm obtains performances similar to what it would deliver if trained over the whole original data. Advantages of coresets include improving training speed and easing human understanding. Coreset discovery is an open line of research as limiting the trainin...

Fig. 1. Diagram describing the distribution of keywords for the Field 2...

List of articles grouped by application domain.

Annotation data about Multi Criteria Assessment Methods used in the agri-food research: the French National Institute for Agricultural Research (INRA) experience

Article

Full-text available

Jul 2019

This data article contains annotation data characterizing MultiCriteria Assessment (MCA) Methods proposed in the agri-food sector by researchers from INRA, Europe's largest agricultural research institute (INRA, http://institut.inra.fr/en). MCA can be used to assess and compare agricultural and food systems, andsupport multi-actor decision making a...

Cross-European initial survey on the use of mathematical models in food industry

Article

Full-text available

Jun 2019

Mathematical modelling plays an important role in food engineering having various mathematical models tailored for different food topics. However, mathematical models are followed by limited information on their application in food companies. This paper aims to discuss the extent and the conditions surrounding the usage of mathematical models in th...

A mathematical model for the prediction of the whey protein fouling mass in a pilot scale plate heat exchanger

Article

Jun 2019

A better understanding of protein fouling during the thermal treatment of whey protein concentrate (WPC) solutions is critical for better fouling control. In order to understand the impact of various parameters on the total whey protein fouling mass, a dimensional analysis was applied to the experimental data obtained from a pilot scale plate heat...

Fundamental Flowers: Evolutionary Discovery of Coresets for Classification

Poster

Full-text available

Apr 2019

In an optimization problem, a coreset can be defined as a subset of the input points, such that a good approximation to the optimization problem can be obtained by solving it directly on the coreset, instead of using the whole original input. In machine learning, coresets are exploited for applications ranging from speeding up training time, to hel...

Fundamental Flowers: Evolutionary Discovery of Coresets for Classification

Chapter

Full-text available

Mar 2019

Fundamental Flowers: Evolutionary Discovery of Coresets for Classification

Book

Mar 2019

Illustration of MRE applied to manufacture a product having many and...

Overview of the network of approaches usable to obtain a cheese having...

Food acceptability and sustainable intake are driven by food sensory...

Permeability values obtained in the case of a soft cheese and...

Fuzzy preferences can be associated with diverse parameters. Here, the...

Multi-Criteria Reverse Engineering for Food: Genesis and Ongoing Advances

Article

Full-text available

Mar 2019

Multi-criteria reverse engineering (MRE) has arisen from the cross-fertilization of advances in mathematics and shifts in social demand. MRE, thus, marks a progressive switch (a) from empirical to formal approaches able to simultaneously factor in diverse parameters, such as environment, economics, and health; (b) from mono-criterion optimization t...

Trade-offs and synergies between livestock production and other ecosystem services

Article

Jan 2019

Formaliser la Durabilité des Paysages Agricoles Comme un Problème Multi-objectif

Book

Jan 2019

Countering Android Malware: A Scalable Semi-Supervised Approach for Family-Signature Generation

Article

Full-text available

Oct 2018

Reducing the effort required by humans in countering malware is of utmost practical value. We describe a scalable, semi-supervised framework to dig into massive datasets of Android applications and identify new malware families. Up to the 2010s, the industrial standard for the detection of malicious applications has been mainly based on signatures;...

Visualization of VALIS’ training process on a synthetic problem with...

Visualization of VALIS’ training process on the Iris dataset, using...

Radar plot for the relative accuracies of classifiers included in the...

VALIS: an evolutionary classification algorithm

Article

Full-text available

Sep 2018

VALIS is an effective and robust classification algorithm with a focus on understandability. Its name stems from Vote-ALlocating Immune System, as it evolves a population of artificial antibodies that can bind to the input data, and performs classification through a voting process. In the beginning of the training, VALIS generates a set of random c...

Promoting diversity in evolutionary optimization: why and how

Conference Paper

Jul 2018

Evaluating surrogate models for multi-objective influence maximization in social networks

Conference Paper

Full-text available

Jul 2018

One of the most relevant problems in social networks is influence maximization, that is the problem of finding the set of the most influential nodes in a network, for a given influence propagation model. As the problem is NP-hard, recent works have attempted to solve it by means of computational intelligence approaches, for instance Evolutionary Al...

Fig 1. Top 50 most expressed miRNA types, across all cancer classes in...

Ensemble Feature Selection and Meta-Analysis of Cancer miRNA Biomarkers

Preprint

Full-text available

Jun 2018

The role of microRNAs (miRNAs) in cellular processes captured the attention of many researchers, since their dysregulation is shown to affect the cancer disease landscape by sustaining proliferative signaling, evading program cell death, and inhibiting growth suppressors. Thus, miRNAs have been considered important diagnostic and prognostic biomark...

Understanding Cancer Phenomenon at Gene-Expression Level by using a Shallow Neural Network Chain

Chapter

Full-text available

Jun 2018

Exploiting the availability of the largest collection of Patient-Derived Xenografts from metastatic colorectal cancer annotated for response to therapies, this manuscript aims to characterize the biological phenomenon from a mathematical point of view. In particular, we design an experiment in order to investigate how genes interact with each other...

Interactive Machine Learning for Applications in Food Science

Chapter

Jun 2018

The apparent simplicity of food processes often hides complex systems, where physical, chemical and living organisms’ processes co-exist and interact to create the final product. Data can be plagued by uncertainty; heterogeneity of available information is likely; qualitative and quantitative data may also coexist in the same process, from expert p...

Evaluating the potential of Genetic Programming as an exploratory data analysis in soil science

Preprint

Full-text available

Apr 2018

Genetic Programming is a powerful optimization technique, able to deliver high-quality results in several real-world problems. One of its most successful applications is symbolic regression, where the objective is to find a suitable expression to model the underlying relationship between data points, with no aprioristic assumptions. In this paper,...

Improving Multi-objective Evolutionary Influence Maximization in Social Networks

Conference Paper

Full-text available

Apr 2018

In the context of social networks, maximizing influence means contacting the largest possible number of nodes starting from a set of seed nodes, and assuming a model for influence propagation. The real-world applications of influence maximization are of uttermost importance, and range from social studies to marketing campaigns. Building on a previo...

Automated Playtesting in Collectible Card Games using Evolutionary Algorithms: a Case Study in HearthStone

Article

Full-text available

Apr 2018

Collectible card games have been among the most popular and profitable products of the entertainment industry since the early days of Magic: The GatheringTM in the nineties. Digital versions have also appeared, with HearthStone: Heroes of WarCraftTM being one of the most popular. In Hearthstone, every player can play as a hero, from a set of nine,...

Review on environmental models in the food chain - Current status and future perspectives

Article

Mar 2018

Diversity of food systems and their interaction with the environment has become a research topic for many years. Scientists use various models to explain environmental issues of food systems. This paper gives an overview of main streams in analyzing this topic. A literature review was performed by analyzing published scientific papers on environmen...

Evolutionary Optimization of Convolutional Neural Networks for Cancer miRNA Biomarkers Classification

Article

Jan 2018

Cancer diagnosis is currently undergoing a paradigm shift with the incorporation of molecular biomarkers as part of routine diagnostic panel. This breakthrough discovery directs researches to examine the role of microRNA in cancer, since its deregulation is often associated with almost all human tumors. Such differences frequently recur in tumor-sp...

LIDeOGraM: An Interactive Evolutionary Modelling Tool

Chapter

Full-text available

Jan 2018

Workshops at PPSN 2018: 15th International Conference, Coimbra, Portugal, September 8–12, 2018, Proceedings, Part II

Chapter

Jan 2018

Interactive Machine Learning for Applications in Food Science and Technology

Book

Jan 2018

LIDeOGraM: an interactive evolutionary modelling tool

Book

Jan 2018

Building complex models from available data is a challenge in many domains, and in particular in food science. Numerical data are often not enough structured, or simply not enough to elucidate complex structures: human choices have thus a major impact at various levels. LIDeOGraM is an interactive modelling framework adapted to cases where numerica...

Interactive machine learning for applications in food Science

Book

Jan 2018

The apparent simplicity of food processes often hides complex systems, where physical, chemical and living organisms' processes co-exist and interact to create the final product. Data can be plagued by uncertainty; heterogeneity of available information is likely; qualitative and quantitative data may also coexist in the same process, from expert p...

Questions

Machine learning methods for dynamical systems?

Question

Jun 2020

I am comparing different machine learning techniques for learning dynamical systems (e.g. a system of ordinary differential equations), and so far I've used Long-Short-Term Memory Networks (LSTM) and other variations of Recurrent Neural Networks, Dynamic Bayesian Networks, and Symbolic Regression.

However, I know only a part of this fascinating domain, so I wanted to ask the community: Can you suggest other state-of-the-art machine learning techniques for learning dynamical systems? Black-box or white-box, it's not important; I am more focused on getting good data fitting for my application.

Thanks in advance for any suggestion :-)

Chains/networks of models: what is the commonly accepted terminology?

Question

Mar 2016

Imagine you have a sequence of models, for example each one being an equation (or a system of equations): they are connected to each other so that the outputs of a model are used as inputs for one or more other models.

Is there a specific terminology to call this structure? I am trying to find literature on the subject, but I realized I am probably missing some keywords. I tried with "model chains", "model networks", and similar names, but I don't feel that's the right nomenclature.

I think there is a specific terminology for sub-categories of this structure: for example, Bayesian networks could be considered network of models, each node being a probabilistic model described by a conditional probability table. But what if the model inside one of the nodes was deterministic? How would you call the structure, then?

Sorry if the question is naive; in the beginning I thought it would be easy to find an answer, but I found myself skimming through tons of literature without finding anything promising.

Thank you in advance for any help you can provide!

Given the bibliography of a paper, how can you evaluate its quality?

Question

Oct 2015

This is a question I stumbled upon while doing something unrelated, and I think it might be interesting for the community at large.

Given the bibliography someone gathered for a certain paper, is there a way to evaluate whether the bibliography is "good"? Or, more in general, to evaluate its "goodness"?

How would you do that? Surely, if it's missing some fundamental citations it might not be good. But citing too many papers without a good reason is also not very appealing. Is there (gasp!) a metric to assess the quality of a bibliography?

Can anyone explain the behavior of the Pepsin enzyme?

Question

Aug 2014

I recently started studying the behavior of the Pepsin enzyme. As far as I understand, when it interacts with proteins, it starts cutting the links between amino acids: generally speaking, its activity is really high at the beginning of the process, then it slows down following a pattern resembling a logarithmic function, and after a while the cutting almost stops completely.

What I would like to know (and so far I was not able to find in the literature I am studying) is how much of this behavior is due to the size of the proteins, and how much is due to the pepsin itself.

In other words: does the pepsin "slow down" because the chains of amino acids become smaller and smaller; or does it slow down because pepsin's activity simply lowers over time?

For the moment, I read some papers about the degree of hydrolysis of the pepsin when it comes into contact with proteins containing 500-700 amino acids; but what would happen if we used the pepsin with smaller proteins (e.g. 20-30 amino acids)? Would it present the same "logarithmic" behavior? Or would it just be the last part of the logarithm, so very few "cuts" from the start?

I hope this question is not too naive...thank you for your time :-)

http://en.wikipedia.org/wiki/Pepsin

Network

Sergiy Smetana
Deutsches Institut für Lebensmitteltechnik
Kalyan Deb
Michigan State University
Bertrand Thirion
National Institute for Research in Computer Science and Control
Vincent Michel
Rakuten
Gael Varoquaux
National Institute for Research in Computer Science and Control

Amy K Hoover
New Jersey Institute of Technology
Waiching Sun
Columbia University
Juan Julián Merelo Guervós
University of Granada
Antonio Mora
University of Granada
Aletta D Kraneveld
Utrecht University

French National Institute for Agriculture, Food, and Environment (INRAE)

Department TRANSFORM (Food, bioproducts and waste)
Paris, France

Current position

Researcher

Top co-authors

Anet Režek Jambrak
Faculty of Food Technology and Biotechnology University of Zagreb
Marc Barnabé
French National Institute for Agriculture, Food, and Environment (INRAE)
Nisrine Mouhrim
Institut Polytechnique Paris - Mines télécom
Eliana Giovannitti
Comau
Doina Bucur
University of Twente