TABLE 2 - uploaded by Sherry Y. Chen
Content may be subject to copyright.
Source publication
Wrapper feature selection approaches are widely used to select a small subset of relevant features from a dataset. However, Wrappers suffer from the fact that they only use a single classifier when selecting the features. The downside to this approach is that each classifier will have its own biases and will therefore select very different features...
Contexts in source publication
Context 1
... is based on the BN family, another on the DT family and the other on the KNN family. For each family, three classifiers of the same nature are used which are showed in Table 2. The reason for choosing and combining these classifiers to do the feature selection is two- fold. ...
Context 2
... reason for choosing and combining these classifiers to do the feature selection is two- fold. First, previous studies have showed that the three classifiers which belong to each family in Table 2 produce very good performance when used to classify user preference data, e.g. [16] [17]. ...
Similar publications
Class imbalance is a prevalent problem in machine learning which affects the prediction performance of classification algorithms. Software Defect Prediction (SDP) is no exception to this latent problem. Solutions such as data sampling and ensemble methods have been proposed to address the class imbalance problem in SDP. This study proposes a combin...
Developing more efficient automated methods for
interpretable machine learning (ML) is an important and long term
machine-learning goal. Recent studies show that
unintelligible “black” box models, such as Deep Learning
Neural Networks, often outperform more interpretable “grey” or
“white” box models such as Decision Trees, Bayesian networks,
Logic...
The paper deals with the problem of performance optimization of a meta-learning scheme for context-based fault detection. The context-based ensemble classifier is proposed to increase the performance of the fault diagnosis system. The most important problem to solve in this approach is to find optimal structures as well as optimal values of behavio...
Cancer has been portrayed as a heterogeneous disease comprising of a wide range of subtypes. The early diagnosis of a cancer type is very important to determine the course of medical treatment required by the patient. The significance of classifying cancerous cells into benign or malignant has driven many research studies, in the biomedical and the...
Although the negative consequences of noise during induction have been widely studied, previous work often lacks the use of validated data to measure its impact. We propose a framework based on Bayesian Networks for modeling class noise and generating synthetic data sets where the kind and amount of class noise are under control. The benefits of th...