A Two-dimensional Support Vector Machine [14].

Source publication

On the Comparison of Classifiers' Construction over Private Inputs

Conference Paper

Full-text available

Dec 2020

Context 1

... framework. SVM aims to find a hyperplane in an m-dimensional space (with m the number of attributes) that distinctly classifies the data points. To separate two classes of data points, there are many possible hyperplanes, where the goal is to find the one that has the maximum margin, i.e. the maximum distance between data points of both classes. Fig. 1 shows an SVM for a linearly separable binary classification problem. The w and b that solve the following optimization problem determine the ...

View in full-text

NIRF rankings' effects on private engineering colleges for improving India's educational system looked at using computational approaches

Chapter

Full-text available

Jun 2024

Descriptive statistical analysis of variables 表 2. 变量描述性统计分析

Factor score coefficient matrix 表 6. 因子得分系数矩阵

Study on the Factors Influencing the Price of Private Elderly Care Institutions in Shanghai

Article

Full-text available

Jan 2022

浩然吴

Statistical description of teachers' academic output 表 3. 教师学术成果统计描述表

The Impact of Talent Composition on Academic Output in a Private College

Article

Full-text available

Jan 2023

珍珠舒

Private Space Expansionism: Potential Scientific and Societal Benefits and Implications for Governance

Article

Full-text available

Aug 2021

Ian A Crawford

Privatisierung von Bildung

Chapter

Full-text available

Jan 2023

Tim Engartner

Die Privatisierung von Bildung reicht über das Sponsoring von Schulfesten, die Produktion von Unterrichtsmaterialien durch private Content-Anbieter und den wachsenden Markt kommerzieller Nachhilfeanbieter hinaus. So kommen auch in verschiedenen bildungspolitischen Debatten Privatisierungstendenzen zum Ausdruck – seien es die Diskussionen um die „Sc...

Collaborative Private Classifiers Construction

Chapter

Jan 2023

Cyber-physical systems (CPS) are smart computer systems that control or monitor machines through computer-based algorithms, which are vulnerable to both cyber and physical threats. Similar to the growing number of applications, CPS also employ classification algorithms as a tool for data analysis and continuous monitoring of the system. While the utility of data is significantly important in building an accurate and efficient classifier, a free access to original (raw) format of data is a crucial challenge due to privacy constraints. Therefore, it is tremendously important to train classifiers in a private setting in which the privacy of individuals is protected, while data remains still practically useful for building the model. In this chapter, we investigate the application of three privacy preserving models, namely anonymization, Differential Privacy (DP), and cryptography, to privatize data and evaluate the performance of two popular classifiers, Naïve Bayes and Support Vector Machine (SVM) over the protected data. Their performances are compared in terms of accuracy, training construction costs on the same data and in the same private environment. Finally, comprehensive findings on constructing the privacy preserved classifiers are outlined. The attack models against the training data and against the private classifier models are also discussed.

Local Differential Privacy for Private Construction of Classification Algorithms

Chapter

Full-text available

Jan 2023

In recent years, Local differential privacy (LDP), as a strong privacy preserving methodology, has been widely deployed in real world applications. It allows the users to perturb their data locally on their own devices before being sent out for analysis. In particular, LDP serves as an effective solution for the construction of privacy-preserving classifiers. While several approaches in the literature have been proposed to build classifiers over distributed locally differential private data, an understanding of the difference in the performance of these LDP-based classifiers is currently missing. In this study, we investigate the impact of using LDP on four well-known classifiers, i.e., Naïve Bayes, Decision Tree, Random Forest, and Logistic Regression classifiers. We evaluate the impact of dataset’s properties, LDP mechanisms, privacy budget, and classifiers’ structure on LDP-based classifiers’ performance.

Not a Free Lunch, But a Cheap One: On Classifiers Performance on Anonymized Datasets

Chapter

Full-text available

Jul 2021

The problem of protecting datasets from the disclosure of confidential information, while published data remains useful for analysis, has recently gained momentum. To solve this problem, anonymization techniques such as k-anonymity, \(\ell \)-diversity, and t-closeness have been used to generate anonymized datasets for training classifiers. While these techniques provide an effective means to generate anonymized datasets, an understanding of how their application affects the performance of classifiers is currently missing. This knowledge enables the data owner and analyst to select the most appropriate classification algorithm and training parameters in order to guarantee high privacy requirements while minimizing the loss of accuracy. In this study, we perform extensive experiments to verify how the classifiers performance changes when trained on an anonymized dataset compared to the original one, and evaluate the impact of classification algorithms, datasets properties, and anonymization parameters on classifiers’ performance.

Comparing Classifiers’ Performance under Differential Privacy

Conference Paper

Full-text available

Jan 2021

Comparing Classifiers’ Performance under Differential Privacy

Conference Paper

Full-text available

Jan 2021

Add Noise to Remove Noise: Local Differential Privacy for Feature Selection

Article

Oct 2022
COMPUT SECUR

Feature selection has become significantly important for data analysis. It selects the most informative features describing the data to filter out the noise, complexity, and over-fitting caused by less relevant features. Accordingly, feature selection improves the predictors’ accuracy, enables them to be trained faster and more cost-effectively, and provides a better understanding of the underlying data. While plenty of practical solutions have been proposed in the literature to identify the most discriminating features describing a dataset, an understanding of feature selection over privacy-sensitive data in the absence of a trusted party is still missing. The design of such a framework is specifically important in our modern society, where each individual through accessing the Internet can play simultaneously the role of a data provider and a data-analysis beneficiary. In this study, we propose a novel feature selection framework based on Local Differential Privacy (LDP), named LDP-FS, which estimates the importance of features over securely protected data while protects the confidentiality of each individual data before leaving the user’s device. The performance of LDP-FS in terms of scoring and ordering the features is assessed by investigating the impact of datasets properties, privacy mechanism, privacy levels, and feature selection techniques on this framework. The accuracy of classifiers trained on the selected subset of features by LDP-FS is also presented. Our experimental results demonstrate the effectiveness and efficiency of the proposed framework.

A decision-support framework for data anonymization with application to machine learning processes

Article

Full-text available

Sep 2022
INFORM SCIENCES

The application of machine learning techniques to large and distributed data archives might result in the disclosure of sensitive information about the data subjects. Data often contain sensitive identifiable information, and even if these are protected, the excessive processing capabilities of current machine learning techniques might facilitate the identification of individuals, raising privacy concerns. To this end, we propose a decision-support framework for data anonymization, which relies on a novel approach that exploits data correlations, expressed in terms of relaxed functional dependencies (rfds) to identify data anonymization strategies providing suitable trade-offs between privacy and data utility. Moreover, we investigate how to generate anonymization strategies that leverage multiple data correlations simultaneously to increase the utility of anonymized datasets. In addition, our framework provides support in the selection of the anonymization strategy to apply by enabling an understanding of the trade-offs between privacy and data utility offered by the obtained strategies. Experiments on real-life datasets show that our approach achieves promising results in terms of data utility while guaranteeing the desired privacy level, and it allows data owners to select anonymization strategies balancing their privacy and data utility requirements.

A Two-dimensional Support Vector Machine [14].

Context in source publication

Similar publications

Citations