| Graphical representation of the restricted Boltzmann machine. Visible layer (observed variables) is comprised of units í µí±£ and the hidden layer (latent variables) is comprised of units ℎ. The layers are fully connected, and the connections W are bidirectional.

Source publication

Variational approach to unsupervised learning

Article

Full-text available

Jul 2019

Swapnil Nitin Shah

Context 1

... one associates a) the average field í µí¼ B and effective action Π[í µí¼ B ] with the layer of visible spins í µí±£ of an RBM ( Fig. 1) like structure and b) í µí¼ and bare action í µí±[í µí¼] with the layer of hidden spins ℎ thereof, after suitable discretization, from (9) and (17), This model, however, suffers from a major issue computationally. In general, the effective functional Π is not solvable perturbatively in í µí¼ B owing to divergent integrals (similar ...

View in full-text

Context 2

View in full-text

A depth estimation framework based on unsupervised learning and cross-modal translation (Conference Presentation)

Conference Paper

Full-text available

Oct 2019

A Self–learning Personal Privacy Assistant

Chapter

Full-text available

Mar 2020

Defining and determining personal privacy is difficult; people jeopardize their personal lives and careers due to a lack of discernment. In this paper we propose an automated self-learning privacy assistant without a central entity determining the privacy and without pre-determined or labelled privacy data. We use the web as a data trove to determi...

Figure 3. Examples showing the cosine-similarity scores for the...

Contrastive Representation Learning for Hand Shape Estimation

Preprint

Full-text available

Jun 2021

This work presents improvements in monocular hand shape estimation by building on top of recent advances in unsupervised learning. We extend momentum contrastive learning and contribute a structured collection of hand images, well suited for visual representation learning, which we call HanCo. We find that the representation learned by established...

Figure 1. The number of calls between two squares in one week.

Figure 2. Internet mobile traffic heat map of Milan in one week. The...

Clustering Analysis of Urban Fabric Detection Based on Mobile Traffic Data

Article

Full-text available

Jan 2020

The rapid development of city makes it complicated to analyse urban structure. It’s difficult to learn city ecosystem using traditional methods including interview and survey. A city generates lots of data every day, which could tell city’s dynamic fabric. The moving patterns of citizens could be illustrated through analysing traffic data on their...

Self-supervised Graph Neural Networks without explicit negative sampling

Conference Paper

Full-text available

Apr 2021

Real world data is mostly unlabeled or only few instances are labeled. Manually labeling data is a very expensive and daunting task. This calls for unsupervised learning techniques that are powerful enough to achieve comparable results as semi-supervised/supervised techniques. Contrastive self-supervised learning has emerged as a powerful direction...

Unsupervised machine learning of quenched gauge symmetries: A proof-of-concept demonstration

Article

Full-text available

Nov 2022

One of the most prominent tasks of machine learning (ML) methods within the field of condensed matter physics has been to classify phases of matter. Given their many successes in performing this task, one may ask whether these methods—particularly unsupervised ones—can go beyond learning the thermodynamic behavior of a system. This question is especially intriguing when considering systems that have a “hidden order”. In this work we study two random spin systems with a hidden ferromagnetic order that can be exposed by applying a Mattis gauge transformation. We demonstrate that the principal component analysis, perhaps the simplest unsupervised ML method, can detect the hidden order, quantify the corresponding gauge variables, and map the original random models onto simpler gauge-transformed ferromagnetic ones, all without any prior knowledge of the underlying gauge transformation. Our work illustrates that ML algorithms can in principle identify not manifestly obvious symmetries of a system.

An Enquiry on similarities between Renormalization Group and Auto-Encoders using Transfer Learning

Preprint

Full-text available

Aug 2021

Physicists have had a keen interest in the areas of Artificial Intelligence (AI) and Machine Learning (ML) for some time now. With a special inclination on unraveling the mechanism at the core of the process of learning. In particular, exploring the underlying mathematical structure of a neural net (NN) is expected to not only help us in understanding the epistemological meaning of `Learning' but also has the potential to unravel the secrets behind the working of the brain. Here, it is worthwhile to establish correspondences and draw parallels between methods developed in core areas of Physics and the techniques developed at the forefront of AI and ML. Although recent explorations indicating a mapping between the Renormalisation Group(RG) and Deep Learning(DL) have shown valuable insights, we intend to investigate the relationship between RG and Autoencoders (AE) in particular. We will use Transfer Learning(TL) to embed the procedure of coarse-graining in a NN and compare it with the underlying mechanism of encoding-decoding through a series of tests.

Addressing the interpretability problem for deep learning using many valued quantum logic

Preprint

Full-text available

Jun 2020

Swapnil Nitin Shah

Deep learning models are widely used for various industrial and scientific applications. Even though these models have achieved considerable success in recent years, there exists a lack of understanding of the rationale behind decisions made by such systems in the machine learning community. This problem of interpretability is further aggravated by the increasing complexity of such models. This paper utilizes concepts from machine learning, quantum computation and quantum field theory to demonstrate how a many valued quantum logic system naturally arises in a specific class of generative deep learning models called Convolutional Deep Belief Networks. It provides a robust theoretical framework for constructing deep learning models equipped with the interpretability of many valued quantum logic systems without compromising their computing efficiency.

USING SELF-SUPERVISED LEARNING TO ENHANCE INDUSTRIAL PROCESS MONITORING

Thesis

Full-text available

May 2020

Ashik Takvir

In this thesis, self-supervised learning is used to enhance process data monitoring with the help of ML. Industrial, process dataset is not easy to acquire and often the dataset might not be large enough to train the system as desired. Therefore, Self-Supervised Learning (SSL) method is used to build a model in order to overcome this small dataset problem. In the first phase, the proposed model will learn about the given process dataset and will classify it based on the normal and fault case operation properties. In the next phase, the model will try to generate data points which will mimic the original data points that existed in the original dataset. This generated dataset will be combined later with the original dataset, enabling the system to learn about all the fault cases and non fault operation. The extended dataset, which is a combination of the original and generated dataset, can be used in terms of predictive maintenance, process monitoring and optimization.

Unsupervised Machine Learning of Quenched Gauge Symmetries: A Proof-of-Concept Demonstration

Preprint

Full-text available

Feb 2020

In condensed matter physics, one of the goals of machine learning is the classification of phases of matter. The consideration of a system's symmetries can significantly assist the machine in this goal. We demonstrate the ability of an unsupervised machine learning protocol, the Principal Component Analysis method, to detect hidden quenched gauge symmetries introduced via the so-called Mattis gauge transformation. Our work reveals that unsupervised machine learning can identify hidden properties of a model and may therefore provide new insights into the models themselves.

Self-regularizing restricted Boltzmann machines

Preprint

Dec 2019

Orestis Loukas

Focusing on the grand-canonical extension of the ordinary restricted Boltzmann machine, we suggest an energy-based model for feature extraction that uses a layer of hidden units with varying size. By an appropriate choice of the chemical potential and given a sufficiently large number of hidden resources the generative model is able to efficiently deduce the optimal number of hidden units required to learn the target data with exceedingly small generalization error. The formal simplicity of the grand-canonical ensemble combined with a rapidly converging ansatz in mean-field theory enable us to recycle well-established numerical algothhtims during training, like contrastive divergence, with only minor changes. As a proof of principle and to demonstrate the novel features of grand-canonical Boltzmann machines, we train our generative models on data from the Ising theory and MNIST.

Finans ve Muhasebede Makine Öğreniminin Temelleri: Perspektif Bir Yaklaşım

Chapter

Full-text available

Jun 2024

Osman Nuri Akarsu

An Enquiry on similarities between Renormalization Group and Auto-Encoders using Transfer Learning

Article

Nov 2022
PHYSICA A

Physicists have had a keen interest in the areas of Artificial Intelligence (AI) and Machine Learning (ML) for a while, with a special inclination toward unraveling the fundamental mechanism behind the process of learning. In particular, exploring the underlying mathematical structure of a neural net (NN) is expected to not only help us understand the epistemological meaning of ‘Learning’ but also has the potential to unravel the secrets behind the workings of the brain. Here, it is worthwhile to establish correspondences and draw parallels between methods developed in core areas of Physics and the techniques developed at the forefront of AI and ML. Although recent explorations indicating a mapping between the Renormalisation Group (RG) and Deep Learning (DL) have shown valuable insights, we intend to investigate the relationship between RG and Autoencoders (AE) in particular. We will use Transfer Learning (TL) to embed the coarse-graining procedure in a NN and compare it with the underlying mechanism of encoding-decoding through a series of tests.

| Graphical representation of the restricted Boltzmann machine. Visible layer (observed variables) is comprised of units í µí±£ and the hidden layer (latent variables) is comprised of units ℎ. The layers are fully connected, and the connections W are bidirectional.

Contexts in source publication

Similar publications

Citations