Home
IRISA - Institut de Recherche en Informatique et Systèmes Aléatoires
Radu Tudoran

Radu Tudoran
IRISA - Institut de Recherche en Informatique et Systèmes Aléatoires | IRISA

About

Publications

19,765

Reads

569

Citations

Publications

SPICE: Streaming PCA Fault Identification and Classification Engine in Predictive Maintenance

Chapter

Mar 2020

Data-driven predictive maintenance needs to understand high-dimensional “in-motion” data, for which fundamental machine learning tools, such as Principal Component Analysis (PCA), require computation-efficient algorithms that operate near-real-time. Despite the different streaming PCA flavors, there is no algorithm that precisely recovers the princ...

An Online Incremental Clustering Framework for Real-Time Stream Analytics

Conference Paper

Dec 2019

Dimensionality Reduction for Low-Latency High-Throughput Fraud Detection on Datastreams

Conference Paper

Dec 2019

NARPCA: Neural Accumulate-Retract PCA for Low-Latency High-Throughput Processing on Datastreams

Chapter

Sep 2019

The increasingly interconnected and instrumented world, provides a deluge of data generated by multiple sensors in the form of continuous streams. Efficient stream processing needs control over the number of useful variables. This is because maintaining data structure in reduced sub-spaces, given that data is generated at high frequencies and is ty...

STARLORD: Sliding Window Temporal Accumulate-Retract Learning for Online Reasoning on Datastreams

Conference Paper

Dec 2018

Moira: A Goal-Oriented Incremental Machine Learning Approach to Dynamic Resource Cost Estimation in Distributed Stream Processing Systems

Conference Paper

Aug 2018

The need for real-time analysis is still spreading and the number of available streaming sources is increasing. The recent literature has plenty of works on Data Stream Processing (DSP). In a streaming environment, the data incoming rate varies over time. The challenge is how to efficiently deploy these applications in a cluster. Several works have...

KerA: Scalable Data Ingestion for Stream Processing

Conference Paper

Full-text available

Jul 2018

Exploring Shared State in Key-Value Store for Window-Based Multi-pattern Streaming Analytics

Conference Paper

May 2017

Data Multiverse: The Uncertainty Challenge of Future Big Data Analytics

Conference Paper

Full-text available

Feb 2017

With the explosion of data sizes, extracting valuable insight out of big data becomes increasingly difficult. New challenges begin to emerge that complement traditional, long-standing challenges related to building scalable infrastructure and runtime systems that can deliver the desired level of performance and resource efficiency. This vision pape...

OverFlow: Multi-Site Aware Big Data Management for Scientific Workflows on Clouds

Article

Jan 2015

The global deployment of cloud datacenters is enabling large scale scientific workflows to improve performance and deliver fast responses. This unprecedented geographical distribution of the computation is doubled by an increase in the scale of the data handled by such applications, bringing new challenges related to the efficient data management a...

Big Data Storage and Processing on Azure Clouds: Experiments at Scale and Lessons Learned

Chapter

Nov 2014

Data-intensive computing is now starting to be considered as the basis for a new, fourth paradigm for science. Two factors are encouraging this trend. First, vast amounts of data are becoming available in more and more application areas. Second, the infrastructures allowing to persistently store these data for sharing and processing are becoming a...

Transfer as a Service: Towards a Cost-Effective Model for Multi-Site Cloud Data Management

Article

Oct 2014

The global deployment of cloud datacenters is enabling large web services to deliver fast response to users worldwide. This unprecedented geographical distribution of the computation also brings new challenges related to the efficient data management across sites. High throughput, low latencies, cost- or energy-related trade-offs are just a few con...

JetStream: Enabling High Performance Event Streaming across Cloud Data-Centers

Article

May 2014

The easily-accessible computation power offered by cloud infrastructures coupled with the revolution of Big Data are expanding the scale and speed at which data analysis is performed. In their quest for finding the Value in the 3 Vs of Big Data, applications process larger data sets, within and across clouds. Enabling fast data transfers across geo...

Achieving high throughput for large scale event streaming across geographically distributed data-centers with JetStream

Article

May 2014

The increasing scale at which data processing is being performed nowadays calls for data management systems that enable high-performance data exchanges among geographically remote instances of large web services. In this demonstration we show how JetStream can increase the transfer rate of events which are streamed between geographically remote clo...

Bridging Data in the Clouds: An Environment-Aware System for Geographically Distributed Data Transfers

Conference Paper

May 2014

Today's continuously growing cloud infrastructures provide support for processing ever increasing amounts of scientific data. Cloud resources for computation and storage are spread among globally distributed datacenters. Thus, to leverage the full computation power of the clouds, global data processing across multiple sites has to be fully enabled....

Evaluating Streaming Strategies for Event Processing Across Infrastructure Clouds

Conference Paper

May 2014

Infrastructure clouds revolutionized the way in which we approach resource procurement by providing an easy way to lease compute and storage resources on short notice, for a short amount of time, and on a pay-as-you-go basis. This new opportunity, however, introduces new performance trade-offs. Making the right choices in leveraging different types...

Top: Representation of the computational framework: given the data, a...

Overview of the multi site deployment of a hierarchical Tomus-MapReduce...

Configuration used for the experiment. (Lines 1–3): Covariates, 10,000...

Results of the real data analysis procedure. (Left) predictive accuracy...

Machine Learning Patterns for Neuroimaging-Genetic Studies in the Cloud

Article

Full-text available

Apr 2014

Brain imaging is a natural intermediate phenotype to understand the link between genetic information and behavior or brain pathologies risk factors. Massive efforts have been made in the last few years to acquire high-dimensional neuroimaging and genetic data on large cohorts of subjects. The statistical analysis of such data is carried out with in...

Adaptive File Management for Scientific Workflows on the Azure Cloud

Conference Paper

Full-text available

Oct 2013

Scientific workflows typically communicate data between tasks using files. Currently, on public clouds, this is achieved by using the cloud storage services, which are unable to exploit the workflow semantics and are subject to low throughput and high latencies. To overcome these limitations, we propose an alternative leveraging data locality throu...

DataSteward: Using Dedicated Compute Nodes for Scalable Data Management on Public Clouds

Conference Paper

Jul 2013

A large spectrum of scientific applications, some generating data volumes exceeding petabytes, are currently being ported on clouds to build on their inherent elasticity and scalability. One of the critical needs in order to deal with this "data deluge" is an efficient, scalable and reliable storage. However, the storage services proposed by cloud...

SAGE: Geo-Distributed Streaming Data Analysis in Clouds

Conference Paper

May 2013

The continuous growth of sensor networks, stock exchanges, climate monitoring or scientific applications produces new streaming data at increasing rates. Managing and processing such data, sometimes generated from multiple geographical locations, raises important challenges as it requires real-time processing or data aggregation. Conventional solut...

TomusBlobs: Scalable Data-intensive Processing on Azure Clouds

Article

May 2013

The emergence of cloud computing has brought the opportunity to use large-scale compute infrastructures for a broader and broader spectrum of applications and users. As the cloud paradigm gets attractive for the ‘elasticity’ in resource usage and associated costs (the users only pay for resources actually used), cloud applications still suffer from...

MapIterativeReduce: A Framework for Reduction-Intensive Data Processing on Azure Clouds

Article

Jun 2012

With the emergence of cloud computing as an alternative to supercomputers to support data intensive applications, MapReduce has arisen as a major programming model for data analysis on clouds. In this context, reduce-intensive algorithms are becoming increasingly useful in applications such as data clustering, classification and mining. However, pl...

A-Brain: Using the Cloud to Understand the Impact of Genetic Variability on the Brain

Conference Paper

Full-text available

May 2012

Joint genetic and neuroimaging data analysis on large cohorts of subjects is a new approach used to assess and understand the variability that exists between individuals. This approach has remained poorly understood so far and brings forward very significant challenges, as progress in this field can open pioneering directions in biology and medicin...

TomusBlobs: Towards Communication-Efﬁcient Storage for MapReduce Applications in Azure

Article

May 2012

The emergence of cloud computing brought the opportunity to use large-scale compute infrastructures for a broad spectrum of applications and users. As the cloud paradigm gets attractive for the " elasticity'' in resource usage and associated costs (the users only pay for resources actually used), cloud applications still suffer from the high latenc...

A Performance Evaluation of Azure and Nimbus Clouds for Scientific Applications

Article

Full-text available

Apr 2012

The emergence of cloud computing brought the opportunity to use large-scale computational infrastructures for a broad spectrum of scientific applications. As more and more cloud providers and technologies appear, scientists are faced with an increasingly difficult problem of evaluating various offerings, like public and private clouds, and deciding...

FPGA-based Monte-Carlo computation of the electric potential in homogeneous and non-homogeneous spaces

Article

Sep 2011

This paper presents a general solution for computing the electric potential in homogeneous and non-homogeneous media using a Monte Carlo-based method. The implementation relies on an original framework that uses FPGAs to improve the computational speed. The calculation process relies on a series of both geometric and electric parameters describing...

Dinechin. FPGA-based computation of the inductance of coils used for the magnetic stimulation of the nervous system

Article

May 2011

In the last years the interest for magnetic stimulation of the human nervous tissue has increased considerably, because this technique has proved its utility and applicability both as a diagnostic and as a treatment instrument. Research in this domain is aimed at removing some of the disadvantages of the technique: the lack of focalization of the s...

Multipliers for Floating-Point Double Precision and Beyond on FPGAs

Article

Full-text available

Jan 2011

The implementation of high-precision floating-point applications on reconfigurable hardware requires large multipliers. Full multipliers are the core of floating-point multipliers. Truncated multipliers, trading resources for a well-controlled accuracy degradation, are useful building blocks in situations where a full multiplier is not needed. This...

Exploiting Crosstalk Effects in FPGAs for Generating True Random Numbers

Conference Paper

Jan 2011

This paper presents a new method for implementing TRNGs in FPGA devices, which relies on filling a region or the whole FPGA chip close to its maximal capacity and exploiting the interconnection network as intensely as possible. This way, there are strong chances for the design to exhibit a nondeterministic behavior. Our first design is a computatio...

An FPGA-specific approach to floating-point accumulation and sum-of-products

Conference Paper

Jan 2009

This article studies two common situations where the flexibility of FPGAs allows one to design application-specific floating-point operators which are more efficient and more accurate than those offered by processors and GPUs. First, for applications involving the addition of a large number of floating-point values, an ad-hoc accumulator is propose...

Implementing True Random Number Generators in FPGAs by Chip Filling.

Conference Paper

Jan 2009

This paper presents a new method for implementing TRNGs in FPGA devices. The design is based on filling the chip close to its maximal capacity and exploiting the interconnection network as intensely as possible. This way, there are strong chances for the design to exhibit a nondeterministic behavior. Our design is a computationally intensive core t...

Implementing true random number generators by generating crosstalk effects in FPGA chips

Conference Paper

Sep 2009

This paper presents an original method for creating TRNGs in Xilinx FPGAs. The design is based on agglomerating active logic in a given region of the FPGA chip, either globally or locally. No timing constraints were used in this design. A series of experiments conducted on different architectural variants lead to the conclusion that mapping logic b...

An FPGA-specific Approach to Floating-Point Accumulation and Sum-of-Products

Article

Full-text available

Dec 2008

Floating-point operators on FPGAs do not have to be identical to the ones available in processors. This article studies two common situations where the flexibility of FPGAs allows one to design application-specific floating-point operators. First, for applications involving the addition of a large number of floating-point values, an ad-hoc accumula...

Software Random Number Generation Based on Race Conditions

Conference Paper

Full-text available

Oct 2008

The paper presents a new software strategy for generating true random numbers, by creating several threads and letting them compete unsynchronized for a shared variable, whose value is read-modified-updated by each thread repeatedly. The generated sequence of random numbers consists of the final values of the shared variable. Our strategy is based...

When FPGAs are better at floating-point than microprocessors

Article

Jul 2008

It has been shown that FPGAs could outperform high-end microprocessors on floating-point computations thanks to massive parallelism. However, most previous studies re-implement in the FPGA the operators present in a processor. This is a safe and relatively straightforward approach, but it doesn't exploit the greater flexibility of the FPGA. This ar...

FPGA-Based Acceleration of the Computations Involved in Transcranial Magnetic Stimulation

Conference Paper

Apr 2008

In the last years the interest for magnetic stimulation of the human nervous tissue has increased, because this technique has proved its utility and applicability both as a diagnostic and as a treatment instrument. Research in this domain is aimed at eliminating some disadvantages of the technique: the lack of focalization of the stimulated human b...

Computing the inductance of coils used for transcranial magnetic stimulation with FPGA devices

Conference Paper

Feb 2008

When FPGAs are better at floating-point than microprocessors

Conference Paper

Full-text available

Feb 2008

It has been shown that FPGAs could outperform high-end microprocessors on floating-point computations thanks to massive parallelism. However, most previous studies re-implement in the FPGA the operators present in a processor. This conservative approach is relatively straightforward, but it doesn't exploit the greater flexibility of the FPGA. We su...

FPGA-Based Computation of the Inductance of Coils Used for the Magnetic Stimulation of the Nervous System.

Conference Paper

Full-text available

Jan 2008

AN FPGA-SPECIFIC APPROACH TO FLOATING-POINT ACCUMULATION AND SUM-OF-PRODUCTS LIP RESEARCH REPORT RR2008-22

Article

Full-text available

Jan 2008

This article studies two common situations where the flex-ibility of FPGAs allows one to design application-specific floating-point operators which are more efficient and more accurate than those offered by processors and GPUs. First, for applications involving the addition of a large number of floating-point values, an ad-hoc accumulator is propos...

When FPGAs are better at floating-point than microprocessors

Article

Jan 2008

Accelerating the computation of the physical parameters involved in transcranial magnetic stimulation using FPGA devices

Article

Full-text available

Jan 2007

In the last years the interest for magnetic stimulation of the hu-man nervous tissue has increased, because this technique has proved its utility and applicability both as a diagnostic and as a treatment instrument. Research in this domain is aimed at eliminating some disadvantages of the technique: the lack of focalization of the stimulated human...

Data Storage in Clouds

Article

Full-text available

Optimizing data storage for MapReduce applications in the Azure Clouds

Article

Full-text available

In this report we address the problem of data management in clouds for the MapRe-duce programing model. In order to improve the performance of data-intensive appli-cations, we designed a distributed file system deployed on the computation nodes of public clouds. This approach exploits the data locality principle by moving the data close to the comp...

Network

Ian Foster
University of Chicago
Joel Saltz
Stony Brook University
Edouard Duchesnay
Atomic Energy and Alternative Energies Commission
Ewa Deelman
University of Southern California
Gideon Juve
University of Southern California

Martin Kumm
University of Applied Sciences Fulda
Marisa Lopez-Vallejo
Universidad Politécnica de Madrid
Gabriel Antoniu
National Institute for Research in Computer Science and Control
Claudio Fernando Resin Geyer
Universidade Federal do Rio Grande do Sul
Luca Benini
University of Bologna

IRISA - Institut de Recherche en Informatique et Systèmes Aléatoires

Rennes, France

Top co-authors

Bertrand Thirion
National Institute for Research in Computer Science and Control
Gael Varoquaux
National Institute for Research in Computer Science and Control
Vincent Frouin
Atomic Energy and Alternative Energies Commission
Bogdan Nicolae
Argonne National Laboratory
Marcella Rietschel
Central Institute of Mental Health

All co-authors (50)

View All

Radu TudoranIRISA - Institut de Recherche en Informatique et Systèmes Aléatoires | IRISA

About

Publications

Network

Radu Tudoran
IRISA - Institut de Recherche en Informatique et Systèmes Aléatoires | IRISA