Dinei Florencio's research works | Microsoft, Washington and other places

What is this page?

This page lists the scientific contributions of an author, who either does not have a ResearchGate profile, or has not yet added these contributions to their profile.

It was automatically created by ResearchGate to create a record of this author's body of work. We create such pages to advance our goal of creating and maintaining the most comprehensive scientific repository possible. In doing so, we process publicly available (personal) data relating to the author as a member of the scientific community.

If you're a ResearchGate member, you can follow this page to keep up with this author's work.

If you are this author, and you don't want us to display this page anymore, please let us know.

A Fusion Framework for Camouflaged Moving Foreground Detection in the Wavelet Domain

Article

Apr 2018

Detecting camouflaged moving foreground objects has been known to be difficult due to the similarity between the foreground objects and the background. Conventional methods cannot distinguish the foreground from background due to the small differences between them and thus suffer from under-detection of the camouflaged foreground objects. In this p...

Distance-Based Probability Model for Octree Coding

Article

Full-text available

Apr 2018

We present a context-driven method to encode nodes of an octree, which is typically used to encode point cloud geometry. Instead of using one bit per node of the tree, the context allows for deriving probabilities for that node based on distances of the actual voxel to voxels in a reference point cloud. Accurate probabilities of the node state allo...

Deep Learning Based Speech Beamforming

Conference Paper

Apr 2018

Deep Learning Based Speech Beamforming

Article

Full-text available

Feb 2018

Multi-channel speech enhancement with ad-hoc sensors has been a challenging task. Speech model guided beamforming algorithms are able to recover natural sounding speech, but the speech models tend to be oversimplified or the inference would otherwise be too complicated. On the other hand, deep learning based enhancement approaches are able to learn...

Experimental setup. (A) The participants were equipped with the VR...

Results from all experiments. (A) Box-plots of the auditory remapping...

Generic HRTFs May be Good Enough in Virtual Reality. Improving Source Localization through Cross-Modal Plasticity

Article

Full-text available

Feb 2018

Auditory spatial localization in humans is performed using a combination of interaural time differences, interaural level differences, as well as spectral cues provided by the geometry of the ear. To render spatialized sounds within a virtual reality (VR) headset, either individualized or generic Head Related Transfer Functions (HRTFs) are usually...

Video V1

Data

Feb 2018

The experimental procedure, setup, and conditions can be seen here: https://youtu.be/i97RvpXO0s4.

Supplementary Material 2

Data

Feb 2018

Glottal Model Based Speech Beamforming for ad-hoc Microphone Arrays

Conference Paper

Aug 2017

Speech Enhancement Using Bayesian Wavenet

Conference Paper

Full-text available

Aug 2017

Foreground Detection in Camouflaged Scenes

Conference Paper

Jul 2017

Foreground detection has been widely studied for decades due to its importance in many practical applications. Most of the existing methods assume foreground and background show visually distinct characteristics and thus the foreground can be detected once a good background model is obtained. However, there are many situations where this is not the...

Supplementary Material 1

Data

Jun 2017

Supplementary Material 2

Data

Jun 2017

Concurrent talking in immersive virtual reality: On the dominance of visual speech cues

Article

Full-text available

Jun 2017

Humans are good at selectively listening to specific target conversations, even in the presence of multiple concurrent speakers. In our research, we study how auditory-visual cues modulate this selective listening. We do so by using immersive Virtual Reality technologies with spatialized audio. Exposing 32 participants to an Information Masking Tas...

Interpolation of Head-Related Transfer Functions Using Manifold Learning

Article

Jan 2017

We propose a new Head-Related Transfer Function (HRTF) interpolation method using Isomap, a nonlinear dimensionality reduction technique. First, we construct a single manifold for all subjects across both azimuth and elevation angles through the construction of an Intersubject Graph (ISG) that includes important prior knowledge of the HRTFs such as...

A Kinect-Based Wearable Face Recognition System to Aid Visually Impaired Users

Article

Full-text available

Sep 2016

In this paper, we introduce a real-time face recognition (and announcement) system targeted at aiding the blind and low-vision people. The system uses a Microsoft Kinect sensor as a wearable device, performs face detection, and uses temporal coherence along with a simple biometric procedure to generate a sound associated with the identified person,...

Speech Enhancement In Multiple-Noise Conditions using Deep Neural Networks

Conference Paper

Full-text available

May 2016

In this paper we consider the problem of speech enhancement in real-world like conditions where multiple noises can simultaneously corrupt speech. Most of the current literature on speech enhancement focus primarily on presence of single noise in corrupted speech which is far from real-world environments. Specifically, we deal with improving speech...

A Manifold Learning Approach for Personalizing HRTFs from Anthropometric Features

Article

Mar 2016

We present a new anthropometry-based method to personalize head-related transfer functions (HRTFs) using manifold learning in both azimuth and elevation angles with a single nonlinear regression model. The core element of our approach is a domain-specific nonlinear dimensionality reduction technique, denominated Isomap, over the intraconic componen...

Maximum a posteriori estimation of room impulse responses

Conference Paper

Apr 2015

3D numerical modeling of parametric speaker using finite-difference time-domain

Conference Paper

Apr 2015

Point cloud attribute compression with graph transform

Article

Jan 2015

Compressing attributes on 3D point clouds such as colors or normal directions has been a challenging problem, since these attribute signals are unstructured. In this paper, we propose to compress such attributes with graph transform. We construct graphs on small neighborhoods of the point cloud by connecting nearby points, and treat the attributes...

Arbitrarily Shaped Motion Prediction for Depth Video Compression Using Arithmetic Edge Coding

Article

Aug 2014

Depth image compression is important for compact representation of 3D visual data in "texture-plus-depth" format, where texture and depth maps from one or more viewpoints are encoded and transmitted. A decoder can then synthesize a freely chosen virtual view via depth-imagebased rendering (DIBR) using nearby coded texture and depth maps as referenc...

Anthropometric-based customization of head-related transfer functions using Isomap in the horizontal plane

Conference Paper

Full-text available

May 2014

In this paper, we introduce a new anthropometric-based method for customizing of Head-Related Transfer Functions (HRTF) in the horizontal plane. The method uses Isomap, artificial neural networks (ANN), and a neighborhood-based reconstruction procedure. We first modify Isomap's graph construction step to emphasize the individuality of HRTFs and per...

Sparse Array-Based Room Transfer Function Estimation for Echo Cancellation

Article

Feb 2014

A number of applications in acoustics, such as echo cancellation, require learning the acoustic impulse response from each deployed loudspeaker to each microphone- the room transfer function. This has conventionally been done separately at each microphone for each loudspeaker. However, the signals arriving at the array share a common structure, whi...

Learning how to increase the chance of human-robot engagement

Conference Paper

Nov 2013

The increasing use of mobile robots in social contexts makes it important to provide them with the ability to behave in the most socially acceptable way possible. In this paper we investigate the problem of making a robot learn how to approach a person in order to increase the chance of a successful engagement. We propose the use of Gaussian Proces...

Autonomous person following for telepresence robots

Conference Paper

May 2013

We present a method for a mobile robot to follow a person autonomously where there is an interaction between the robot and human during following. The planner takes into account the predicted trajectory of the human and searches future trajectories of the robot for the path with the highest utility. Contrary to traditional motion planning, instead...