and standard deviation (in centimetres) of the length and height measured with and without obstacles.

Source publication

Stairs Detection with Odometry-aided Traversal From a Wearable RGB-D Camera

Article

Full-text available

May 2016

Stairs are one of the most common structures present in human-made scenarios, but also one of the most dangerous for those with vision problems. In this work we propose a complete method to detect, locate and parametrise stairs with a wearable RGB-D camera. Our algorithm uses the depth data to determine if the horizontal planes in the scene are val...

Context 1

... have excluded the width from the analysis as the view of the stairs may be partial and it is not as relevant as the other measurements. After computing the height and length of a staircases, in both ascending and descending perspectives, from different viewing angles, the results were compared to the real measurements, as shown in the Table 2. As we can ob- serve, the values do not have strong deviation even though the model is computed with one single frame. ...

View in full-text

Context 2

... presence of obstacles par- tially occluding the view of the staircase does not adversely af- fect the quality of the model and we get similar results in terms of average measurements. Our experiment from Table 2 show slighly better standard deviation in the presence of occluding obstacles. This unexpected result is due to the variability of the images in the set and not because of the method itself. ...

View in full-text

StairNetV3: depth-aware stair modeling using deep learning

Article

Full-text available

Feb 2024
VISUAL COMPUT

Vision-based stair modeling can help autonomous mobile robots deal with the challenge of climbing stairs, especially in unfamiliar environments. To address the problem that current monocular methods are difficult to model stairs accurately without depth information in scenes with fuzzy visual cues, this paper proposes a depth-aware stair modeling method for monocular vision. Specifically, we take the prediction of depth images and the extraction of stair geometric features as joint tasks in a convolutional neural network, with the designed information propagation architecture, we can achieve effective supervision for stair geometric feature learning by depth features. In addition, to complete the stair modeling, we take the convex lines, concave lines, tread surfaces and riser surfaces as stair geometric features and apply Gaussian kernels to enable StairNetV3 to predict contextual information within the stair lines. Combined with the depth information obtained by depth sensors, we propose a point cloud reconstruction method that can quickly segment point clouds of stair step surfaces. The experiments show that the proposed method has a significant improvement over the previous best monocular vision method, with an intersection over union increase of 3.4%\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\%$$\end{document}, and the lightweight version has a fast detection speed and can meet the requirements of most real-time applications.

RASPV: A Robotics Framework for Augmented Simulated Prosthetic Vision

Article

Full-text available

Jan 2024

One of the main challenges of visual prostheses is to augment the perceived information to improve the experience of its wearers. Given the limited access to implanted patients, in order to facilitate the experimentation of new techniques, this is often evaluated via Simulated Prosthetic Vision (SPV) with sighted people. In this work, we introduce a novel SPV framework and implementation that presents major advantages with respect to previous approaches. First, it is integrated into a robotics framework, which allows us to benefit from a wide range of methods and algorithms from the field (e.g. object recognition, SLAM, obstacle avoidance, autonomous navigation, deep learning). Second, we go beyond traditional image processing with 3D point clouds processing using an RGB-D camera, allowing us to robustly detect the floor, obstacles and the structure of the scene. Third, it works either with a real camera or in a virtual environment, which gives us endless possibilities for immersive experimentation through a head-mounted display. Fourth, we incorporate a validated temporal phosphene model that replicates time effects into the generation of visual stimuli. Finally, we have proposed, developed and tested several applications within this framework, such as avoiding moving obstacles, providing a general understanding of the scene, staircase detection, helping the subject to navigate an unfamiliar space and reach a destination, and object and person detection. We provide experimental results in real and virtual environments. The code will be publicly available at www.github.com/aperezyus/RASPV.

StairNetV3: Depth-aware Stair Modeling using Deep Learning

Preprint

Aug 2023

Vision-based stair perception can help autonomous mobile robots deal with the challenge of climbing stairs, especially in unfamiliar environments. To address the problem that current monocular vision methods are difficult to model stairs accurately without depth information, this paper proposes a depth-aware stair modeling method for monocular vision. Specifically, we take the extraction of stair geometric features and the prediction of depth images as joint tasks in a convolutional neural network (CNN), with the designed information propagation architecture, we can achieve effective supervision for stair geometric feature learning by depth information. In addition, to complete the stair modeling, we take the convex lines, concave lines, tread surfaces and riser surfaces as stair geometric features and apply Gaussian kernels to enable the network to predict contextual information within the stair lines. Combined with the depth information obtained by depth sensors, we propose a stair point cloud reconstruction method that can quickly get point clouds belonging to the stair step surfaces. Experiments on our dataset show that our method has a significant improvement over the previous best monocular vision method, with an intersection over union (IOU) increase of 3.4 %, and the lightweight version has a fast detection speed and can meet the requirements of most real-time applications. Our dataset is available at https://data.mendeley.com/datasets/6kffmjt7g2/1.

Deep leaning-based ultra-fast stair detection

Article

Full-text available

Sep 2022

Staircases are some of the most common building structures in urban environments. Stair detection is an important task for various applications, including the environmental perception of exoskeleton robots, humanoid robots, and rescue robots and the navigation of visually impaired people. Most existing stair detection algorithms have difficulty dealing with the diversity of stair structure materials, extreme light and serious occlusion. Inspired by human perception, we propose an end-to-end method based on deep learning. Specifically, we treat the process of stair line detection as a multitask involving coarse-grained semantic segmentation and object detection. The input images are divided into cells, and a simple neural network is used to judge whether each cell contains stair lines. For cells containing stair lines, the locations of the stair lines relative to each cell are regressed. Extensive experiments on our dataset show that our method can achieve 81.49% accuracy, 81.91% recall and 12.48 ms runtime, and our method has higher performance in terms of both speed and accuracy than previous methods. A lightweight version can even achieve 300+ frames per second with the same resolution.

Semantic scene synthesis: application to assistive systems

Article

Full-text available

Aug 2022
VISUAL COMPUT

The aim of this work is to provide a semantic scene synthesis from a single depth image. This is used in assistive aid systems for visually impaired and blind people that allows them to understand their surroundings by the touch sense. The fact that blind people use touch to recognize objects and rely on listening to replace sight motivated us to propose this work. First, the acquired depth image is segmented and each segment is classified in the context of assistive systems using a deep learning network. Second, inspired by the Braille system and the Japanese writing system Kanji, the obtained classes are coded with semantic labels. The scene is then synthesized using these labels and the extracted geometric features. Our system is able to predict more than 17 classes only by understanding the provided illustrative labels. For the remaining objects, their geometric features are transmitted. The labels and the geometric features are mapped on a synthesis area to be sensed by the touch sense. Experiments are conducted on noisy and incomplete data including acquired depth images of indoor scenes and public datasets. The obtained results are reported and discussed.

Indoor Scene Understanding for the Visually Impaired Based on Semantic Segmentation

Thesis

Full-text available

Feb 2022

Huayao Liu

Independently exploring unknown spaces or finding objects in an indoor environment is a daily but challenging task for visually impaired people. The previous assistive systems lack depth relationships between various objects, resulting in difficulty to obtain accurate spatial layout and relative positions of objects. Semantic segmentation enables a complete understanding of the surrounding environment. By combining semantic and position information, a high-level indoor scene understanding is possible. In this work, an assistive system based on semantic segmentation is proposed. This entire system consists of three hardware components and two interactive assistive modes. The first mode is designed for holistic indoor detection and avoidance. Based on voice guidance, the point cloud from the most recent state of the changing indoor environment is captured through on-site scanning performed by the user. A point cloud segmentation model is applied in this mode to generate the 3D semantic instance map. After this 3D instance segmentation, the system integrates the information above and interacts with users intuitively by acoustic feedback. The second mode is RGB-Depth semantic segmentation based. A two-stream multi-modal segmentation framework is proposed, which uses the Feature Rectification Module (FRM ) to bi-directionally enhance the current modal feature. For the feature pairs extracted from the two branches, a Feature Fusion Module (FFM ) is applied to merge them for semantic prediction. Image segmentation is much faster than point cloud segmentation, therefore this mode aims for real-time scene perception on walkable areas and obstacles. An estimated obstacle distance is calculated by combining semantic prediction with the corresponding depth map. These two complementary modes realize a high-level perception for visually impaired people. The proposed 3D instance segmentation model and 2D RGB-Depth semantic segmentation model have achieved leading performance on multiple datasets. Comprehensive field tests with various tasks in a user study verify the usability and effectiveness of this system for assisting visually impaired people in indoor scene understanding.

Deep Leaning-Based Ultra-Fast Stair Detection

Preprint

Full-text available

Jan 2022

Staircases are some of the most common building structures in urban environments. Stair detection is an important task for various applications, including the environmental perception of exoskeleton robots, humanoid robots, and rescue robots and the navigation of visually impaired people. Most existing stair detection algorithms have difficulty dealing with the diversity of stair structure materials, extreme light and serious occlusion. Inspired by human perception, we propose an end-to-end method based on deep learning. Specifically, we treat the process of stair line detection as a multitask involving coarse-grained semantic segmentation and object detection. The input images are divided into cells, and a simple neural network is used to judge whether each cell contains stair lines. For cells containing stair lines, the locations of the stair lines relative to each cell are regressed. Extensive experiments on our dataset show that our method can achieve high performance in terms of both speed and accuracy. A lightweight version can even achieve 300+ frames per second with the same resolution. Our code is available at GitHub.

A New Terrain Recognition Approach for Predictive Control of Assistive Devices Using Depth Vision

Chapter

Jan 2022

Vision based systems for terrain detection play important roles in mobile robotics, and recently such systems emerged for locomotion assistance of disabled people. For instance, they can be used as wearable devices to assist blind people or to guide prosthesis or exoskeleton controller to retrieve gait patterns being adapted to the executed task (overground walking, stairs, slopes, etc.). In this paper, we present a computer vision-based algorithm achieving the detection of flat ground, steps, and ramps using a depth camera. Starting from point cloud data collected by the camera, it classifies the environment as a function of extracted features. We further provide a pilot validation in an indoor environment containing a rich set of different types of terrains, even with partial occlusion, and observed that the overall system accuracy is above 94$\%$. The paper further shows that our system needs less computational resources than recently published concurrent approaches, owing to the original transformation method we developed.

HIDA: Towards Holistic Indoor Understanding for the Visually Impaired via Semantic Instance Segmentation with a Wearable Solid-State LiDAR Sensor

Conference Paper

Full-text available

Oct 2021

Using Depth Vision for Terrain Detection during Active Locomotion *

Conference Paper

Sep 2021

and standard deviation (in centimetres) of the length and height measured with and without obstacles.

Contexts in source publication

Citations