The camera locations estimated by the relocalization module (the blue dots) and the tracking module (the red dots). a Camera locations of V1. b Camera locations of V2. c Camera locations of V3 (color figure online)

Source publication

Real-time SLAM relocalization with online learning of binary feature indexing

Article

Full-text available

Nov 2017

A visual simultaneous localization and mapping (SLAM) system usually contains a relocalization module to recover the camera pose after tracking failure. The core of this module is to establish correspondences between map points and key points in the image, which is typically achieved by local image feature matching. Since recently emerged binary fe...

Accelerated DNA-SLAM for RGB-D images

Conference Paper

Full-text available

Feb 2018

In the highly active research field of Simultaneous Localization And Mapping (SLAM), RGB-D images have been a major interest to use. Real-time SLAM for RGB-D images is of great importance since dense methods using all the depth and intensity values showed superior performance in the past. Due to development of GPU and CPU technologies, the real-tim...

Feature-based visual odometry with fusion of optical flow method in weak texture environment

Preprint

Full-text available

Feb 2024

Feature-based visual odometry has difficulty in feature extraction and matching in weak texture environment, resulting in substantial inter-frame pose resolution errors. Meanwhile, the computation and matching of feature point descriptors can be time-consuming and computationally inefficient. To address these issues encountered by traditional ORB-SLAM odometry in texture-lacking regions, an enhanced method for visual odometry estimation is proposed. First, the quadtree technique is employed to extract ORB feature points with a uniform distribution and an adequate number. Subsequently, when processing non-critical frames, the optical flow method is utilized to predict the precise locations of the feature points, circumventing the need for feature matching. Following this, the random sampling consistency method is applied to eliminate mismatched points in optical flow tracking, ensuring that only high-quality internal points are retained. Afterwards, a system of nonlinear equations is solved using AP3P method to estimate the precise position of the camera. Finally, the trajectory is optimized by Dogleg algorithm to achieve accurate and stable tracking and positioning. The experimental results demonstrate that the improved algorithm outperforms mainstream ORB-SLAM3 algorithm in terms of operation efficiency and positioning accuracy across multiple experimental scenarios. This method effectively addresses the challenges of low tracking accuracy and poor real-time performance commonly encountered by traditional visual odometers operating in weak texture environments. As a result, the method combining the feature-based method and the optical flow method significantly enhances the application of visual odometry in complex environments by improving the tracking stability, motion estimation accuracy, and real-time performance.

Camera and inertial sensor fusion for the PnP problem: algorithms and experimental results

Article

Full-text available

Jul 2021
MACH VISION APPL

In this work, we face the problem of estimating the relative position and orientation of a camera and an object, when they are both equipped with inertial measurement units (IMUs), and the object exhibits a set of n landmark points with known coordinates (the so-called Pose estimation or P n P Problem). We present two algorithms that, fusing the information provided by the camera and the IMUs, solve the P n P problem with good accuracy. These algorithms only use the measurements given by IMUs’ inclinometers, as the magnetometers usually give inaccurate estimates of the Earth magnetic vector. The effectiveness of the proposed methods is assessed by numerical simulations and experimental tests. The results of the tests are compared with the most recent methods proposed in the literature.

Calibration of mobile robot odometry and camera based on hand-eye calibration

Article

Full-text available

Aug 2019
J Phys Conf

Keyframe-Based Camera Relocalization Method Using Landmark and Keypoint Matching

Article

Full-text available

Jun 2019

Camera relocalization is a challenging task, especially based on the sparse 3D map or keyframes. In this paper, we present an accurate method for RGB camera relocalization in case of a very sparse 3D map built by limited keyframes. The core of our approach is a top-to-down feature matching strategy to provide a set of accurate 2D-to-3D matches. Specifically, we firstly use the landmark-based place recognition method to generate from keyframes the images similar to the current view along with the set of pairwise matched landmarks. This step constrains the 3D model points that can be matched with the current view. Then, the points are matched within the landmark pairs and combined afterwards. This is in contrast to conventional methods of feature matching that typically match points between the entire images and the whole 3D map, which, as a result, may not be robust to large viewpoint changes, the main challenge of the relocalization based on sparse map. After feature matching, the camera pose is calculated by an efficient novel Perspective-n-Points (PnP) algorithm. We conduct experiments on challenging datasets to demonstrate that the camera poses estimated by our method based on the sparse 3D point cloud are more accurate than the classical methods using dense map or large number of training images.

Let's Take This Online: Adapting Scene Coordinate Regression Network Predictions for Online RGB-D Camera Relocalisation

Preprint

Full-text available

Jun 2019

Many applications require a camera to be relocalised online, without expensive offline training on the target scene. Whilst both keyframe and sparse keypoint matching methods can be used online, the former often fail away from the training trajectory, and the latter can struggle in textureless regions. By contrast, scene coordinate regression (SCoRe) methods generalise to novel poses and can leverage dense correspondences to improve robustness, and recent work has shown how to adapt SCoRe forests between scenes, allowing their state-of-the-art performance to be leveraged online. However, because they use features hand-crafted for indoor use, they do not generalise well to harder outdoor scenes. Whilst replacing the forest with a neural network and learning suitable features for outdoor use is possible, the techniques used to adapt forests between scenes are unfortunately harder to transfer to a network context. In this paper, we address this by proposing a novel way of leveraging a network trained on one scene to predict points in another scene. Our approach replaces the appearance clustering performed by the branching structure of a regression forest with a two-step process that first uses the network to predict points in the original scene, and then uses these predicted points to look up clusters of points from the new scene. We show experimentally that our online approach achieves state-of-the-art performance on both the 7-Scenes and Cambridge Landmarks datasets, whilst running in under 300ms, making it highly effective in live scenarios.

Image-based camera localization: an overview

Article

Full-text available

Dec 2018

Abstract Virtual reality, augmented reality, robotics, and autonomous driving, have recently attracted much attention from both academic and industrial communities, in which image-based camera localization is a key task. However, there has not been a complete review on image-based camera localization. It is urgent to map this topic to enable individuals enter the field quickly. In this paper, an overview of image-based camera localization is presented. A new and complete classification of image-based camera localization approaches is provided and the related techniques are introduced. Trends for future development are also discussed. This will be useful not only to researchers, but also to engineers and other individuals interested in this field.

SPM-SLAM: Simultaneous Localization and Mapping with Squared Planar Markers

Article

Full-text available

Sep 2018
PATTERN RECOGN

SLAM is generally addressed using natural landmarks such as keypoints or texture, but it poses some limitations, such as the need for enough textured environments and high computational demands. In some cases, it is preferable sacrificing the flexibility of such methods for an increase in speed and robustness by using artificial landmarks. The recent work [1] proposes an off-line method to obtain a map of squared planar markers in large indoor environments. By freely distributing a set of markers printed on a piece of paper, the method estimates the marker poses from a set of images, given that at least two markers are visible in each image. Afterwards, camera localization can be done, in the correct scale. However, an off-line process has several limitations. First, errors can not be detected until the whole process is finished, e.g., an insufficient number of markers in the scene or markers not properly spotted in the capture stage. Second, the method is not incremental, so, in case of requiring the expansion of the map, it is necessary to repeat the whole process from start. Finally, the method can not be employed in real-time systems with limited computational resources such as mobile robots or UAVs. To solve these limitations, this work proposes a real-time solution to the problems of simultaneously localizing the camera and building a map of planar markers. This paper contributes with a number of solutions to the problems arising when solving SLAM from squared planar markers, coining the term SPM-SLAM. The experiments carried out show that our method can be more robust, precise and fast, than visual SLAM methods based on keypoints or texture.

Augmented Reality for Assistive Maintenance and Real-Time Failure Analysis in Industries

Conference Paper

Mar 2020

Sparse semantic map building and relocalization for UGV using 3D point clouds in outdoor environments

Article

Mar 2020
NEUROCOMPUTING

In this paper, we proposed a sparse semantic map building method and an outdoor relocalization strategy based on this map. Most existing semantic mapping approaches focus on improving semantic understanding of single frames and retain a large amount of environmental data. Instead, we don't want to locate the UGV precisely, but use the imprecise environmental information to determine the general position of UGV in a large-scale environment like human beings. For this purpose, we divide the environment into environment nodes according to the result of scene understanding. The semantic map of the outdoor environment is obtained by generating topological relations between the environment nodes. In the semantic map, only the information of the nodes is saved, so that the storage space can be kept at a very small level with the increasing size of environment. When the UGV receives a new local semantic map, we evaluate the similarity between local map and global map to determine the possible position of the UGV according to the categories of the left and right nodes and the distance between the current position and the nodes. In order to validate the proposed approach, experiments have been conducted in a large-scale outdoor environment with a real UGV. Depending on the semantic map, the UGV can redefine its position from different starting points.

Let's Take This Online: Adapting Scene Coordinate Regression Network Predictions for Online RGB-D Camera Relocalisation

Conference Paper

Sep 2019

The camera locations estimated by the relocalization module (the blue dots) and the tracking module (the red dots). a Camera locations of V1. b Camera locations of V2. c Camera locations of V3 (color figure online)

Similar publications

Citations