The SAFHG core block diagram.

Source publication

FPGA-based module for SURF extraction

Article

Full-text available

Apr 2014

We present a complete hardware and software solution of an FPGA-based computer vision embedded module capable of carrying out SURF image features extraction algorithm. Aside from image analysis, the module embeds a Linux distribution that allows to run programs specifically tailored for particular applications. The module is based on a Virtex-5 FXT...

Context 1

... image generation. The structure and principle of operation of this core can be seen in Figure 6. As aforementioned the result- ing integral image is sent not only to the Fast Hessian Generator, but also to the main memory for later reuse by the descriptor calculator, see Figure. 4. The SURF Accelerator -Fast-Hessian Generator IP core (SAFHG) (Fig. 7) is a key component of SURF de- tector acceleration. It calculates the Fast-Hessian re- sponses from the integral image and forms the entire scale space used by the detector. An important factor influencing the performance of the determinant calcu- lation is the optimization of memory access, which is performed by the MasterController ...

View in full-text

FPGA Hardware Implementation of Smart Home Autonomous System Based on Deep Learning

Chapter

Full-text available

Jun 2018

The use of deep learning algorithms, as a core element of artificial intelligence, has attracted increased attention from industrial and academic institutes recently. One important use of deep learning is to predict the next user action inside an intelligent home environment that is based on Internet of Things (IoT). Recent researcher discusses the...

Lightweight Linux dynamic libraries profiling technique for embedded systems

Conference Paper

Full-text available

Oct 2013

Situations when profiling is required for embedded applications are common in software engineering practice. Usually such tasks have additional limitations - no ability to recompile or relink application, necessity to minimize profiler impact on application work, which makes performance evaluation difficult problem. The paper describes lightweight...

Desarrollo de Sistemas Embebidos con Linux en Hardware Reconfigurable

Thesis

Full-text available

Jul 2013

Antonio Escobar-Molero

The goal of this project is to provide a comprehensive resource on designing embedded systems with the most flexible programmable logic device to date, the Platform FPGA. All the steps in the design cycle are covered: building the base hardware, including an operating system and cross-compiling applications to take advantage of the custom computing...

Hardware copyleft como herramienta para la enseñanza del procesamiento de señales e imágenes

Article

Full-text available

Feb 2012

Digital signal-and-image processing is an area that covers a wide range of academic and commercial applications. It is a compulsory topic in most courses at engineering colleges. Moreover, thanks to the current achievements of the semiconductor industry, it is possible to obtain specialized devices that enable the creation of commercializable produ...

Automatically Provisioned Embedded Systems in Managed Networks

Article

Full-text available

Dec 2015

The article deals with a design of a new automatically provisioned embedded system. Through the years of our active development a highly advanced platform has been created. This platform, called BEESIP, is meant for the embedded network devices, and allows them to act as telephony exchanges, secured access points, VPN concentrators, etc. As the key...

FPGA Implementation of Integral Image generator in SURF detector

Article

Full-text available

Nov 2020

Sri Chakrapani Yellamraju

Integral image generation improves the speed by reducing no. of computations like additions and multiplications. in computer vision applications such as image feature detectors, There are different algorithms for image feature detection, such as SURF, SIFT, HOG, Harris-Laplace Feature detection, FAST etc. Integral Image generation is used in SURF detector, which detects salient points from image and computes descriptors of their surroundings that are invariant to scale, rotation and illumination changes, hence it can be used in many of applications. The proposed Integral image generator in SURF detector uses Recursive addition equations for 320x240 image. Which improves the speed and reduce the Hard ware.. This Integral Image Generator is implemented in Virtex7 FPGA using verilog HDL.

FPGA Implementation of Integral Image generator in SURF detector

Conference Paper

Full-text available

Jul 2020

Ground Control Point Automatic Extraction for Spaceborne Georeferencing Based on FPGA

Article

Full-text available

Jun 2020
IEEE J-STARS

Feature points that are obtained from the combined speeded-up robust feature (SURF) detector and binary robust independent elementary features (BRIEF) descriptor have a highly robust performance. These points are previously considered the ground control points (GCPs) for building a connection between the image coordinates and the corresponding geodetic coordinates. This paper proposes a novel architecture to automatically and intelligently extract GCPs based on field programmable gate arrays (FPGAs). The parallelization SURF detector, BRIEF descriptor and BRIEF matching are implemented in a single Xilinx XC7VX980T FPGA system. Word length reduction (WLR), memory-efficient parallel architecture (MEPA), shift and subtraction strategies (SAS), a sliding window for separable convolution, and an optimized multispacer-scale are used to optimize the SURF detector. Improved parallel adder trees are used to accelerate the BRIEF matching. The proposed system achieves 380 frame per second (fps) with a 100 MHz clock frequency, which satisfies the real-time and low-power requirements of embedded devices. The results of the experiment demonstrate that the proposed architecture, when mapped onto a Xilinx Virtex-7 XC7VX980T FPGA device, can select the robust feature points.

A fully pipelined and parallel hardware architecture for real-time BRISK salient point extraction

Article

Full-text available

Oct 2019

Scale and rotation invariant salient point detection and matching algorithms are variously used in computer vision applications such as image matching, 3D localization and pose estimation. Recently, hardware implementation of image and video processing algorithms has emerged as a viable solution to handle the high computational complexity of applications like 3D pose estimation with several processing stages. The hardware implementation of various stages of theses algorithms can be executed in a pipelined manner to ensure the reality of time. In this paper, a new and fully pipelined hardware architecture is proposed for salient point detection using Binary Robust Invariant Scalable Keypoints (BRISK) algorithm. BRISK algorithm is a binary keypoint extractor that detects salient points by constructing a scale-space pyramid; therefore, its fixed-point hardware implementation in a pipelined manner is challenging because of the required synchronization for various layers in scale domain. The proposed hardware architecture was implemented using Verilog Hardware Description Language, and the functionality of the design was validated through several experiments. The proposed design was synthesized by using an ASIC digital design flow utilizing 180 nm CMOS technology as well as a Virtex-4 FPGA. The design is clocked at 90.91 MHz in ASIC implementation and achieves processing rate of 169.29 frames/s while running on input images with 800 × 600 resolution. The throughput of FPGA implementation is 180.44 frames/s with 96.89 MHz clock frequency for the same input image resolution. Experimental results confirm the efficiency of the proposed hardware architecture in comparison with software implementation.

FPGA Implementation of Integral Image generator in SURF detector

Research

Full-text available

Jun 2018

Yaragani Mamillu

Long-term autonomy of Mobile Robots in Changing Environments

Thesis

Full-text available

Mar 2018

Tomáš Krajník

This habilitation thesis presents research that aims to enable long-term deployment of mobile robots in changing environments. The presented approaches encompass methods that ensure robustness of autonomous visual navigation in outdoor environments for prolonged time periods, spatio-temporal representations that explicitly model the environment changes over time, and supporting software modules that enable robust and accurate robot localisation. The main contribution of the thesis is a novel approach that allows to incorporate the notion of time into most stationary environment models used in mobile robotics. This is achieved by representing the uncertainty of the environment states not by fixed probabilities, but by probabilistic functions of time, represented in the frequency domain. The method allows to integrate unlimited numbers of sparse and irregular observations obtained during long-term deployments of mobile robots into memory-efficient models that reflect the persistence and recurrence of environment variations. The frequency-enhanced spatio-temporal models allow to predict the future environment states, which improves the efficiency of mobile robot operation in changing environments. In this thesis, we present a series of articles, which demonstrate that the proposed approach improves mobile robot localization, path and task planning, activity recognition, human-robot interaction and allows for life-long spatio-temporal exploration of perpetually-changing environments.

Real-Time FPGA-based Detection of Speeded Up Robust Features Using Separable Convolution

Article

Full-text available

Oct 2017
IEEE T IND INFORM

In this paper, we propose a novel architecture for efficient detection of Speeded Up Robust Features (SURF) for Field-programmable gate array (FPGA). The main benefits of the proposed architecture are in real-time low-latency performance and scalability. The proposed solution provides a significant acceleration of salient points extraction which is fundamental image processing technique for vision-based methods including the simultaneous localization and mapping (SLAM). Based on the presented practical results, the proposed architecture is capable of processing streaming image data at the rate of 140 Megapixels per second which roughly scales from the 640×480@420fps up to 1920×1080@60fps video streams on a low-end, low-cost FPGA solution (Cyclone V). Moreover, the proposed feature detection utilizes only about 20% of logic elements of the FPGA which supports further parallel processing of multiple inputs.

Improving the construction of ORB through FPGA-based acceleration

Article

Full-text available

Aug 2017
MACH VISION APPL

Binary descriptors have won their place as efficient and effective visual descriptors in several vision tasks. In this context, one of the most widely used binary descriptors to date is the ORB descriptor. ORB is robust against rotation changes, and it uses a learning procedure to generate sampling pairwise tests to construct the descriptor. However, this construction involves a sequential memory access of as many steps as the binary string size. From the latter and motivated by the fact that modern computer vision tasks may require the construction of thousands, if not millions of binary descriptors, we propose to accelerate the construction process of the ORB descriptor via an FPGA-based hardware architecture. The latter is leveraged with a novel arrangement of pairwise tests, which takes advantage of a dual random access memory scheme achieving an acceleration of up to 17 times when compared against the sequential way. The empirical assessment indicates that ORB descriptors obtained from the proposed approach keep a similar performance to that of the original ORB.

On-Board Detection and Matching of Feature Points

Article

Full-text available

Jun 2017

This paper presents a FPGA-based method for on-board detection and matching of the feature points. With the proposed method, a parallel processing model and a pipeline structure are presented to ensure a high frame rate at processing speed, but with a low power consumption. To save the FPGA resources and increase the processing speed, a model which combines the modified SURF detector and a BRIEF descriptor, is presented as well. Three pairs of images with different land coverages are used to evaluate the performance of FPGA-based implementation. The experiment results demonstrate that (1) when the image pairs with artificial features (such as buildings and roads), the performance of FPGA-based implementation is better than those image pairs with natural features (such as woods); (2) the proposed FPGA-based method is capable of ensuring the processing speed at a high frame rate, such as the speed of can achieve 304 fps under a 100 MHz clock frequency. The speedup of the proposed implementation is about 27 times higher than that when using the PC-based implementation.

Low-power coprocessor for Haar-like feature extraction with pixel-based pipelined architecture

Article

Apr 2017
JPN J APPL PHYS

Intelligent analysis of image and video data requires image-feature extraction as an important processing capability for machine-vision realization. A coprocessor with pixel-based pipeline (CFEPP) architecture is developed for real-time Haar-like cell-based feature extraction. Synchronization with the image sensor's pixel frequency and immediate usage of each input pixel for the feature-construction process avoids the dependence on memory-intensive conventional strategies like integral-image construction or frame buffers. One 180 nm CMOS prototype can extract the 1680-dimensional Haar-like feature vectors, applied in the speeded up robust features (SURF) scheme, using an on-chip memory of only 96 kb (kilobit). Additionally, a low power dissipation of only 43.45 mW at 1.8 V supply voltage is achieved during VGA video procession at 120 MHz frequency with more than 325 fps. The Haar-like feature-extraction coprocessor is further evaluated by the practical application of vehicle recognition, achieving the expected high accuracy which is comparable to previous work.

The SAFHG core block diagram.

Context in source publication

Similar publications

Citations