Bypass Decoding Process

Source publication

A hardware accelerator for context-based adaptive binary arithmetic decoding in H.264/AVC

Conference Paper

Full-text available

Jun 2005

We propose a hardware accelerator for context-based adaptive binary arithmetic decoding (CABAC) in H.264/AVC. We also propose an efficient memory system for easy integration with other components such as motion compensation and IDCT. We develop an efficient finite state machine so that our design can generate one bit every 2 to 3 clock cycles. Expe...

Dependable Dynamic Partial Reconfiguration with minimal area & time overheads on Xilinx FPGAS

Conference Paper

Full-text available

Sep 2013

Thanks to their flexibility, FPGAs are nowadays widely used to implement digital systems' prototypes and, more frequently, their final releases. Reconfiguration traditionally required an external controller to upload contents in the FPGA. Dynamic Partial Reconfiguration (DPR) opens new horizons in FPGAs' applications, providing many new utilization...

A Novel Algorithm with FPGA Implementation for Action and Gesture Recognition Employing Spatiotemporal Gradient in the Transform Domain

Conference Paper

Full-text available

Oct 2014

In this paper, a novel human action/gesture recognition algorithm based on spatiotemporal gradients of moving points dealt with in 2D in the transform domain is presented. 2DPCA is used to obtain compact feature descriptor representing each action/gesture and Canonical correlation analysis is used to distinguishbetweentestingandtrainingdescriptor.T...

A Novel Algorithm with FPGA Implementation for Action and Gesture Recognition Employing Spatiotemporal Gradient in the Transform Domain

Conference Paper

Full-text available

Oct 2014

Hardware Acceleration of BLOB Detection for Image Processing

Article

Full-text available

Jul 2010

This paper presents the implementation and evaluation of a computer vision task on a Field Programmable Gate Array (FPGA). As an experimental approach for an application-specific image-processing problem, it provides results about gained performance and precision compared with similar solutions on General Purpose Processor (GPP) architectures. The...

Figure 1. Example of the look-up-table (LUT) for sinθ.

Table 1 . Comparison results for straight-line-detection speeds.

Figure 2. Hardware implementation for ρ and θ computation with n-fold...

Table 2 . Hardware resource usage of each module in the prototype...

Figure 3. One of the parallel modules for implementing the Hough space...

Real-Time Straight-Line Detection for XGA-Size Videos by Hough Transform with Parallelized Voting Procedures

Article

Full-text available

Jan 2017

The Hough Transform (HT) is a method for extracting straight lines from an edge image. The main limitations of the HT for usage in actual applications are computation time and storage requirements. This paper reports a hardware architecture for HT implementation on a Field Programmable Gate Array (FPGA) with parallelized voting procedure. The 2-dim...

High-throughput H.264/AVC high-profile CABAC decoder for HDTV applications

Article

Sep 2009
IEEE T CIRC SYST VID

In this letter we propose a high-throughput VLSI architecture design for H.264 high-profile context-based adaptive binary arithmatic coding (HP CABAC) decoding for HDTV applications. To speed up the inherent sequential CABAC decoding, we eliminate the bottleneck by proposing a look-ahead decision parsing technique on the grouped context table with cache registers, which reduces 62% of cycle count on average as compared with the original CABAC decoding. In addition, the proposed design supports the macroblock adaptive frame field coding tools in H.264 main profile coding and 8 times 8 transform in H.264 high-profile coding. It achieves the real-time processing for H.264 CABAC decoding up to L4.1@30 frames/s with maximum 60 Mbits/s when operating at 105 MHz.

Variable-bin-rate CABAC engine for H.264/AVC high definition real-time decoding

Article

Full-text available

Mar 2009
IEEE T VLSI SYST

This paper presents an efficient VLSI architecture for H.264/AVC Content-Adaptive Binary Arithmetic Code (CABAC) decoding. We introduce several new techniques to maximize the parallelism of the decoding process, including variable-bin-rate strategy, multiple-bin arithmetic decoding and efficient probability propagation scheme. The CABAC engine can ensure the real-time decoding for H.264/AVC main profile HD level 4.0. Synthesis results show that the multi-bin decoder can be operated up to 45MHz, and the total logic area is only 42K gates when targeted at TSMC's 0.18um process.

A High-Performance Hardwired CABAC Decoder

Conference Paper

Full-text available

May 2007
Acoust Speech Signal Process

We present a high-performance hardwired context-based adaptive binary arithmetic decoder (CABAD) for H.264/AVC. Based on an analysis of decoding time for different types of syntax elements, we propose three parallel processing techniques. Our decoder takes 309 clock cycles to decode a typical I-type macroblock. It needs to run at only 45 MHz for 1080HD application. Therefore, our architecture is suitable for low power mobile applications

High-speed H.264/AVC CABAC decoding

Article

May 2007
IEEE T CIRC SYST VID

The decoding of context-based adaptive binary arithmetic coding (CABAC) imposes a heavy performance requirement on H.264/AVC decoding systems particularly for large-scale video sequences. As a simple approach of elevating the operating frequency is not sufficient to meet the performance requirement, this paper proposes an efficient approach to accelerate the decoding, which is effective under relatively low operating frequency. Since the CABAC decoding procedure is highly sequential and has strong data dependencies, it is difficult to exploit parallelism and pipeline schemes. The proposed approach resolves the difficulties by modifying the operation chain based on a thorough analysis, eventually enabling both parallel operations and pipelining. More specifically, 1) several context models are simultaneously loaded from memory while context selection is performed in parallel and 2) bin-level pipelining is enabled by employing a small storage to remove structural hazards and data dependencies. Experimental results show that the proposed approach leads to the real-time decoding of HD sequences

High-performance CABAC engine for H.264/AVC high definition Real-time decoding

Conference Paper

Full-text available

Feb 2007

This paper presents an efficient VLSI architecture for H.264/AVC CABAC decoding. We introduce several new techniques to extremely exploit, to the largest extent possible, the parallelism of the decoding process, including line-bit-rate decoding, multiple bin arithmetic decoding and efficient probability propagation scheme. The CABAC engine can ensure the real-time decoding for H.264/AVC main profile HD level 4.0. synthesis results show that the multi-bin decoder can run up to 45 MHz, and the total area is only 42K gates.

A 160K gates/4.5 KB SRAM H.264 video decoder for HDTV applications

Article

Feb 2007
IEEE J SOLID-ST CIRC

In this paper, a low-cost H.264/AVC video decoder design is presented for high definition television (HDTV) applications. Through optimization from algorithmic and architectural perspectives, the proposed design can achieve real-time H.264 video decoding on HD1080 video (1920 times 1088@30 Hz) when operating at 120 MHz with 320 mW power dissipation. Fabricated by using the TSMC one-poly six-metal 0.18 mum CMOS technology, the proposed design occupies 2.9times2.9 mm<sup>2</sup> silicon area with the hardware complexity of 160K gates and 4.5K bytes of local memory

A 50 % power reduction in H.264/AVC HDTV video decoder LSI by dynamic voltage scaling in elastic pipeline

Article

Full-text available

Dec 2006
IEICE T FUND ELECTR

SUMMARY We propose an elastic pipeline that can apply dynamic voltage scaling (DVS) to hardwired logic circuits. In order to demon- strate its feasibility, a hardwired H.264/AVC HDTV decoder is designed as a real-time application. An entropy decoding process is divided into context-based adaptive binary arithmetic coding (CABAC) and syntax el- ement decoding (SED), which has advantages of smoothing workload for CABAC and keeping efficiency of the elastic pipeline. An operating fre- quency and supply voltage are dynamically modulated every slot depend- ing on workload of H.264 decoding to minimize power. We optimize the number of slots per frame to enhance power reduction. The proposed de- coder achieves a power reduction of 50% in a 90-nm process technology,

System-on-Chip Design Methodology for a Statistical Coder

Conference Paper

Jul 2006

AbstractIn this paper, we propose a system-on-chip software hardware co-design methodology for a statistical coder. We use the Context Adaptive Binary Arithmetic Coder (CABAC) used in the Main profile of the H.264/AVC video coding standard as a design example. The design methodology first involves performance and complexity analyses of the existing CABAC reference software, and thus the top-level CABAC software hardware architecture can be conceptualized. The design is aimed to strike a balance between software modules and hardware modules based on design constraints. Verification is performed by comparing the compressed bit stream generated by the reference CABAC SW (without any HW assisted circuitries), with that output by the top-level CABAC architecture (with HW assisted circuitries). Standard video test sequences have been used for verification purpose. The CABAC architecture is then put within the system-on-chip frame work where system bus and its signals, input/output FIFO buffers, debug structures, reset circuit, etc. are designed into. Compared to existing statistical coders, this design is aimed for significant coding time saving by balancing timing between software modules and hardware modules, is well verified with standard video test sequences, and is reusable as an IP in a SoC environment.

A High Throughput VLSI Architecture Design for H.264 Context-Based Adaptive Binary Arithmetic Decoding with Look Ahead Parsing

Conference Paper

Full-text available

Jul 2006

In this paper we present a high throughput VLSI architecture design for context-based adaptive binary arithmetic decoding (CABAD) in MPEG-4 AVC/H.264. To speed-up the inherent sequential operations in CABAD, we break down the processing bottleneck by proposing a look-ahead codeword parsing technique on the segmenting context tables with cache registers, which averagely reduces up to 53% of cycle count. Based on a 0.18 mum CMOS technology, the proposed design outperforms the existing design by both reducing 40% of hardware cost and achieving about 1.6 times data throughput at the same time

RANDM: Random Access Depth Map Compression Using Range-Partitioning and Global Dictionary

Conference Paper

May 2020

Bypass Decoding Process

Similar publications

Citations