Multiply accumulate operation (a) conventional implementation and (b) distributed Arithmetic implementation 3.2.1. Internal configuration

Source publication

Fast discrete wavelet transformation using FPGAs and distributed arithmetic

Article

Full-text available

Jan 2003

Ali Al-Haj

The discrete wavelet transform has gained the reputation of being a very effective sig-nal analysis tool for many practical applications. However, due to its computation-intensive na-ture, current implementations of the transform fall short of meeting real-time processing re-quirements of most applications. This paper describes a parallel implement...

Context 1

... the input samples are represented with B bits of preci- sion, B clock cycles are required to complete an inner-product calculation. An example of a distributed arithmetic implementation of a 4- element inner product operation is shown in Figure 1 along with the conventional imple- mentation of the same product operation. ...

View in full-text

Context 2

... is noted from the results obtained above, and further illustrated in Figure 10, that the throughput of the distributed arithmetic im- plementation is higher than the throughput of the conventional arithmetic implementation. This is expected since the distributed arith- metic implementation replaced the time-con- suming conventional multiply accumulate op- erations with fast look-up tables and shift op- erations. ...

View in full-text

Context 3

... partial products of all multiply accumulate operations were pre- computed offline and stored in the LUTs, thus saving a great a mount of real-time computa- tion. As for Virtex slice utilization, distributed arithmetic, uses less hardware resources than the conventional arithmetic, as illustrated in Figure 11. Conventional Arithmetic Implementation ...

View in full-text

Context 4

... DWT Inverse DWT Figure 11. Comparison between the utilization of two DWT implementations This is also expected since the conventional arithmetic multiplier requires much more logic resources than the distributed arithmetic multiplier which requires small LUTs, sim- ples adders and shift registers. ...

View in full-text

A solution to overcome some limitations of SDF based models

Conference Paper

Feb 2018

For computer-aided hardware design, models are usually used to evaluate the designed systems. But there is still a gap between models and their efficient implementations on a real architecture, like FPGAs. For example, some model characteristics may lead to a waste of resources, which can even make a design infeasible. In this paper, we focus on ho...

Efficient Scan-Based BIST Architecture for Application-Dependent FPGA Test

Conference Paper

Full-text available

Nov 2013

FPGAs are attractive devices due to their low develop-ment cost and short time-to-market, and widely used not only for reconfigurable purpose but also as application-dependent embedded devices for low-volume products. This paper presents a scan-based BIST architecture for testing of application-dependent circuits configured on FPGA. In or-der to bu...

A compact FPGA implementation of a bit-serial SIMD cellular processor array

Conference Paper

Full-text available

Aug 2012

An FPGA implementation of a fine grain general-purpose SIMD processor array is presented. The processor architecture has a compact processing element which is encapsulated into two configurable logic blocks (CLBs) and is then replicated to form an array. A 32 × 32 processing element array is implemented on a low-cost Xilinx XC5VLX50 FPGA using four...

FPGA design and implementation of Digital Up-Converter using quadrature oscillator

Conference Paper

Full-text available

Dec 2013

In this paper we design and implement a complex Digital Up-Converter (DUC) using a Xilinx Virtex6 FPGA. All the steps necessary to build such circuits are thoroughly described and some valuable hints on how to overcome problems during the design time are presented. We introduce a new approach for oscillator circuits, which are an important part of...

Placement Algorithm for FPGA Circuits

Article

Full-text available

Field-Programmable Gate Arrays (FPGAs) are flexible and reusable circuits that can be easily reconfigured by the designer. One of the steps involved in the logic design with FPGA circuits is placement. In this step, the logic functions are assigned to specific cells of the circuit. In this paper we present a placement algorithm for FPGA circuits. I...

Modeling and simulation of FIR filter using distributed arithmetic algorithm on FPGA

Article

Full-text available

Feb 2024
MULTIMED TOOLS APPL

In many industries and telecommunication system there is a need for digital signal processing for fast transfer of data between two points or devices with low power consumption and considerable hardware resources (circuit size and speed). Finite Impulse Response (FIR) filters play an important role in many signal processing applications and telecommunication systems. This paper propose the design and implementation of 4-bit FIR Filter using Distributed Arithmetic (DA) Algorithms, which substitute, multiply and accumulate operation with series of Look Up Table (LUT). The proposed FIR filter is implemented in high-density field programmable logic devices (FPGAs) and designed using very high-speed integrated circuit hardware description language (VHDL) and verified using Xilinx ISE 14.7 tool and simulator. The proposed, modified and optimized DA provided the multiplication and accumulation free calculation of inner product data of FIR filter and this consecutively reduces the size and power dissipation of circuit. DA is one of the methods to implement FIR filters that impact the storage resource and the calculating speed, which make the memory size smaller and the operation speed faster. The simulated proposed structure required nearly 40% less cells, 35% less LUT pairs and 4% less power consumption with existing structure.

6G MIMO Spatio-Temporal Data Scattering for Reconfigurable Intelligent Surface (RIS) Performance

Article

Full-text available

May 2023

rbital Angular Momentum (OAM), provides the new angular or mode dimension for wireless communications, offers an intriguing way for anti-jamming. The unprecedented demands for high-quality and seamless wireless services impose continuous challenges to existing cellular networks. Applications like enhanced mobile broadband (eMBB), ultra-reliable and low latency communications (URLLC), and massive machine type communications (mMTC) services are pushing the evolution of cellular systems towards the fifth-generation (5G). We propose to use the orthogonally of OAM modes for anti-jamming in wireless communications. In particular, the mode hopping (MH) scheme for anti-jamming within the narrow frequency band. We derive the closed-form expression of bit error rate (BER) for multiple user's scenario with our developed MH scheme. Our developed MH scheme can achieve the same anti-jamming results within the narrow frequency band as compared with the conventional wideband FH scheme. We explore the challenges in the design of next generation transport layer protocols (NGTP) in 6G Terahertz communication-based networks. Furthermore, we propose mode-frequency hopping (MFH) scheme, which jointly uses our developed MH scheme and the conventional FH scheme to further decrease the BER for wireless communication. In contrast, our experiments for Reconfigurable Intelligent Surface (RIS) reveal it as economically simple and a new type of ultra-thin meta material inlaid with multiple sub-wavelength scatters. We exposed our observations for possible favorable propagation conditions by controlling the phase shifts of the reflected waves at the surface such that the received signals are directly reflected towards the receivers without any extra cost of power sources or hardware. It provides a revolutionarily new approach to actively improve the link quality and coverage, which sheds light into the future 6G. Aiming high-quality channel links in cellular communications via design and optimization of RIS construction is explored in this work as novel RIS-based smart radio techniques. Unlike traditional antenna arrays, three unique characteristics of RIS are revealed in this work. First, the built-in programmable configuration of RIS enables analog beam forming inherently without extra hardware or signal processing. Second, the incident signals can be controlled to partly reflect and partly transmit through the RIS simultaneously, adding more flexibility to signal transmission. Third, RIS has no digital processing capability to actively send signals nor any radio frequency (RF) components. One of the considerations is the use of Terahertz communications that aims to provide 1 Tbps (terabits per second) and air latency less than 100μs. Further, 6G networks are expected to provide for more stringent Quality of Service (QoS) and mobility requirements. As such, it is necessary to develop novel channel estimation and communication protocols, design joint digital and RIS-based analog beam forming schemes, and perform interference control via mixed reflection and transmission. The aforementioned innovative use-cases call for the necessity of redefining the requirements of upcoming 6G technology. 5G technology has abundant potential but it cannot satisfy the stringent rate-reliability-latency requirements of the new applications. This work also highlights the requirements and KPIs of 6G technology will be stricter and more diverse. For example, we discuss a scenario while the 5G network is already operated in the very high frequency mm-waves region, 6G could require even higher frequencies for operation. The 6G technology will focus on achieving higher peak data rate, seamless ubiquitous connectivity, non-existent latency, high reliability, and strong security and privacy for providing ultimate user experience. A Section is devoted to describe the comparative study of the KPIs of both 5G and 6G.

High-Speed Modified DA Architecture for DWT Computation in Secure Image Encoding

Chapter

Sep 2020

Throughout the last 20 years, the DWT has been broadly utilizing in the applications in digital image handling. In this paper, a novel architecture for DWT computation based on modified distributive arithmetic and modified multiplexer logic-based architecture are proposed, designed and implemented on FPGA platform. The designed DWT architecture is designed for high throughput and latency. HDL model is developed for the modified architecture and is validated on FPGA platform for area, timing and power performances. The novel architecture proposed in this work is suitable for high-speed image coding.

Quality of Service Assessment on Some Major Mobile Network Operators in Ghana

Conference Paper

Full-text available

Oct 2018

In this paper, a study was conducted on four major mobile network operators (MTN, Vodafone, Tigo and Airtel) in some selected cities (Accra, Tema and Kumasi) in Ghana. The KPIs (Call Drop rate and Audio Quality) of these networks were measured, analysed and compared with the benchmark set by the local regulator (NCA) and international standard authority-International Telecommunication Union (ITU). It was observed that some of the measured KPIs values (Call Drop Rate and Audio Quality) were fairly close to the standard set by the local (NCA) and the international regulator (ITU) indicating customers could experience fairly good service in those locations, while other values (Traffic Channel Congestion and Call Set Up Time) were outside the standard set by NCA and ITU which means customers could experience some poor QoS in these areas.

Implementation Method On Medical Image Compression System: A Review

Research

Full-text available

Oct 2017

The rapid development of medical imaging and the invention of various medicines have benefited mankind and the whole community. Medical image processing is a niche area concerned with the operations and processes of generating images of the human body for clinical purposes. Potential areas such as image acquisition, image enhancement, image compression and storage, and image based visualization also include in medical image processing analysis. Unfortunately, medical image compression dealing with three-dimensional (3-D) modalities still in the pre-matured stage. Along with that, very limited researchers take a challenge to apply hardware on their implementation. Referring to the previous work reviewed, most of the compression method used lossless rather than lossy. For implementation using software, MATLAB and Verilog are the famous candidates among researchers. In term of analysis, most of the previous works conducted objective test compared with subjective test. This paper thoroughly reviews the recent advances in medical image compression mainly in terms of types of compression, software and hardware implementations and performance evaluation. Furthermore, challenges and open research issues are discussed in order to provide perspectives for future potential research. In conclusion, the overall picture of the image processing landscape, where several researchers more focused on software implementations and various combinations of software and hardware implementation.

A Code Generator for Implementing Dual Tree Complex Wavelet Transform on Reconfigurable Architectures for Mobile Applications

Article

Full-text available

Sep 2016

The authors aimed to develop an application for producing different architectures to implement dual tree complex wavelet transform (DTCWT) having near shift-invariance property. To obtain a low-cost and portable solution for implementing the DTCWT in multi-channel real-time applications, various embedded-system approaches are realised. For comparison, the DTCWT was implemented in C language on a personal computer and on a PIC microcontroller. However, in the former approach portability and in the latter desired speed performance properties cannot be achieved. Hence, implementation of the DTCWT on a reconfigurable platform such as field programmable gate array, which provides portable, low-cost, low-power, and high-performance computing, is considered as the most feasible solution. At first, they used the system generator DSP design tool of Xilinx for algorithm design. However, the design implemented by using such tools is not optimised in terms of area and power. To overcome all these drawbacks mentioned above, they implemented the DTCWT algorithm by using Verilog Hardware Description Language, which has its own difficulties. To overcome these difficulties, simplify the usage of proposed algorithms and the adaptation procedures, a code generator program that can produce different architectures is proposed.

FIR implementation on FPGA: Investigate the FIR order on SDA and PDA algorithms

Conference Paper

Full-text available

Sep 2015

Finite impulse response (FIR) digital filters are extensively used due to their key role in various digital signal processing (DSP) applications. Several attempts have been made to develop hardware realization of FIR filters characterized by implementation complexity, precision and high speed. Field Programmable Gate Array is a reconfigurable realization of FIR filters. Field-programmable gate arrays (FPGAs) are on the verge of revolutionizing digital signal processing. Many front-end digital signal processing (DSP) algorithms, such as FFTs, FIR or IIR filters, are now most often realized by FPGAs. Modern FPGA families provide DSP arithmetic support with fast-carry chains that are used to implement multiply-accumulates (MACs) at high speed, with low overhead and low costs. In this paper, distributed arithmetic (DA) realization of FIR filter as serial and parallel are discussed in terms of hardware cost and resource utilization.

FPGA-based architecture of 3-D HWT using distributed arithmetic (DA)

Conference Paper

Full-text available

Dec 2014

This paper describes the design and implementation of three-dimensional (3-D) Haar with transpose-based computation and distributed arithmetic (DA). As a results of the separately property of the multidimensional Haar wavelet transform (HWT), the proposed architecture has been implemented using a cascade of three N-point one-dimensional (1-D) Haar and two transpose memories for a 3-D volume of N × N × N, suitable for 3-D medical image compression. The 3-D HWT architecture were implemented on SubRIO-9632 board National Instrument. Experimental result and analysis of area, power consumption and maximum frequency are discussed in this paper.

FPGA Implementation of Fir Filter using Distributed Arithmetic Architecture for DWT

Article

Mar 2014

A Novel Reconfigurable Architecture of a DSP Processor for Efficient Mapping of DSP Functions using Field Programmable DSP Arrays

Article

Full-text available

Jun 2013
Comput Architect News

Development of modern integrated circuit technologies makes it feasible to develop cheaper, faster and smaller special purpose signal processing function circuits. Digital Signal processing functions are generally implemented either on ASICs with inflexibility, or on FPGAs with bottlenecks of relatively smaller utilization factor or lower speed compared to ASIC. Field Programmable DSP Array (FPDA) is the proposed DSP dedicated device, redolent to FPGA, but with basic fixed common modules (CMs) (like adders, subtractors, multipliers, scaling units, shifters) instead of CLBs. This paper introduces the development of reconfigurable system architecture with a focus on FPDA that integrates different DSP functions like DFT, FFT, DCT, FIR, IIR, and DWT etc. The switching between DSP functions is occurred by reconfiguring the interconnection between CMs. Validation of the proposed architecture has been achieved on Virtex5 FPGA. The architecture provides sufficient amount of flexibility, parallelism and scalability.

Multiply accumulate operation (a) conventional implementation and (b) distributed Arithmetic implementation 3.2.1. Internal configuration

Contexts in source publication

Similar publications

Citations