Relative simulation times for reception of 150 000 bits for floating...

Context 1

... simulation execution times, the MR is rated better than the SR strategy since 1) the code/ block diagrams will be more application specific, and 2) the code can usually be speeded up with special compilation. However, in an experiment where we implemented the AUT (figure 2) as a dedicated FxP function (FxP2m) in Matlab, the simulation times decreased only 20% compared to the overloaded diffRake (FxP1), see figure 6. Also interesting is that the compiled version (FxP2mex) resulted in the slowest simulation. ...

View in full-text

Impact of ASIC messaging fees on market quality. Evidence from Australia.

Research

Full-text available

Oct 2015

Nduka Robert Enemuwe

Impact of ASIC messaging fees on market quality. Evidence from Australia.

Characterization of the GET4 v1.0 TDC ASIC with detector signals

Book

Full-text available

Jan 2014

A high-Precision Timing ASIC for TOF-PET Applications

Conference Paper

Full-text available

Mar 2018

Development of a Programmable ASIC for Circumventing SPAM

Thesis

Full-text available

Apr 2009

ASIC Commercialization Analysis: Technology Portfolios and the Innovative Performance of ASIC Firms during Technology Evolution

Chapter

Full-text available

Nov 2018

FPGA Resource and Timing Estimation from Matlab Execution Traces

Article

Full-text available

May 2002

We present a simulation-based technique to estimate area and latency of an FPGA implementation of a Matlab specification. During simulation of the Matlab model, a trace is generated that can be used for multiple estimations. For estimation the user provides some design constraints such as the rate and bit width of data streams. In our experience the runtime of the estimator is approximately only 1/10 of the simulation time, which is typically fast enough to generate dozens of estimates within a few hours and to build cost-performance trade-off curves for a particular algorithm and input data. In addition, the estimator reports on the scheduling and resource binding used for estimation. This information can be utilized not only to assess the estimation quality, but also as first starting point for the final implementation.

FPGA resource and timing estimation from Matlab execution traces

Conference Paper

Full-text available

Feb 2002

We present a simulation-based technique to estimate area and latency of an FPGA implementation of a Matlab specification. During simulation of the Matlab model, a trace is generated that can be used for multiple estimations. For estimation the user provides some design constraints such as the rate and bit width of data streams. In our experience the runtime of the estimator is approximately only 1/10 of the simulation time, which is typically fast enough to generate dozens of estimates within a few hours and to build cost-performance trade-off curves for a particular algorithm and input data. In addition, the estimator reports on the scheduling and resource binding used for estimation. This information can be utilized not only to assess the estimation quality, but also as first starting point for the final implementation

Profile-Guided Floating- to Fixed-Point Conversion for Hybrid FPGA-Processor Applications

Article

Jan 2013
ACM T ARCHIT CODE OP

The key to enabling widespread use of FPGAs for algorithm acceleration is to allow programmers to create efficient designs without the time-consuming hardware design process. Programmers are used to developing scientific and mathematical algorithms in high-level languages (C/C++) using floating point data types. Although easy to implement, the dynamic range provided by floating point is not necessary in many applications; more efficient implementations can be realized using fixed point arithmetic. While this topic has been studied previously [Han et al. 2006; Olson et al. 1999; Gaffar et al. 2004; Aamodt and Chow 1999], the degree of full automation has always been lacking. We present a novel design flow for cases where FPGAs are used to offload computations from a microprocessor. Our LLVM-based algorithm inserts value profiling code into an unmodified C/C++ application to guide its automatic conversion to fixed point. This allows for fast and accurate design space exploration on a host microprocessor before any accelerators are mapped to the FPGA. Through experimental results, we demonstrate that fixed-point conversion can yield resource savings of up to 2x--3x reductions. Embedded RAM usage is minimized, and 13&percnt;--22&percnt; higher Fmax than the original floating-point implementation is observed. In a case study, we show that 17&percnt; reduction in logic and 24&percnt; reduction in register usage can be realized by using our algorithm in conjunction with a High-Level Synthesis (HLS) tool.

Relative simulation times for reception of 150 000 bits for floating point (FoP), overloaded FxP (FxP1), dedicated FxP m-code (FxP2m) and compiled dedicated m-code (FxP2mex-slowest!) implementations.

Context in source publication

Similar publications

Citations