Fig 4 - uploaded by Maria D. Vieira
Content may be subject to copyright.
Algorithm in C, and x86-64 instructions correspondents.

Algorithm in C, and x86-64 instructions correspondents.

Source publication
Conference Paper
Full-text available
In the last ten years, the demand for performance improvements in computing systems has not fulfilled by CPU enhancements. A solution widely applied in different computing is the use of hardware accelerators. In the industrial scenario, accelerators such as Graphics Processing Unit (GPU) are more popular because they offer a well-defined and establ...

Contexts in source publication

Context 1
... Section compares the x86-64 instruction set and our instruction set. In Figure 4, we show an algorithm implemented in C (a), and in x86-64 assembly (b). We choose compare our instruction set with the x86-64 assembly because we aim to compare the i5 with our architecture. ...
Context 2
... 3 (b) shows our instruction set which represents the operations graph mapped in Figure 1. That instructions are equivalent to the x86-64 instructions, shown in Figure 4 (b). That is, the operations of the graph in 3 (b) correspond to the algorithm in 4 (a). ...

Similar publications

Article
Full-text available
For the hard guessing random additive noise decoding Markov order (GRAND-MO) algorithm, it is crucial to develop an efficient noise error patterns (NEPs) generator to facilitate its application in bursty channels. This paper proposes a practical hardware realization by generating the NEPs in a sequential manner. Based on classification of the four...
Article
Full-text available
Modern computer systems can use different types of hardware acceleration to achieve massive performance improvements. Some accelerators like FPGA and dedicated GPU (dGPU) need optimized data structures for the best performance and often use dedicated memory. In contrast, APUs, which are a combination of a CPU and an integrated GPU (iGPU), support s...
Preprint
Full-text available
For the hard guessing random additive noise decoding Markov order (GRAND-MO) algorithm used in bursty channels, this paper presents an efficient noise error patterns (NEPs) generator. By converting the NEPs generation process into practical engineering realization, the ''1" and ''0" burst permutations are generated in a sequentially manner. Then th...
Conference Paper
Full-text available
Accelerators became a wide-reaching solution for increasing computing systems' performance. However, they bring the trade-off between programming facility versus energy efficiency. FPGAs are highly energy-efficient accelerators, but complex to program. CGRA Overlays offers a more straightforward programming interface for FPGA and can use dataflows...
Preprint
Full-text available
Experimental particle physics demands a sophisticated trigger and acquisition system capable to efficiently retain the collisions of interest for further investigation. Heterogeneous computing with the employment of FPGA cards may emerge as a trending technology for the triggering strategy of the upcoming high-luminosity program of the Large Hadron...