Figure 8 - uploaded by Ken Hawick
Content may be subject to copyright.
Pipelined memory access: reading, processing and writing each column in 3 consecutive clock cycles increases overall processing throughput. 

Pipelined memory access: reading, processing and writing each column in 3 consecutive clock cycles increases overall processing throughput. 

Source publication
Article
Full-text available
Field Programmable Gate Arrays (FPGAs) have attracted recent attention as accelerators for a range of scientific ap-plications which had previously been only practicable on conventional general purpose programming platforms. We report on the performance scalability and software engi-neering considerations when FPGAs are used to accelerate performan...

Context in source publication

Context 1
... 7: Combinational logic for next state computation. To compute the next state for a column of cells three separate operations need to be performed. A memory read operation, next state computation and a memory write operation. Because next state computation requires fairly simple logic, memory read and write operations are relatively long in comparison. It is possible to pipeline these three operations to increase the overall throughput, as shown in figure 8. If the next computation logic was quite complex, it could be beneficial to also pipeline it by dividing it up into smaller computation stages; however, in this case it would not achieve any gains. We used Verilog HDL to describe the design. Verilog description of the processing element is shown in algorithm 1. It consists of two parts: first part is the always statement that describes a synchronous three-element shift register using three non-blocking assignment <= opera- tors. These assignments are executed simultaneously in a single clock cycle, using the value of each variable on the right hand side from the previous clock cycle. The value of this shift register is assigned to the reg out port to be passed to PE above and PE below. The second part consists of computation of the number of alive neighbors by summing their states and the computation of the next state by applying the game rules. This code, along with  ...

Similar publications

Article
Full-text available
Dynamic Circuit Specialization is used to optimize the implementation of a parameterized application on an FPGA. Instead of implementing the parameters as regular inputs, in the DCS approach these inputs are implemented as constants. When the parameter values change, the design is reoptimized for the new constant values by reconfiguring the FPGA. T...
Conference Paper
Full-text available
In this article we describe a proposal for a new architecture of the Fast Walsh-Hadamard Transform (FHT) based upon a Radix-R factorization that has the interesting property of generating an identical regular pattern between inputs/outputs for all stages that allows us to simplify significantly the stride permutation in each stage. This property ma...

Citations

... Cellular Agent-Based Models can broadly be characterised as having a large number of microscopically detailed agents arranged spatially so that there are definite visibility or information propagating relationships between nearby cells. Such models include cellular automata like the well known two state Game of Life model [9], or a three-state variant -the Game of Death [17]. Other successful models include systems for simulating: predator-prey systems [18]; social segregation [13]; materials propagation in oil wells [12]; and health systems including disease propagation [14]. ...
Conference Paper
Full-text available
Agent-based models (ABMs) continue to find uses for simulating many complex systems. Spatial or cellular ABMs are particularly useful when they contain a large number of simple microscopic agents. Visualising such many-component models remains challenging however and while 3D graphical rendering technologies can help considerably, sometimes being able to examine a physical model artifact is a better aid to understanding emergent model properties. Additive manufacturing techniques and platforms are becoming commercially affordable for routine use in 3d-printing model artifacts. We describe how an ABM such as the Eden growth model can generate complex 3D structures - effectively tumorous clusters - that can be 3d-printed as physical artifacts and which can be handled and examined directly. We describe some current 3d-print technologies, and the algorithms and software needed to 3d-print realisations of such simulation cellular models. We discuss the capabilities of present 3d-printing technologies and likely future directions of development, as well as what will be needed to make cellular model printing routine.