Netlist representation. In (a), schematic graphical representation, in (b), BLIF textual representation , and in (c), netgraph representation of the given netlist.

Source publication

High-Speed Event-Driven RTL Compiled Simulation

Conference Paper

Full-text available

Jul 2004

In this paper we present a new approach for generating high-speed optimized event-driven register transfer level (RTL) compiled simulators. The generation of the simulators is part of our BUILDABONG(7) framework, which aims at architecture and compiler co-generation for special purpose processors. The main focus of the paper is on the transformatio...

Context 1

... how a given netlist is transformed into such a graph representation. basic logic elements. A unidirectional 1 net f ∈ F , which interconnects n + m elements will be represented through f = ({v 1 , . . . , v n }, {u 1 , . . . , u m }), where v 1 , . . . , v n ∈ V are source nodes and u 1 , . . . , u m ∈ V are target nodes of net f . Example 1. In Fig. 1, a netlist is shown with |V | = 10 elements. Net f 1 is given by f 1 = ({r 4 }, {c 2 , c 3 }). Nodes named r i denote sequential elements whereas nodes named c i denote combinational (i.e., state free) logic ...

View in full-text

Context 2

... 2. In Fig. 1, a netlist and its netgraph G = (V, E), respectively, is shown. The subset of combinational elements V c = {c 2 , c 3 , c 4 , c 6 } is shown as circles and the subset of registers or sequential vertices V r = {r 3 , r 4 , r 5 , r 6 , r 9 , r 10 } is represented by ...

View in full-text

Context 3

... Furthermore, all of the nets are represented as directed edges e ∈ E of a netgraph G. In case, when a net contains a n : m connection, it is transformed into n × m directed edges of the netgraph, (in Example 1, for f 1 the case of a 1 : 2 connection is represented which is transformed into edges (r 4 , c 2 ) and (r 4 , c 3 ) of the graph G in Fig. 1 (c)). Given such a netgraph, a simple procedure to perform a determination of the initial sensitivity-update-mappings could be as follows: if there exist a directed path from one register v r1 to an other register v r2 and on the path between these registers are no other sequential elements, then if the value of v r1 changes, v r2 has to ...

View in full-text

Context 4

... on the path between these registers are no other sequential elements, then if the value of v r1 changes, v r2 has to be updated by evaluating the path of combinational elements in between. A set of such initial sensitivity-update- mappings and evaluation paths can be achieved by a search algorithm like depth-first search (DFS). For the example in Fig. 1 (c), the initial sensitivity-update-mappings are extracted in (a): In a technical implementation, for instance, registers r 6 and r 9 would have to be updated twice if r 3 and r 4 would have changed their values compared to the previous simulation ...

View in full-text

The (a,d)-ascending subgraph decomposition

Article

Full-text available

Jan 2010

Let G be a graph of size q and a, n, d be positive integers for which n/2(2a+(n-1)d) ≤ q < (2n+1/2)(2a + nd). Then G is said to have (a, d)- ascending subgraph decomposition into n parts ((a, d) - ASD) if the edge set of G can be partitioned into n-non-empty sets generating subgraphs G 1, G 2, G 3, ..., G n without isolated vertices such that each...

Computing H-Joins with Application to 2-Modular Decomposition

Article

Full-text available

Oct 2014

We present here a general framework to design algorithms that compute H-join. For a given bipartite graph H, we say that a graph G admits a H-join decomposition or simply a H-join, if the vertices of G can be partitioned in |H| parts connected as in H. This graph H is a kind of pattern, that we want to discover in G. This framework allows us to pre...

Sharp Disjunctive Decomposition for Language Emptiness Checking

Conference Paper

Full-text available

Nov 2002

We propose a “Sharp” disjunctive decomposition approach for language emptiness checking which is specifically targeted at “Large” or “Difficult” problems. Based on the SCC (Strongly-Connected Component) quotient graph of the property automaton, our method partitions the entire state space so that each state subspace accepts a subset of the language...

On the use and effect of graph decomposition in qualitative spatial and temporal reasoning

Conference Paper

Full-text available

Apr 2015

We survey the use and effect of decomposition-based techniques in qualitative constraint-based reasoning, and clarify the notions of a tree decomposition, a chordal graph, and a partitioning graph, and their implication with a particular constraint property that has been extensively used in literature, namely, patchwork. As a consequence, we prove...

Network Decomposition using EvolutionaryAlgorithms in Power Systems

Article

Full-text available

Jan 2011

Power system has a highly interconnected network that requires intense computational effort and resources for centralised control. Distributed computing is a solution to this and needs the systems to be partitioned optimally into clusters. The network partitioning is an optimization problem whose objective is to minimise the number of nodes in a cl...

Constructing fast and cycle-accurate simulators for configurable accelerators using C++ templates

Conference Paper

Oct 2017

To quickly prototype accelerator/compiler co-designs, fast and highly accurate architectural simulators are indispensable. They must be fast to keep design iteration times low; they must be highly accurate to make simulation results meaningful. In this paper, we describe how to construct such fast, cycle-accurate simulators from an architectural model by using C++ templates. Not only are templates fully resolved at compile time, thus offering ample opportunity for optimization, they also aptly mirror synthesis-time parameterization of accelerators. For each hardware component, we encode these architecture parameters in a C++ type and construct a class templated on this type. Hierarchically composing the component classes then yields the overall simulator. To demonstrate our constructed simulators' speedup, we construct two simulators for a lightweight VLIW processor, one with, one without templates, and measured their performance: the templated simulator is about 4.85 times faster. Their execution speed makes our simulators well-suited for compiler validation and prototyping accelerator features.

MAML - An Architecture Description Language for Modeling and Simulation of Processor Array Architectures Part I

Article

Full-text available

Mar 2006

A Generic Framework for Rapid Prototyping of System-on-Chip Designs.

Conference Paper

Full-text available

Jan 2006

The integration of different Intellectual Property (IP) cores to modern System-on-Chip (SoC) de- signs becomes more and more an important topic because of the benefits in the overall system performance and the design costs. In this paper we present a new generic framework consisting of a graphical user interface with an extendable highly parameterizable IP component library for con- venient SoC architecture entry, as well as software tools, which provide an automatic generation of fast cycle-accurate simulators for verification purposes and synthesizable HDL code for hardware synthesis. Because the communication of the single IP cores also plays an important role, our IP core library includes an open-source bus component, which is used in a case study design. Topics: System-on-a-chip: design and methodology, Case studies, FPGA-based design

Co-Design of Massively Parallel Embedded Processor Architectures

Conference Paper

Full-text available

Jan 2005

Automatic and Optimized Generation of Compiled High-Speed RTL Simulators

Article

Full-text available

Sep 2004

In this paper we focus on the derivation of optimal code when generating high-speed event-driven com-piled simulators for processor architectures described on register transfer level (RTL). The simulators' gen-eration is part of a framework, which aims at archi-tecture and compiler co-generation for special pur-pose processors. The main contribution of this paper is an efficient algorithm to generate optimal if-then-else structures in order to perform the update cycle during the event-driven simulation process. Our ap-proach guarantees that during one simulation cycle a possible change of each register content is checked ex-actly once and that each register is updated at most once. Additionally, the proposed technique minimizes the code size of the generated simulator. The simu-lator's superior performance compared to an existing commercial simulator is shown. Finally, we demon-strate the pertinence of our approach by simulating a MIPS processor.

HW/SW Co-Optimization and Co-Protection

Chapter

Mar 2024

Khaled Salah Mohamed

Optimization is the process of finding the best input variable values from among all possibilities without explicitly evaluating each possibility.

RepCut: Superlinear Parallel RTL Simulation with Replication-Aided Partitioning

Conference Paper

Mar 2023

Tango: An Optimizing Compiler for Just-In-Time RTL Simulation

Conference Paper

Mar 2020

POSTER: Tango: An Optimizing Compiler for Just-In-Time RTL Simulation

Conference Paper

Sep 2019

Scheduling Techniques for High-Throughput Loop Accelerators

Article

Full-text available

Aug 2009

Frank Hannig

Netlist representation. In (a), schematic graphical representation, in (b), BLIF textual representation , and in (c), netgraph representation of the given netlist.

Contexts in source publication

Similar publications

Citations