Basic integration aspects on hardware.

Source publication

Hardware Accelerated Application Integration Processing: Industry Paper

Conference Paper

Full-text available

Jun 2017

The growing number of (cloud) applications and devices massively increases the communication rate and volume pushing integration systems to their (throughput) limits. While the usage of modern hardware like Field Programmable Gate Arrays (FPGAs) led to low latency when employed for query and event processing, application integration adds yet unexpl...

Context 1

... Channel on Hardware. The message channels decouple sending and receiving endpoints or processors and denote the com- munication between them. Thereby, the sending endpoint writes data to the channel, while the receiving endpoint reads the data for further processing. Our message channel definition on hardware is depicted in Fig. 2. We use hardware signals and data lines to represent the control and data flow through a message channel. The channels contain a unique identifier as id, the message length as length, and the body as data of 8 bit chunks from the previously defined message over the data line (data(0..7)). To indicate that a message is sent over the channel, we added a message signal as message, which is set to one (i. e., high), when one message is sent -even if there is currently no valid data on the data line. The message signal is zero (i. e., low) only between messages (i. e., the channel is ready to receive another message). For the transport of the data to the subsequent processor we define an enable signal as enable, which is high, when valid data is on the data line and low, when there is no valid data on the data line. The id and length are separate lines, which are constant, when the message line is ...

View in full-text

Context 2

... the FPGA, we define flow control similar to [3], which is ex- clusively used there for the synchronous communication between remote endpoints. For the back-pressure between message proces- sors (i. e., no TCP support), we cannot reject messages atomically, because the stream might already be processed partially. There- fore, we decided for an approach with small FIFO queues in each processor that are used to buffer message data that cannot be im- mediately processed by the subsequent processor and thus ensure that no message data is lost. The receiving processor signals this by setting its readReady to low (cf. Fig. 2). The FIFO queues can be represented on hardware using flip-flops (FF), Block RAM (BRAM) or built-in FIFOs. Since FFs can only store one bit at a time and are very important for the logic of message processors, we chose BRAM. Although BRAM is a limited resources as well, it can be more easily extended by on-board DRAM to buffer larger messages. If the queue limit is exceeded, and the successor processor is not ready yet (i. e., readReady low), the current processor notifies its sender by setting its readReady to ...

View in full-text

FIGURE 7. Up: Reconstructed 2D points (Left and Right Retinas). Down:...

FIGURE 8. Up and Middle: Reconstructed 3D points, frontal and spatial...

Bio-Inspired Stereo Vision Calibration for Dynamic Vision Sensors

Article

Full-text available

Sep 2019

Many advances have been made in the field of computer vision. Several recent research trends have focused on mimicking human vision by using a stereo vision system. In multi-camera systems, a calibration process is usually implemented to improve the results accuracy. However, these systems generate a large amount of data to be processed; therefore,...

Responsible Composition and Optimization of Integration Processes under Correctness Preserving Guarantees

Preprint

Full-text available

May 2023

Enterprise Application Integration deals with the problem of connecting heterogeneous applications, and is the centerpiece of current on-premise, cloud and device integration scenarios. For integration scenarios, structurally correct composition of patterns into processes and improvements of integration processes are crucial. In order to achieve this, we formalize compositions of integration patterns based on their characteristics, and describe optimization strategies that help to reduce the model complexity, and improve the process execution efficiency using design time techniques. Using the formalism of timed DB-nets - a refinement of Petri nets - we model integration logic features such as control- and data flow, transactional data storage, compensation and exception handling, and time aspects that are present in reoccurring solutions as separate integration patterns. We then propose a realization of optimization strategies using graph rewriting, and prove that the optimizations we consider preserve both structural and functional correctness. We evaluate the improvements on a real-world catalog of pattern compositions, containing over 900 integration processes, and illustrate the correctness properties in case studies based on two of these processes.

Decompressor for hardware applications

Article

Apr 2023

The use of lossless compression in the application specific computers provides such advantages as minimized amount of memory, increased bandwidth of interfaces, reduced energy consumption, and improved self-testing systems. The article discusses known algorithms of lossless compression with the aim of choosing the most suitable one for implementation in a hardware-software decompressor. Among them, the Lempel-Ziv-Welch (LZW) algorithm makes it possible to perform the associative memory of the decompressor dictionary in the simplest way by using the sequential reading the symbols of the decompressed word. The analysis of the existing hardware implementations of the decompressors showed that the main goal in their development was to increase the bandwidth at the expense of increasing hardware costs and limited functionality. It is proposed to implement the LZW decompressor in a hardware module based on a microprocessor core with a specialized instruction set. For this, a processor core with a stack architecture was selected, which is developed by the authors for the tasks of the file grammar analyzing. Additional memory block for the dictionary storing and an input buffer which converts the byte stream of the packed file into a sequence of unpacked codes are added to it. The processor core instruction set is adjusted to both speed up decompression and reduce hardware costs. The decompressor is described by the Very high-speed integral circuit Hardware Description Language and is implemented in a field programable gate array (FPGA). At a clock frequency of up to two hundred megahertz, the average throughput of the decompressor is more than ten megabytes per second. Because of the hardware and software implementation, an LZW decompressor is developed, which has approximately the same hardware costs as that of the hardware decompressor and has a lower bandwidth at the costs of flexibility, multifunctionality, which is provided by the processor core software. In particular, a decompressor of the Graphic Interchange Format files is implemented on the basis of this device in FPGA for the application of dynamic visualization of patterns on the embedded system display

Performance Evaluation of Thread Pool Configurations in the Run-time Systems of Integration Platforms

Article

Full-text available

Sep 2021
Int J Bus Process Integrat Manag

Companies' software ecosystem-composed of local applications and cloud computing services-is made up by the connection of integration platforms and applications. Run-time systems are arguably the most considerable components for integration platforms performance. Our literature review has identified that most integration run-time systems adopt a global pool as configuration for threads. However, it is possible to configure local thread pools to increase the performance of run-time systems. This article brings a comparison between two configurations of thread pools simulating the execution of a real integration problem. Results show that the execution performance through the local pool configuration exceeds the performance through the global pool in high workload scenarios. These results were reviewed by rigorous statistical analysis.

Performance Evaluation of Thread Pool Configurations in the Run-time Systems of Integration Platforms

Preprint

Full-text available

Sep 2021

Performance evaluation of thread pool configurations in the run-time systems of integration platforms

Article

Full-text available

Jan 2021
Int J Bus Process Integrat Manag

Non-Relational Databases on FPGAs: Survey, Design Decisions, Challenges

Preprint

Full-text available

Jul 2020

Non-relational database systems (NRDS), such as graph, document, key-value, and wide-column, have gained much attention in various trending (business) application domains like smart logistics, social network analysis, and medical applications, due to their data model variety and scalability. The broad data variety and sheer size of datasets pose unique challenges for the system design and runtime (incl. power consumption). While CPU performance scaling becomes increasingly more difficult, we argue that NRDS can benefit from adding field programmable gate arrays (FPGAs) as accelerators. However, FPGA-accelerated NRDS have not been systematically studied, yet. To facilitate understanding of this emerging domain, we explore the fit of FPGA acceleration for NRDS with a focus on data model variety. We define the term NRDS class as a group of non-relational database systems supporting the same data model. This survey describes and categorizes the inherent differences and non-trivial trade-offs of relevant NRDS classes as well as their commonalities in the context of common design decisions when building such a system with FPGAs. For example, we found in the literature that for key-value stores the FPGA should be placed into the system as a smart network interface card (SmartNIC) to benefit from direct access of the FPGA to the network. However, more complex data models and processing of other classes (e.g., graph and document) commonly require more elaborate near-data or socket accelerator placements where the FPGA respectively has the only or shared access to main memory. Across the different classes, FPGAs can be used as communication layer or for acceleration of operators and data access. We close with open research and engineering challenges to outline the future of FPGA-accelerated NRDS.

Catalog of Optimization Strategies and Realizations for Composed Integration Patterns

Preprint

Full-text available

Jan 2019

The discipline of Enterprise Application Integration (EAI) is the centrepiece of current on-premise, cloud and device integration scenarios. However, the building blocks of integration scenarios, i.e., essentially a composition of Enterprise Integration Patterns (EIPs), are only informally described, and thus their composition takes place in an informal, ad-hoc manner. This leads to several issues including a currently missing optimization of application integration scenarios. In this work, we collect and briefly explain the usage of process optimizations from the literature for integration scenario processes as catalog.

Optimization Strategies for Integration Pattern Compositions

Conference Paper

Full-text available

Jun 2018

Enterprise Application Integration is the centerpiece of current on-premise, cloud and device integration scenarios. We describe optimization strategies that help reduce the model complexity, and improve the process execution using design time techniques. In order to achieve this, we formalize compositions of Enterprise Integration Patterns based on their characteristics, and propose a realization of optimization strategies using graph rewriting. The framework is successfully evaluated on a real-world catalog of pattern compositions, containing over 900 integration scenarios.

Hardware accelerated application integration: challenges and opportunities

Conference Paper

Dec 2017

Daniel Ritter

In this talk we set the emerging domain of application integration into the context of recent hardware advances.

Responsible composition and optimization of integration processes under correctness preserving guarantees

Article

Apr 2024
INFORM SYST

Basic integration aspects on hardware.

Contexts in source publication

Similar publications

Citations