Sequential Result on Intel XEON E5420 and Intel CORE 2 Duo E7500

Source publication

Analysis of parallel multicore performance on Sobel Edge detector

Conference Paper

Full-text available

Jul 2011

This paper presents the parallel multicore Sobel edge algorithm which parallelizes the traditional sequential Sobel edge detection algorithm on a parallel multicore platform. The current advancement of multicore architecture can be utilized by the parallel programming paradigm when focuses on the thread operations. The CPUs/cores provide more proce...

Task-Based Programming for Seismic Imaging: Preliminary Results

Conference Paper

Full-text available

Aug 2014

The level of hardware complexity of current super-computers is forcing the High Performance Computing (HPC) community to reconsider parallel programming paradigms and standards. The high-level of hardware abstraction provided by task-based paradigms make them excellent candidates for writing portable codes that can consistently deliver high perform...

SPFC: An Effective Optimization for Vertex-Centric Graph Processing Systems

Article

Full-text available

Dec 2017

The real-world demands of mining big data and smart data of graph structure have led to an active research of distributed graph processing. Many distributed graph processing systems [19], [22], [23] adopt a vertex-centric programming paradigm. In these systems, messages are passed between vertices to propagate the latest states. The communication e...

Figure 1. The computation performance of the modules in the CAMx model....

Figure 2. Heterogeneous porting scheme of the CAMx-CUDA model.

Figure 3. The calling and computation process of GPU-HADVPPM on the...

Figure 4. An example of parallel architecture with an MPI + CUDA hybrid...

Figure 5. SO 2 and O 3 concentrations outputted by the CAMx model for...

GPU-HADVPPM V1.0: a high-efficiency parallel GPU design of the piecewise parabolic method (PPM) for horizontal advection in an air quality model (CAMx V6.10)

Article

Full-text available

Aug 2023

With semiconductor technology gradually approaching its physical and thermal limits, graphics processing units (GPUs) are becoming an attractive solution for many scientific applications due to their high performance. This paper presents an application of GPU accelerators in an air quality model. We demonstrate an approach that runs a piecewise par...

Lattice Boltzmann Method: Parallel Computing and Collision term effect.

Conference Paper

Full-text available

Dec 2016

A Lattice Boltzmann solver is implemented using various techniques and the performance is discussed. Both Open Multi-Processing (OpenMP) and Message Passing Interface (MPI) parallelization techniques using the same numerical algorithm and employing different collision terms were executed. We compare the numerical solution of different programming p...

Performance model for Master/Worker hybrid applications

Conference Paper

Full-text available

Jul 2013

Master/worker is a commonly used parallel/distributed programming paradigm. Many applications are developed following such paradigm. This paradigm can be easily implemented using message passing programming libraries (MPI), but moreover, the multicore features of current nodes can be exploited at the node level by applying thread parallelism (OpenM...

Membrane computing and image processing: a short survey

Article

Full-text available

Mar 2019

Membrane computing is a well-known research area in computer science inspired by the organization and behavior of live cells and tissues. Their computational devices, called P systems, work in parallel and distributed mode and the information is encoded by multisets in a localized manner. All these features make P systems appropriate for dealing with digital images. In this paper, some of the open research lines in the area are presented, focusing on segmentation problems, skeletonization and algebraic-topological aspects of the images. An extensive bibliography about the application of membrane computing to the study of digital images is also provided.

Parallelization of Gradient-based Edge Detection Algorithm on Multicore processors

Conference Paper

Full-text available

Apr 2018

Current computers are multi-core, with more than one physical core in one microprocessor chip. Many applications in digital image processing are parallel in nature. Therefore, multi-core processors can be exploited to perform such computations in parallel. In this paper, the standard OpenMP threading library is used to speed-up the edge detection operation on multicore processors. Different partitioning methods of the input image are tested and their effect on the performance of the parallel implementation of the Sobel Edge Detection algorithm is analyzed. It is shown that the horizontal partitioning of the image leads to better performance than vertical partitioning or two-dimensional block partitioning. Various numbers of blocks of the image are tested. It is shown that a number of blocks equal to 0.25 the size of the cache line and a number of threads double the number of physical core give the best performance of the parallel Sobel algorithm.

Multi-Threaded Computation of the Sobel Image Gradient on Intel Multi-Core Processors Using OpenMP Library

Article

Full-text available

Apr 2016

Ahmed Sherif Zekri

Performance of applications executed on multi-core processors is not boosted by just dividing the work among a team of threads and assigning them blindly to the CPU cores. Factors such as data access patterns in memory, the way of allocating the threads to the physical cores, and how the data are partitioned among the threads significantly affect performance. In this paper, we target the acceleration of the Sobel image gradient computing which is important in segmenting images for further processing in computer vision and image analysis applications. We present a multi-threaded algorithm using the standard OpenMP threading library to parallelize the computations using two Intel multi-core processors. The effect of the parallelization factors on the performance of the proposed algorithm are evaluated using different image resolutions to draw accurate conclusions. Our results showed a maximum attained speedup closer to the number of physical cores in the CPU, which is the maximum theoretical value.

Image edge detection based on rotating kernel transformation

Article

Full-text available

Jan 2015

An edge detection algorithm based on improved Rotating Kernel Transformation, IRKT edge detection method (IRKTE), is proposed in this paper. The algorithm adopt the line detection approach RKT, and defines a new model of edge detection according to the direction difference between edge and smooth regions. Simultaneously, an accurate edge location approach based on edge normal direction is presented to overcome the wide width caused by a large scale kernel in IRKT. Furthermore, the improvement of previous IRKT with weight edge detection (IRKTEW) is proposed to improve the ability to resist the noise effectively. A series of experiments are carried out through the picture libraries with ground truth and the performance is analyzed with ROC curves. The experimental results show that the proposed method can effectively detect the edge under the strong noises, and the performance of edge detection is improved with the proposed approach.

SCALING PERFORMANCE OF TASK-INTENSIVE APPLICATIONS VIA MAPREDUCE PARALLEL PROCESSING

Conference Paper

Full-text available

Sep 2013

Task-intensive applications are synonymous with today's computing era. Large amount of computing resource is required in order to process such applications. Therefore, running task-intensive applications via off-the-shelf personal computer (PC) will lead to long execution time. Parallel processing via MapReduce generates more computing resources by connecting multiple PC to execute a common task. Despite this, MapReduce parallel processing is mostly used for data-intensive applications. This research explores how MapReduce parallel processing scales the performance of task-intensive applications. Travelling salesman problem (TSP) via brute-force approach is used as the case study for the experiment. The experiment is conducted by using 1, 2, 3, and 4 node configurations. Parallel processing criteria such as performance improvement, speedup, and efficiency are discussed for performance benchmarks. It is discovered that MapReduce parallel processing allows good performance scaling for task-intensive applications.

A methodology for speeding up edge and line detection algorithms focusing on memory architecture utilization

Article

Apr 2013
J SUPERCOMPUT

In this paper, a new methodology for speeding up edge and line detection algorithms is presented, achieving improved performance over the state of the art software library OpenCV (speedup from 1.35 up to 2.22) and other conventional implementations, in both general and embedded processors, by reducing the number of load/store and arithmetic instructions, the number of data cache accesses and data cache misses in memory hierarchy and the algorithm memory size. This is achieved by fully exploiting the combination of the software and hardware parameters which are considered simultaneously as one problem and not separately. Furthermore, the edge and line detection algorithms have been simplified for a computer vision application in a Virtex-5 Xilinx FPGA using Microblaze soft processor (detection and measurement of flow fronts in a microfluid device); it achieves speedup up to 660 times in comparison with conventional software implementations.

Parallel approach of sobel edge detector on multicore platform

Article

Full-text available

Aug 2011

Parallelization of Sobel Edge Detection Algorithm

Article

Full-text available

The size of images in image processing considered a critical point in processing the images, so we must process the large size of images in small time es-pecially in medical applications.In this paper, we present a new design of parallelizing Sobel edge de-tection algorithm in order to decrease the computa-tion time. The parallel algorithm is implemented using MPI library and decentralized architecture. The new methodology depending on domain and function decomposition to improve the results. Ex-perimental is done in one machine and the results demonstrate that the new design gives a good re-sults.

Bitmap Image Processing and Edge Detection Using Sobel Algorithm and Multi-Threaded Parallel Techniques

Conference Paper

Oct 2021

Influential Researcher Identification in Academic Network Using Rough Set Based Selection of Time-Weighted Academic and Social Network Features

Chapter

Jan 2020

Researchers entering into a new research area are interested in knowing the current research trends, popular publications and influential (popular) researchers in that area in order to initiate their research. In this work, we attempt to determine the influential researcher for a specific topic. The active participation of the researchers in both the academic and social network activities signifies the researchers' influence level across time. The content and frequency of social interaction to a researcher reflects his or her influence. In our system, appropriate time-based social and academic features are selected using entropy based feature selection approach of rough set theory. A three layer model comprising semantically related concepts, researcher and social relations is developed based on the appropriate (influential) features. The researchers' topic trajectories are identified and recommended using Spreading activation algorithm. To cope up with the scalable academic network, map reduce paradigm has been employed in the spreading activation algorithm.

Sequential Result on Intel XEON E5420 and Intel CORE 2 Duo E7500

Similar publications

Citations