Workload model specification for a social event calendar application.

Source publication

Synthetic Workload Generation for Cloud Computing Applications

Article

Full-text available

Jan 2011

We present techniques for characterization, modeling and generation of workloads for cloud computing applications. Methods for capturing the workloads of cloud computing applications in two different models-benchmark application and workload models are described. We give the design and implementation of a synthetic workload generator that accepts t...

Context 1

... Model Specification: The workload model specifications are formulated in as an XML document that is input to the GT-CSWL code generator. Table 6 shows the specifications of the workload model for a social event calendar application. The workload model contains specifications for the distributions for the work- load model attributes such as think time, inter-session interval and session length. ...

View in full-text

Situation-Aware IoT Data Generation towards Performance Evaluation of IoT Middleware Platforms

Article

Full-text available

Dec 2022
SENSORS-BASEL

With the increasing growth of IoT applications in various sectors (e.g., manufacturing, healthcare, etc.), we are witnessing a rising demand of IoT middleware platform that host such IoT applications. Hence, there arises a need for new methods to assess the performance of IoT middleware platforms hosting IoT applications. While there are well established methods for performance analysis and testing of databases, and some for the Big data domain, such methods are still lacking support for IoT due to the complexity, heterogeneity of IoT application and their data. To overcome these limitations, in this paper, we present a novel situation-aware IoT data generation framework, namely, SA-IoTDG. Given a majority of IoT applications are event or situation driven, we leverage a situation-based approach in SA-IoTDG for generating situation-specific data relevant to the requirements of the IoT applications. SA-IoTDG includes a situation description system, a SySML model to capture IoT application requirements and a novel Markov chain-based approach that supports transition of IoT data generation based on the corresponding situations. The proposed framework will be beneficial for both researchers and IoT application developers to generate IoT data for their application and enable them to perform initial testing before the actual deployment. We demonstrate the proposed framework using a real-world example from IoT traffic monitoring. We conduct experimental evaluations to validate the ability of SA-IoTDG to generate IoT data similar to real-world data as well as enable conducting performance evaluations of IoT applications deployed on different IoT middleware platforms using the generated data. Experimental results present some promising outcomes that validate the efficacy of SA-IoTDG. Learning and lessons learnt from the results of experiments conclude the paper.

Towards SLA-Driven Autoscaling of Cloud Distributed Services for Mobile Communications

Article

Full-text available

Oct 2022

In recent years cloud computing has established itself as the computing paradigm that supports most distributed systems, which are essential in mobile communications, such as publish-subscribe (pub/sub) systems or complex event processing (CEP). The cornerstone of cloud computing is elasticity, and today’s autoscaling systems leverage that property by making scaling decisions based on estimates of future workload to satisfy service level agreements (SLAs). However, these autoscaling systems are not generic enough, as the workload definition is application-based. On the other hand, the workload prediction needs to be mapped in terms of SLA parameters, which introduces a double prediction problem. This work presents an empirical study on the relationship between different types of workloads in the literature and their relationship in terms of SLA parameters in the context of mobile communications. In addition, more than 30 prediction models have been trained using different techniques (time series analysis, regression, random forests) to test which ones offer better prediction results of the SLA parameters based on the type of workload and the prediction horizon. Finally, a series of conclusions on the predictive models to be used as a first step towards an autonomous decision system are presented.

P2BED-C: A Novel Peer to Peer Load Balancing and Energy Efficient Technique for Data-Centers Over Cloud

Article

Full-text available

Mar 2022
WIRELESS PERS COMMUN

Krishan Kumar

In recent days, cloud computing data centres are considerably involved in performing operations. It accounts for the enormous energy consumption, which increases with an increase in computing capacity. Thinking with respect for the environment, reducing operating costs and energy consumption can prove to be beneficial. Previous works in data-centre energy optimization only involved scheduling the job between the servers based on thermal profiles or workload parameters. Dynamic power management by shutting down the free accessories of data centres was also considered in many models to reduce energy consumption. Further, the role of the communication fabric focused on energy consumption. The proposed work focuses on the minimization of energy consumption at both computing servers and communicating devices. Here, a parameter is defined named config to initialize the configuration of a system in a current state. The parameter will assist the existing Dynamic Voltage Frequency Scheduling (DVFS) scheme for assigning the tasks to a virtual machine to minimize energy consumption at computing servers. Moreover, it extends the Data-centre Energy-efficient Network-aware Scheduling (DENS) with the peer-to-peer load balancer to reduce energy consumption from networking components. The proposed system uses a scheduling algorithm for the cloud data centre, which reduces the energy consumption both at the server and the communication fabric level. Based on the number of samples for the energy consumption, 95% confidence achieve. Energy consumed by the proposed P2BED-C model is 1610.22 Wxh, while other existing approaches FCFS and Round Robin consumed 1684.32 and 1678.35, respectively. The results show considerable improvement in the power utilization of the server resulting in more power savings.

Multi-cloud Load Distribution for Three-tier Applications

Conference Paper

Feb 2022

Web-based business applications commonly experience user request spikes called flash crowds. Flash crowds in web applications might result in resource failure and/or performance degradation. To alleviate these challenges, this class of applications would benefit from a targeted load balancer and deployment architecture of a multi-cloud environment. We propose a decentralised system that effectively distributes the workload of three-tier web-based business applications using geographical dynamic load balancing to minimise performance degradation and improve response time. Our approach improves a dynamic load distribution algorithm that utilises five carefully selected server metrics to determine the capacity of a server before distributing requests. Our first experiments compared our algorithm with multi-cloud benchmarks. Secondly, we experimentally evaluated our solution on a multi-cloud test-bed that comprises one private cloud, and two public clouds. Our experimental evaluation imitated flash crowds by sending varying requests using a standard exponential benchmark. It simulated resource failure by shutting down virtual machines in some of our chosen data centres. Then, we carefully measured response times of these various scenarios. Our experimental results showed that our solution improved application performance by 6.7% during resource failure periods, 4.08% and 20.05% during flash crowd situations when compared to Admission Control and Request Queuing benchmarks.

Generic SDE and GA-based workload modeling for cloud systems

Article

Full-text available

Jan 2021

Workload models are typically built based on user and application behavior in a system, limiting them to specific domains. Undoubtedly, such a practice creates a dilemma in a cloud computing (cloud) environment, where a wide range of heterogeneous applications are running and many users have access to these resources. The workload model in such an infrastructure must adapt to the evolution of the system configuration parameters, such as job load fluctuation. The aim of this work is to propose an approach that generates generic workload models (1) which are independent of user behavior and the applications running in the system, and can fit any workload domain and type, (2) model sharp workload variations that are most likely to appear in cloud environments, and (3) with high degree of fidelity with respect to observed data, within a short execution time. We propose two approaches for workload estimation, the first being a Hull-White and Genetic Algorithm (GA) combination, while the second is a Support Vector Regression (SVR) and Kalman-filter combination. Thorough experiments are conducted on real CPU and throughput datasets from virtualized IP Multimedia Subsystem (IMS), Web and cloud environments to study the efficiency of both propositions. The results show a higher accuracy for the Hull-White-GA approach with marginal overhead over the SVR-Kalman-Filter combination.

Robust Dynamic CPU Resource Provisioning in Virtualized Servers

Article

Full-text available

Jan 2020

We present robust dynamic resource allocation mechanisms to allocate application resources meeting Service Level Objectives (SLOs) agreed between cloud providers and customers. In fact, two filter-based robust controllers, i.e. $\mathcal{H}_\infty$ filter and Maximum Correntropy Criterion Kalman filter (MCC-KF), are proposed. The controllers are self-adaptive, with process noise variances and covariances calculated using previous measurements within a time window. In the allocation process, a bounded client mean response time (mRT) is maintained. Both controllers are deployed and evaluated on an experimental testbed hosting the RUBiS (Rice University Bidding System) auction benchmark web site. The proposed controllers offer improved performance under abrupt workload changes, shown via rigorous comparison with current state-of-the-art. On our experimental setup, the Single-Input-Single-Output (SISO) controllers can operate on the same server where the resource allocation is performed; while Multi-Input-Multi-Output (MIMO) controllers are on a separate server where all the data are collected for decision making. SISO controllers take decisions not dependent to other system states (servers), albeit MIMO controllers are characterized by increased communication overhead and potential delays. While SISO controllers offer improved performance over MIMO ones, the latter enable a more informed decision making framework for resource allocation problem of multi-tier applications.

An “On The Fly” Framework for Efficiently Generating Synthetic Big Data Sets

Conference Paper

Dec 2019

Influence of the trace resolution and length in the cost optimization process in cloud computing

Conference Paper

Jul 2019

An open source approach to the design and implementation of Digital Twins for Smart Manufacturing

Article

Apr 2019

This paper discusses the design of a Digital Twin (DT) demonstrator for Smart Manufacturing, following an open source approach for implementation. Open source technology can comprise of software, hardware and hybrid solutions that nowadays drive Smart Manufacturing. The major potential of open source technology in Smart Manufacturing lies in enabling interoperability and in reducing the capital costs of designing and implementing new manufacturing solutions. After presenting our motivation to adopt an open source approach for the design of a DT demonstrator, we identify the major implementation requirements of Smart Cyber Physical Systems (CPSs) and DTs. A conceptualisation of the core components of a DT demonstrator is provided and three technology building blocks for the realisation of a DT have been identified. These technology building blocks include components for the management of data, models and services. From the conceptual model of the DT demonstrator, we derived a high-level micro-services architecture and provided a case study infrastructure for the implementation of the DT demonstrator based on available open source technologies. The paper closes with research questions to be addressed in the future.

An "On The Fly" Framework for Efficiently Generating Synthetic Big Data Sets

Preprint

Full-text available

Mar 2019

Collecting, analyzing and gaining insight from large volumes of data is now the norm in an ever increasing number of industries. Data analytics techniques, such as machine learning, are powerful tools used to analyze these large volumes of data. Synthetic data sets are routinely relied upon to train and develop such data analytics methods for several reasons: to generate larger data sets than are available, to generate diverse data sets, to preserve anonymity in data sets with sensitive information, etc. Processing, transmitting and storing data is a key issue faced when handling large data sets. This paper presents an "On the fly" framework for generating big synthetic data sets, suitable for these data analytics methods, that is both computationally efficient and applicable to a diverse set of problems. An example application of the proposed framework is presented along with a mathematical analysis of its computational efficiency, demonstrating its effectiveness.

Workload model specification for a social event calendar application.

Context in source publication

Citations