Cloud System Architecture

Source publication

GeoClouds Modcs: A perfomability evaluation tool for disaster tolerant IaaS clouds

Conference Paper

Full-text available

Mar 2014

Performance and availability are key aspects to evaluate the quality of cloud computing systems. The assessment of these systems should consider the effects of queuing and failure/recovery behavior of data center subsystems and disaster occurrences. Additionally, penalties may be applied if the defined quality level of SLA contracts is not satisfie...

Context 1

... section presents a case study to illustrate the impor- tance of software tools for helping cloud computing designers to estimate performability metrics. Figure 8 presents a cloud architecture located in Brazil, which is composed of a data center located in Recife (Data Center 1), other in Rio de Janeiro (Data Center 2) and a Backup server in São Paulo. Each data center consists of two physical machines and each machine is capable to run up two virtual machines. ...

View in full-text

On Decidability of Concurrent Kleene Algebra

Conference Paper

Full-text available

Aug 2017

Concurrent Kleene algebras support equational reasoning about computing systems with concurrent behaviours. Their natural semantics is given by series(-parallel) rational pomset languages, a standard true concurrency semantics, which is often associated with processes of Petri nets. We use constructions on Petri nets to provide two decision procedu...

Petri Nets to B-Language Transformation in Software Development

Article

Full-text available

Jan 2014

Petri nets and B-Method represent a pair of formal methods, for computer systems engineering, with interesting complementary features. Petri nets have nice graphical representation, valuable analytical properties and can express concurrency. B-Method supports verified software development. To gain from these complements, a mapping from Petri nets t...

Modèle hyperexponentiel en temps continu et en temps discret pour l'évaluation de la croissance de la sûreté de fonctionnement

Article

Full-text available

Jan 1992

Mohamed Kaaniche

This dissertation presents theoretical and practical results on computer system reliability and availability growth modeling. Two kinds of system behavior characterizations are considered: first, with respect to time, and second, with respect to the number of executions performed. The dissertation is centered on two reliability growth models: the c...

Figure 1-Proposed supervisory framework for the system

Figure 2-P-T Petri net a) and its reachability tree b)

Figure 3 -Petri net model of cyclic loop of AGV path

Figure 4 -Modelling of human behavior using the command-response concept

Figure 5 -Port transshipment transport system

Petri Net Approach of Collision Prevention Supervisor Design in Port Transport System

Article

Full-text available

Sep 2007

Modern port terminals are equipped with various local transport systems, which have the main task to transport cargo between local storehouses and transport resources (ships, trains, trucks) in the fastest and most efficient way, and at the lowest possible cost. These local transport systems consist of fully automated transport units (AGV- automati...

Equivalent transformation of merging two transitions with the same meaning.

Equivalent transformation of two places with the same preset and postset.

Equivalent transformation of two transitions with the same preset and...

Basic working process of tasks on a cloud platform.

The refined DSSPN model of a typical cloud system adopting fair scheduling.

Dynamic Scalable Stochastic Petri Net: A Novel Model for Designing and Analysis of Resource Scheduling in Cloud Computing

Article

Full-text available

Jan 2016

Performance evaluation of cloud computing systems studies the relationships among system configuration, system load, and performance indicators. However, such evaluation is not feasible by dint of measurement methods or simulation methods, due to the properties of cloud computing, such as large scale, diversity, and dynamics. To overcome those chal...

Evaluating and modelling solutions for disaster recovery

Article

Aug 2020

Systems outages can have disastrous effects on businesses such as data loss, customer dissatisfaction, and subsequent revenue loss. Disaster recovery (DR) solutions have been adopted by companies to minimise the effects of these outages. However, the selection of an optimal DR solution is difficult since there does not exist a single solution that suits the requirement of every company (e.g., availability and costs). In this paper, we propose an integrated model-experiment approach to evaluate DR solutions. We perform experiments in different real-world DR solutions and propose analytic models to evaluate these solutions regarding DR key-metrics: steady-state availability, recovery time objective (RTO), recovery point objective (RPO), downtime, and costs. The results reveal that DR solutions can significantly improve availability and minimise costs. Also, a sensitivity analysis identifies the parameters that most affect the RPO and RTO of the DR adopted solutions.

Data centers’ services restoration based on the decision-making of distributed agents

Article

Full-text available

Jul 2020
TELECOMMUN SYST

The increasing number of companies that are migrating their IT infrastructure to cloud environments has been motivated many studies on distributed backup strategies to improve the availability of these companies’ systems. In this scenario, it is essential to study mechanisms to evaluate the network conditions to minimize the transmission time to improve the availability of the system. The goal of this study is to build models to evaluate the availability of services running in cloud data center infrastructure, emphasizing the impact of the variation of throughput on the data redundancy, and consequently, on the availability of the service. Based on it, this research purposes some smart models which can be deployed in each data center of a distributed arrange of data centers and help the system administrator to choose the best data center to restore the services of a faulty one. To analyze the impact of the network throughput over the service’s availability, we gathered the MTTF and MTTR metrics of data center’s components and services, generated a reliability block diagram to get the MTTF of the system as a whole, and developed a formalism to model the network component. Based on the results, we built an SPN model to represent the system and get the availability of it in many network conditions. After that, we analyze the availability of the system to discuss the impact of the network conditions over the system’s availability. After building the models and get the system’s availability in many network conditions, we can perceive the enormous impact of the network conditions over the system’s availability through a plot that exhibits the annual downtime along of a year. Using the models developed to study the system availability, we developed smart agents capable of predicting the transfer time of a bulk of data and, with it, choose the data center with the best network conditions to restore the services of a faulty one.

Multiple-criteria Evaluation of Disaster Recovery Strategies Based on Stochastic Models

Conference Paper

Mar 2020

The consequences for a company losing its data or having its IT system disrupted are severe and can impact negatively on business operations. It can also cause customer dissatisfaction and subsequent revenue loss. In a competitive global market, companies have been adopting disaster recovery (DR) strategies as an attempt to keep IT systems operational, prevent data loss, and ensure business continuity. However, there is not a single DR strategy that meets the requirements of every business (e.g., availability and cost). Besides, most of the time, these requirements are conflicting. Therefore, efficient and accurate analysis of DR strategies before its deployment is crucial to choose the best strategy that suits companies’ needs and budget. In this paper, we propose the adoption of a multiplecriteria decision-making (MCDM) method and stochastic models to evaluate and rank DR strategies for IT infrastructures. The stochastic models are used for quantitative assessing distinct DR strategies regarding five DR key-metrics: availability, downtime, Recovery Time Objective (RTO), and Recovery Point Objective (RPO), and cost. We also use an MCDM method to rank the strategies according to multiple criteria (e.g., availability maximization and costs minimization). A case study demonstrates the feasibility and usefulness of the proposed approach for finding the best DR strategies according to multiple criteria.

Evaluating the impact of maintenance policies associated to SLA contracts on the dependability of data centers electrical infrastructures

Article

Full-text available

Jan 2020

Due to the growth of cloud computing, data center environment has grown in importance and in use. Data centers are responsible for maintaining and processing several critical-value applications. Therefore, data center infrastructures must be evaluated in order to improve the high availability and reliability demanded for such environments. This work adopts Stochastic Petri Nets (SPN) to evaluate the impact of maintenance policies on the data center dependability. The main goal is to analyze maintenance policies, associated to SLA contracts, and to propose improvements. In order to accomplish this, an optimization strategy that uses Euclidean distance is adopted to indicate the most appropriate solution assuming conflicting requirements (e.g., cost and availability). To illustrate the applicability of the proposed models and approach, this work presents case studies comparing different SLA contracts and maintenance policies (preventive and corrective) applied on data center electrical infrastructures.

Evaluation of a Backup-as-a-Service Environment for Disaster Recovery

Conference Paper

Jun 2019

Systems unavailability may produce severe consequences for modern business such as data loss, customer dissatisfaction, and subsequent revenue loss. Disaster recovery (DR) solutions have been adopted by many organizations as an attempt to prevent data loss and ensure business continuity. With the cloud computing expansion, different cloud providers have been offering low-cost solutions for DR purposes such as the Backup-as-a-service (BaaS) for consumers. Therefore, in this paper, we present an integrated model-experiment approach to evaluate a BaaS environment for DR purposes. We use analytic models and fault-injection experiments to evaluate DR key-metrics such as availability, downtime, Recovery Time Objective (RTO), and Recovery Point Objective (RPO) in a real-world BaaS environment. The results revealed that the environment availability can vary according to the amount of data to backed up and restored. Besides, a sensitivity analysis shows that the RTO and RPO are mainly influenced by the the mean time to recover from a disaster and the backup interval, respectively.

Disaster Recovery Solutions for IT Systems: A Systematic Mapping Study

Article

Dec 2018
J SYST SOFTWARE

Context: Organizations are spending an unprecedented amount of money towards the cost of keeping Information Technology (IT) systems operational. Hence, these systems need to be designed using effective fault-tolerant techniques like Disaster Recovery (DR) solutions. Even though research has been done in the DR field, it is necessary to assess the current state of research and practice, to provide practitioners with evidence that enables foster its further development. Objective: This paper has the following goals: to investigate state-of-the-art solutions for DR, as well as to systematically analyze the current published research and identify different strategies available in the literature. Method: A systematic mapping study was conducted, in which 49 studies, dated from 2007 to 2017, were evaluated. Results: Various DR practices are being investigated. The results identified a number of relevant issues, including reasons to adopt DR solutions, strategies used to implement DR solutions, approaches employed to analyze DR solutions, and metrics considered during the analyses of DR solutions. Conclusion: The number of strategies and reasons for adopting DR solutions is overwhelming. Hence, there was a need to provide a consolidated view of the field. Also, the results can help to direct future research efforts in this critical area.

Sensitivity analysis of an availability model for disaster tolerant cloud computing system

Article

Jul 2018
Int J Netw Manag

Because of the dependence on Internet‐based services, many efforts have been conceived to mitigate the impact of disasters on service provision. In this context, cloud computing has become an interesting alternative for implementing disaster tolerant services due to its resource on‐demand and pay‐as‐you‐go models. This paper proposes a sensitivity analysis approach to assess the parameters that most impact the availability of cloud data centers, taking into account disaster occurrence, hardware and software failures, and disaster recovery mechanisms for cloud systems. The analysis adopts continuous‐time Markov chains, and the results indicate that disaster issues should not be neglected. Hardware failure rate and time for migration of virtual machines (VMs) are the critical factors pointed out for the system modeled in our analysis. Moreover, the location where data centers are placed has a significant impact on system availability, due to time for migrating VMs from a backup server. This paper proposes a sensitivity analysis approach to assess the parameters that most impact the availability of cloud data centers, taking into account disaster occurrence, hardware and software failures, and disaster recovery mechanisms for cloud systems. The analysis adopts continuous‐time Markov chains, and the results indicate that disaster issues should not be neglected. Hardware failure rate and time for migration of virtual machines are the critical factors pointed out for the system modeled in our analysis.

Mercury: Performance and Dependability Evaluation of Systems with Exponential, Expolynomial, and General Distributions

Conference Paper

Full-text available

Jan 2017

Mercury - An Integrated Environment for Performance and Dependability Evaluation of General Systems

Data

Full-text available

Feb 2016

Mercury: An Integrated Environment for Performance and Dependability Evaluation of General Systems

Presentation

Full-text available

Jun 2015

The evaluation of dependability or performance of general systems is not a trivial task. Therefore, the assistance of software tools to obtain the wanted metrics is of utmost importance. This paper introduces the Mercury environment, which is an integrated software that enables creating and evaluating Reliability Block Diagrams, Stochastic Petri Nets, Continuous Time Markov Chains, and Energy Flow Models. Mercury provides graphical user interface for these modeling formalisms and a script language that allows using it through command-line interface and also integration with external applications. The set of features available in the Mercury tool make it helpful for dependability and performance evaluation of various systems in both academy and industry scenarios.

Cloud System Architecture

Context in source publication

Similar publications

Citations