The "Xbox deals" experiment.

Source publication

The Evolution of Continuous Experimentation in Software Product Development

Conference Paper

Full-text available

May 2017

Software development companies are increasingly aiming to become data-driven by trying to continuously experiment with the products used by their customers. Although familiar with the competitive edge that the A/B testing technology delivers, they seldom succeed in evolving and adopting the methodology. In this paper, and based on an exhaustive and...

Context 1

... one of the experiments, a product team at Xbox aimed to identify whether showing prices (original price and the discount) in the weekly deals stripe, and using algorithmic as opposed to editorial ordering of the items in the stripe impacts engagement and purchases. They experimented with two different variants. On Figure 2, we illustrate the experiment control (A) and both of the treatments (B, C). At Xbox, instrumentation is well established and a reliable pipeline for data collection exists. Metrics that measure user engagement and purchases are established and consist of a combination of different signals from the logs aggregated per user, session and other analysis units. In contrast to the Office Word experiment above, the Xbox team autonomously set-up their experiments, however, they still require assistance on the execution and monitoring of the experiment and at the analysis stage to interpret results. The two-week experiment showed that, compared to control, treatment B decreased engagement with the stripe. The purchases, however, did not decrease. By showing prices upfront treatment B provided better user experience by engaging the users who are interested in a purchase and sparing a click for those not interested. Treatment C provided even greater benefit, increasing both engagement with the stripe and purchases made. In this experiment the team learned that (1) Showing prices upfront results in better user experience, and (2) Algorithmic ordering of deals beats manual editorial ...

View in full-text

Persistence of Callosobruchus chinensis evolutionary lines competing...

Demographic dynamics of Callosobruchus chinensis (black lines) and...

Convergent cross‐mapping for interspecific interactions. Solid lines...

Sequential locally weighted global linear map (S‐map) coefficients for...

Does past evolutionary history under different mating regimes influence the demographic dynamics of interspecific competition?

Article

Full-text available

Jul 2019

Interspecific interactions are contingent upon organism phenotypes, and thus phenotypic evolution can modify interspecific interactions and affect ecological dynamics. Recent studies have suggested that male–male competition within a species selects for capability to reproductively interfere with a closely related species. Here, we examine the effe...

A/B Testing for Beginners: Reusable Controlled Experiments

Conference Paper

Full-text available

Mar 2024

Online controlled experimentation, and more specifically A/B testing, is an effective method for assessing the impact of software changes. However, when adopting A/B testing, a development team faces various organizational and technical challenges. In this paper, we propose a new notion of reusable controlled experiments (RCE) to simplify and accelerate the adoption of A/B testing for software teams. In its essence, an RCE is a reusable software component supplied with the built-in A/B testing functionality. We provide a proof-of-concept implementation of an RCE, integrate it into a mobile applications in the field of educational technology, and run a experiment to validate the proposed solution. We conclude by checking the resulting integration against the six criteria categories of Experimentation Evolution Model (EEM) to identify the maturity phase for each category. The resulting RCE is found to correspond to the experimentation evolution model's Walk maturity phase in three out of six categories, and to the Crawl phase in the other three categories.

Becoming a Data-Driven Organization: A Comparative Case Study on Digital Transformation Strategies

Conference Paper

Full-text available

Dec 2023

Hannes Fischer

In today’s data-centric era, organizations increasingly aim to operate more data-driven and therefore engage in digital transformations toward becoming a data-driven organization (DDO). To govern such transformations, top managers develop digital transformation strategies (DTS) characterized by different organizational ambidexterity approaches. This study analyzes how such DTS influence the process and (intermediate) outcomes of organizations’ digital transformations toward becoming a DDO by studying two organizations undertaking such DDO transformations using the concept of organizational ambidexterity as a theoretical lens. On this empirical basis, we find that DTS characterized by different organizational ambidexterity approaches lead to different transformation processes and (intermediate) outcomes. Thereby, this study contributes to existing academic literature in the field of DDOs and DTS, as such transformation journeys toward becoming a DDO have not been studied in its entirety yet. Furthermore, our paper offers practical guidance for top managers to develop and implement a DTS suitable for their organization.

Data-Driven Organizations: Review, Conceptual Framework, and Empirical Illustration

Article

Full-text available

Nov 2023

With companies and other organizations increasingly striving to become (more) data-driven, there has been growing research interest in the notion of a data-driven organization (DDO). In existing literature, however, different understandings of such an organization emerged. The study at hand sets forth to synthesize the fragmented body of research through a review of existing DDO definitions and implicit understandings of this concept in the information systems and related literatures. Based on the review results and drawing on the established concept of the “knowing organization,” our study identifies five core dimensions of a DDO—namely, data sourcing & sensemaking, data capabilities, data-driven culture, data-driven decision-making, and data-driven value creation—which we integrate into a conceptual DDO framework. Most notably, the proposed framework suggests that—like its predecessor, the knowing organization—a DDO may draw on an outside-in view; however, it may also draw on an inside-out view, or even combine the two views, thereby setting itself apart from the knowing organization. To illustrate our conceptual DDO framework and demonstrate its usefulness, we apply this framework to three empirical examples. Theoretical and practical contributions as well as directions for future research are discussed.

A/B Testing: A Systematic Literature Review

Preprint

Aug 2023

In A/B testing two variants of a piece of software are compared in the field from an end user's point of view, enabling data-driven decision making. While widely used in practice, no comprehensive study has been conducted on the state-of-the-art in A/B testing. This paper reports the results of a systematic literature review that analyzed 141 primary studies. The results shows that the main targets of A/B testing are algorithms and visual elements. Single classic A/B tests are the dominating type of tests. Stakeholders have three main roles in the design of A/B tests: concept designer, experiment architect, and setup technician. The primary types of data collected during the execution of A/B tests are product/system data and user-centric data. The dominating use of the test results are feature selection, feature rollout, and continued feature development. Stakeholders have two main roles during A/B test execution: experiment coordinator and experiment assessor. The main reported open problems are enhancement of proposed approaches and their usability. Interesting lines for future research include: strengthen the adoption of statistical methods in A/B testing, improving the process of A/B testing, and enhancing the automation of A/B testing.

Human factors in developing automated vehicles: A requirements engineering perspective

Article

Full-text available

Jul 2023
J SYST SOFTWARE

Online Controlled Experiments and A/B Tests

Chapter

Full-text available

Mar 2023

Many good resources are available with motivation and explanations about online controlled experiments (Kohavi et al. 2009a, 2020; Thomke 2020; Luca and Bazerman 2020; Georgiev 2018, 2019; Kohavi and Thomke 2017; Siroker and Koomen 2013; Goward 2012; Schrage 2014; King et al. 2017; McFarland 2012; Manzi 2012; Tang et al. 2010). For organizations running online controlled experiments at scale, Gupta et al. (2019) provide an advanced set of challenges. We provide a motivating visual example of a controlled experiment that ran at Microsoft’s Bing. The team wanted to add a feature allowing advertisers to provide links to the target site. The rationale is that this will improve ads quality by giving users more information about what the advertiser’s site provides and allow users to directly navigate to the sub-category matching their intent. Visuals of the existing ads layout (Control) and the new ads layout (Treatment) with site links added are shown in Fig. 1.

A/B Integrations: 7 Lessons Learned from Enabling A/B testing as a Product Feature

Conference Paper

Full-text available

Mar 2023

A/B tests are the gold standard for evaluating product changes. At Microsoft, for example, we run tens of thousands of A/B tests every year to understand how users respond to new designs, new features, bug fixes, or any other ideas we might have on what will deliver value to users. In addition to testing product changes, however, A/B testing is starting to gain momentum as a differentiating feature of platforms or products whose primary purpose may not be A/B testing. As we describe in this paper, organizations such as Azure PlayFab and Outreach have integrated experimentation platforms and offer A/B testing to their customers as one of the many features in their product portfolio. In this paper and based on multiple-case studies, we present the lessons learned from enabling A/B integrations-integrating A/B testing into software products. We enrich each of the learnings with a motivating example, share the trade-offs made along this journey, and provide recommendations for practitioners. Our learnings are most applicable for engineering teams developing experimentation platforms, integrators considering embedding A/B testing into their products, and for researchers working in the A/B testing domain.

The Viability of Continuous Experimentation in Early-Stage Software Startups: A Descriptive Multiple-Case Study

Preprint

Full-text available

Dec 2022

Background: Continuous experimentation (CE) has been proposed as a data-driven approach to software product development. Several challenges with this approach have been described in large organisations, but its application in smaller companies with early-stage products remains largely unexplored. Aims: The goal of this study is to understand what factors could affect the adoption of CE in early-stage software startups. Method: We present a descriptive multiple-case study of five startups in Finland which differ in their utilisation of experimentation. Results: We find that practices often mentioned as prerequisites for CE, such as iterative development and continuous integration and delivery, were used in the case companies. CE was not widely recognised or used as described in the literature. Only one company performed experiments and used experimental data systematically. Conclusions: Our study indicates that small companies may be unlikely to adopt CE unless 1) at least some company employees have prior experience with the practice, 2) the company's limited available resources are not exceeded by its adoption, and 3) the practice solves a problem currently experienced by the company, or the company perceives almost immediate benefit of adopting it. We discuss implications for advancing CE in early-stage startups and outline directions for future research on the approach.

Low-code experimentation on software products

Conference Paper

Nov 2022

Living in a Pink Cloud or Fighting a Whack-a-Mole? On the Creation of Recurring Revenue Streams in the Embedded Systems Domain

Conference Paper

Full-text available

Aug 2022

The "Xbox deals" experiment.

Context in source publication

Similar publications

Citations