Michael Pradel
Universität Stuttgart · Department of Computer Science

About

121

Publications

11,085

Reads

3,987

Citations

Publications

PyTy: Repairing Static Type Errors in Python

Conference Paper

Apr 2024

Fuzz4All: Universal Fuzzing with Large Language Models

Conference Paper

Apr 2024

Resource Usage and Optimization Opportunities in Workflows of GitHub Actions

Conference Paper

Feb 2024

LExecutor: Learning-Guided Execution

Conference Paper

Nov 2023

That’s a Tough Call: Studying the Challenges of Call Graph Construction for WebAssembly

Conference Paper

Jul 2023

Beware of the Unexpected: Bimodal Taint Analysis

Conference Paper

Jul 2023

Where to Look When Repairing Code? Comparing the Attention of Neural Models and Developers

Preprint

May 2023

Neural network-based techniques for automated program repair are becoming increasingly effective. Despite their success, little is known about why they succeed or fail, and how their way of reasoning about the code to repair compares to human developers. This paper presents the first in-depth study comparing human and neural program repair. In part...

SecBench.js: An Executable Security Benchmark Suite for Server-Side JavaScript

Conference Paper

May 2023

MorphQ: Metamorphic Testing of the Qiskit Quantum Computing Platform

Conference Paper

May 2023

When to Say What: Learning to Find Condition-Message Inconsistencies

Conference Paper

May 2023

Figure 2: Overview of TraceFixer approach.

TraceFixer: Execution Trace-Driven Program Repair

Preprint

Full-text available

Apr 2023

When debugging unintended program behavior, developers can often identify the point in the execution where the actual behavior diverges from the desired behavior. For example, a variable may get assigned a wrong value, which then negatively influences the remaining computation. Once a developer identifies such a divergence, how to fix the code so t...

VULGEN: Realistic Vulnerability Generation Via Pattern Mining and Deep Learning

Conference Paper

Full-text available

Feb 2023

Building new, powerful data-driven defenses against prevalent software vulnerabilities needs sizable, quality vulnerability datasets, so does large-scale benchmarking of existing defense solutions. Automatic data generation would promisingly meet the need, yet there is little work aimed to generate much-needed quality vulnerable samples. Meanwhile,...

LExecutor: Learning-Guided Execution

Preprint

Feb 2023

Executing code is essential for various program analysis tasks, e.g., to detect bugs that manifest through exceptions or to obtain execution traces for further dynamic analysis. However, executing an arbitrary piece of code is often difficult in practice, e.g., because of missing variable definitions, missing user inputs, and missing third-party de...

Beware of the Unexpected: Bimodal Taint Analysis

Preprint

Jan 2023

Static analysis is a powerful tool for detecting security vulnerabilities and other programming problems. Global taint tracking, in particular, can spot vulnerabilities arising from complicated data flow across multiple functions. However, precisely identifying which flows are problematic is challenging, and sometimes depends on factors beyond the...

CrystalBLEU: Precisely and Efficiently Measuring the Similarity of Code

Conference Paper

Jan 2023

Generating Realistic Vulnerabilities via Neural Code Editing: An Empirical Study

Conference Paper

Full-text available

Nov 2022

The availability of large-scale, realistic vulnerability datasets is essential for both benchmarking existing techniques and developing effective new ones, especially those using data-driven (e.g., machine/deep-learning based) approaches, for software security. Yet such datasets are critically lacking. A promising solution is to generate such datas...

The evolution of type annotations in python: an empirical study

Conference Paper

Nov 2022

DynaPyt: a dynamic analysis framework for Python

Conference Paper

Nov 2022

CrystalBLEU: precisely and efficiently measuring the similarity of code

Conference Paper

Oct 2022

Code Search: A Survey of Techniques for Finding Code

Article

Oct 2022

The immense amounts of source code provide ample challenges and opportunities during software development. To handle the size of code bases, developers commonly search for code, e.g., when trying to find where a particular feature is implemented or when looking for code examples to reuse. To support developers in finding relevant code, various code...

Nalin: learning from runtime behavior to find name-value inconsistencies in jupyter notebooks

Conference Paper

Jul 2022

Nessie: automatically testing JavaScript APIs with asynchronous callbacks

Conference Paper

Jul 2022

Finding the Dwarf: Recovering Pecise Types from WebAssembly Binaries

Conference Paper

Jun 2022

MorphQ: Metamorphic Testing of Quantum Computing Platforms

Preprint

Jun 2022

As quantum computing is becoming increasingly popular, the underlying quantum computing platforms are growing both in ability and complexity. This growth may cause bugs in the platforms, which hinders the adoption of quantum computing. Unfortunately, testing quantum computing platforms is challenging due to the relatively small number of existing q...

Mutants generated by our FSLM-based tool and by Major [34].

Line coverage achieved by the tests generated by our FSLM-based test...

Code Generation Tools (Almost) for Free? A Study of Few-Shot, Pre-Trained Language Models on Code

Preprint

Full-text available

Jun 2022

Few-shot learning with large-scale, pre-trained language models is a powerful way to answer questions about code, e.g., how to complete a given code example, or even generate code snippets from scratch. The success of these models raises the question whether they could serve as a basis for building a wide range code generation tools. Traditionally,...

Wobfuscator: Obfuscating JavaScript Malware via Opportunistic Translation to WebAssembly

Conference Paper

May 2022

Bugs in Quantum computing platforms: an empirical study

Article

Apr 2022

The interest in quantum computing is growing, and with it, the importance of software platforms to develop quantum programs. Ensuring the correctness of such platforms is important, and it requires a thorough understanding of the bugs they typically suffer from. To address this need, this paper presents the first in-depth study of bugs in quantum c...

Meta Learning for Code Summarization

Preprint

Full-text available

Jan 2022

Source code summarization is the task of generating a high-level natural language description for a segment of programming language code. Current neural models for the task differ in their architecture and the aspects of code they consider. In this paper, we show that three SOTA models for code summarization work well on largely disjoint subsets of...

Neural software analysis

Article

Jan 2022

Developer tools that use a neural machine learning model to make predictions about previously unseen code.

DiffSearch: A Scalable and Precise Search Engine for Code Changes

Article

Jan 2022

The source code of successful projects is evolving all the time, resulting in hundreds of thousands of code changes stored in source code repositories. This wealth of data can be useful, e.g., to find changes similar to a planned code change or examples of recurring code improvements. This paper presents DiffSearch, a search engine that, given a qu...

Nalin: Learning from Runtime Behavior to Find Name-Value Inconsistencies in Jupyter Notebooks

Preprint

Full-text available

Dec 2021

Variable names are important to understand and maintain code. If a variable name and the value stored in the variable do not match, then the program suffers from a name-value inconsistency, which is due to one of two situations that developers may want to fix: Either a correct value is referred to through a misleading name, which negatively affects...

Preventing Dynamic Library Compromise on Node.js via RWX-Based Privilege Reduction

Conference Paper

Nov 2021

Thinking Like a Developer? Comparing the Attention of Humans with Neural Models of Code

Conference Paper

Nov 2021

Fuzzm: Finding Memory Bugs through Binary-Only Instrumentation and Fuzzing of WebAssembly

Preprint

Oct 2021

WebAssembly binaries are often compiled from memory-unsafe languages, such as C and C++. Because of WebAssembly's linear memory and missing protection features, e.g., stack canaries, source-level memory vulnerabilities are exploitable in compiled WebAssembly binaries, sometimes even more easily than in native code. This paper addresses the problem...

Bugs in Quantum Computing Platforms: An Empirical Study

Preprint

Oct 2021

Semantic bug seeding: a learning-based approach for creating realistic bugs

Conference Paper

Full-text available

Aug 2021

Finding data compatibility bugs with JSON subschema checking

Conference Paper

Jul 2021

Continuous test suite failure prediction

Conference Paper

Jul 2021

Automatic Program Repair

Article

Jul 2021

Programming mistakes of all kinds-in source code, configurations, tests, or other artifacts-are a wide-ranging and expensive problem. Developers dedicate a significant proportion of engineering time and effort to finding and fixing bugs in their code, businesses lose market share when vulnerabilities in the software they sell impact customers, and...

Learning to make compiler optimizations more effective

Conference Paper

Jun 2021

IdBench: Evaluating Semantic Representations of Identifier Names in Source Code

Conference Paper

May 2021

An Empirical Study of Real-World WebAssembly Binaries: Security, Languages, Use Cases

Conference Paper

Apr 2021

Learning to Make Compiler Optimizations More Effective

Preprint

Feb 2021

Because loops execute their body many times, compiler developers place much emphasis on their optimization. Nevertheless, in view of highly diverse source code and hardware, compilers still struggle to produce optimal target code. The sheer number of possible loop optimizations, including their combinations, exacerbates the problem further. Today's...

Neural Software Analysis

Preprint

Nov 2020

Many software development problems can be addressed by program analysis tools, which traditionally are based on precise, logical reasoning and heuristics to ensure that the tools are practical. Recent work has shown tremendous success through an alternative way of creating developer tools, which we call neural software analysis. The key idea is to...

TypeWriter: neural type prediction with search-based validation

Conference Paper

Nov 2020

Mir: Automated Quantifiable Privilege Reduction Against Dynamic Library Compromise in JavaScript

Preprint

Oct 2020

Third-party libraries ease the development of large-scale software systems. However, they often execute with significantly more privilege than needed to complete their task. This additional privilege is often exploited at runtime via dynamic compromise, even when these libraries are not actively malicious. Mir addresses this problem by introducing...

Satisfying Increasing Performance Requirements with Caching at the Application Level

Preprint

Full-text available

Oct 2020

Application-level caching is a form of caching that has been increasingly adopted to satisfy performance and throughput requirements. The key idea is to store the results of a computation, to improve performance by reusing instead of recomputing those results. However, despite its provided gains, this form of caching imposes new design, implementat...

Satisfying Increasing Performance Requirements With Caching at the Application Level

Article

Oct 2020

Scaffle: bug localization on millions of files

Conference Paper

Jul 2020

Extracting taint specifications for JavaScript libraries

Conference Paper

Jun 2020

A Survey of Compiler Testing

Article

Full-text available

Feb 2020

Virtually any software running on a computer has been processed by a compiler or a compiler-like tool. Because compilers are such a crucial piece of infrastructure for building software, their correctness is of paramount importance. To validate and increase the correctness of compilers, significant research efforts have been devoted to testing comp...

Figure 5: Precision/Recall curves for different...

Figure 6: Distribution of types found by TypeWriter and Pyre Infer.

Effectiveness of neural type prediction.

Effectiveness of various search strategies for type inference.

TypeWriter: Neural Type Prediction with Search-based Validation

Preprint

Full-text available

Dec 2019

Maintaining large code bases written in dynamically typed languages, such as JavaScript or Python, can be challenging: simple data compatibility errors proliferate, IDE support is lacking and APIs are harder to comprehend. Recent work attempts to address those issues through either static analysis or probabilistic type inference. Unfortunately, sta...

Type Safety with JSON Subschema

Preprint

Full-text available

Nov 2019

JSON is a popular data format used pervasively in web APIs, cloud computing, NoSQL databases, and increasingly also machine learning. JSON Schema is a language for declaring the structure of valid JSON data. There are validators that can decide whether a JSON document is valid with respect to a schema. Unfortunately, like all instance-based testing...

Automated program repair

Article

Nov 2019

Automated program repair can relieve programmers from the burden of manually fixing the ever-increasing number of programming mistakes.

An Empirical Study of Information Flows in Real-World JavaScript

Conference Paper

Nov 2019

Information flow analysis prevents secret or untrusted data from flowing into public or trusted sinks. Existing mechanisms cover a wide array of options, ranging from lightweight taint analysis to heavyweight information flow control that also considers implicit flows. Dynamic analysis, which is particularly popular for languages such as JavaScript...

Evaluating Semantic Representations of Source Code

Preprint

Oct 2019

Learned representations of source code enable various software developer tools, e.g., to detect bugs or to predict program properties. At the core of code representations often are word embeddings of identifier names in source code, because identifiers account for the majority of source code vocabulary and convey important semantic information. Unf...

Fig. 6. Dendrogram showing concrete edits merged into more abstract...

Fig. 12. Accuracy of top-1, top-5 and top-∞ predictions (fraction of...

Accuracy of predicting exactly the human fix for different kinds of bugs.

Time taken by Getafix for 10-fold experiment of training and prediction.

Getafix: learning to fix bugs automatically

Article

Full-text available

Oct 2019

Static analyzers help find bugs early by warning about recurring bug categories. While fixing these bugs still remains a mostly manual task in practice, we observe that fixes for a specific bug category often are repetitive. This paper addresses the problem of automatically fixing instances of common bugs by learning from past fixes. We present Get...

Interactive metamorphic testing of debuggers

Conference Paper

Jul 2019

When improving their code, developers often turn to interactive debuggers. The correctness of these tools is crucial, because bugs in the debugger itself may mislead a developer, e.g., to believe that executed code is never reached or that a variable has another value than in the actual execution. Yet, debuggers are difficult to test because their...

An Empirical Study of Information Flows in Real-World JavaScript

Preprint

Jun 2019

Neural Bug Finding: A Study of Opportunities and Challenges

Preprint

Jun 2019

Static analysis is one of the most widely adopted techniques to find software bugs before code is put in production. Designing and implementing effective and efficient static analyses is difficult and requires high expertise, which results in only a few experts able to write such analyses. This paper explores the opportunities and challenges of an...

Anything to Hide? Studying Minified and Obfuscated Code in the Web

Conference Paper

May 2019

JavaScript has been used for various attacks on client-side web applications. To hinder both manual and automated analysis from detecting malicious scripts, code minification and code obfuscation may hide the behavior of a script. Unfortunately, little is currently known about how real-world websites use such code transformations. This paper presen...

NL2Type: Inferring JavaScript Function Types from Natural Language Information

Conference Paper

Full-text available

May 2019

Wasabi: A Framework for Dynamically Analyzing WebAssembly

Conference Paper

Apr 2019

WebAssembly is the new low-level language for the web and has now been implemented in all major browsers since over a year. To ensure the security, performance, and correctness of future web applications, there is a strong need for dynamic analysis tools for WebAssembly. However, building such tools from scratch requires knowledge of low-level deta...

Small World with High Risks: A Study of Security Threats in the npm Ecosystem

Preprint

Feb 2019

The popularity of JavaScript has lead to a large ecosystem of third-party packages available via the npm software package registry. The open nature of npm has boosted its growth, providing over 800,000 free and reusable software packages. Unfortunately, this open nature also causes security risks, as evidenced by recent incidents of single packages...

Figure 1: Overview of the Chameleon framework and its four steps.

Table 1 : Example files to illustrate the metrics.

Easy to Fool? Testing the Anti-evasion Capabilities of PDF Malware Scanners

Preprint

Full-text available

Jan 2019

Malware scanners try to protect users from opening malicious documents by statically or dynamically analyzing documents. However, malware developers may apply evasions that conceal the maliciousness of a document. Given the variety of existing evasions, systematically assessing the impact of evasions on malware scanners remains an open challenge. T...

Feedback-directed differential testing of interactive debuggers

Conference Paper

Oct 2018

To understand, localize, and fix programming errors, developers often rely on interactive debuggers. However, as debuggers are software, they may themselves have bugs, which can make debugging unnecessarily hard or even cause developers to reason about bugs that do not actually exist in their code. This paper presents the first automated testing te...

Table 2 . Test generation approaches used for the evaluation.

Table 3 . Comparison of different test generation approaches in terms...

Table 7 . Statement coverage for 1,000 generated tests.

Test generation for higher-order functions in dynamic languages

Article

Full-text available

Oct 2018

Test generation has proven to provide an effective way of identifying programming errors. Unfortunately, current test generation techniques are challenged by higher-order functions in dynamic languages, such as JavaScript functions that receive callbacks. In particular, existing test generators suffer from the unavailability of statically known typ...

Table 1 . Examples of name-related bugs detected by DeepBugs.

Table 2 . Examples of identifier names and literals extracted for...

Table 3 . Statistics on extraction and generation of training data.

Table 4 . Results of inspecting and classifying warnings in real-world...

DeepBugs: a learning approach to name-based bug detection

Article

Full-text available

Oct 2018

Natural language elements in source code, e.g., the names of variables and functions, convey useful information. However, most existing bug detection tools ignore this information and therefore miss some classes of bugs. The few existing name-based bug detection approaches reason about names on a syntactic level and rely on manually designed and tu...

Pinpointing and repairing performance bottlenecks in concurrent programs

Article

Full-text available

Oct 2018

Developing concurrent software that is both correct and efficient is challenging. Past research has proposed various techniques that support developers in finding, understanding, and repairing concurrency-related correctness problems, such as missing or incorrect synchronization. In contrast, existing work provides little support for dealing with c...

Is this class thread-safe? inferring documentation using graph-based learning

Conference Paper

Sep 2018

Thread-safe classes are pervasive in concurrent, object-oriented software. However, many classes lack documentation regarding their safety guarantees under multi-threaded usage. This lack of documentation forces developers who use a class in a concurrent program to either carefully inspect the implementation of the class, to conservatively synchron...

How many of all bugs do we find? a study of static bug detectors

Conference Paper

Sep 2018

Static bug detectors are becoming increasingly popular and are widely used by professional software developers. While most work on bug detectors focuses on whether they find bugs at all, and on how many false positives they report in addition to legitimate warnings, the inverse question is often neglected: How many of all real-world bugs do static...

Wasabi: A Framework for Dynamically Analyzing WebAssembly

Preprint

Aug 2018

WebAssembly is the new low-level language for the web and has now been implemented in all major browsers since over a year. To ensure the security, performance, and correctness of future web applications, there is a strong need for dynamic analysis tools for WebAssembly. Unfortunately, building such tools from scratch requires knowledge of low-leve...

Context2Name: A Deep Learning-Based Approach to Infer Natural Variable Names from Usage Contexts

Preprint

Full-text available

Aug 2018

Most of the JavaScript code deployed in the wild has been minified, a process in which identifier names are replaced with short, arbitrary and meaningless names. Minified code occupies less space, but also makes the code extremely difficult to manually inspect and understand. This paper presents Context2Name, a deep learningbased technique that par...

ConflictJS: finding and understanding conflicts between JavaScript libraries

Conference Paper

Full-text available

May 2018

It is a common practice for client-side web applications to build on various third-party JavaScript libraries. Due to the lack of namespaces in JavaScript, these libraries all share the same global namespace. As a result, one library may inadvertently modify or even delete the APIs of another library, causing unexpected behavior of library clients....

DeepBugs: A Learning Approach to Name-based Bug Detection

Preprint

Full-text available

Apr 2018

Synthesizing programs that expose performance bottlenecks

Conference Paper

Feb 2018

Software often suffers from performance bottlenecks, e.g., because some code has a higher computational complexity than expected or because a code change introduces a performance regression. Finding such bottlenecks is challenging for developers and for profiling techniques because both rely on performance tests to execute the software, which are o...

Synthesizing programs that expose performance bottlenecks

Conference Paper

Feb 2018

SYNODE: Understanding and Automatically Preventing Injection Attacks on NODE.JS

Conference Paper

Jan 2018

Detecting argument selection defects

Article

Full-text available

Oct 2017

Identifier names are often used by developers to convey additional information about the meaning of a program over and above the semantics of the programming language itself. We present an algorithm that uses this information to detect argument selection defects, in which the programmer has chosen the wrong argument to a method call in Java program...

Saying ‘Hi!’ is not enough: Mining inputs for effective test generation

Conference Paper

Oct 2017

Detecting Argument Selection Defects

Article

Aug 2017

Identifier names are often used by developers to convey additional information about the meaning of a program over and above the semantics of the programming language itself. We present an algorithm that uses this information to detect argument selection defects in which the programmer has chosen the wrong argument to a method call in Java programs...

A Survey of Dynamic Analysis and Test Generation for JavaScript

Article

Jul 2017

JavaScript has become one of the most prevalent programming languages. Unfortunately, some of the unique properties that contribute to this popularity also make JavaScript programs prone to errors and difficult for program analyses to reason about. These properties include the highly dynamic nature of the language, a set of unusual language feature...

An actionable performance profiler for optimizing the order of evaluations

Conference Paper

Jul 2017

The efficiency of programs often can be improved by applying rel- atively simple changes. To find such optimization opportunities, developers either rely on manual performance tuning, which is time-consuming and requires expert knowledge, or on traditional profilers, which show where resources are spent but not how to optimize the program. This pap...

Systematic black-box analysis of collaborative web applications

Conference Paper

Jun 2017

Web applications, such as collaborative editors that allow multiple clients to concurrently interact on a shared resource, are difficult to implement correctly. Existing techniques for analyzing concurrent software do not scale to such complex systems or do not consider multiple interacting clients. This paper presents Simian, the first fully autom...

Systematic black-box analysis of collaborative web applications

Article

Jun 2017

Efficient Detection of Thread Safety Violations via Coverage-Guided Generation of Concurrent Tests

Conference Paper

May 2017

Making Malory Behave Maliciously: Targeted Fuzzing of Android Execution Environments

Conference Paper

May 2017

Monkey see, monkey do: effective generation of GUI tests with inferred macro events

Conference Paper

Jul 2016

Automated testing is an important part of validating the behavior of software with complex graphical user interfaces, such as web, mobile, and desktop applications. Despite recent advances in UI-level test generation, existing approaches often fail to create complex sequences of events that represent realistic user interactions. As a result, these...

SyncProf: detecting, localizing, and optimizing synchronization bottlenecks

Conference Paper

Full-text available

Jul 2016

Writing concurrent programs is a challenge because developers must consider both functional correctness and performance requirements. Numerous program analyses and testing techniques have been proposed to detect functional faults, e.g., caused by incorrect synchronization. However, little work has been done to help developers address performance pr...

Performance issues and optimizations in JavaScript: an empirical study

Conference Paper

May 2016

As JavaScript is becoming increasingly popular, the performance of JavaScript programs is crucial to ensure the responsiveness and energy-efficiency of thousands of programs. Yet, little is known about performance issues that developers face in practice and they address these issues. This paper presents an empirical study of 98 fixed performance is...

Nomen est omen: exploring and exploiting similarities between argument and parameter names

Conference Paper

May 2016

Programmer-provided identifier names convey information about the semantics of a program. This information can complement traditional program analyses in various software engineering tasks, such as bug finding, code completion, and documentation. Even though identifier names appear to be a rich source of information, little is known about their pro...

Performance Problems You Can Fix: A Dynamic Analysis of Memoization Opportunities

Article

Oct 2015

Performance bugs are a prevalent problem and recent research proposes various techniques to identify such bugs. This paper addresses a kind of performance problem that often is easy to address but difficult to identify: redundant computations that may be avoided by reusing already computed results for particular inputs, a technique called memoizati...

JITProf: pinpointing JIT-unfriendly JavaScript code

Conference Paper

Full-text available

Sep 2015

Most modern JavaScript engines use just-in-time (JIT) compilation to translate parts of JavaScript code into efficient machine code at runtime. Despite the overall success of JIT compilers, programmers may still write code that uses the dynamic features of JavaScript in a way that prohibits profitable optimizations. Unfortunately, there currently i...

DLint: Dynamically Checking Bad Coding Practices in JavaScript

Conference Paper

Full-text available

Jul 2015

JavaScript has become one of the most popular programming languages, yet it is known for its suboptimal design. To effectively use JavaScript despite its design flaws, developers try to follow informal code quality rules that help avoid correctness, maintainability, performance, and security problems. Lightweight static analyses, implemented in "li...

TypeDevil: Dynamic Type Inconsistency Analysis for JavaScript

Conference Paper

May 2015

Poster: Automatically Fixing Real-World JavaScript Performance Bugs

Conference Paper

May 2015

EventBreak

Article

Dec 2014

Event-driven user interface applications typically have a single thread of execution that processes event handlers in response to input events triggered by the user, the network, or other applications. Programmers must ensure that event handlers terminate after a short amount of time because otherwise, the application may become unresponsive. This...

EventBreak: Analyzing the Responsiveness of User Interfaces through Performance-Guided Test Generation

Article

Oct 2014

Performance regression testing of concurrent classes

Article

Jul 2014

Developers of thread-safe classes struggle with two opposing goals. The class must be correct, which requires synchronizing concurrent accesses, and the class should pro- vide reasonable performance, which is difficult to realize in the presence of unnecessary synchronization. Validating the performance of a thread-safe class is challenging because...

Bita: Coverage-guided, automatic testing of actor programs

Conference Paper

Nov 2013

Actor programs are concurrent programs where concurrent entities communicate asynchronously by exchanging messages. Testing actor programs is challenging because the order of message receives depends on the non-deterministic scheduler and because exploring all schedules does not scale to large programs. This paper presents Bita, a scalable, automat...