ArticlePDF Available

A Review of Memory Errors Exploitation in x86-64

June 2020
Computers 9(2):48

June 2020
9(2):48

DOI:10.3390/computers9020048

License
CC BY 4.0

Authors:

Hector Marco-Gisbert

Universitat Politècnica de València

Memory errors are still a serious threat affecting millions of devices worldwide. Recently, bounty programs have reached a new record, paying up to USD 2.5 million for one single vulnerability in Android and up to USD 2 million for Apple’s operating system. In almost all cases, it is common to exploit memory errors in one or more stages to fully compromise those devices. In this paper, we review and discuss the importance of memory error vulnerabilities, and more specifically stack buffer overflows to provide a full view of how memory errors are exploited. We identify the root causes that make those attacks possible on modern x86-64 architecture in the presence of modern protection techniques. We have analyzed how unsafe library functions are prone to buffer overflows, revealing that although there are secure versions of those functions, they are not actually preventing buffer overflows from happening. Using secure functions does not result in software free from vulnerabilities and it requires developers to be security-aware. To overcome this problem, we discuss the three main security protection techniques present in all modern operating system; the non-eXecutable bit (NX), the Stack Smashing Protector (SSP) and the Address Space Layout Randomization (ASLR). After discussing their effectiveness, we conclude that although they provide a strong level of protection against classical exploitation techniques, modern attacks can bypass them.

Top vulnerability types.

…

The RAX register and its sub-registers (in bits).

…

Layout of a stack frame with the SSP technique.

…

The sixteen x86-64 general purpose registers and their sub-registers.

…

Main unsafe functions included in the C standard library.

…

Figures - available via license: Creative Commons Attribution 4.0 International

Content may be subject to copyright.

Available via license: CC BY 4.0

Content may be subject to copyright.

computers

Review

A Review of Memory Errors Exploitation in x86-64

Conor Pirry, Hector Marco-Gisbert * and Carolyn Begg

School of Computing, Engineering and Physical Sciences, University of the West of Scotland, High Street,

Paisley PA1 2BE, UK; B00283255@studentmail.uws.ac.uk (C.P.); Carolyn.Begg@uws.ac.uk (C.B.)

*Correspondence: hector.marco@uws.ac.uk; Tel.:+44-141-849-4418

Received: 23 April 2020; Accepted: 2 June 2020; Published: 8 June 2020





Abstract:

Memory errors are still a serious threat affecting millions of devices worldwide. Recently,

bounty programs have reached a new record, paying up to USD 2.5 million for one single vulnerability

in Android and up to USD 2 million for Apple’s operating system. In almost all cases, it is common to

exploit memory errors in one or more stages to fully compromise those devices. In this paper,

we review and discuss the importance of memory error vulnerabilities, and more speciﬁcally

stack buffer overﬂows to provide a full view of how memory errors are exploited. We identify

the root causes that make those attacks possible on modern x86-64 architecture in the presence of

modern protection techniques. We have analyzed how unsafe library functions are prone to buffer

overﬂows, revealing that although there are secure versions of those functions, they are not actually

preventing buffer overﬂows from happening. Using secure functions does not result in software

free from vulnerabilities and it requires developers to be security-aware. To overcome this problem,

we discuss the three main security protection techniques present in all modern operating system;

the non-eXecutable bit (NX), the Stack Smashing Protector (SSP) and the Address Space Layout

Randomization (ASLR). After discussing their effectiveness, we conclude that although they provide

a strong level of protection against classical exploitation techniques, modern attacks can bypass them.

Keywords: memory errors; x86-64; stack buffer overﬂows; SSP; ASLR; NX

1. Introduction

Every computing device, from the servers that comprise the infrastructure of the internet to

electronic cars, to the internet-of-things devices that are increasingly making their way into people’s

homes, has digital memory.

Unfortunately, vulnerabilities exploiting memory errors are still a major threat. In fact, in a

quantitative study conducted [

], it was found that memory buffer (overﬂow) errors account for 14% of

all vulnerabilities reported to MITRE’s Common Vulnerabilities and Exposures (CVE) database and the

National Institute of Standards and Technology’s National Vulnerability Database (NIST NVD) from

1988 to 2012. This makes them the most common vulnerability reported throughout these years [

Figure 1illustrates the top vulnerability types where buffer overﬂows due to memory errors are still at

the top of the ranking.

Preventing memory errors and particularly buffer overﬂows has been a challenge since the early

1970s as James P. Anderson (1972) highlighted in his [

] in his report, “Computer Security Technology

Planning Study” [

]. Those vulnerabilities are still considered as an open security problem, which

requires solutions that do not introduce signiﬁcant overhead and allow programmers to fully use fast

languages like C.

Computers 2020,9, 48; doi:10.3390/computers9020048 www.mdpi.com/journal/computers

Computers 2020,9, 48 2 of 21

Buﬀer Errors

14%

XSS

13%

SQL Injection 10%

Access Control

11%

Input Validation 10%

Not Enough Info

Code Injection

Information Leak

Resource Management

5% Path Traversal

5% Conﬁguration

Numeric Errors

Others 8%

Figure 1. Top vulnerability types.

In February 2019, a CVE rated at high across all three impact metrics—(conﬁdentiality, integrity,

availability)—was released that describes a stack buffer overﬂow vulnerability in

libcurl

, affecting

versions from 7.36.0 to 7.64.0 [

]. This library is the most used C-based highly portable transfer library

in the world [

], provides functionality for end-users to carry out network-based resource transfers.

Stenberg [

] claims that the library is used in every imaginable sort of embedded device where Internet

transfers are needed.

Ever since Aleph One published the famous article, “Smashing The Stack For Fun And Proﬁt” in

Phrack magazine in 1996 [

], many memory protection mechanisms have been developed. The three

main protection techniques employed by all modern operating systems are; the Stack Smashing

Protector (SSP), which adds a canary/cookie/guard value in stack frames to mitigate against

the overwriting of the frame return address; the Address Space Layout Randomisation (ASLR),

which randomises the virtual memory addresses layout of a process memory to thwart attacks that

relies on known memory locations; and the Non-eXecutable (NX), which enables readable memory

to be non-writable, meaning that any malicious code injected for example into the stack as part of

an attack payload will not be executed by the CPU. Although those protection techniques mitigates

the exploitation, they do not eliminate the vulnerabilities itself but make the exploitation harder.

From the developers side, a memory corruption mitigation approach is essential for careful

programming practices. That is, for example, avoiding the use of unsafe functions, checking array

bounds, deallocating memory space no longer required, when possible. However, due to the need

to maintain software on internet time [

], in which developers have to meet narrow deadlines, there

is not much time for code and security testing. In many occasions, this leads to unnoticed, critical

programming mistakes in publicly released software that are written in those languages which allow

the freedom to, for example, interact directly with memory, such as C and C++.

Programmers working with other languages, such as Java or Python that cannot interact directly

with memory, are not affected by these kind of memory errors because those languages have been

built with protection mechanisms, such as array bounds checking and the automatic deallocation of

memory space (a.k.a. garbage collection), so that memory corruption errors are less prevalent.

The main contributions of this paper are:

•

A literature review on memory errors including new protection mechanisms that have not been

considered in previous reviews.

•

We identify the root causes that make memory errors attacks possible on modern x86-64

architecture.

Computers 2020,9, 48 3 of 21

•

We reveal that “safe” functions are not actually preventing memory errors exploitation

and developers should not wholly trust them.

•

We examine the techniques developed to mitigate memory errors showing that no protection

mechanism provides complete protection against known attack vectors.

This paper is organised as follows: Section 3introduces the reader to the x86-64 architecture

and provides the knowledge required to understand real memory errors exploitation. In Section 3.6

we detail both, the process and stack layout on the x86-64 architecture. In Section 4we review

the literature of buffer overﬂows and attack vectors used by adversaries. Section 5presents attack

approaches when probing buffer overﬂow vulnerabilities. Section 6introduces the three main

protection techniques present in all modern operating systems that have been developed in an effort to

mitigate memory errors, including the recent literature that bypasses all those protection techniques.

Finally, we conclude the paper in Section 7.

2. Literature Review

2.1. Memory Errors

The term “memory error” refers to a wide range of programming faults relating to how a processor

interprets the contents of the main memory and what happens when that memory is misused or

misinterpreted by a program. It is important to distinguish between physical errors caused by

the underlying hardware (memory cells), due to electrical or mechanical defects, from the logical errors

caused by incorrect coding. Essentially, the root cause of a memory error is a programming error and is

therefore not related to hardware issues.

As cited by Steve McConnell [

] “A pair of studies performed [in 1973 and 1984] found that,

of total errors reported, roughly 95% are caused by programmers, 2% by systems software (the compiler

and the operating system), 2% by some other software, and 1% by the hardware. Systems software

and development tools are used by many more people today than they were in the 1970s and 1980s,

and so my best guess is that, today, an even higher percentage of errors are the programmers’ fault.”

Application code is prone to containing programming errors.

2.2. Techniques to Protect Memory Errors

Compilers are now able to analyse the code and detect, alert and even generate the correct

code for certain types of memory error. Most of the newer programming languages (Ada, Java,

Python) try to hide the complexity of memory management all at once, by avoiding direct memory

manipulation, which is an effective way to prevent most memory errors, albeit at a high cost in relation

to expressiveness or overhead. Unfortunately, not all memory errors can be detected or captured

before the code is released.

A decade ago, buffer overﬂows, especially the stack buffer overﬂow, was the most dangerous

threat to computer system security. Over the last few years, several techniques have been developed to

mitigate the ability to exploit this kind of programming fault [7,10].

The last line of defence to mitigate those programming faults is formed by a set of

mitigation techniques that are active while the program is running, namely NX (Non-eXecutable),

SSP (Stack Smashing Protector) and ASLR (Address Space Layout Randomisation), which do not

remove the vulnerabilities per se but at least make them harder to exploit. These three mitigation

techniques are very effective and easy to implement, and so most systems (Windows, Linux, Mac,

Android, etc.) include most of them or slightly modiﬁed versions thereof. Most of the research

effort focusing on the NX, SSP and ASLR techniques was carried out during the early 2000s.

Once the techniques were consolidated and proved to be effective, they were gradually introduced

into products.

Computers 2020,9, 48 4 of 21

2.3. Attackers Evolve and Innovate

Following the classic measure/counter-measure sequence, a few years after the introduction

of each protection technique, new methods to bypass or reduce their effectiveness were introduced.

The SSP can be bypassed using brute force or by overwriting non-shielded data [

–

], the ASLR can

be bypassed using brute force attacks [

–

] and the NX, which effectively blocks the execution of

injected code, can be bypassed using ROP (return-oriented programming) [

–

]. In spite of many

existing counter-measures, these techniques are still effective protection methods, and in some cases

they are the only barrier against attacks, until software is upgraded to remove a speciﬁc vulnerability.

2.4. Description of Memory Error Solutions

In response to those attacks, new and improved techniques have been proposed. The renew

stack-smashing protector (RenewSSP) [

], is a new proposal which improves the stack-smashing

protection (SSP) [

]. There is also a version named SSPMD [

], which is the renewSPP adapted to

Android. The address-space layout randomisation next generation (ASLR-NG) [

] is the successor

of the address-space layout randomisation (ASLR) [

]. Recently, Rick Edgecombe presented an

improvement for the Non-eXecutable [

] technique by adding the concept of “execution-only” [

Those techniques do not remove the underlying error which leads to vulnerability, but they do prevent

or hinder exploitation of the fault. The key idea behind the four techniques (SSP, RenewSSP, ASLR

and ASLR-NG) is to introduce a secret that must be known by the attacker in order to bypass it, while

the NX technique’s method involves restricting the execution capabilities of processes.

2.5. Address Space Layout Randomisation (ASLR)

The ASLR is an abstract idea which has multiple implementations [

–

]. PaX published

the ﬁrst design and implementation of ASLR [

] in July 2001. The PaX project implementation

is the most complete and advanced, also providing kernel stack randomisation from October 2002

onward. It also continues to provide the most entropy for each randomised layout compared to

other implementations.

Two years after ASLR was invented and published as part of the PaX project, a popular security

patch for Linux, OpenBSD became the ﬁrst mainstream operating system to support partial ASLR

(and to activate it by default) [

]. OpenBSD completed its ASLR support after Linux, in 2008, when it

added support for PIE binaries [31].

Microsoft

Windows Vista

(released January 2007) was the ﬁrst version of Windows

operating

system to support ASLR [

]. Then all subsequent versions of Windows OS also supported ASLR [

There is a wide range of implementations with different levels of entropy, depending on the version

and the security conﬁguration: the Enhanced Mitigation Experience Toolkit (EMET), High Entropy

ASLR or ForceASLR. For the purpose of this paper, we are only interested in the relative positions

where each section of the program is loaded.

Apple

ﬁrst introduced the randomisation of some library offsets in Mac OS X

v10.5 (released

October 2007) [

]. However, because this initial implementation was limited to only certain system

libraries, it was naturally unable to protect against many attacks that a full ASLR implementation

is designed to defeat. In Mac OS X Lion 10.7, Apple expanded its ASLR implementation to also

cover application code. Apple stated that “address space layout randomisation (ASLR) has been

improved for all applications. It is now available for 32-bit apps (as is heap memory protection),

making 64-bit and 32-bit applications more resistant to attack.” As for OS X Mountain Lion 10.8,

the kernel, as well as kexts and zones, are randomly relocated during the system boot. As in the case of

Windows, all applications see a concrete library at the same address. In 2019, ASLR-NG, a new ASLR

design was proposed [

] which highly increases the absolute entropy and removes correlation attacks,

such as the offset2lib attack [15].

Computers 2020,9, 48 5 of 21

2.6. Stack Smashing Protector (SSP)

The other main protection technique present in all modern operating systems is the stack smashing

protector (SSP). The ﬁrst proposal was presented in 1998 [

] and has improved over the years. The SSP

technique [

] is a compiler extension which adds a guard (the canary) between the protected region

of the stack and the local buffers. Later, the RAF SSP technique [

] was introduced which greatly

increases (several orders of magnitude) the difﬁculty of an attack when the three protection techniques

(SSP+ASLR+NX) are employed, as, is the case in most systems. Regarding the problems that may cause

a canary change on a running process, authors identiﬁed that the error conﬁnement that represents each

forked thread is also a de-facto stack conﬁnement which allows to change the value of the reference

canary with no impact on the correct operation of process. Recently, in 2019, the technique was applied

to Android OS [21].

2.7. Non-eXecutable (NX)

The Non-eXecutable [

] is a protection technique that allows a memory region to be readable

but not writable. Recently, using a similar idea, Rick Edgecombe extended this technique by adding

the concept of “execution-only” [

]. The idea is quite similar to the original NX where a memory page

can be either executable or writable but not both at the same time. The “execution-only” approach

allows a memory page be marked as “executable only” which means that the page cannot be modiﬁed

or read. This is motivated by the current knowledge about attacks that ﬁrst read the memory content of

the process to later exploit the target. These attacks [

] allow ROP attack techniques even in scenarios

where attackers have no knowledge about the target binary.

3. Background: x86-64 Architecture Overview

In order to properly understand the memory errors and their exploitations, it is essential to

choose a speciﬁc architecture as a reference because these attacks are architecture dependant. Although

the same ideas and concepts can be applied to other architectures with minimal changes, this section is

intended to provide the reader with the required information and knowledge to understand memory

errors on the x86-64 architecture.

This section will outline some key instructions that the CPU uses during software execution.

Afterwards, stack memory and stack frames will be introduced.

3.1. Endianness

Endianness is the order in which bytes are stored in memory. There are two forms: big-endian

and little-endian. The x86-64 architecture uses little-endian form [

]. Little-endian means that

the least-signiﬁcant byte (LSB) is stored at the lowest memory address and the most-signiﬁcant (MSB)

at the highest, where the LSB is the ﬁrst byte and the MSB is the last.

In the context of memory errors exploitation, knowing the endianness is important, specially in

scenarios where attackers have the ability to produce memory overﬂows at byte level when exploiting

vulnerabilities. For example, attacks such as return-to-csu and byte-for-byte SSP attack or even

the blind ROP attack, all assume that attackers have the capability to overwrite an arbitrary number of

bytes (not words or half-words) in memory. In this scenario, attackers need to know which byte they

are overwriting and this depends on the endianness. It is widely know that on some architectures,

one of the bytes of the Stack Smashing Protector is set to zero, and therefore attackers could also guess

the endianness from that observation and quickly identify whether they are overwriting the Stack

Smashing Protector, the saved base pointer, etc.

Figure 2shows a little-endian example which stores the variable

dbeef_var

that has been

initialised with the value 0xDEADBEEF.

Computers 2020,9, 48 6 of 21

...

dbeef_var

dbeef_var = 0xDEADBEEF

0x10000000006

0x10000000005

0x10000000004

0x10000000003

0x10000000002

0x10000000001

Figure 2.

Little-endian memory layout example after storing

0xDEADBEEF

in memory. The variable

content starts at lower addresses.

3.2. Strings

Whilst little-endian is true for how integers and memory addresses are stored, strings are not

affected by the little-endian order. This is due to the way strings are created and handled, in regards to

the C programming language.

In C and C++, strings are stored as a sequence of characters surrounded by double-quotes [

which are concatenated together during compilation to make a single string. As will be discussed

in Section 3.6 Stack Memory, strings (stored as local variables) grow towards higher memory

addresses—each character is stored at a memory address, starting with the ﬁrst character at the

lowest address.

To deﬁne a string using C, the programmer creates an array of the

char

type. This is demonstrated

in Listing 1:

Listing 1: Deﬁning and initialising a string in C.

1int main() {

2char greeting[] = "Hello World\0";

This code creates an array of

char

acters called

greeting

and places the characters in “Hello World”

into each element of the array, such that the ﬁrst element contains “H” (

0x48

in hexadecimal), the second

element contains “e” (0x65 in hexadecimal) and so on.

Listing 2shows in hexdacimal the memory content (

string_memory

) where the

greeting[]

resides in the process memory. Note that the memory addresses (left-hand side) are in order of lowest

to highest, conﬁrming that strings grow upwards in memory.

Listing 2: String greeting[] process memory content growing upwards.

1$ xxd -s 0x754 -l 32 c string_memory

200000754: 4865 6c6c 6f20 576f 726c 6400 0000 0000 Hello World.....

300000764: 011b 033b 3800 0000 0600 0000 ccfd ffff ...;8...........

3.3. Processor Registers

Processor registers are rapid-access storage locations embedded within a processor. In x86-64

systems, a processor contains sixteen general purpose registers (GPRs) [

]. Registers are utilised

in Assembly languages and can be equated with variables in higher-level programming languages;

Computers 2020,9, 48 7 of 21

they are used as operands in an instruction to perform a task, such as store a value. Table 1shows

the sixteen general purpose registers and their sub-registers.

Table 1. The sixteen x86-64 general purpose registers and their sub-registers.

Size (in Bits)

64 32 16 8

RAX EAX AX AH/AL

RBX EBX BX BH/BL

RCX ECX CX CH/CL

RDX EDX DX DH/DL

RDI EDI DI DIL

RSI ESI SI SIL

RBP EBP BP BPL

RSP ESP SP SPL

R8∼R15 R8D∼R15D R8W∼R15W R8L∼R15L

These registers are general purpose, and can be used in many assembly operations, some do have

conventional uses [40]. The RAX stores the result of logical or arithmetic operations, the ’accumulator’

RBX

points to data in the

Data

segment, the

RCX

is usually a counter for string and loop

operations, the

RDX

is employed for I/O pointers, the

RDI

for destination pointer for string operations,

the

RSI

for source pointer for string operations, the

RBP

, the stack base pointer, which points to

the bottom of the stack, and ﬁnally, the RSP, the stack pointer, which to the top of the stack.

As Table 1shows, there are many more registers within the sixteen GPRs. This is because,

in x86-64, the GPRs have had their width (used to refer to storage capacity, in this case) extended from

32 bits to 64 bits, but the legacy 32 bit, 16 bit and 8 bit registers are all still accessible.

The

preﬁx tells the CPU to use the entire 64 bit width of the register; the

preﬁx tells it to use

the last 32 bits. Omitting a preﬁx tells the CPU to use only the last 16 bits and the

and

sufﬁxes

specify to use only the ﬁrst (higher) or last (lower) 8 bits of the last 16 bits, respectively. When using

the sub-registers, the bits leading up to that register are zeroed out. Figure 3visualises the

RAX

along with its sub-registers.

RAX

AH AL

EAX

63 31 15 7 0

Figure 3. The RAX register and its sub-registers (in bits).

The processor needs to know which instruction is next in a program. To do this, it uses the 64 bits

wide

RIP

, instruction pointer, register. This register stores the address of the next instruction to be

executed [40] —it points to it.

Once an instruction has been executed, the

RIP

the following instruction and jumps to that address, (2) reads (decodes) the value (an instruction

opcode) stored at that address and (3) executes it. This is known as the fetch-decode-execute cycle.

Computers 2020,9, 48 8 of 21

3.4. System V Calling Convention

An application binary interface (ABI) is responsible for specifying how a binary executable should

exchange information with some service [

]. The ABI in *nix systems is called System V. The System

V ABI speciﬁes how, amongst many other things, a function is called within software, at a low level.

The function calling convention under System V is as follows [39]:

• The ﬁrst argument passed to a function is moved into the RDI register;

• The second argument is moved to the RSI register;

• The third to RDX;

• The fourth to RCX;

• The ﬁfth to R8;

• The sixth to R9;

• The system call number to RAX.

Any arguments beyond the sixth argument are passed on the stack, although this is a rare

occurrence. It is important to note that arguments are passed in reverse order. That is, a function that

requires three arguments, the third argument is moved to

RDX

, then the second to

RSI

and the ﬁrst to

RDI. In the case of system calls, the syscall number is moved to RAX.

3.5. Instruction Set

The x86-64 architecture includes a massive collection of instructions—discussion of every

instruction is outside the scope of this paper. However, there are some instructions that are extensively

used when developing exploits. Table 2shows those instructions, including stack, data transfer

and control transfer instructions.

Table 2. x86 stack instructions used in exploiting.

Instruction Description

push src Push src register on to the stack and increments rsp 8 bytes towards lower memory addresses

pop src Copy the top stack value into src register and decrements rsp 8 bytes towards higher memory addresses

mov dst, src Copy data stored in src register to dst register.

jump dest Jump to func

call func Push next instruction onto stack and jumps to func

leave Carries out function epilogue

ret Pops top of stack into RIP and jumps to address in RIP

Stack instructions are those which directly affect the stack. There are only two of these instructions:

push

and

pop

, which adds and removes data from the top of the stack (i.e., highest address),

respectively [42].

3.6. Stack Memory

The stack is a contiguous array of memory location [

]. It is used by functions to store any

local variables they may have, as well as information about how to return control to a calling

function—the return address—in a structure called a ’stack frame’. Yurichev [

] describes the stack as

a fundamental data structure in computer science as, without stacks, functions would not be able to

call each other and, furthermore, recursion would be impossible.

Historically, the stack was also used to store any arguments passed to the function. However,

this was changed under the x86-64 architecture; the ﬁrst six arguments are now passed via registers.

Computers 2020,9, 48 9 of 21

Any arguments beyond the sixth are passed on the stack [

]. Further discussion of the calling

procedure is in Section 3.4 System V Calling Convention.

On x86-64 the stack grows downwards towards lower memory addresses [

], using last-in

ﬁrst-out data organisation [

]. That is, when a function is called, space is made for it in the stack

and its stack frame is placed after (at a lower address than) the calling function’s stack frame; the stack’s

overall storage capacity needs increase by the size required for the new frame. When the function has

ﬁnished executing, its stack frame is removed from the stack; this also means that the stack size shrinks.

As mentioned in the previous section, two GPRs are used by the stack––

RBP

and

RSP

RBP

serves

as a static point for referencing stack-based information [

], such as local variables. It also points

to the highest address in memory [

], in the context of the current stack frame. The

RSP

always points to the top of the current stack frame (the lowest address) and is used as an operand in

the calculation to allocate sufﬁcient memory space for any local variables of a function.

Figure 4provides a simpliﬁed view of a process memory layout where three nested functions

have been called and three stack frames are stacked in the process stack.

Process memory space layout Stack segment layout

0x7FFFFFFFFFF

0x400000

Text segment

Data segment

BSS segment

Heap

Stack Process Environment

func1() stack frame

func2() stack frame

func3() stack frame

empty memory

call

Figure 4. Process and stack layout after calling 3 nested functions.

A function that is called from another function is referred to as callee function, shown for example

func2()

and

func3()

in Figure 4. On the other hand, a function that calls another function is

referred to as caller function, shown for example as func1() and func2() in Figure 4. After each call,

a new stack frame is created for the callee function and added to the stack at a lower memory address

than the previous; note that the entire stack frame is not added to the stack at once and is, instead,

created through register manipulation combined with assembly instructions.

After a

call

, the CPU needs to know where to continue execution from in the caller function

when the callee function has ﬁnished executing. To do this, the address of the instruction immediately

after the call instruction is pushed onto the stack. This is done by the caller function and is included in

the functionality of the

call

instruction. In the callee function’s stack frame, this will be the

return

address [7].

After the return address is pushed to the stack, the instruction pointer jumps to the address of

the function and then a process known as the function prologue takes place. The function prologue

is the process of creating a stack frame to hold callee function information. It is done by the callee

function—the code to create the frame is located at the start of the callee function. There are three steps

to the function prologue:

Computers 2020,9, 48 10 of 21

The current value of

RBP

push

ed onto the stack. This will allow the calling function’s stack

frame to be rebuilt after the callee function ﬁnish;

2. The current value of RSP is moved into RBP;

Space is allocated for any local variables. This is done by

sub

tracting their collective size

(in hexadecimal form) from RSP.

Similarly, there is a function epilogue which to get rid of the frame, clean up the space it occupied

and return control to the calling function. The function epilogue also has three steps and is a simple

reverse of the prologue:

1. RBP is moved into RSP;

2. RBP is popped from the stack;

The return address is read from the top of the stack (where

RBP

is pointing) and the instruction

pointer jumps to that address.

It is important to note that at step 3, the instruction pointer will jump to whatever address is

contained at the return address, be it the legitimate address to return to or not. In fact, this forms

the basis of exploiting buffer overﬂows, as will be discussed in Section 4Buffer Overﬂows. Figure 5

shows the func3 stack frame containing the local variables, the saved RBP and the return address.

Stack space layout Stack frame layout

0x7FFFFFFFFFF

0x7FFDF55D500

Beginning of stack

func1() stack frame

func2() stack frame

func3() stack frame

empty memory

RBP

RSP

func2()’s

local variables

return address

saved RBP

func3()’s

local variables

Figure 5. Layout of a stack frame.

4. Unsafe GLIBC Functions

A buffer overﬂow is the result of stufﬁng more data into a buffer than it can handle [

]. Although

software defects causing buffer overﬂows are present in different parts of the software, the most

common functions in Linux actually overwriting the memory are the functions handling memory

in the GNU C Library, the

GLIBC

. This library is an extension of the C language, rather than being

built into the language itself and is linked to C programs at compilation time. The library contains a

huge collection of functions that enable a programmer to accomplish tasks—such as manage memory,

receive input/send output and process strings—that are not included as built-in language functionality.

These functions are declared across multiple header ﬁles (

ﬁle format extension) that are included at

the start of a C source code ﬁle (.c extension).

Unfortunately, this library contains numerous functions that are considered unsafe [

];

some modern compilers will warn the programmer during compilation that an unsafe function

has been used.

Listing 3shows an example of GNU Compiler Collection (GCC) detecting an unsafe function.

The get_username() function uses gets(buff) to ﬁll the buff from stdin.

Computers 2020,9, 48 11 of 21

Listing 3: GCC warning message: dangerous function detected.

1$ gcc test.c -o test

2/usr/bin/ld: /tmp/ccDjwSDi.o: in function ‘get_username’:

3c.c:(.text+0x24): warning: the~‘gets’ function is dangerous and~should not be used.

GCC knows that the call to

gets(buff)

has no length about the buffer and therefore the

gets()

function cannot be check as to whether it is copying too many bytes or not and therefore overﬂows are

possible. The C standard library contains other functions that are considered unsafe and should be

avoided [47,49]. Table 3lists the most important ones to avoid.

Table 3. Main unsafe functions included in the C standard library.

Function Signature Description Potential Problem

strcpy(char *dest, const char *src) Copies string pointed to by src into

buffer pointed to by dest May overﬂow dest

strcat(char *dest, const char *src) Appends string pointed to by src into

buffer pointed to by dest May overﬂow dest

getwd(char *buf) Returns absolute path of current

working directory May overﬂow buf

gets(char *s) Read a line from stdin, store into

buffer pointed to by sMay overﬂow s

fscanf(FILE *stream, const char *format) Reads ﬁle stream, formats according

to format argument May overﬂow arguments

scanf(const char *format, ...) Reads from stdin, formats according

to format argument May overﬂow arguments

realpath(char *path, char resolved_path[]) Resolves symbolic links to path and

writes canonical path to resolved_path May overﬂow path

sprintf(char *str, const char *format, ...) Formats a string according to format

and writes formatted string to str May overﬂow str

The ﬁrst function listed,

strcpy

, is the most infamous for being the cause of buffer overﬂows.

It copies a string,

src

, into a buffer,

dest

. However, there is no check to ensure that the

src

string is

smaller, in length, than the

dest

buffer; it is up to the programmer to carry out this check and handle

any errors appropriately, as is the modus operandi of the C language. The other unsafe functions can

be exploited in a similar way in the sense that they also do not include the length as a parameter.

An updated version of

strcpy

exists, named

strncpy

, which attempts to prevent

strcpy

’s

weaknesses. The

strncpy

function has been accepted by the C community and is included with

the

GLIBC

library. This version of the function adds a third parameter,

. It operates in the same way as

strcpy

, except that it copies a speciﬁed amount of bytes,

, from

src

dest

. Although the

strcpy

copy could be stopped before if the source strings ends, under an attack the input length is controlled

by attackers and will never happen.

Therefore, this does not make the function safe but pass to the developers responsibility to choose

a proper value of

. If the developers incorrectly use a

value longer than the destination buffer then

the overﬂow is still possible. Unfortunately, Miller and de Raadt [

] found that many programmers

failed to grasp the subtleties of the API when trying to use this version and end up using it incorrectly.

A simple but illustrative example of using the unsafe

strcpy

function is showed in Figure 6.

If the

src

string that is 24 bytes long is copied, using

strcpy

, into a

dest

buffer that only has 8 bytes of

memory allocated for it, the

dest

buffer will be overﬂown. If the

dest

buffer is a local variable, it will

be stored in the stack frame and the extra bytes from

src

will overwrite the saved frame pointer and,

potentially, the return address of the stack frame. Note that the overﬂow is still possible when using

strncpy, the safe version of the string copy memory.

Computers 2020,9, 48 12 of 21

func3() frame layout

char dest[8];

strcpy(dest, "AAAAAAAAAAAAAAAAAAAA");

strncpy(dest,"AAAAAAAAAAAAAAAAAAAAAAAA", 24);

RBP

RSP

func2()’s

local variables

return addressAAAAAAAA

saved RBPAAAAAAAA

dest[AAAAAAAA]

Figure 6. Buffer overﬂow using strcpy() and strncpy to overﬂow a buffer.

Although Figure 6illustrates a simple example, real world applications contain similar bugs even

when using safe functions. For example, instead of using a hard-coded 24 value, it is quite common to

ﬁnd code calculating the length of the source of the string using

strlen

and later use this variable as

the third argument(

) of the safe

strncpy

function. If the length of the source of the string is longer

than the destination buffer’s size then the overﬂow will take place.

5. Attack Approaches

One of the ﬁrst goals for attackers when probing for buffer overﬂow vulnerabilities is gaining

the ability to overwrite the stack frame return address [

]. When it is possible to overwrite the return

address of a stack frame, and an attacker does so, the CPU will jump to whatever address is stored in

the return address when the function attempts to return to its caller [52].

5.1. Denial of Service

There are situations and bugs that do not allow attackers to arbitrarily overwrite the return address

to redirect the execution ﬂow to a desired location. In those situations, attackers cannot actually execute

their desired code and they are limited to overwrite the return address to an invalid value.

This overwrite will cause the process to access to an invalid memory and the operating system

will throw a

segmentation fault

error and halt the process’s execution. By repeating this action,

attackers can achieve a denial of service. This can be particularly devastating if the process provides a

service that does not have a way of re-spawning.

5.2. Code Injection

On the other hand, if attackers have the capability to control the execution ﬂow, they can try

to inject and execute some code. Once the malicious code is executed, it allows an attacker to

subsume their privilege level of the vulnerable process. If the process is running with root privileges,

the attackers can spawn a shell and gain control of the entire system, free to upload or download

data as they please, although this is just one of many things that can be done. The injected, malicious

code is most commonly referred to as ’shellcode’. Listing 4shows an example of shellcode that can be

inserted into a vulnerable process to reboot a Linux x86-64 machine [53].

Computers 2020,9, 48 13 of 21

Listing 4: Code to reboot (POWER_OFF) a Linux x86-64 system.

1char shellcode_reboot[] =

2"\xBA\xDC\xFE\x21\x43"

3"\xBE\x69\x19\x12\x28"

4"\xBF\xAD\xDE\xE1\xFE"

5"\xB0\xA9"

6"\x0F\x05";

The Listing 4code is basically the assembler code of the

reboot(2)

system call. It requires three

integer arguments:

1. The ﬁrst argument is a magic number: 0x4321DEFC moved to EDX.

2. The second argument is another magic number: 0x28121969 moved to ESI.

3. The third argument is another magic number: 0xFEE1DEAD moved to EDI.

4. Finally, the syscall number 0xA9 for the sys_reboot moved to AL.

Tools like

rasm2

can be used to assemble instructions to opcode which greatly facilitates the task

of converting from assembler to opcodes that will compose the shellcode. Listing 5shows the

rasm2

output when converting from assembler instructions to the opcodes listed in Listing 4.

Listing 5: Reboot shellcode: From assembler to opcodes.

1$ rasm2 -a x86 -b 64 ’mov edx, 0x4321DEFC’

2bafcde2143

3$ rasm2 -a x86 -b 64 ’mov esi, 0x28121969’

4be69191228

5$ rasm2 -a x86 -b 64 ’mov edi, 0xFEE1DEAD’

6b0a9

7$ rasm2 -a x86 -b 64 ’syscall’

80f05

In order for the injected shellcode to be executed, the attacker must redirect execution to where it

ends up in memory. This can be done simply by changing the return address of a vulnerable function

to the location of the shellcode in the stack. In this particular example, the shellcode is 19 bytes long so

the attacker must craft the attack very precisely.

5.3. Return Orientated Programming

There are scenarios where injected code can not be executed because the system is protected

against this. This protection is known as Non-eXecutable bit and is described in Section 6.1. In those

scenarios the attackers need to ﬁnd the shellcode assembler instructions in the already executing code.

To achieve this, a very popular attacking technique known as Return orientated programming (ROP)

is used.

Unlike code injection, ROP requires no code to be injected into the process and, instead, uses parts

of codes that the process already has access to, “each of which ends in a ”return” instruction” [

These small parts of codes are referred to as ’gadgets’ and are rife within programs and libraries.

In fact, Buchanan et al. [

] theorise that “(in the absence of code-ﬂow integrity) any sufﬁciently large

program codebase

→

arbitrary attacker computation and behaviour, without code injection”—that is

to say, access to a large enough collection of code (the standard C library, for example) will result in an

attacker being able to build a ROP exploit from the codebase [52].

The attack redirects the ﬂow execution to the application itself or to any of its shared libraries to

ﬁnd those instructions or to execute different ones to create a compatible shellcode. That is, the attackers

can execute selected assembler instructions (i.e., gadgets) to ﬁll the registers into desired values to

ﬁnally jump to a memory position containing the syscall assembler instruction.

Computers 2020,9, 48 14 of 21

To make use of gadgets, the attacker must ﬁrst locate them within the code of the process.

There have been many tools for automating this process, one of the most widely known is ROPgadget,

which uses the Capstone disassembly engine to search a given binary ﬁle for snippets of assembly

code that end with a ret instruction.

6. Memory Protection Techniques

This section examines the developments in memory protection to mitigate buffer overﬂow errors.

We discuss the three most important protections techniques present in all modern operating systems:

the Non-eXecutable bit (NX), the Stack Smashing Protector (SSP) and ﬁnally the Address Space Layout

Randomisation (ASLR).

6.1. Non-Executable Bit (NX)

Non-eXectuable (NX) is a mitigation technique that seeks to thwart attacks that rely on executing

injected code. To do this, memory regions are marked as writable or executable but not both at the same

time. By doing this, attackers can inject code but they cannot execute it. This protection technique

requires CPU support and most modern CPUs have NX support nowadays.

There is a software implementation of NX introduced by PaX [

], which implements NX on

architectures where the Memory Management Unit (MMU) has no direct support. The NX protection

mechanism is a very effective protection that prevents the execution of injected code but does not

prevent injected data being used in return orientated programming attacks.

For example, attackers can exploit a stack buffer overﬂow by overwriting the return address

on the stack but they also can keep overwriting more data on the stack where attackers can place

arbitrary data. The overwrite of the return address will allow the attackers to redirect the control

ﬂow and the arbitrary data overwritten will assist ROP attacks. The attack idea is to use already

exiting code in the application, named “ROP gadgets”, to control the program state in order to execute

arbitrary code.

Therefore attackers just need to know the code being executed and do some ofﬂine preprocessing

to ﬁnd the ROP gadgets that are equivalent to the execution of the injected code. Recent attacks

such as

offset2lib

[

] and

return-to-csu

[

] require to bypass the NX protection technique to

have success. Later, we describe how to fully bypass the NX protection with a minimal C program,

demonstrating than even a simple “hello world” application has enough ROP gadgets.

6.2. Stack Smashing Protector (SSP)

First introduced in 1997 by Immunix Inc., StackGuard is an extension of the GNU Compiler

Collection (GCC) that aims to mitigate the effectiveness of buffer overﬂow attacks with only modest

performance penalties [20].

In the ﬁrst versions of StackGuard, to accomplish mitigation, a canary value was inserted next to

the return address of the current stack frame to prevent an attacker from overwriting the return address.

The canary value is checked before the instruction pointer loads the return address of the stack frame.

If the canary value is altered, the processor knows that an attack has been attempted and execution is

aborted. Figure 7shows a stack frame protected with the SSP.

Computers 2020,9, 48 15 of 21

func3() stack frame layout

RBP

RSP

func2()’s

local variables

return address

saved RBP

canary

local variables

Figure 7. Layout of a stack frame with the SSP technique.

The canary value is placed on the stack, just below the saved registers by the function

prologue code. StackGuard includes three potential canary value types: random XOR, random

and terminator [56,57]:

Random XOR:

Generated by performing a bitwise exclusive OR operation on a randomly generated

canary with some or all [

] of the information on the stack used to return to a calling function,

such as the return address.

Terminator:

This type of canary takes advantage of the fact that strings end with a terminator value

and that most stack buffer overﬂows involve string operations [

]. By using a canary that contains

terminator values, the exploit will (should) fail as the attacker cannot write the terminator character

sequence for the particular string operation being used to memory and then continue writing [

For example, if 1 byte of the canary is

0x00

then

strcpy

and

strncpy

library functions will stop

copying after copying the 0x00 and therefore no memory beyond the canary will be overwritten.

Random:

The canary is a randomly generated 64 bit value. This is the canary generation approach

followed by the standard GLIBC. This could slightly vary depending on the architecture, for example,

in x86_64, 7 bytes are fully random but one is

0x00

, a terminator to stop overﬂows from

str*-like

library functions [58].

RenewSSP:

It is a modiﬁcation of the Stack Smashing Protector (SSP) technique that can be

applied to forking servers applications to prevent brute force attacks [

]. The technique changes

the reference canary every time a new child process is created. This in practice means that each child

process has a different random canary and therefore brute force attacks are not longer possible.

Unfortunately the Stack Smashing Protector (SSP) is not perfect and it can be bypassed. The SSP

can not detect overﬂows but actually it detects whether the canary value has changed before

the function returns. Therefore if the vulnerability allows to perform an overﬂow without overwriting

the canary value then the SSP will not detect any overﬂow and the attack will success. Although this

scenario is not very common, it is plausible [59].

The second approach to bypass the SSP is to ﬁnd another vulnerability in the target application

that allows attackers to perform info leak [

] attacks. Using info leaks to obtain the canary value,

attackers can create a payload to overﬂow the buffer writing the expected canary value. The SSP will

not detect any attack because the overwritten canary is the same and therefore the attack will not

be detected.

Computers 2020,9, 48 16 of 21

The third approach to bypass SSP is to guess the canary value. Attackers can always try to guess

the canary value and if the canary value guessed does not match, the SSP will send a signal to kill

its own process. On remote servers, with multiple clients, the SPP will kill the process associated to

the attackers’ connection. However, if the target application is a forking server where a parent process

launches children processes to attend clients, attackers can perform brute force attacks. Those attacks

are much more efﬁcient that just guessing [

]. Brute force attacks are possible because children always

inherit the canary value from its parent and therefore the canary value remains always the same.

This allows attackers to discard previously guessed values. Note that this is different from the scenario

where a server is re-launched because the canary value is changed every time. This scenario is named

trial and test attacks [61], and attackers cannot perform brute force attacks.

Bypassing the SSP using the brute force attacks are still being used in modern attacks because

the SSP is a barrier widely used to protect applications. For example, the modern return-to-csu [

]

and offset2lib [15] attacks bypasses the NX, ASLR and SSP protection techniques and both perform a

SSP byte-for-byte brute force attack.

6.3. Address Space Layout Randomisation (ASLR)

Address space layout randomisation (ASLR) is a protection technique that attempts to render

exploits that depend on predetermined memory addresses useless [56].

It is a protection technique that which the memory address layout to prevent attacks that relies on

knowing the location of an application’s memory map. Similarly to the source code analysis tools [

the ASLR does not increase the security by removing vulnerabilities from the system but it makes

more difﬁcult to exploit existing vulnerabilities.

When a process is loaded into memory, it is mapped to an address space in virtual memory.

When ASLR is enabled in the operating system, the different parts of the process, such as shared

libraries, stack or HEAP are placed at random addresses in virtual memory [

]. By doing so, attacks

that rely on knowing the memory locations of the application or libraries to conduct the attack will

simplyfail [63].

The effectiveness of ASLR’s mitigation is reliant on the amount of randomness that can be

applied, a.k.a the entropy [

]. With more randomness comes the increased unlikeliness that an

attacker will be able to guess an address via brute-force attacks. On 32 bit systems, this entropy is

worryingly limited [

] because only 8 bits of an address are randomised, resulting in 2

= 256

possible variations. However, on 64 bit systems, 28 bits of an addresses are randomised, resulting

in 2

28 ≈

268.5 million possible variations. Brute-forcing an ASLR-enabled 64 bit system address is

impractical for an attacker [65].

PaX ASLR implementation has the added beneﬁt of being able to apply different amounts of

randomisation [

] to these areas, as the base addresses used to map each group are not related. PaX

ASLR uses 40 bits of a memory address for entropy [

], increasing the randomness and the time

it would take for an attacker to brute-force the address. This, compared with the regular 28 bits of

entropy under the vanilla Linux ASLR implementation, increases the entropy by over 4000×.

Listing 6shows the three top memory regions addresses of two executions of Firefox. Because

the ASLR is disabled in both executions, the memory regions addresses are kept across multiple

executions and therefore can be pretested by attackers. However, when the ASLR is on, as shown

in Listing 7, the addresses where the three memory regions are loaded are different and therefore

attackers will not be able to predict the addresses and their exploits will fail.

Listing 6: ASLR disabled in two executions of Firefox.

ASLR OFF (execution 1) ASLR OFF (execution 2)

--------------------- -----------------------

7ffff7c2a000-7ffff7c4c000 || 7ffff7c2a000-7ffff7c4c000 r--p /usr/lib/x86_64-linux-gnu/libc-2.28.so

7ffff7fcb000-7ffff7fcc000 || 7ffff7fcb000-7ffff7fcc000 r--p /usr/lib/firefox-esr/libmozgtk.so

7ffffffde000-7ffffffff000 || 7ffffffde000-7ffffffff000 rw-p [stack]

Computers 2020,9, 48 17 of 21

Listing 7: ASLR enabled in two executions of Firefox.

ASLR ON (execution 1) ASLR ON (execution 2)

--------------------- -----------------------

7f494b86d000-7f494b88f000 || 7f8e4df95000-7f8e4dfb7000 r--p /usr/lib/x86_64-linux-gnu/libc-2.28.so

7f494bc0e000-7f494bc0f000 || 7f8e4e336000-7f8e4e337000 r--p /usr/lib/firefox-esr/libmozgtk.so

7ffef630d000-7ffef632e000 || 7ffd7e410000-7ffd7e431000 rw-p [stack]

However, the same attacks developed to bypass the NX protection, such as “offset2lib”

and “return-to-csu” [

] can be used to bypass the ASLR. The attack takes advantage of the way

Linux stores ASLR-enabled objects in memory. Authors were able to predict where those objects will

be randomized in execution time by discovering that the relative distance between the executable

and the libraries was always a constant.

By leaking an address belonging to the application it is possible to ﬁnd out where the libraries are

stored in memory [

]. This is due to the fact that the offset of the process to the libraries loaded above

it (at a higher address in memory) remains constant throughout each process instance. Once the library

address is known, a ROP gadget chain can be built from gadgets contained within the library, which

will further defeat NX. Recently, another attack named

return-to-csu

[

], showed that even with a

minimal C program there is enough gadgets to fully bypass the ASLR.

6.4. Effectiveness Summary

Besides the direct exploitation of the buffer overﬂow, we need to consider the presence of different

attack vectors. Table 4shows a summary of the main protection techniques and whether a particular

protection technique can prevent the attack, where;

High:

the technique provides good protection;

Med:

the technique provides some protection but in some scenarios fails;

Low:

the technique provides

no practical protection and - : the technique does not apply or provide protection.

Table 4. Main protection techniques effectiveness.

Protection Technique Brute Force Ret2-* ROP

Non-eXecutable (NX) - Low Low

Stack Smashing Protector (SSP) [XOR/Terminator/Random] Med - -

Address Space Layout Randomization (ASLR) Med Low High

Address Space Layout Randomization Next-Generation (ASLR-NG) High Low High

Renew Stack Smashing Protector (RenewSSP) High - -

Stack Smashing Protector for Mobile Devices (SSPFA) High - -

As Table 4shows, there is no protection mechanism that can provide effective protection against

all forms of attacks. The NX [

] is considered completely defeated by ret2-* [

] and ROP attacks [

NX is considered as complementary protection but provides no protection on its own. The SSP is an

effective protection against stack buffer overﬂows but it is not a mechanism to protect memory errors in

general [

]. For example, it can deter full brute force attacks but not byte-for-byte attacks. However,

the renewSSP is able to fully prevent brute force attacks against the stack smashing protector [

The SSPMD, a renewSSP modiﬁcation for Android Operating systems, prevents all kind of brute force

attacks against the SSP of individual Android applications [21].

On the other hand, ASLR is a more generic technique that can help to mitigate wider forms of

attack [

]. ROP attacks require to know the code location to be bypassed and ASLR randomizes

the memory layout, therefore having both techniques provide a security level higher than the sum

of protection level provided individually. ASLR-NG provides stronger security since it provides

more absolute and relative entropy which removes classical attacks but also attacks that exploit

the correlation between virtual memory objects to de-randomize libraries [15].

Computers 2020,9, 48 18 of 21

7. Conclusions

In this paper, we have highlighted the importance of memory error vulnerabilities and more

speciﬁcally stack buffer overﬂows. We have identiﬁed the root causes that make those attacks possible

on modern x86-64 architecture.

We have analyzed how unsafe library functions are prone to buffer overﬂows, revealing that

although there are secure versions of those functions, they are not actually preventing buffer overﬂows

from happening. In fact, if they are incorrectly used, attackers can exploit them in a similar way as

when unsafe functions are employed.

Therefore, using secure functions does not result in software free from vulnerabilities and it

requires developers to be security-aware. Furthermore, analysis of the three main security protection

techniques present in all modern operating system; the non-eXecutable bit (NX), the Stack Smashing

Protector (SSP) and the Address Space Layout Randomization (ASLR), concluded that although they

provide a strong level of protection against classical exploitation techniques, unfortunately, recent

advanced attacks have demonstrated effective approaches to bypass them.

For the future, novel protections techniques will need to effectively protect against memory errors

exploitation. The techniques should focus on attack approaches and they must be effective against

attacks vectors while also fully back compatible with old, current and future applications. Since many

protection techniques could coexist, they should introduce very little overhead to be accepted in

the security community.

Author Contributions:

Writing—original draft, C.P., H.M.-G. and C.B. All authors have read and agreed to the

published version of the manuscript.

Funding: This research received no external funding.

Conﬂicts of Interest: Authors declare no conﬂict of interest.

References

Younan, Y. 25 Years of Vulnerabilities: 1988–2012; Sourceﬁre, 2013. Available online: https://maxedv.com/

wp-content/uploads/2011/12/Sourceﬁre-25- Years-of-Vulnerabilities-Research-Report.pdf (accessed on 12

February 2019).

Meer, H. Memory Corruption Attacks: The (almost) Complete History. 2010. Available

online: https://media.blackhat.com/bh-us-10/whitepapers/Meer/BlackHat-USA-2010-Meer-History-of-

Memory-Corruption-Attacks-wp.pdf (accessed on 12 February 2019).

Anderson, P.J. Computer Security Technology Planning Study; Deputy for Command and Management

Systems HQ Electronic Systems Division (AFSC) Technical Report; 1972; Volume 2. Available online:

http://seclab.cs.ucdavis.edu/projects/history/papers/ande72.pdf (accessed on 12 February 2019).

Fowler, S. CVE-2019-3822 curl: NTLMv2 type-3 Header Stack Buffer Overﬂow. 2019. Available online:

https://bugzilla.redhat.com/show_bug.cgi?id=CVE-2019-3822 (accessed on 5 March 2019).

5. curl. 2019 Available online: https://curl.haxx.se/libcurl/features.html (accessed on 5 March 2019).

6. Stenberg, D. Everything Curl; GitBook: Lyon, France, 2015.

7. Aleph One. Smashing the Stack for Fun and Proﬁt. Phrack 1996,7.

Cowan, C.; Pu, C. Death, Taxes and Imperfect Software: Surviving the Inevitable. In Proceedings of the

1998 Workshop on New Security Paradigms, Charlottsville, VA, USA, 22–25 September 1998; pp. 54–70,

doi:10.1145/310889.310915.

9. McConnell, S. Code Complete, 2nd ed.; Microsoft Press: Redmond, WA, USA, 2004.

10.

Younan, Y.; Pozza, D.; Piessens, F.; Joosen, W. Extended protection against stack smashing attacks without

performance loss. In Proceedings of the ACSAC, Shanghai, China, 6–8 September 2006.

11. Bulba; Kil3r. Bypassing StackGuard and StackShield. Phrack 2002,10, 56.

12.

Richarte, G. Four different tricks to bypass StackShield and StackGuard protection. World Wide Web. 2002

Available online: https://www.cs.purdue.edu/homes/xyzhang/spring07/Papers/defeat-stackguard.pdf

(accessed on 12 February 2019).

Computers 2020,9, 48 19 of 21

13.

Marco-Gisbert, H.; Ripoll-Ripoll, I. Return-to-csu: A New Method to Bypass 64-bit Linux ASLR; Black Hat. 2018

Available online: https://i.blackhat.com/brieﬁngs/asia/2018/asia- 18-Marco-return-to-csu-a-new-method-

to-bypass-the-64-bit-Linux-ASLR-wp.pdf (accessed on 12 February 2019).

14. Shacham, H.; Page, M.; Pfaff, B.; Goh, E.J.; Modadugu, N.; Boneh, D. On the effectiveness of address-space

randomization. In Proceedings of the 11th ACM Conference on Computer and Communications Security,

Washington, DC, USA, 25–29 October 2004; pp. 298–307, doi:10.1145/1030083.1030124.

15.

Marco-Gisbert, H.; Ripoll-Ripoll, I. On the Effectiveness of Full-ASLR on 64-bit Linux. In Proceedings of the

In-Depth Security Conference (DeepSec), Vienna, Austria, 18–21 November 2014.

16.

Tran, M.; Etheridge, M.; Bletsch, T.; Jiang, X.; Freeh, V.; Ning, P. On the expressiveness of return-into-libc

attacks. In Proceedings of the 14th International Conference on Recent Advances in Intrusion Detection,

Menlo Park, CA, USA, 20–21 September 2011; pp. 121–141, doi:10.1007/978-3-642-23644-0_7.

17.

Wojtczuk, R. The advanced return-into-lib(c) exploits: PaX case study. Phrack

2001

,58. Available online:

http://phrack.org/issues/58/4.html (accessed on 12 February 2019).

18.

Roemer, R.; Buchanan, E.; Shacham, H.; Savage, S. Return-Oriented Programming: Systems, Languages,

and Applications. ACM Trans. Inf. Syst. Secur. 2012,15, 2:1–2:34, doi:10.1145/2133375.2133377.

19.

Marco-Gisbert, H.; Ripoll, I. Preventing Brute Force Attacks Against Stack Canary Protection on Networking

Servers. In Proceedings of the 12th International Symposium on Network Computing and Applications,

Cambridge, MA, USA, 22–24 August 2013; pp. 243–250. doi:10.1109/NCA.2013.12.

20.

Cowan, C.; Pu, C.; Maier, D.; Walpole, J.; Bakke, P.; Beattie, S.; Grier, A.; Wagle, P.; Zhang, Q. StackGuard:

Automatic Adaptive Detection and Prevention of Buffer-Overﬂow Attacks. In Proceedings of the 7th

USENIX Security Symposium, San Antonio, TX, USA, 26–29 January 1998.

21.

Marco-Gisbert, H.; Ripoll-Ripoll, I. SSPFA: Effective stack smashing protection for Android OS. Int. J. Inf.

Secur. 2019,18, 519–532, doi:10.1007/s10207-018-00425-8.

22.

Marco-Gisbert, H.; Ripoll-Ripoll, I. Address Space Layout Randomization Next Generation. Appl. Sci.

2019

9, 2928.

23.

Pax Team. PaX Address Space Layout Randomization (ASLR). 2003. Available online: http://pax.grsecurity.

net/docs/aslr.txt (accessed on 17 July 2019).

24. Paulson, L.D. New Chips Stop Buffer Overﬂow Attacks. Computer 2004,37, 28–30.

25.

Edgecombe, R. Touch But Don’T Look: Running the Kernel in Execute Only Memory. In

Proceedings of the Linux Plumbers Conference, Lisbon, Portugal, 9–11 September 2019. Available

online: https://linuxplumbersconf.org/event/4/contributions/283/attachments/357/588/Touch_but_

dont_look__Running_the_kernel_in_execute_only_memory-presented.pdf (accessed on 17 July 2019).

26.

Xu, J.; Kalbarczyk, Z.; Iyer, R. Transparent runtime randomization for security. In Proceedings of the 22nd

International Symposium on Reliable Distributed Systems, Florence, Italy, 6–8 October 2003; pp. 260–269,

doi:10.1109/RELDIS.2003.1238076.

27.

Zhan, X.; Zheng, T.; Gao, S. Defending ROP Attacks Using Basic Block Level Randomization. In Proceedings

of the 2014 IEEE Eighth International Conference on Software Security and Reliability-Companion (SERE-C),

San Francisco, CA, USA, 30 June–2 July 2014; pp. 107–112, doi:10.1109/SERE-C.2014.28.

28.

Kil, C.; Jim, J.; Bookholt, C.; Xu, J.; Ning, P. Address space layout permutation (ASLP): Towards ﬁne-grained

randomization of commodity software. In Proceedings of the Computer Security Applications Conference,

Miami Beach, FL, USA, 11–15 December 2006; pp. 339–348.

29.

Iyer, V.; Kanitkar, A.; Dasgupta, P.; Srinivasan, R. Preventing Overﬂow Attacks by Memory Randomization.

In Proceedings of the 2010 IEEE 21st International Symposium on Software Reliability Engineering (ISSRE),

San Jose, CA, USA, 1–4 November 2010; pp. 339–347, doi:10.1109/ISSRE.2010.22.

30.

Raadt, T.D. Exploit Mitigation Techniques (Updated to Include Random Malloc and MMAP). In Proceedings

of the OpenCON 2005, Venice, Italy, 5–6 November 2005.

31.

Miller, K. OpenBSD’s Position Independent Executable (PIE) Implementation; In Proceedings of the NYCBSDCon,

New York, NY, USA, 11–12 October 2008.

32.

Russinovich, M. Inside the Windows Vista Kernel: Part 3; Microsfot, 2007 Available online: https:

//docs.microsoft.com/en-us/previous-versions/technet- magazine/cc162458(v=msdn.10) (accessed on

17 July 2019).

Computers 2020,9, 48 20 of 21

33.

Whitehouse, O. An Analysis of Address Space Layout Randomization on Windows Vista; Technical Report;

Symantec Advanced Threat Research, Black Hat, 2007. Available online: https://www.blackhat.com/

presentations/bh-dc-07/Whitehouse/Paper/bh-dc-07-Whitehouse-WP.pdf (accessed on 12 February 2019).

34. Ruoho, C. ASLR: Leopard Versus Vista; Laconic Security: Broomﬁeld, CO, USA, 2008

35. ’xorl’. Linux GLibC Stack Canary Values 2010.

36.

Bittau, A.; Belay, A.; Mashtizadeh, A.; Mazières, D.; Boneh, D. Hacking Blind. In Proceedings of the 35th

IEEE Symposium on Security and Privacy, Berkeley, CA, USA, 18–21 May 2014. Available online: http:

//www.ieee-security.org/TC/SP2014/papers/HackingBlind.pdf (accessed on 17 July 2019).

37.

AMD64 Architecture Programmer’s Manual, Volume 1: Application Programming; AMD, 2017. Available online:

https://www.amd.com/system/ﬁles/TechDocs/24592.pdf (accessed on 17 July 2019).

38.

ISO/IEC. Working Draft, Standard for Programming Language C++ [Online, C++ International Standard,

N4800]. Available online: https://www.iso.org/standard/74528.html (accessed on 18 February 2019).

39.

Matz, M.; Hubiˇcka, J.; Jaeger, A.; Mitchell, M. System V Application Binary Interface. 2014 Available online:

https://uclibc.org/docs/psABI-x86_64.pdf (accessed on 17 July 2019).

40.

Intel. Intel 64 and IA-32 Architecture Software Developer’s Manual. 2016. Available

online: https://www.intel.com/content/dam/www/public/us/en/documents/manuals/64-ia-32-

architectures-software-developer-instruction-set-reference-manual-325383.pdf (accessed on 17 July 2019).

41.

Kerrisk, M. The Linux Programming Interface: A Linux and UNIX System Programming Handbook; No Starch

Press: San Francisco, CA, USA, 2010.

42.

Anley, C.; Heasman, J.; Lindner, F.; Richarte, G. The Shellcoder’s Handbook: Discovering and Exploiting Security

Holes, 2nd ed.; John Wiley & Sons, Inc.: New York, NY, USA, 2007.

43.

Yurichev, D. Reverse Engineering for Beginners (Understanding Assembly Language). 2018. Available

online: https://yurichev.com/writings/RE4B-EN.pdf (accessed on 17 July 2019).

44. Eilam, E. Reversing: Secrets of Reverse Engineering; Wiley Publishing, Inc.: Hoboken, NJ, USA, 2005.

45.

Foster C., J.; Osipov, V.; Bhalla, N.; Heinen, N. Buffer Overﬂow Attacks: Detect, Exploit, Prevent; Syngress

Publishing Inc.: Rockland, MA, USA, 2005.

46.

Weidman, G. Penetration Testing: A Hands-On Introduction to Hacking; William Pollock: San Francisco, CA,

USA, 2014.

47.

Baratloo, A.; Singh, N.; Tsai, T. Transparent Run-time Defense Against Stack Smashing Attacks.

In Proceedings of the 2000 USENIX Annual Technical Conference, San Diego, CA, USA, 18–23 June 2000.

48.

Wheeler A.D. Secure Programming HOWTO. 2015. Available online: https://www.tldp.org/HOWTO/

pdf/Secure-Programs-HOWTO.pdf (accessed on 17 July 2019).

49. Gustedt, J. Modern C; Manning Publications: Helter Island, NY, USA, 2016.

50.

Miller C., T.; Raadt de, T. strlcpy and strlcat—Consistent, Safe, String Copy and Concatenation.

In Proceedings of the FREENIX Track: 1999 USENIX Annual Technical Conference, Monterey, CA, USA,

6–11 June 1999.

51.

Kc S., G.; Keromytis D., A. e-NeXSh: Achieving an Effectively Non-Executable Stack and Heap via

System-Call Policing. In Proceedings of the 21st Annual Computer Security Applications Conference,

Tucson, AZ, USA, 5–9 December 2005.

52.

Sayeed, S.; Marco-Gisbert, H.; Ripoll-Ripoll, I.; Birch, M. Control-Flow Integrity: Attacks and Protections.

Appl. Sci. 2019,9, 4229.

53.

“zbt”. Linux/x86-64—reboot(POWER_OFF)—19 Bytes. Available online: http://shell-storm.org/shellcode/

ﬁles/shellcode-602.php (accessed on 16 February 2019).

54.

Buchanan, E.; Roemer, R.; Savage, S.; Shacham, H. Return-Orientated Programming: Exploitation without

Code Injection. Black Hat, 2008. Available online: https://www.blackhat.com/presentations/bh-usa-08/

Shacham/BH_US_08_Shacham_Return_Oriented_Programming.pdf (accessed on 12 February 2019).

55.

PaX. NOEXEC. 2003 Available online: https://pax.grsecurity.net/docs/noexec.txt (accessed on 12

February 2019).

56.

Silberman, P.; Johnson, R. A Comparison of Buffer Overﬂow Prevention Implementations and

Weaknesses. Black Hat, 2014. Available online: https://www.blackhat.com/presentations/bh-usa-04/

bh-us-04-silberman/bh-us-04-silberman-paper.pdf (accessed on 12 February 2019).

57.

Wagle, P.; Cowan, C. StackGuard: Simple Stack Smash Protection for GCC. In Proceedings of the GCC

Developers Summit, Ottawa, ON, Canada, 25–27 May 2003.

Computers 2020,9, 48 21 of 21

58.

Cowan, C.; Beattie, S.; Pu, C.; Wagle, P.; Walthinsen, E. Protecting Systems from Stack Smashing Attacks with

StackGuard; Institute of Science & Technology: Hillsboro, Or, USA, 1999.

59.

Marco, H. Root Shell on Snifﬁt. Available online: http://hmarco.org/bugs/CVE-2014-5439-snifﬁt_0.3.7-

stack-buffer-overﬂow.html (accessed on 25 May 2020).

60.

Huawei Technologies Co., L. Information Leak Vulnerability in Some Huawei Products. Available online:

https://www.huawei.com/en/psirt/security-advisories/huawei-sa-20191030-01-phone-en (accessed on

25 May 2020).

61.

Marco-Gisbert, H.; Ripoll-Ripoll, I. On the Effectiness of NX, SSP, RenewSSP and ASLR against Stack Buffer

Overﬂows; IEEE: New York, NY, USA, 2014.

62.

Jelinek, J. Object Size Checking to pRevent (Some) Buffer Overﬂows (GCC FORTIFY). 2004. Available online:

http://gcc.gnu.org/ml/gcc-patches/2004-09/msg02055.html (accessed on 17 July 2019).

63.

Kc S., G.; Keromytis D., A.; Prevelakis, V. Countering Code-Injection Attacks With Instruction-Set

Randomization. In Proceedings of the 10th ACM Conference on Computer and Communications Security,

Washington, DC, USA, 27–31 October 2003.

64.

Marco-Gisbert, H. Cyber-Security Protection Techniques to Mitigate Memory Errors Exploitation.

Ph.D. Thesis, Universitat Politècnica de València, València, Spain, 2015.

65.

Gras, B.; Ravazi, K.; Bosman, E.; Bos, H.; Giuffrida, C. ASLR on the Line: Practical Cache Attacks on the MMU;

Network and Distributed System Security Symposium (NDSS): San Diego, CA, USA, 2017.

66.

Göktas, E.; Athanasopoulos, E.; Bos, H.; Portokalidis, G. Out of Control: Overcoming Control-Flow Integrity.

In Proceedings of the 2014 IEEE Symposium on Security and Privacy, San Jose, CA, USA, 18–21 May 2014.

2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access

article distributed under the terms and conditions of the Creative Commons Attribution

(CC BY) license (http://creativecommons.org/licenses/by/4.0/).

DSLR–: A low-overhead data structure layout randomization for defending data-oriented programming

Article

Nov 2023

By developing a Turing-complete non-control data attack to bypass existing defenses against control flow attacks, Data-Oriented Programming (DOP) has gained significant attention from researchers in recent years. While several defense techniques have been proposed to mitigate DOP attacks, they often introduce substantial overhead due to the blind protection of a large range of data objects. To address this issue, we focus on selecting and protecting the specific target data that are of interest to DOP attackers, rather than securing the entire non-control data in the program. In this regard, we perform static analysis on 20 real-world applications and identify the target data, verifying that they constitute only a small percentage of the overall program, averaging around 3%. Additionally, we propose a semi-automated tool to analyze how to chain operations on the target data in these 20 applications to achieve Turing-complete attacks. Furthermore, we introduce DSLR-: a low-overhead Data Structure Layout Randomization (DSLR) method, which modifies the existing DSLR technique to only randomize the selected target data for DOP. Experimental results demonstrate that DSLR- effectively mitigates DOP attacks, reducing performance overhead by 71.2% and memory overhead by 82.5% compared to the original DSLR technique.

SafeOSL: Ensuring memory safety of C via ownership‐based intermediate language

Article

Dec 2021

The unsafe features of C make it a big challenge to ensure memory safety of C programs, and often lead to memory errors that can result in vulnerabilities. Various formal verification techniques for ensuring memory safety of C have been proposed. However, most of them either have a high overhead, such as state explosion problem in model checking, or have false positives, such as abstract interpretation. In this article, by innovatively borrowing ownership system from Rust, we propose a novel and sound static memory safety analysis approach, named SafeOSL. Its basic idea is an ownership-based intermediate language, called ownership system language (OSL), which captures the features of the ownership system in Rust. Ownership system specifies the relations among variables and memory locations, and maintains invariants that can ensure memory safety. The semantics of OSL is formalized in K-framework, which is a rewriting-logic based tool. C programs to be checked are first transformed into OSL programs and then detected by OSL semantics. Experimental results have demonstrated that SafeOSL is effective in detecting memory errors of C. Moreover, the translations and experiments indicate that the intermediate language OSL could be reused by other programming languages to detect memory errors.

Control-Flow Integrity: Attacks and Protections

Article

Full-text available

Oct 2019

Despite the intense efforts to prevent programmers from writing code with memory errors, memory corruption vulnerabilities are still a major security threat. Consequently, control-flow integrity has received significant attention in the research community, and software developers to combat control code execution attacks in the presence of type of faults. Control-flow Integrity (CFI) is a large family of techniques that aims to eradicate memory error exploitation by ensuring that the instruction pointer (IP) of a running process cannot be controlled by a malicious attacker. In this paper, we assess the effectiveness of 14 CFI techniques against the most popular exploitation techniques, including code reuse attacks, return-to-user, return-to-libc, and replay attacks. We also classify these techniques based on their security, robustness, and implementation complexity. Our study indicates that the majority of the CFI techniques are primarily focused on restricting indirect branch instructions and cannot prevent all forms of vulnerability exploitation. We conclude that the performance overhead introduced, jointly with the partial attack coverage, is discouraging the industry from adopting most of them.

Address Space Layout Randomization Next Generation

Article

Full-text available

Jul 2019

Systems that are built using low-power computationally-weak devices, which force developers to favor performance over security; which jointly with its high connectivity, continuous and autonomous operation makes those devices specially appealing to attackers. ASLR (Address Space Layout Randomization) is one of the most effective mitigation techniques against remote code execution attacks, but when it is implemented in a practical system its effectiveness is jeopardized by multiple constraints: the size of the virtual memory space, the potential fragmentation problems, compatibility limitations, etc. As a result, most ASLR implementations (specially in 32-bits) fail to provide the necessary protection. In this paper we propose a taxonomy of all ASLR elements, which categorizes the entropy in three dimensions: (1) how, (2) when and (3) what; and includes novel forms of entropy. Based on this taxonomy we have created, ASLRA, an advanced statistical analysis tool to assess the effectiveness of any ASLR implementation. Our analysis show that all ASLR implementations suffer from several weaknesses, 32-bit systems provide a poor ASLR, and OS X has a broken ASLR in both 32- and 64-bit systems. This is jeopardizing not only servers and end users devices as smartphones but also the whole IoT ecosystem. To overcome all these issues, we present ASLR-NG, a novel ASLR that provides the maximum possible absolute entropy and removes all correlation attacks making ASLR-NG the best solution for both 32- and 64-bit systems. We implemented ASLR-NG in the Linux kernel 4.15. The comparative evaluation shows that ASLR-NG overcomes PaX, Linux and OS X implementations, providing strong protection to prevent attackers from abusing weak ASLRs.

SSPFA: effective stack smashing protection for Android OS

Article

Full-text available

Aug 2019
INT J INF SECUR

In this paper, we detail why the stack smashing protector (SSP), one of the most effective techniques to mitigate stack buffer overflow attacks, fails to protect the Android operating system and thus causes a false sense of security that affects all Android devices. We detail weaknesses of existing SSP implementations, revealing that current SSP is not secure. We propose SSPFA, the first effective and practical SSP for Android devices. SSPFA provides security against stack buffer overflows without changing the underlying architecture. SSPFA has been implemented and tested on several real devices showing that it is not intrusive, and it is binary-compatible with Android applications. Extensive empirical validation has been carried out over the proposed solution.

Protecting Systems from Stack Smashing Attacks with StackGuard

Article

Full-text available

Jan 1999

The StackGuard compiler provides robust automatic protection against the all-too-com- mon problem of stack smashing vulnerabili- ties. However, this protection is only provided for programs and libraries that are re-compiled with StackGuard. Thus protecting an entire system requires that all potentially vulnerable programs be re-compiled to assure that an attacker cannot exploit a stack smashing vul- nerability to gain privilege on the system. This paper describes securing a Linux distribution against stack smashing attacks by re-compil- ing all of the C software from source code using the StackGuard compiler. We present our experience re-compiling 526 packages from source code, and our experience deploy- ing and using the resultant system.

Defending ROP Attacks Using Basic Block Level Randomization

Conference Paper

Jun 2014

Code reuse attacks such as return-oriented programming, one of the most powerful threats to software system, rely on the absolute address of instructions. Therefore, address space randomization should be an effective defending method. However, current randomization techniques either are lack of enough entropy or have significant time or space overhead. In this paper, we present a novel fine-grained randomization technique at basic block level. In contrast to previous work, our technique dealt with critical technical challenges including indirect branches, callbacks and position independent codes properly at least cost. We implement an efficient prototype randomization system which supports Linux ELF file format and x86 architecture. Our evaluation demonstrated that it can defend ROP attacks with tiny performance overhead (4% on average) successfully.

On the Effectiveness of NX, SSP, RenewSSP, and ASLR against Stack Buffer Overflows

Conference Paper

Aug 2014

Reversing: Secrets of Reverse Engineering

Book

Jan 2005

E Eilam

A Comparison of Buffer Overflow Prevention Implementations and Weaknesses

Article

In the world of information security, buffer overflows remain the leading cause of software vulnerabilities. In recent years, the industry has seen an elevated rate of exploitation of these vulnerabilities due to readily available worm-generation software and mass-exploitation toolkits. This increasing exposure to buffer overflow attacks requires a technological solution that applies a protective layer against automated exploitation attempts. This paper will examine two approaches to applying a generic protection against buffer overflow attacks and critique the effectiveness of available buffer overflow protection mechanisms on the Linux and Microsoft Corp.'s Windows platforms. An analysis of each technology will explain the methods by which a protection mechanism has been implemented and the technology's effectiveness in defending against both automated and targeted attacks, which specifically try to circumvent that specific protection method. Finally, a matrix will be presented that will define each technology's ability to protect against multiple classes of buffer overflow attacks including format strings, stack overflows and heap overflow.

Preventing Brute Force Attacks Against Stack Canary Protection on Networking Servers

Conference Paper

Aug 2013

The buffer overflow is still an important problem despite the various protection methods developed and widely used on most systems (Stack-Smashing Protector, ASLR and Non-eXecutable). Most of these techniques rely on keeping secret some key information needed by the attackers to build the exploit. Unfortunately, the architecture of most Web servers allows attacker to implement brute force attacks that can be exploited to obtain those secrets by mean of brute force attacks, and eventually break into the server. We propose a modification of the stack-smashing protector (SSP) technique which eliminates brute force attacks against the canary. The technique is not intrusive, and can be applied by just pre-loading a shared library. The overhead is almost negligible. The technique has been tested on several web servers and on a complete GNU/Linux distribution by patching the standard C library. We expect that the strategy presented in this paper will become a standard technique on both desktop and servers.

Secure Programming for Linux and Unix HOWTO

Article

Jul 2010

David A. Wheeler

This paper provides a set of design and implementation guidelines for writing secure programs for Linux andUnix systems. Such programs include application programs used as viewers of remote data, web applications(including CGI scripts), network servers, and setuid/setgid programs. Specific guidelines for C, C++, Java,Perl, Python, TCL, and Ada95 are included.This document is Copyright (C) 1999-2000 David A. Wheeler. Permission is granted to copy, distributeand/or modify this document under ...

A Review of Memory Errors Exploitation in x86-64

Abstract and Figures

Recommended publications

On the Effectiveness of NX, SSP, RenewSSP, and ASLR against Stack Buffer Overflows

Address Space Layout Randomization Next Generation

Preventing Brute Force Attacks Against Stack Canary Protection on Networking Servers

SSPFA: effective stack smashing protection for Android OS