Termination of Ethereum’s Smart Contracts

∗

Thomas Genet, Thomas Jensen and Justine Sauvage

Univ. Rennes, Inria, CNRS, IRISA, France

Keywords:

Formal Methods for Security, Ethereum, Smart Contracts, Security in Distributed Systems.

Abstract:

Ethereum is a decentralized blockchain technology equipped with so-called Smart Contracts. A contract is a

program whose code is public, which can be triggered by any user, and whose actual execution is performed by

miners participating in Ethereum. Miners execute the contract on the Ethereum Virtual Machine (EVM) and

apply its effect by adding new blocks to the blockchain. A contract that takes too much time to be processed

by the miners of the network may result into delays or a denial of service in the Ethereum system. To prevent

this scenario, termination of Ethereum’s Smart Contracts is ensured using a gas mechanism. Roughly, the

EVM consumes gas to process each instruction of a contract and the gas provided to run a contract is limited.

This technique could make termination of contracts easy to prove but the way the ofﬁcial deﬁnition of the

EVM speciﬁes gas usage makes the proof of this property non-trivial. EVM implementations and formal

analysis techniques of EVM’s Smart Contracts use termination of contracts as an assumption, so having a

formal proof of termination of contracts is crucial. This paper presents a mechanized, formal, and general

proof of termination of Smart Contracts based on a measure of EVM call stacks.

1 INTRODUCTION

A blockchain is a decentralized ledger, shared over a

network, on which all users agree. Users can submit

new elements to be added to this ledger. To add new

elements in the ledger, one needs to add a new block

(containing the new elements) to the blockchain. A

block will be added to the blockchain if most of the

participants agree on it. In Bitcoin, to add a new

block to the chain, one has to solve a cryptographic

puzzle on this new block in a limited amount of time

(around 10 minutes in Bitcoin). This is called mining

a block. Since the puzzle is computationally difﬁcult

it requires that most users participate in its resolution.

Users contributing to the resolution are called miners.

The fact that most miners try to solve the same puzzle

entails that they all agree on the block itself and on

all the added elements.

Bitcoin is equipped with a programming lan-

guage, called Script (script, 2014), that is used to de-

ﬁne programs reading inputs in the blockchain and

proposing outputs (new elements) to be added to

the blockchain. It is the role of the miners to exe-

cute the Script programs and to build the new blocks

containing the outputs of those programs. If one

∗

This work was partially supported by Laboratoire

d’excellence CominLabs.

Script program is non-terminating, this prevents min-

ers from building new blocks and adding them to

the blockchain within the 10 minutes time limit. If

many Script programs are non terminating, this could

cause a denial of service in the Bitcoin system. This

is the reason why the Script language is not Turing-

complete, in particular it has no loops.

Ethereum extends Bitcoin’s blockchain with a

Turing-complete programming language and the abil-

ity to store those programs (called contracts) in the

blockchain itself. Contracts are programmed into

dedicated high-level languages like Solidity (solid-

ity, 2014) or Vyper (vyper, 2017) and compiled to a

bytecode format executed by the so-called Ethereum

Virtual Machine (EVM). Since the programming lan-

guage is Turing-complete, Ethereum needs to prevent

looping contracts. In addition, Ethereum also targets

to accelerate the pace of block additions w.r.t. Bit-

coin. Thus, a terminating contract that takes too long

to complete is another source of denial of service for

Ethereum. Ethereum protects its system from non ter-

minating programs and too complex programs with

a single mechanism: the gas (Buterin, 2013). Intu-

itively, the EVM consumes gas to process each in-

struction of a contract and the gas provided to run a

contract is limited.

Though this mechanism looks simple and robust,

the protection it offers against denial of service is

Genet, T., Jensen, T. and Sauvage, J.

Termination of Ethereum’s Smart Contracts.

DOI: 10.5220/0009564100390051

In Proceedings of the 17th International Joint Conference on e-Business and Telecommunications (ICETE 2020) - SECRYPT, pages 39-51

ISBN: 978-989-758-446-6

fragile. For instance, in 2016, badly chosen gas val-

ues for some EVM instructions resulted into several

denial of service of Ethereum. This had to be ﬁxed

by two consecutive hard forks of the system (Hudson,

2016a; Hudson, 2016b). Independently of choosing

for the best gas cost for each instruction, a general

question to ask is whether the gas mechanism is suf-

ﬁcient to prove termination of any contract? Surpris-

ingly, proving formally that this is true is not trivial

because of the complexity of the EVM semantics (see

Section 5).

The goal of this paper is twofold: to prove that no

program can execute indeﬁnitely without consuming

gas in the EVM execution model, and to prove it in

a way that can be used in a mechanized proof. More

precisely, we present two termination proofs on two

slightly different EVM semantics. The ﬁrst model is

the formal semantics of the (foundational) Ethereum

Yellow Paper (Gavin, 2014), the Isabelle/HOL EVM

semantics (Hirai, 2017; Amani et al., 2018) and the

small-step formal semantics of (Grishchenko et al.,

2018b). The second model is the semantics of the

reference implementations of EVM such as (pevm,

2017; gevm, 2014). Noteworthy, the implementations

and the Yellow Paper disagree on the gas consumption

when calling a contract from another contract. In the

Yellow Paper, when a contract c

calls another con-

tract c

with, say, g units of gas, the gas associated

to c

is not charged immediately. In implementations,

this gas is immediately consumed. This little differ-

ence in the semantics makes a big difference when

we are interested in proving the termination of con-

tracts. Indeed, with the Yellow Paper semantics, a

contract c

calling itself can loop without consuming

gas, until it exhausts the call stack. This paper pro-

vides a termination proof of contracts for the two se-

mantics. Proving termination of contracts when gas is

charged immediately is natural and will be brieﬂy dis-

cussed in Section 7. Proving termination of the con-

tracts for the Yellow Paper semantics is more difﬁcult

and requires a complex termination measure on call

stacks. Though the Yellow Paper semantics differs

from the reference implementations, having a termi-

nation proof for this semantics is important. First, this

termination proof contributes evidence that the Yel-

low Paper semantic model is indeed coherent. Sec-

ond, this semantics serves as a base for formal veriﬁ-

cation tools, such as (Grishchenko et al., 2018b; Gr-

ishchenko et al., 2018a), or for formal semantics such

as (Hirai, 2017; Amani et al., 2018). In those tools

and semantics, the termination of contracts is used as

an assumption. In particular, in the Isabelle/HOL for-

malization of (Hirai, 2017; Amani et al., 2018) the

termination of the contract evaluation is proven using

an internal step counter, which is not related to the

gas, and simpliﬁes the proofs.

Our proof comple-

ment their work by showing how the gas itself ensures

termination of contracts, and thus assuming termina-

tion of contracts in the Yellow Paper semantics was

indeed correct.

Contributions. This paper gives the ﬁrst formal and

mechanized proof of termination of EVM contracts,

written in EVM bytecode. The central part is a mea-

sure that can be used for the proof of termination in a

proof assistant (in our case Isabelle/HOL). We prove

termination for:

• the two variants of the semantics of the contract

call described above;

• a formal model where contracts can add and run

arbitrary new contracts;

• a formal model that safely over-approximates the

EVM semantics with minimal assumptions. In

particular, for non-zero cost byte code operations

(i.e. all operations except STOP, RETURN, RE-

VERT), we only require that they have any strictly

positive cost. Similarly, we only require the call

stack size is upper-bounded by any natural num-

ber greater than 0.

Note that having minimal assumptions on the con-

crete gas costs for each operation is valuable because

the gas cost has already changed several times dur-

ing the EVM’s lifetime

and is likely to evolve again

since gas pricing of operations is still not fully satis-

factory (Yang et al., 2019).

2 RELATED WORK

The Ethereum system has been formalized in the so-

called Yellow Paper (Gavin, 2014) which has been

updated recently (Gavin, 2019). This update does not

impact gas consumption but provides some new in-

structions which are taken into account in our formal

proof. A nice complementary reading is the White Pa-

per (Buterin, 2013) which provides useful intuitions

about the system. There are several available refer-

ence implementations of EVM such as (pevm, 2017;

gevm, 2014).

In the comments of the lem/evm.lem speciﬁcation ﬁle,

it becomes evident that the termination proof uses an artiﬁ-

cial step counter and not the gas mechanism. This choice

was made to simplify the proof as stated line 1859 of

lem/evm.lem (FEL, 2018).

There was a cost increase for 8 EVM instructions on

2016/10/18 (Hudson, 2016a) and a cost increase for one

EVM instruction on 2016/11/18 (Hudson, 2016b)

SECRYPT 2020 - 17th International Conference on Security and Cryptography

Grishchenko et al. have proposed

EtherTrust (ethertrust, 2017) a veriﬁcation framework

for the static analysis of contracts code (Grishchenko

et al., 2018a). The static analysis tools focus on

proving some security properties on contracts, such

as single-entrancy (Grishchenko et al., 2018b).

EtherTrust comes with a complete small-step seman-

tics for EVM (Grishchenko et al., 2018b) that uses

the Yellow Paper semantics for the contract call.

There are several attempts to deﬁne a mechanized

and formal semantics of EVM. The ﬁrst one was de-

ﬁned in Lem by Yoichi Hirai (Hirai, 2017). This se-

mantics was deﬁned to prove safety and security prop-

erties on speciﬁc contracts. It is partially executable

and can be used to export Isabelle/HOL theories. The

objective here was to compile EVM bytecode to Is-

abelle/HOL theories so that properties on those spe-

ciﬁc contracts can be proved in Isabelle/HOL. This

semantics is very precise w.r.t. speciﬁcation of low

level operations of EVM but it does not precisely fol-

low the gas consumption during calls (see Section 6.2

of (Hirai, 2017)). Thus, this mechanized semantics

is not usable, as is, for the proof we want to carry

out. Another mechanized semantics is the one by Ev-

erett Hildenbrandt et al. (Hildenbrandt et al., 2018)

in the K framework. This semantics is fully exe-

cutable and passes ofﬁcial test suite of EVM (ETS,

2015). This semantics consumes gas at the call

point (see rule <k> callWithCode in https://github.

com/kframework/evm-semantics/blob/master/evm.md). In

Section 7, we will discuss termination of contracts in

this speciﬁc setting.

A contract running out of gas stops without com-

pleting its task and becomes useless. Thus estimat-

ing gas consumption of contracts is an active research

subject. For instance, (Grech et al., 2018) proposes a

static analysis of contract’s code to detect resumable

loops, loops bounded by inputs, etc. that can lead to

an execution running out of gas. Our objective here

is different since we aim at proving that whatever the

contract code, it cannot loop forever while not spend-

ing gas.

3 ETHEREUM

The blockchain of Ethereum describes the global state

of the system, noted σ. In Ethereum a global state

σ contains accounts. An account is a structure com-

posed of 4 elements: a nonce, a balance (an amount

of money in the virtual currency called Ether), a data

storage and a code. In Ethereum, there exists two

types of accounts: external accounts with an empty

code and contracts with a non-empty code.

Calling a Contract. External accounts are used to

store information and Ether. Like in Bitcoin, it is

possible to transfer Ether from an account to another

through a transaction. When a transaction is sent

to an account having a code, i.e. a contract, a part

of the money is used to pay for the execution of the

code

. This is called calling a contract. When call-

ing a contract, the sent money is not collected by the

contract itself but by the miner who accepts to exe-

cute contract’s code and to add the updated accounts

and blocks to the blockchain. In other words, from

a given global state σ, the miners produce the new

global state σ

resulting of the transactions (and con-

tracts) application on σ. Since adding blocks to the

blockchain costs computation power, the miner needs

a way to estimate if the reward (money sent to the

contract) is competitive with its own computational

effort. In Ethereum, this estimation is made possible

through the gas mechanism. Every basic instruction

of contract’s code has a ﬁxed cost in gas and every

contract claims an (estimated) maximal cost in gas to

run its code. Besides, when an account calls a con-

tract it also ﬁxes a gas price in Ether. This is used to

motivate miners to execute one particular transaction

by increasing the gas price and thus their reward.

Ethereum

Account a

Balance 130

Storage

Code i++

Balance

Storage

Code

Account a

Transaction T

Account a

Balance 130

Storage

Code i++

Balance

Storage

Code

Account a

[i : 5][i : 4]

20-g+g

Account m ( miner of T)

Balance += g-g'

State σ State σ

(1)

(2)

(3)

Figure 1: Account a

calls contract a

and miner m process

the transaction.

Example 1. On the left-hand side of Figure 1, in the

state σ, there are two accounts a

and a

with a re-

spective balance 130 and 20. Account a

is a contract

and a

is an external account. Account a

has a stor-

age called i whose value is 4. The code of a

is simply

i++, i.e., it increments i. Assume that the estimated

maximal cost of contract a

is g. Assume that account

builds a transaction T towards a

, where a

calls

with g gas. To simplify the presentation, we do not

consider gas price and assume that one gas costs one

Ether. Assume that a miner m processes the transac-

In addition, it is possible to transfer money to a con-

tract, but this part is not important for our termination proof

and will not be modeled here.

Termination of Ethereum’s Smart Contracts

tion T and then adds the new blocks encoding the new

values of accounts a

and a

in the new blockchain

global state σ

. In σ

, in the account a

, i is now 5 (1).

Note that balance of a

has not evolved. Balance of a

has been decreased of g gas unit and increased by g

which is a (possible) gas refund (2). Indeed, contract

claims to need g gas units to run its code but less

gas may actually be needed. Here we assume that

there were g

gas left after the execution of a

. This

gas is refunded to a

. Finally, the miner m who adds

the blocks in σ

is rewarded by g −g

gas (3). An-

other possibility would have been that execution of a

needs more than g gas. In this case, the execution of

runs out of gas, an exception is thrown, the value

of i in a

does not change, the g gas are lost by a

and the miner m wins g gas. Precise estimation of

gas consumption for contracts is, in itself, a research

subject (Grech et al., 2018).

Creating a Contract. Any contract c

can create a

(new) contract c

with any arbitrary code, provided

that c

is given enough gas to store all the instruc-

tions of the bytecode of c

in the new global state σ

If contract creation succeeds, this makes contract c

publicly available in σ

4 ETHEREUM VIRTUAL

MACHINE: EVM

Contract code is run on the Ethereum Virtual Machine

(EVM). Contracts are written in high-level languages

such as Solidity (solidity, 2014) or Vyper (vyper,

2017) and compiled to a bytecode format speciﬁc to

EVM. A bytecode program is a list of instructions

and during the execution a program counter (pc) gives

the index of the next instruction to execute. EVM is

a stack machine and the effect of arithmetic instruc-

tions, test instructions, storage instructions is to read

and/or modify this stack, called the execution stack.

There are more than 60 different instructions in

EVM. We can split them in 5 families:

• Execution Stack Operations. This family en-

compasses all arithmetic, logic and test instruc-

tions like ADD, SUB, AND, OR, EQ, LT, etc.

This family also contains instructions that push,

pop, swap or duplicate the elements on the execu-

tion stack.

• Memory Access. This family contains instruc-

tions whose effect is to transfer data between the

execution stack and either the temporary local

memory (MLOAD, MSTORE) or into the perma-

nent memory (SLOAD, SSTORE). The temporary

local memory is a memory where a contract can

read and write during its execution and which is

erased after contract’s completion. The perma-

nent memory is in accounts’ storage (thus in the

blockchain) and will survive after contract’s com-

pletion, like variable i in contract a

of Example 1.

• Environment Operations. These are the opera-

tions that gather information on the current trans-

action and contract (who called this contract, how

many gas unit are left, etc.).

• Control Flow Operations. Those operations

modify the control ﬂow inside the same contract:

JUMP, JUMPI (conditional jump), JUMPDEST

(marks a jump destination), ...

• System Operations. This family gathers all the

operations that permit to create and destroy a con-

tract (CREATE, SUICIDE in (Gavin, 2014), or

SELFDESTRUCT in (Gavin, 2019)) and the call

and exit operations on contracts (CALL, CALL-

CODE, DELEGATECALL, RETURN) and ad-

ditional (REVERT, CALLSTATIC) in (Gavin,

2019).

The differences between the four types of call

(CALL, CALLCODE, DELEGATECALL, CALL-

STATIC) are subtle. The differences essentially lies in

the way the global state is affected by calling the con-

tract and not about the way gas is consumed. The con-

tract called by CALL changes the state of the callee,

like in Example 1. The contract called by CALL-

CODE changes the state of the caller, like when call-

ing a library code. In Example 1, assume that state

of account a

has a ﬁeld i, then a CALLCODE on

, would have incremented the value of this ﬁeld in

the state of account a

. The DELEGATECALL acts

as a CALLCODE except that the identity of the con-

tract caller is different. In a contract c

, if contract

is called with DELEGATECALL, the call to con-

tract c

happens like with a CALLCODE except that

identity of the caller is not c

but the identity of the

caller of c

. See (Grishchenko et al., 2018a) for de-

tails. Finally, CALLSTATIC is similar to CALL ex-

cept that no modiﬁcation of the state is permitted. It

can be considered as a “pure” function call without

side-effects. Since there is no difference between the

4 call instructions w.r.t. to gas consumption, we will

abstract them in the same way in Section 6.2.

As explained above, to implement the gas mecha-

nism, EVM’s designers have chosen to associate each

operation with a cost in gas. All operations, ex-

cept zero-cost operations (STOP, REVERT and RE-

TURN), have a cost strictly greater than zero. Some

instructions, like SELFDESTRUCT or SSTORE may

result into a gas refund. SELFDESTRUCT destroys

SECRYPT 2020 - 17th International Conference on Security and Cryptography

the current executed contract and the Ether which

may be present in the account is refunded. SSTORE

writes information in the permanent storage of the

account and, thus, in the blockchain. Refund with

SSTORE happens when it replaces a non-zero value

by a zero. This kind of erasure permits to save space

in the blockchain and is, thus, rewarded. Refunds ob-

tained using SELFDESTRUCT or SSTORE are ac-

cumulated during the execution in a separate counter

and given back after the completion of the whole con-

tract. As a result, during the contract execution, the

available gas is not increased by those speciﬁc re-

funds.

Now, to give some intuition about EVM’s be-

havior, we describe more precisely the semantics of

some particular instructions. We present all those in-

structions through their EtherTrust (ethertrust, 2017)

small-step semantic rules. Some examples of Yellow

Paper semantics and their EtherTrust counterpart can

be found here (Genet et al., 2020). The interest of

EtherTrust rules w.r.t to the Yellow Paper is that they

describe in the same place the effect of the instruction

on the state of the system and the gas consumption.

4.1 The ADD Instruction

The rules for the ADD instruction are given Figure 2-

1. In these rules, µ is the local state of the stack ma-

chine where µ.s denotes the execution stack, µ.pc the

program counter, µ.gas the available gas. The other

record ι represents the parameters of the transaction

where ι.code denotes the program under execution.

Thus ι.code[µ.pc] is the current instruction to execute.

Below the line of the semantic rules, (µ, ι,σ,η) :: S is

the current call stack. An element of the call stack

is called a frame, e.g., (µ,ι,σ, η) is the top frame of

the current call stack. The ﬁeld η is a transaction ef-

fect where the only information that could be relevant

for us w.r.t. gas consumption would be the refund

counter. However, as explained in Section 4, this re-

fund counter is separate from the gas available for op-

eration execution. Finally, σ is the current state of the

global state. Since, there are no side effects, every up-

date on this global state is propagated by the semantic

rules. In the ﬁrst rule, for ADD, there is enough gas to

execute ADD and an execution stack with at least two

elements. Thus, the call stack becomes (µ

,ι, σ,η) :: S

where µ

is µ with an updated execution stack, an in-

creased program counter µ.pc, and a µ.gas decreased

of 3 gas units. The second rule deﬁnes the execution

of ADD when there are not enough elements on the

execution stack or not enough gas to execute ADD.

This results into stacking an exception frame (EXC)

on top of the call stack.

4.2 The CALL Instruction

The rule for the CALL (Figure 2-2) deﬁnes the CALL

execution when everything is OK: the execution stack

contains enough arguments to perform the call (µ.s

has at least 7 elements), there is enough gas to per-

form the call µ.gas ≥ c, and there is room in the call

stack to add a new frame (|A|+ 1 < 1024). The cost c

is the sum of the costs for calling the CALL instruc-

tion itself (700 gas units) plus a variable cost depend-

ing on the size of the input and output of the con-

tract: this gas is paid when reading contract param-

eters and outputting its future result. On the lower

part of this rule, the call stack (µ,ι, σ,η) :: S becomes

(µ

,ι

,σ

,η

) :: (µ,ι,σ, η) :: S where (µ

,ι

,σ

,η

) is a

new frame stack which has been added on top of the

call stack, where µ

is a new record, where µ

.gas =

call

is the gas transferred to the new frame stack by

the old one and µ

.pc is set to 0. The code to exe-

cute in this new frame is ι

.code = σ(to).code where

σ(to) is the account receiving the call. Note that, like

it was stated in the above sections, the new call stack

is (µ

,ι

,σ

,η

) :: (µ, ι,σ, η) :: S where the gas sent to

the new frame (µ

.gas) has not been subtracted from

the frame (µ,ι,σ, η) (µ is the same, thus so is µ.gas).

The gas is retracted when the contracts returns. Note

also that this is compatible with the Yellow Paper se-

mantics where, to update the gas w.r.t. the execution

of the CALL, one has to know how much gas g

will

be refunded after the execution of the called contract,

see (Genet et al., 2020).

4.3 The RETURN Instruction

Contract returning is performed by two rules (Fig-

ure 2-3). The ﬁrst rule processes the RETURN

operation, where the current instruction to execute

ι.code[µ.pc] is abbreviated by ω

µ,ι

. The effect of this

rule is to replace the frame by an HALT frame with

the information that should be provided to the caller,

i.e., the possible updates on the global state σ, the re-

maining gas g, a result d and transaction effects η.

Finally, the HALT frame is popped by the second rule

of (Figure 2-3). We only present the rule for the stan-

dard case, i.e., in the frame below the HALT frame,

the current instruction is a CALL and the execution

stack contains all the information that were necessary

to perform the call. Then, we retract the gas units

necessary to perform the call (noted c) and refund gas

units of gas coming from the HALT frame. The global

store σ

coming from the HALT frame replaces σ in

the current frame. For the semantics of the CREATE

instruction, see (Genet et al., 2020).

Termination of Ethereum’s Smart Contracts

A Semantic Framework for the Security Analysis 251

– gas ∈ N

256

is the current amount of gas still available for execution;

– pc ∈ N

256

is the current program counter;

– m ∈ B

256

→ B

is a mapping from 256-bit words to bytes that represents the

local memory;

– i ∈ N

256

is the current number of active words in memory;

– s ∈ L(B

256

)isthelocal256-bitwordstackofthestackmachine.

The execution of each internal transaction starts in a fresh machine state, with

an empty stack, memory initialized to all zeros, and program counter and active

words in memory set to zero. Only the gas is instantiated with the gas value

available for the execution.

3.4 Small-Step Rules

In the following, we will present a selection of interesting small-step rules in

order to illustrate the most important features of the semantics.

For demonstrating the overall design of the semantics, we start with the

example of the arithmetic expression ADD performing addition of two values on

the machine stack. Note that as the word size of the stack machine is 256, all

arithmetic operations are performed modulo 2

256

ι.code [µ.pc]=ADD

µ.s = a :: b :: sµ.gas ≥ 3 µ

′

= µ[s → (a + b)::s][pc += 1][gas −= 3]

Γ ! (µ, ι, σ, η)::S → (µ

′

, ι, σ, η)::S

ι.code [µ.pc]=ADD (|µ.s| < 2 ∨ µ.gas < 3)

Γ ! (µ, ι, σ, η)::S → EXC :: S

We use a dot notation, in order to access components of the diﬀerent state

parameters. We name the components with the variable names introduced for

these components in the last section written in sans-serif-style. In addition, we

use the usual notation for updating components: t[c → v]denotesthatthe

component c of tuple t is updated with value v.Forexpressingincremental

updates in a simpler way, we additionally use the notation t[c += v] to denote

that the (numerical) component of c is incremented by v and similarly t[c −= v]

for decrementing a component c of t.

The execution of the arithmetic instruction ADD only p erforms local changes

in the machine state aﬀecting the local stack, the program counter, and the

gas budget. For deciding upon the correct instruction to execute, the currently

executed code (that is part of the execution environment) is accessed at the

position of the current program counter. The cost of an ADD instruction is

constantly three units of gas that get subtracted from the gas budget in the

machine state. As every other instruction, ADD can fail due to lacking gas or due

to underﬂows on the machine stack. In this case, the exception state is entered

and the execution of the current internal transaction is terminated. For better

readability, we use here the slightly sloppy ∨ notation for combining the two

error cases in one inference rule.