WHAT COST US CLOUD COMPUTING?
A Case Study on How to Decide for or Against IaaS based Virtual Labs
Nane Kratzke
L
¨
ubeck University of Applied Sciences
M
¨
onkhofer Weg 239, 23562 L
¨
ubeck, Germany
Keywords:
Cloud, Computing, Cost, Estimation, IaaS, Decision, Making, Model, Average, Peak, Load, Ratio, at p,
Weinman, Case Study.
Abstract:
Coud computing is characterized by ex ante cost intransparency making it difficult from a decision point
of view – to decide for or against a cloud based approach before a system enters its operational phase. This
contribution develops a four step decision making model and describe its application by a performed use
case analysis of the higher education domain which might be interesting for colleges, universities or other
IT training facilities planning to implement cloud based training facilities. The developed four step decision
making model of general IaaS
applicability can be used to decide whether a IaaS cloud based system approach
is more cost efficient than a dedicated approach.
1 INTRODUCTION
Cloud computing is one of the latest developments
within the business information systems domain and
describes a new delivery model for IT services based
on the Internet, and it typically involves the provi-
sion of dynamically scalable and often virtualized re-
sources.
A performed a literature review showed that most
of the overall cost efficiency is deduced by capac-
ity efficiency which is intensively proclaimed as a
key benefit by cloud service providers (see (Kratzke,
2011a) and (Kratzke, 2011b)). The simple fact that
only the used capacity of a cloud-based service has
to be paid inveigles to postulate the overall cost ef-
fectiveness of cloud-based approaches. Almost every
analyzed publication repeats this more or less unre-
flected even Talukader et al. (Talukader et al., 2010).
This paper does not denial this postulation in general
but advocates a more critical view like (Weinmann,
2011) or (Mazhelis et al., 2011) do.
According to (Truong and Dustdar, 2010) or
(Kratzke, 2012) cloud computing is also characterized
by an ex ante cost intransparency. This very important
weakness (from an IT management and decision mak-
ing point of view) is even little reflected in literature
so far. To answer the question whether a cloud-based
This contribution follows the IaaS, PaaS and SaaS def-
inition of NIST (Mell and Grance, 2011).
approach is more cost efficient than a dedicated ap-
proach it has to be answered the question what costs
will be generated per month before a cloud based ap-
proach enters operation (Walker et al., 2010). This
is very difficult to answer ex ante because it is influ-
enced by a bunch of interdependent parameters. But
for profound decision making exactly this question
has to be answered before a system is established in a
dedicated or cloud based manner.
Research Methodology. Especially the work of
(Weinmann, 2011) is very interesting from this deci-
sion making point of view because it shows how to de-
cide for or against a cloud based approach very prag-
matically. This contribution is about using the work of
Weinman to build up a simple decision making model
for or against cloud based approaches. This decision
making model has been applied in a presented case
study covering the higher education domain.
We want to find out whether it is more economi-
cal for practical courses covering web technologies to
provide classical dedicated educational labs or to use
IaaS in order to provide our students virtual labs for
their practical courses.
The analysed case study was a college lecture
for computer science students covering primarily web
technologies which was held at the L
¨
ubeck Univer-
sity of Applied Sciences in 2011. During the practical
courses of this lecture students formed groups of 5 or
6 persons in order to build up a website for a scientific
451
Kratzke N..
WHAT COST US CLOUD COMPUTING? - A Case Study on How to Decide for or Against IaaS based Virtual Labs.
DOI: 10.5220/0003880804510456
In Proceedings of the 2nd International Conference on Cloud Computing and Services Science (CLOSER-2012), pages 451-456
ISBN: 978-989-8565-05-1
Copyright
c
2012 SCITEPRESS (Science and Technology Publications, Lda.)
conference on robotic sailing
1
(project 1) or establish
a google map based automatic sailbot tracking ser-
vice (project 2) for the same conference. All groups
were assigned cloud service provider accounts from
Amazon Web Services. The groups were asked to
use these accounts in order to fulfill their projects in
a complete cloud based manner. Both projects were
developed according to the timetable shown in figure
1.
24x7
Training Project P M
CW 13
CW 14
CW 15
CW 16
CW 17
CW 18
CW 19
CW 20
CW 21
CW 22
CW 23
CW 24
CW 25
P = Presentation M = Migration
Figure 1: Project phases.
Outline. This contribution has the following outline.
The decision making model is described in section 2.
The case study is analysed in section 3. This contri-
bution sums up some findings of general as well as
educational interest and ends with a summary, con-
clusion and outlook in section 4.
2 DECISION MAKING MODEL
(Weinmann, 2011) is stressing the following interest-
ing fact which is a crucial input for pragmatic decision
making for or against cloud based system implemen-
tations especially on a IaaS level of cloud computing:
A pay-per-use solution obviously makes sense
if the unit cost of cloud services is lower than
dedicated, owned capacity. [...]
[...] a pure cloud solution also makes sense
even if its unit cost is higher, as long as
the peak-to-average ratio of the demand curve
is higher than the cost differential between
on-demand and dedicated capacity. In other
words, even if cloud services cost, say, twice
as much, a pure cloud solution makes sense
for those demand curves where the peak-to-
average ratio is two-to-one or higher. (Wein-
mann, 2011)
According to Weinman the peak-to-average ra-
tio is the essential indicator whether a cloud based
approach is economical reasonable or not. So it is
not necessary to estimate costs per month of a cloud
based solution exactly. It is sufficient to proof that
cloud based costs are smaller then a dedicated sys-
tem implementation. And this can be figured out by
1
World Robotic Sailing Conference 2011, see http://
www.wrsc2011.org
analysing the peak usage as well as the average usage
of a system.
Let us operationalize this by defining a usage char-
acteristic. A usage characteristic is a time sorted tu-
ple. Each element of the tuple state how many server
instances are operated within a specified atomic time-
frame t. This atomic timeframe is typically set by a
cloud service provider. It is the smallest possible us-
age reporting granularity of a cloud service provider.
For example Amazon Web Services has an atomic
timeframe of t = 1h.
uc := (i
t
1
,i
t
2
,...,i
t
n
)
t
1
< t
2
... t
n1
< t
n
(1)
Let us now define the peak usage peak as the max-
imum number of parallel operated server instances for
a given analytical timeframe [T
Start
,T
End
]. T must be
an even multiple of t.
peak(T
Start
,T
End
,uc) := max(i
t
k
,...,i
t
l
)
t
1
t
k
t
l
t
n
T
Start
t
k
t
l
T
End
(2)
In the same way we can define the average usage
avg for a given analytical timeframe [T
Start
,T
End
]:
avg(T
Start
,T
End
,uc) :=
l
z=k
i
t
z
T
End
T
Start
t
k
T
Start
i
l
T
End
(3)
So we can define the peak-to-avg ratio pta of a
given usage characteristic uc (see equation 1) within
a given analytical timeframe [T
Start
,T
End
] in the fol-
lowing way:
pta(T
Start
,T
End
,uc) :=
peak(T
Start
,T
End
,uc)
avg(T
Start
,T
End
,uc)
(4)
In the following we will also use the average to
peak ratio at p which is defined as the inverse of the
pta:
at p(T
Start
,T
End
,uc) :=
1
pta(T
Start
,T
End
,uc)
(5)
According to (Weinmann, 2011) we have to com-
pare the costs of a classical dedicated approach with
the costs of a cloud based approach. On the IaaS level
it is common to be billed per service usage with a
granularity of the atomic timeframe t level. Which
would be in case of Amazon Web Service that you
are billed for a server instance per complete (or par-
tial) hour usage. Let us name our dedicated costs per
atomic timeframe d and our cloud costs per atomic
timeframe c. Cloud costs c can be easily figured out
because they are provided as pricing by their cloud
CLOSER2012-2ndInternationalConferenceonCloudComputingandServicesScience
452
service providers
2
. Dedicated costs per atomic time-
frame d are a little more complex to calculate. Never-
theless for estimations we can assume, that they can
be defined via their amortizations. If a dedicated in-
stance can be procured for p value units
3
their dedi-
cated costs per atomic timeframe can be calculated as
follows
4
:
d
AT F
(p) =
p
AT F
d
3year
(p) =
p
3 · 365 · 24h
d
5year
(p) =
p
5 · 365 · 24h
(6)
Typical amortization timeframes are a 3 year or a 5
year hardware regeneration interval (see equation 6).
So a 500 $ server would have the following dedicated
costs per atomic timeframe of 1h over a amortization
interval of 3 years.
d
3year
(500$) =
500$
3 · 365 · 24h
0.019
$
h
(7)
According to (Weinmann, 2011) the peak-to-
average ratio pta should be greater than the relation
between the variable costs per atomic timeframe c and
the dedicated costs per atomic timeframe d
AT F
which
can be expressed in the following form:
pta(T
Start
,T
End
,uc) >
c
d
AT F
(p)
pta(T
Start
,T
End
,uc)d
AT F
(p) > c
c <
d
AT F
(p)
at p(T
Start
,T
End
,uc)
(8)
In other words equation 4 provides a clear deci-
sion criteria to decide for or against a cloud based ap-
proach. By knowing your average to peak ratio at p,
your hardware procurement costs per instance p as
well as your hardware amortization timeframes AT F
(which is typically 3 or 5 years) it is possible to calcu-
late a maximum of cloud costs per atomic timeframe
c
MAX
until a cloud based approach is economical (see
equation 9). Whenever a cloud service provider can
2
E.g. Amazon Web Services publishes these cloud costs
per atomic timeframe here: http://aws.amazon.com/de/
ec2/#pricing
3
E.g. US Dollars $ or Euro e
4
Be aware this assumption do not account typical ad-
ditional operational efforts like administration, cooling or
electricity. Nevertheless we do not want to calculate exact
costs we only want to know whether a cloud based approach
is more economical than a dedicated one. In this case it is
OK to give the dedicated side an advantage by not account-
ing aspects like administration, cooling, electricity, etc. al-
though these costs are included in the variable costs on the
cloud service provider side.
realize instance pricings below c
MAX
we decide for a
cloud based approach
5
in all other cases we should
realize the system in a dedicated approach
6
.
c
MAX
:=
d
AT F
(p)
at p(T
Start
,T
End
,uc)
(9)
The at p function generates values between
]0.0...1.0[. Equation 9 shows us that low at p values
that means high peak to average relations (peaky
usage scenarios) – will result into very high c
MAX
val-
ues. On the other side: High at p values – that means
very low peak to average ratios (constant usage sce-
narios) result into decreasing c
MAX
values. In the
absolutely worst case of at p = 1.0 (absolutely con-
stant usage) the dedicated costs become the maximum
costs which means that the cloud service provider has
to provide resources cheaper than the dedicated ones
(which is very very unlikely).
3 ANALYSED CASE STUDY
Table 1 shows all costs per group within the analysed
timeframe. In total the L
¨
ubeck University of Applied
Sciences had to spend 847.01$ in providing a (virtu-
ally) unlimited amount of server instances to 49 stu-
dents organized in 9 groups within a timeframe of 13
calendar weeks. This sounds impressive but says in
fact nothing about how cost efficient this performed
cloud based approach was. Could we had reached the
same results with a classical dedicated approach?
Table 1: Group overview.
Group Size Project Costs in $
WRSC 1 5 WRSC Website 88.39$
WRSC 2 6 WRSC Website 265.37$
WRSC 3 4 WRSC Website 88.14$
WRSC 4 6 WRSC Website 162.88$
GM 1 6 Sailbot Tracking 41.17$
GM 2 6 Sailbot Tracking 57.58$
GM 3 6 Sailbot Tracking 57.46$
GM 4 5 Sailbot Tracking 37.42$
GM 5 5 Sailbot Tracking 48.58$
3.1 Usage Analysis
As we have found out in our analyzed use case – main
cost driver was server/box usage. Thats why a de-
tailed box usage analysis has been performed and is
shown in figure 2.
Figure 2(A) shows the maximum (according to
equation 3) and average box usage (according to
5
Only from an economical point of view.
6
Also only from an economical point of view.
WHATCOSTUSCLOUDCOMPUTING?-ACaseStudyonHowtoDecidefororAgainstIaaSbasedVirtualLabs
453
13 14 15 16 17 18 19 20 21 22 23 24 25
Average Box Usage
Maximum Box Usage in an hour
(A)
Maximum and Average Box Usage
Calendar Week
Used Server Boxes
0 10 20 30 40 50
13 14 15 16 17 18 19 20 21 22 23 24 25
(B)
Accumulated Processing Hours per Week
Calendar Week
Processing Hours
0 500 1000 1500 2000
14 16 18 20 22 24
0.0 0.2 0.4 0.6 0.8 1.0
(C)
Average Box to Maximum Box Ratio
according to Weinman
Calendar Week
Avg to Max Box Usage Ratio
Figure 2: Analysed Box Usage.
equation 2) per calendar week measured within the
analysed timeframe (calendar week 13 - 25). Figure
2(B) shows the total sum of all processing hours gen-
erated by all used server boxes/instances per calendar
week (this is the usage characteristic shown in equa-
tion 1 aggregated to calendar weeks). Figure 2(C)
shows the average to peak load ratio (according to
equation 5) per calendar week.
Within the initial training phase (calendar week
13 - 21) the usage characteristic shows an extremely
high maximum box usage but an astonishing low av-
erage usage. This characterizes an extreme peak load
situation and results in extreme low at p ratios (see
equation 5 and figure 2(C)). According to equation
8 or the definition of c
MAX
(equation 9) this shows
a very ideal cloud computing (peaky) situation. So
training phases seems to be very economical inter-
esting cloud computing use cases.
Within the project phase (calendar week 13 - 15)
the usage characteristic shows dramatically reduced
maximum as well as average box usages. Neverthe-
less the at p ratios (see equation 5 and figure 2(C))
stay in a very comfortable situation for cloud com-
puting approaches. So we still have a peaky usage
situation but on a dramatical lower level. So also de-
velopment phases seems to be very economical in-
teresting cloud computing use cases.
The 24x7 phase (calendar week 21 - 24) shows
raised maximum as well as average box usages (see
figure 2(A). Also the at p ratios are raising within this
phase – nevertheless the peaky usage remains but less
distinctive. The 24x7 phase can be clearly seen in the
accumulated processing hours (see figure 2(B)) which
shows a clear peak in calendar weeks 22 - 24. We
have stated already in section 3 that 24x7 seems to be
expensive and we can harden this conclusion by our
usage analysis. This might surprise some readers.
The migration phase (calendar week 25) was
characterized by transferring the best solutions for the
website and sailbot tracking service into the opera-
tional environment. Within this phase the systems run
in a steady load scenario as can be seen in figure 2(A)
(almost the same average box as well as maximum
usage) and in figure 2(C) (an at p ratio near 1.0). Ac-
cording to equation 9 this shows an extreme uncom-
fortable situation for cloud computing situations – so
steady loads seem to be no economical interesting
cloud computing use cases.
3.2 Economical Decision Analysis
As we have seen in sections 3 and 3.1 we can iden-
tify different phases which are more cloud compatible
than others from an ecomomical point of view. Train-
ing and development phases show very low at p ratios
(see figure 2(C)) and therefore indicate peaky usage
characteristics of resources which advantages cloud
computing realizations
7
(check equation 9). Other
phases with less peaky usage characteristics (like our
24x7 or migration phase) disadvantage cloud comput-
ing realizations.
So we have identified pro and cons for a cloud
based realization of our educational labs. But how
to decide? Now we are going to apply our decision
model presented in section 2.
Step 1: Determine the at p Ratio
First of all we have to calculate our overall average
to peak load ratio at p. According to equation 3 we
have to define our timeframe basis (T
End
T
Start
). Our
analysis timeframe covered the calendar weeks 13 -
25. So an intuitive timeframe for average building
would be 13 weeks but this implicates a contin-
ual usage of an educational lab over a complete year
which is very uncommon. But in the university or col-
lege business educational labs are typically used one
time per semester. In most cases an educational lab
can be used only one time a year (per semester – that
means average building over 26 weeks) or even only
one time per year (every second semester that means
average building over 52 weeks which is a year). In
our analyzed timeframe 7612 hours of instance usage
were generated (the sum of all bars of figure 2(B)).
So equation 11 (which is an application of equation
3) shows the average amount of servers which would
7
From an economical point of view.
CLOSER2012-2ndInternationalConferenceonCloudComputingandServicesScience
454
be necessary to provide 7612 processing hours within
a 26 or 52 week timeframe.
avg
26w
=
7612h
26 · 7 · 24h
1.74
avg
52w
=
7612h
52 · 7 · 24h
0.87
(10)
Now we can build up our average to peak ratio.
Our maximum server usage within an atomic time-
frame of 1 hour was 49 servers (please check figure
2(A)). So by applying equations 2, 3, 4 and 5 we got
the following at p ratios for a 26 or 52 week time-
frame.
at p
26w
=
1.74
49
0.035
at p
52w
=
0.87
49
0.018
(11)
Step 2: Determine your Dedicated Costs
First of all we have to find out how much would cost
us a dedicated server. At the L
¨
ubeck University of
Applied Sciences our procurement office could pur-
chase the smallest possible server version
8
for about
3055$. Equation 6 told us to calculate our dedicated
costs per atomic timeframe (1h) in the following way
for a 5 year amortization interval:
d
5year
(3055$) =
3055$
5 · 365 · 24h
0.0697
$
h
(12)
Step 3: Determine your Maximal Economical
Cloud Costs
Equation 9 told us to calculate our c
MAX
costs in the
following way:
c
26w
MAX
=
d
5year
(3055$)
at p
26w
=
0.0697
0.035
$
h
1.99
$
h
c
52w
MAX
=
d
5year
(3055$)
at p
52w
=
0.0697
0.018
$
h
3.87
$
h
(13)
Do we find appropriate resources within our max-
imal costs? We have to figure this out in our last step
4 to make a profound decision for or against a cloud
based approach.
Step 4: Determine Appropriate Cloud Resources
Now we know our maximal cloud costs and have to
look if our cloud service provider can deliver appro-
priate and comparable resources. In our case this is
8
Dell PowerEdge Server R610, 2.13 GHz Intel Xeon
processor, 8GB memory, 140 GB hard drive (valid on 28th
October 2011) with approximately 2 ECU so the AWS
instance type pendant would be something between a Stan-
dard Small (1 ECU) or Large (4 ECU) instance type. Check
out detailed instance type informations of AWS here: http://
aws.amazon.com/de/ec2/instance-types/
Amazon Web Services, but it could be any other IaaS
cloud service provider as well. We do this exemplar-
ily for a 26 week timeframe
9
. But it works absolutely
the same for all other timeframes or IaaS cloud ser-
vice providers as well.
Table 2 shows all instance types of AWS and their
allocated costs. Remember section 3.2 told us, that all
instance types cheaper than 1.99 $/h result into cloud
based solutions which are more economical than ded-
icated approaches.
Table 2: AWS Instance Types and Pricings, according to
AWS pricing informations on 28th Oct. 2011, EU (Ire-
land) Region, On-Demand Instances, Linux/UNIX Operat-
ing System.
AWS Instance Type ECU Price/h
Economical
Comparable
Micro < 1 0.025$ yes -
Small (Standard) 1 0.095$ yes o
Large (Standard) 4 0.38$ yes o
XL (Standard) 8 0.76$ yes +
XL (High Memory) 6.5 0.57$ yes +
2x XL (High Memory) 13 1.14$ yes ++
4x XL (High Memory) 26 2.28$ no ++
Medium (High CPU) 5 0.19$ yes o
XL (High CPU) 20 0.76$ yes ++
As you can see in table 2 all provided instance
types (except one
10
) of AWS in the EU Region are
economical in the sense of section 2 and equations
8 and 9. The most similar instance types listed in
table 2 are marked as ’o’. (’-’ stands for worser, ’+’
for better and ’++’ for much better than a dedicated
reference system
11
).
So in our analysed use case the Medium (High
CPU) or may be even the Small (Standard) AWS in-
stance types (see table 2) are the most comparable
systems to our dedicated reference system (Dell Pow-
erEdge Server R610). Both provide variable cloud
costs clearly below our maximum costs of 1.99$/h
(see equation 13). So in our analysed use case a
cloud based approach is more economical than a
dedicated approach.
9
Because in our special case we can use our educational
lab every semester – so two times a year.
10
4x XL (High Memory) instance type is not economical
reasonable but this instance type is not comparable to our
reference system because it is much more powerful.
11
In our case the Dell PowerEdge Server R610
WHATCOSTUSCLOUDCOMPUTING?-ACaseStudyonHowtoDecidefororAgainstIaaSbasedVirtualLabs
455
4 CONCLUSIONS
Summary. This contribution presented a pragmatical
decision making model for or against IaaS based dis-
tributed systems inspired by (Weinmann, 2011). We
applied this decision making model (see section 2) in
a concrete use case of practical educational labs in the
higher education domain (colleges, universities, etc.,
see section 3) and showed that it is very economical to
use cloud based educational labs where ever it is pos-
sible. It turned out that cloud based educational labs
have a more than 25 to 50 times cost advantage (see
section 3.2 [step 3]) to classical dedicated approaches.
So cloud computing seems to be a very promising and
economical variant of providing educational labs
for university or college practical courses which is
mainly due to an inherent peaky usage characteristics
of practical university or college courses (see figure
2).
Conclusions. Nevertheless the decision making
model is applicable to all other domains and dis-
tributed system development approaches as well.
Crucial point of the here presented approach is the
first step (determine the average to peak ratio). For a
profound decision this average to peak ratio has to be
determined before a system enters its long-term oper-
ational phase. Problem is that this average to peak
ratio is hardly predictable in analysis and develop-
ment phase because it depends on a bunch of interde-
pendent and hardly predictable parameters (Kratzke,
2011a), (Kratzke, 2012). Our proposal is to plan
large-scale distributed systems generally IaaS based
so that in a first usage evaluation phase the average to
peak ratios can be analyzed from provided cloud ser-
vice provider usage data and the presented decision
making model can be applied. Dependent on the re-
sults of this evaluation the system can stay in the IaaS
cloud or can be transferred to a dedicated infrastruc-
ture.
Outlook. This contribution will not deny some short
comings so far. We analysed a box usage intensive
use case. In our ongoing research we plan to evaluate
how the here mentioned principles can be applied or
adapted to data storage or data transfer intensive use
cases as well. And finally – this contribution covered
only the IaaS level of cloud computing so far it is
a very interesting question for our ongoing research
whether the here mentioned principles can be applied
to the PaaS and SaaS level as well.
ACKNOWLEDGEMENTS
Thanks to Amazon Web Services for supporting our
ongoing research with several research as well as ed-
ucational grants. Thanks to my students and Michael
Breuker for using cloud computing in practical educa-
tion. This contribution would not exist without their
engagement. Let me thank Alexander Schlaefer and
Uwe Krohn for organizing the World Robotic Sailing
Championship 2011 in L
¨
ubeck and their confidence
in our students.
REFERENCES
Kratzke, N. (2011a). Cloud-based it management impacts.
In Proceedings of the 1st International Conference
on Cloud Computing and Services Science (CLOSER
2011), pages 145–151.
Kratzke, N. (2011b). Overcoming ex ante cost intrans-
parency of clouds. In Proc. of the 1st international
Conference on Cloud Computing and Services Science
(CLOSER2011, special session on Business Systems
and Aligned IT Services - BITS 2011), pages 707–716.
Kratzke, N. (2012). Cloud computing costs and bene-
fits. In Ivanov, van Sinderen, and Shishkov, editors,
Cloud Computing and Service Science, Lecture Notes
in Business Information Processing. Springer.
Mazhelis, O., Tyrv
¨
ainen, P., Eeik, T. K., and Hiltunen, J.
(2011). Dedicated vs.. on-demand infrastructure costs
in communications-intensive applications. In Proc. of
the 1st international Conference on Cloud Computing
and Services Science (CLOSER2011), pages 362–370.
Mell, P. and Grance, T. (2011). The nist definition of cloud
computing (draft). Technical report, National Insti-
tute of Standards and Technology (U.S. Department
of Commerce).
Talukader, A. K., Zimmermann, L., and Prahalad, H.
(2010). Cloud economics: Principles, costs and bene-
fits. In Cloud Computing - Computer Communications
and Networks, volume 4, pages 343–360. Springer.
Truong, H.-L. and Dustdar, S. (2010). Cloud computing for
small research groups in computational science and
engineering. Computing.
Walker, E., Brisken, W., and Romney, J. (2010). To lease
or not to lease from storage clouds. Computer, pages
44–50.
Weinmann, J. (2011). Mathematical proof of
the inevitability of cloud computing. http://
www.JoeWeinman.com/Resources/Joe Wein-
man Inevitability Of Cloud.pdf.
CLOSER2012-2ndInternationalConferenceonCloudComputingandServicesScience
456