SOFTWARE USABILITY EVALUATION

An Empirical Study

Tereza G. Kirner and Alessandra V. Saraiva

Graduate Program in Computer Science - Methodist University of Piracicaba

Rodovia do Açúcar, Km 156 - 13400-911, Piracicaba , SP - Brasil

Keywords: Software usability; Requirements usability evaluation; Empirical study; Goal-Question- Metrics.

Abstract: This article presents an empirical study performed to evaluate the usability of a software applied to the agri-

livestock area. The evaluation plan was prepared with basis on the Goal-Question-Metrics paradigm. The

research was performed in a government department of São Paulo state, in Brazil, and the subjects were

professionals that give assistance to small rural properties in the planning, execution and control of agri-

livestock activities, which can be supported by software systems. Usability is concerned with the suitability

of the software for its users, defined in this work through the following attributes: easiness of understanding,

easiness of learning, operationability, software attractiveness and user satisfaction, and usefulness and

accomplishment of the goals. The preparation and execution of the empirical study are described and the

data analysis and conclusions are presented. The obtained results indicate a satisfactory level of usability for

the considered software. Besides evaluating the software, the study aim to contribute to the detailing of a

process, based on GQM, to perform usability evaluations. The work also represents a contribution to the

software quality improvement, primarily to those systems applied to agri-livestock tasks.

1 INTRODUCTION

Usability is a quality requirement of software

products, which has been growing in importance

mainly due to the contribution that it brings to the

software improvement. Software with acceptable

usability leads to positive impacts, such as: decrease

in time and learning cost with the use of the system;

easiness of transition to new system versions; users´

better performance and productivity; increase in the

quality of tasks and decrease of mistakes due to data

input provided by the users; etc. Moreover, several

studies have already demonstrated that 80% of the

total maintenance costs are related to problems that

the users face on "what" the system makes and not

with technical bugs. Of these problems, 64% are

directly related to usability issues (Seffah, 2003).

Usability has been associated to attributes such

as: effective use of the software; easiness of learning

to use the software and exploring all its potentiality;

reaching of the users’ goals, through the software, in

supported tasks; software attractiveness and users’

satisfaction; conformity with specific patterns; and

adaptation to the users' experience (Avouris, 2001;

Bevan, 1991, 1995; Nielsen, 1993).

Studies have pointed out the need of methods

that effectively help to identify and specify the users'

needs, and to test and validate prototypes and

software products with their final users (Seffah,

2003). In Brazil, particularly in the agri-livestock

area, there are very few initiatives to identify and

improve the quality of software products, fact that

has been contributing to some retraction in the agro-

informatics market (Cócaro, 2005).

This work presents an empirical study on the

evaluation of the usability of a software applied to

the agri-livestock area, aiming to contribute to the

continuous improvement of software products in this

area. At the same time, it intends to define a plan of

usability evaluation together with a process of

conducting empirical studies on this theme.

Section 2 summarizes some empirical studies on

software usability, which were relevant for the

present work. Section 3 presents the concepts in

which the study was based on, including an

explanation of usability and Goal-Question-Metrics

(GQM). Section 4 presents the plan of usability

evaluation, based on the GQM approach. Section 5

presents the definition of the empirical study,

describing its accomplishment. The analysis of the

459

G. Kirner T. and V. Saraiva A. (2007).

SOFTWARE USABILITY EVALUATION - An Empirical Study.

In Proceedings of the Ninth International Conference on Enterprise Information Systems - ISAS, pages 459-465

DOI: 10.5220/0002352004590465

 SciTePress

results is presented in section 6, and the conclusions

are presented in section 7.

2 EMPIRICAL STUDIES ON

SOFTWARE USABILITY

Among recent works reported in the literature,

which provided a conceptual basis for the present

work, studies to test the usability of electronic

commerce systems (Rosa, 2004), education systems

(Granic, 2004), and software packages (Dag, 2001)

are highlighted. Moreover, a research about

evaluation of agricultural software in Brazil,

reported by Cócaro (2005), is indicated. These

studies were reviewed to obtain a basis for the

definition of usability and to determine the process

of conducting the empirical study.

Rosa (2004) presents an empirical study on the

usability of application forms in electronic

commerce systems. The usability was defined

through a series of aspects, such as: filling of the

fields of the form, legibility, correction of filling

mistakes, clarity of the instructions, explanatory and

error messages, layout and organization of the form.

To perform the study, two electronic commerce

systems for book and CD sale were selected. Six

subjects, from 20 to 40 years, who had already

purchased on-line, participated of the study,

responding to non-structured interviews. A

qualitative analysis of the obtained data was

performed, with the preparation of tables and

calculation of percentages.

Granic (2004) presents a methodology to

evaluate the usability of educational systems of ITS

(Intelligent Tutoring Systems). The usability was

identified through aspects related to the easiness of

using the software, easiness of learning the software,

and user satisfaction, such as: language and terms

used in the software human-computer interfaces,

supplied explanations, usefulness of the software,

timing response, navigation through screens, and

satisfaction provided by the accomplishment of the

tasks. As part of the study, an ITS was evaluated by

5 usability experts. They answered a questionnaire

with 10 questions, using a Lickert scale of seven

points (Pereira, 2004), ranging from "I disagree" to

"I agree". Data were tabulated, percentages where

calculated regarding the answers, and the results

were discussed.

Dag (2001) presents a case study for evaluating

the usability of a software package produced by a

Sweden software factory. The study used two

usability evaluation methods: a questionnaire to

allow a quantitative analysis, and a heuristic test,

which allowed a qualitative analysis.

- For the questionnaire, it was used the SUMI

(Software Usability Measurement Inventory)

model (SUMI, 2006), which includes a standard

framework for identifying usability issues. The

questionnaire was sent to 90 potential final users

of the considered software in Europe, and the

results were statistically analyzed.

- The heuristic evaluation presupposes the experts'

participation. In the study, 12 experts with

knowledge on human-computer interfaces and on

the software domain have participated. As result,

it was obtained a list of usability problems,

which should be corrected in order to produce an

improved software version.

In the agri-livestock area, there are few

experiences for obtaining and improving the quality

of software products. This fact has been contributing

for a retraction in the agro-informatics market

(Cócaro, 2005). A recent study about evaluation of

the quality of an agricultural software (Cócaro,

2005) focuses on the evaluation of two usability

attributes: use easiness (including customization

easiness and user's manual) and operational easiness

(including simplicity to register operations, easiness

of understanding the results, easiness of consulting

the data and easiness of modifying the data). The

evaluation methodology included the use of an

evaluation record containing questions, which

measure the quality attributes. Two specialists of the

agri-livestock area answered the research, attributing

concepts to the questions, followed by a discussion

to justify the evaluation. The obtained results were

organized with basis on the measured attributes.

The analysis of the empirical studies above

indicated, suggested the need to invest in some

aspects, such as:

- Definition of the usability requirement, aiming to

reflect the real demands of specific end-user

software;

- Preparation of the usability evaluation plan for

the software;

- Definition of the process for performing the

empirical study, including the definition of the

objectives, the definition of the sample, the

preparation of the instruments of data collection,

the survey and preparation of the data, and the

analysis of the results and identification of the

conclusions.

The aspects highlighted above were focused on

the present empirical study, described in the next

sections

ICEIS 2007 - International Conference on Enterprise Information Systems

460

3 CONCEPTUAL FOUNDATIONS

3.1 Software Usability

Usability is a software quality requirement that

comprises aspects related to the efficient and

effective use of the software. Several definitions are

reported in the literature, including attributes such

as: easiness of understanding the software

functioning, easiness of learning the software,

accomplishment of objectives and tasks,

attractiveness of the software for the users, the users'

satisfaction with the use of the software, adequacy

with specific patterns, adaptation to the users'

experience, etc. (Avouris, 2001; Bevan, 1991, 1995;

Nielsen, 1993).

In the present work, the software usability was

defined through the following attributes: easiness of

understanding, easiness of learning, operationability,

attractiveness and satisfaction of the user, and

usefulness and accomplishment of goals. These

attributes are defined as follows.

- Easiness of Understanding. Capacity of the

software of enabling to the user the

understanding for an appropriate use.

- Easiness of Learning. Allows the user to be able

to explore the software and to accomplish his/her

tasks without difficulties.

- Operationability. The software allows the user to

operate and control it appropriately.

- Attractiveness and User Satisfaction. Capacity of

the software to be attractive to the user, making

him/her satisfied with the system interaction.

- Usefulness and Accomplishment of the Goals.

The user reaches the goals proposed by the

software, through its use.

The users’ performance and expectations are

directly related with the software usability.

According to Govindarajulu (2003), the users of a

system include several people, such as operators,

developers and professionals that, in some way,

control an application. The understanding of the

users' profile and their roles are important for a study

on software usability.

The present study considered, as final users, the

agricultural technicians (agricultural engineers and

veterinary physicians) who worked at a Department

of Rural Development of São Paulo State, in Brazil,

who had experience in the agricultural area and

basic notions of Informatics.

3.2 Goal-Question-Metrics

The Goal-Question-Metrics (GQM) approach aims

to support the elaboration and implementation of

quality evaluation programs of products and

software processes. GQM is based on the principles

of Goal-Oriented Measurement and software Quality

Improvement Paradigm (Basili, 1994; Kirner, 1997,

Shull, 2006).

The preparation of a software measurement plan,

based on GQM, comprises the following steps:

- Identification of the goals. Each goal is defined

through the object, purpose, quality focus, point

of view, and environment related to the intended

study.

- Definition of the questions. For each goal, a set

of questions has to be defined. These should be

compatible with the proposed goal and include

the requirements and quality attributes that are

being evaluated.

- Definition of the metrics. Metrics to quantify the

proposed questions have to be defined. Each

metric should be related to at least one question,

and each question should have at least one

associated metric.

In this work, the GQM approach was used to

support the elaboration of the usability evaluation

plan for the considered software.

4 USABILITY EVALUATION

PLAN

The measured software was developed by a

Brazilian government organ, which support the

decision-making related to solve specific problems

of small, familiar properties, aiming at greater

profitability for the rural producers (Saraiva, 2006).

The usability evaluation plan of the agricultural

software was prepared with basis on the GQM

approach. The goal, questions and metrics are

presented as follows.

A brainstorm session was performed to start the

definition of the evaluation plan, with the

participation of 10 students enrolled in a software

engineering discipline of a master's degree program.

This session was very useful and it indicated a series

of important aspects related to software usability.

4.1 Definition of the Goal

According to the GQM approach, the goal of the

intended evaluation, which was to measure the

SOFTWARE USABILITY EVALUATION - An Empirical Study

461

usability of an agri-livestock, was detailed. The goal

is presented in Table 1.

Table 1: Goal of the Usability Empirical Study.

Goal

Object

to analyze Agri-livestock software

Purpose

with the

purpose of

Evaluation of the software

Quality Focus

regarding

Usability, evaluated through

the following attributes:

- easiness of understanding;

- easiness of learning;

- operationability;

- attractiveness and user

satisfaction;

- usefulness and

accomplishment of goals.

Point of View

under the

point of

view of

User - agriculture technician

Environment

in the

following

context

Department of Rural

Development

4.2 Definition of the Questions

After the definition of the goal, a series of guidelines

with 10 tasks was defined, to guide the users to work

with the software. Based on these tasks, the

questions were defined, which the users should

answer in order to identify the level of software

usability. Table 2 presents the defined questions for

each usability attribute.

4.3 Definition of the Metrics

According to the GQM approach, metrics for the

proposed questions, listed in Table 2, were defined.

These metrics were based on a five-point Likert

Scale (Pereira, 2004), according to which the users

were requested to inform his/her degree of

agreement or disagreement with each question, that

is: (1): Disagree Completely; (2): Disagree; (3)

Neither Agree nor Disagree; (4): Agree; (5): Agree

Completely.

5 EMPIRICAL STUDY

5.1 Objective of the Study and

Definition of the Sample

The empirical study aimed to evaluate the usability

of a software applied to the agriculture area,

measured through the following attributes: easiness

of understanding; easiness of learning; software

operationability; software attractiveness and user

satisfaction; and usefulness and accomplishment the

goals.

The population of the empirical study was

composed of the agricultural technicians of a

Brazilian Department of Rural Development. After

the population was established, the sampling

technique adopted to choose the elements that would

compose the sample was defined. Two aspects were

relevant for defining the sampling: the fact that the

participants have specific characteristics, such as,

being technicians with higher education, and having

basic notions of the use of computers and

Informatics. In this way, it was decided to use a

convenience sampling, according to

which the

subjects are chosen because they have characteristics

that are appropriate to the study.

The sample was composed of 11 subjects, whose

characteristics were according to those mentioned

above. From them, 54,55% were agriculture

engineers and 45,45% were veterinary physicians.

5.2 Instruments for Data Collection

The present study used three instruments for data

collection, which were prepared based on the

evaluation plan presented in the previous section.

Those instruments, available in Saraiva (2006), are:

- Questionnaire for General Information

Surveying. These questionnaires collected

information on the profile of the participants,

such as academic grades and level of experience.

- Guidelines for Using the Software. They

included 10 topics to guide the subjects to

perform tasks regarding the use of the software.

Starting from the accomplishment of the defined

tasks, the subjects would be able to evaluate the

software usability.

- Questionnaire of Software Usability Evaluation.

It was composed by 20 questions, to be answered

with basis on the Likert scale. The evaluators

should answer each question, underlining one of

the presented values, which are: (1) Agree

completely; (2) Agree; (3) Neither agree nor

disagree; (4) Disagree; (5) Disagree completely.

To guarantee the reliability of the collected data

and a greater index of answers, the questionnaires

were applied through individual interviews,

performed with the participants, in the facilities of

the Department. Besides, a pilot test was performed

to test the instruments of data survey before using

them in the research. The test had the collaboration

of a user of the agricultural area, with similar profile

of the further evaluators.

ICEIS 2007 - International Conference on Enterprise Information Systems

462

Table 2: Questions for the Empirical Study.

Questions related to the Software Easiness of Understanding

- Q1. In the "Main Screen", the illustrations of buttons help the understanding of the software.

- Q2. In case there is the need to use the Option HELP, the user is able to use it.

- Q3. During the use of the software, the user notices the objective of the application.

- Q4. Menu dispositions are easily understood during the sequence of tasks.

Questions related to the Software Easiness of Learning

- Q5. The language of the software in "Portuguese" is considered a relevant factor for the learning of the

software.

- Q6. The software is considered "Self-explanatory" in its functions.

- Q7. In the "Main Screen", the disposition of the illustrations of buttons helps the learning of the software.

- Q8. The use of the Menus provides the learning of the software.

Questions related to the Software Operationability

- Q9. The user considers that the navigation among screens enables the performance of operations.

- Q10. The icons are representative in the use of the software.

- Q11. The user can easily register the "property".

- Q12. The user can clearly interpret the graphic representation.

Questions related to the Software Attractiveness and Users’ Satisfaction

- Q13. The illustrations of buttons are attractive.

- Q14. The user feels pleased when using the software.

- Q15. The use of this software can collaborate with the planning of agricultural activities.

- Q16. The fact that it is a free distribution software is an attractive reason for its use.

Questions related to the Software Usefulness and Accomplishment of Goals

- Q17. The software is an advantageous tool for agricultural development.

- Q18. The software usefulness was insufficient in some applications.

- Q19. The software fulfilled the needs of your business.

- Q20. You could reach your goals with the use of the software.

He was guided to identify ambiguous or confused

aspects, in the three questionnaires. The identified

aspects were discussed, analyzed and corrected.

6 ANALYSIS OF RESULTS

The analysis of the results followed guidelines

supplied by statistics (Pereira, 2004). A suitable

methodology was adopted for qualitative data, using

descriptive techniques, Likert scale and the

Cronbach’s Alpha coefficient.

Data obtained through the Questionnaire of

Software Usability Evaluation allowed the

evaluation of the software usability regarding the

considered attributes - easiness of understanding,

easiness of learning, operationability, software

attractiveness and user satisfaction, and usefulness

and accomplishment of goals.

The totals (in percentages) of the values marked

by the participants to each usability attribute are

presented in Table 3. The answers, chosen by the

participants according to the Lickert scale, presented

the average of 4,2818. That means that the point of

balance of the informed answers pointed out a

relative agreement to the questions of the

Questionnaire of Software Usability Evaluation.

Table 3: Totals (in %) of Values per Attribute.

attributes

Complet

agree

Agree

Neither

agree nor

disagree

Dis-

agree

Complet

disagree

Easiness of

Underst.

47,73% 52,27% 0% 0% 0%

Easiness of

Learning

43,18% 50% 2,2% 4,55% 0%

Operationa-

bility

40,91% 56,82% 0% 2,27% 0%

Attract. and

User

Satisfaction

52,27% 43,19% 4,55% 0% 0%

The results obtained for the usability attributes

are presented in the next subsections.

6.1 Easiness of Understanding

The questions Q1 to Q4 of the Questionnaire of

Software Usability Evaluation measured the

software usability regarding to the “easiness of

understanding” attribute.

For these questions, the answers "Neither Agree

nor Disagree", “I Disagree", and “I Disagree

Completely", had null result in the users' opinion. In

the other way, the answers “I Agree Completely"

SOFTWARE USABILITY EVALUATION - An Empirical Study

463

and “I Agree" had a value of 47,73% and 52,27%,

respectively. This shows that the users, in a general

way, agree that the considered agricultural software

is easy to understand.

6.2 Easiness of Learning

The questions Q5 to Q8 of the Questionnaire of

Software Usability Evaluation measured the

software usability regarding to the “easiness of

learning” attribute.

For these questions, the answer was only given

once, that is, a single user was positioned as

indifferent to the easiness of learning characteristic

provided by the software, which represents 2,27% of

the responses. As for “I Disagree", two users chose

this answer, representing 4,55% of the total

percentile. The answer "I Agree Completely"

represented 43,18% of the total percentile.

Moreover, the most significant fact is that 50% of

the users that participated in the evaluation "Agree"

that the evaluated agricultural software is easy of

learning, during its use.

6.3 Operationability

The questions Q9 to Q12 of the Questionnaire of

Software Usability Evaluation measured the

software usability regarding to the “operationability”

attribute.

For these questions, the answer "I Disagree" was

chosen only by one user, which represents 2,27% of

the total percentage. The "Neither Agree nor

Disagree" and “I Disagree Completely" answers

were not marked. The answer "I Agree Completely"

had a percentage of 40,91% according the users'

evaluation, and the answer " I Agree" represented

56,82% of the marked answers. Such results

suggested that the operationability of the evaluated

software is an attribute noticed by the users while

they are using it.

6.4 Attractiveness and User

Satisfaction

The questions Q13 to Q16 of the Questionnaire of

Software Usability Evaluation measured the

software usability regarding to the “attractiveness

and the user satisfaction” attribute.

For these questions, only two users answered that

are indifferent (“Neither Agree nor Disagree”) to the

software attractiveness and/or the user satisfaction,

which represents only 4,55% of the total percentage.

From the remaining users, 43,19% "Agree" and

52,27% "Agree Completely" that the attractiveness

is present in the focused agricultural software and

that there is user satisfaction in using the software.

6.5 Usefulness and Accomplishment of

Goals

The questions Q17 to Q20 of the Questionnaire of

Software Usability Evaluation measured the

software usability regarding to the “usefulness and

achievement of the goals” attribute.

Concerning this attribute, a larger degree of

diversified answers was observed, since all the

answer categories were selected. The answers "I

Disagree Completely" presents 4,55%, the "Neither

Agree nor Disagree” option shows 9,19%, "I

Disagree" indicates 15,91%, and “I Agree

Completely" presents 31,82%cof the total

percentage. The most suitable option for the users,

with 38,64 % of the answers, is "I Agree".

6.6 Analysis of Consistency and

Reliability

The analysis of the intern consistence of the

Questionnaire of Software Usability Evaluation was

estimated through the Cronbach’s Alpha coefficient,

which allowed an evaluation of the reliability of the

research results (Pereira, 2004).

In the present study, the representativeness of

Alpha Coefficient is 0.8743, suggesting that it is

measuring 87,43% of the real impact. This value of

Alpha is satisfactory in terms of reliability of the

results, since it is very close to 100% of the

evaluation efficiency. The indicators show that the

questionnaire is concise and the questions regarding

the usability attributes evaluated in the empirical

study have coherence and authenticity.

7 CONCLUSION

This paper presented an empirical study about

usability evaluation of a software for the agri-

livestock area. The empirical study was carried out

in a Brazilian department of supporting for

agriculture and livestock in São Paulo state.

To guide the research, a plan of usability

evaluation was elaborated, based on the GQM

approach. This plan was useful for the complete

understanding of the problem, enabling the clear

accomplishment of the proposed goal for the study,

the identification of the questions to be answered,

ICEIS 2007 - International Conference on Enterprise Information Systems

464

and the definition of the metrics related to the

questions. The plan helped the definition and

preparation of the research instruments,

guaranteeing the creation of questionnaires that

included the usability aspects.

This empirical study followed a process based on

statistical techniques, which allowed a qualitative

analysis of data, providing clear identification of the

results. The use of Cronbach´s Alpha coefficient

could determine the reliability of data and results,

which were significant for the study.

In the area of agriculture, the study showed the

usability of a software used to support small and

medium rural properties. Thus, a contribution was

made to the continuous improvement of software in

this area. It is important to point out that the

obtained results indicated a satisfactory level of

usability for the considered software.

Besides evaluating the software, the study

contributed to the detailing of a process, based on

GQM, to perform usability evaluations. The work

also represents a contribution to the software quality

improvement, primarily to those systems applied to

the agriculture.

Finally, it is important to highlight the need of

investing in new research that could contribute to the

results of this study. Therefore, such results could be

extended to a broader spectrum of software, in the

agricultural area as well as in other ones.

ACKNOWLEDGEMENTS

The authors would like to thank Idemauro de Lara

and Maria Imaculada Montebelo, for the valuable

support in the data analysis.

REFERENCES

Avouris, N.M., 2001. An introduction to software

usability. In Workshop on Software Usability –

Proceedings of the 8 th Panhellenic Conference on

Informatics. v. 2, p. 514-522.

Basili, V.R.; Caldiera, G.; Rombach, H.D., 1994. Goal-

Question-Metrics paradigm. Encyclopedia of Software

Engineering, John Wiley & Sons. New York, vol. 1, p.

527-532.

Beavan N., Kirakowski, J.; Maissel, J., 1991. What is

usability? In Proceedings of the 4 th International

Conference on Human-Computer Interaction.

Stuttgart, Germany.

Beavan N., 1995. Usability is quality of use. In

Proceedings of the 6

International Conference on

Human-Computer Interaction. Tokio, Japan.

Cocaro, H.; Lopes, M. A.; Campos, F.C., 2005. Quality of

agri-livestock software: a case study. Science and

Agro-Technology, Lavras, MG, Brazil, v. 29, n. 5, p.

1075-1082.

Dag, J.N et all., 2001. An industrial case study of usability

evaluation. In Proceedings of the 9th International

Conference on Human-Computer Interaction.

Mahwah, NJ, p.425-429.

Granic, A.; Glavinic, V.; Stankov, S., 2004. Usability

evaluation methodology for web-based educational

systems. In Proceedings of the 8th ERCIM Workshop.

Govindarajulu, C., 2003. End Users: Who are they?

Communications of the ACM, Vol. 46, N. 9, p. 152-

159.

Kirner, T.G.; Abib, J.C., 1997. Use of the Goal-Question-

Metrics in the evaluation of software quality

requirements. In Proceedings of the IV Brazilian

Workshop on Software Quality. Fortaleza, CE, Brazil,

p.71-82.

Nielsen, J., 1993. Usability Engineering. Academic Press,

Cambridge, MA.

Pereira, J.C., 2004. Analysis of Qualitative Data. São

Paulo, University of São Paulo Press, 2004.

Rosa, J.G.; Moraes, A., 2004. Usability in electronic

commerce forms. In Congress of Ergonomy of the

Countries of Portuguese Language. Funchal, Island of

Madeira. (in Portuguese).

Saraiva, A.V., 2006. Use of the Goal-Question-Metrics in

theElaboration and Execution of Usability

Measurement Plans. Master Thesis, Graduate Program

in Computer Science, Methodist University of

Piracicaba, Brazil (in Portuguese).

Seffah, A.; Benn, J., 2003. Making usability a respectable

quality attribute in the engineering lifecycle. IEEE

Canadian Review, Fall 2003, p. 25-28.

Shull, F.; Seaman, C.; Zelkowitz, M., 2006. Victor Basili’s

contributions to software quality. IEEE Software,

Volume 23, Number 1, p. 16-18.

SUMI, 2006. The SUMI Model. Retrieved October 2006

from http://sumi.ucc.ie.

SOFTWARE USABILITY EVALUATION - An Empirical Study

465