Examining the Potential for Conversational Exploratory Search Using a
Smart Speaker Digital Assistant
Abhishek Kaushik
a
and Gareth J. F. Jones
b
ADAPT Centre, School of Computing, Dublin City University, Dublin 9, Ireland
Keywords:
Conversational Search, Exploratory Search, Dialogue System, Alexa.
Abstract:
Online Digital Assistants, such as Amazon Alexa, Google Assistant, Apple Siri are very popular and provide
a range or services to their users, a key function is their ability to satisfy user information needs from the
sources available to them. Users may often regard these applications as providing search services similar to
Google type search engines. However, while it is clear that they are in general able to answer factoid questions
effectively, it is much less obvious how well they support less specific or exploratory type search tasks. We
describe an investigation examining the behaviour of the standard Amazon Alexa for exploratory search tasks.
The results of our study show that it not effective in addressing these types of information needs. We propose
extensions to Alexa designed to overcome these shortcomings. Our Custom Alexa application extends Alexa’s
conversational functionality for exploratory search. A user study shows that our extended Alexa application
both enables users to more successfully complete exploratory search tasks and is well accepted by our test
users.
1 INTRODUCTION
There is currently much interest in conversational dig-
ital assistants embedded in smart speaker and mobile
platforms, e.g. Amazon Alexa, Google Assistant, Ap-
ple Siri. These applications offer users a range of
services including simple command and control of
networked smart home appliances, accessed through
conversational engagement. A widely promoted func-
tion is their ability to satisfy user information needs
from the sources available to them. Digital assistants
are often demonstrated using requests such as fetch-
ing recipes or latest weather forecasts. While it is
clear that these application are often able to address
such requests, which are generally satisfied by single
items or factoids, it is much less clear how well cur-
rent applications support more exploratory informa-
tion needs, and what additional functionality might
be required to address any identified shortcomings.
While conventional information retrieval (IR) sys-
tems, such as web search engines, rely on the
searcher’s ability to browse retrieved content in an
efficient manner, smart speaker systems are largely
driven by spoken interaction, sometimes involving
multi-modal output. User access to returned informa-
a
https://orcid.org/0000-0002-3329-1807
b
https://orcid.org/0000-0003-2923-8365
tion in spoken form has a much lower bandwidth than
visual review of textual. This suggests that digital as-
sistants must select information to be returned in spo-
ken form in search applications with higher precision
than is the case for conventional IR systems. One way
to limit delivery of extraneous information is to parti-
tion the search process into smaller incremental tasks
where the searcher engages with the digital assistant
using a conversational search process (Radlinski and
Craswell, 2017).
Interest in conversational search has developed
rapidly the IR research community in recent years,
and while it has been the focus of multiple workshops
exploring its principles and challenges, there is very
limited work exploring user interaction with working
systems. Good examples of existing work such as
(Trippas et al., 2017a; Trippas et al., 2018; Trippas
et al., 2017b) are limited to the use of Wizard type
conversational agents in limited search tasks.
While in some contexts spoken only engagement
is possible, the operational platforms of many digital
assistants enable some form of multi-modal interac-
tion. For example, smartphones, tablets and dedicated
platforms such as the Amazon Echo Show
1
.
In this paper we report an experimental study
1
https://www.amazon.co.uk/amazon-echo-show-5-
compact-smart-display-with-alexa/dp/B07KD7TJD6
Kaushik, A. and Jones, G.
Examining the Potential for Conversational Exploratory Search Using a Smart Speaker Digital Assistant.
DOI: 10.5220/0011798700003417
In Proceedings of the 18th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2023) - Volume 2: HUCAPP, pages
305-317
ISBN: 978-989-758-634-7; ISSN: 2184-4321
Copyright
c
2023 by SCITEPRESS Science and Technology Publications, Lda. Under CC license (CC BY-NC-ND 4.0)
305
of search using a state-of-the-art digital assistant
to examine the ability of current assistants to sat-
isfy exploratory information needs. For our study
we adopt the Amazon Alexa operating on an Echo
Show platform embedded with a display screen. The
Echo Show enables conversational interaction with
the Alexa assistant and incorporates a tablet sized
screen to enable multi-modal engagement. The re-
sults of our study demonstrate that the existing Alexa
system is very limited in terms of its support for
search tasks of this type, leading to frustration and
user dissatisfaction. We then describe experiments us-
ing a prototype new Alexa customed skill which seeks
to overcome these shortcomings. Before describing
these studies, the next section reviews relevant exist-
ing work in conversational search.
2 BACKGROUND
In this section, we review literature relevant to the de-
velopment of conversational search applications. We
begin with a brief review of conversational systems,
then consider existing work on conversational search,
and look at dialogue strategies
2.1 Conversational Systems
Conversational engagement between human and com-
puter applications is currently a very active area of in-
vestigation (Radlinski and Craswell, 2017). Here we
focus on the area of digital conversational assistants.
A study on conversational agents (Alexa vs. Siri vs.
Cortona vs. Google Assistant) to investigate these us-
ability for different services such as access to music
services, agenda, news, weather, To-Do lists and maps
or directions and others is reported in (L
´
opez et al.,
2018). This showed that even though there are many
services already available, there is much scope to im-
prove the usability of these systems.
A review of multiple studies related to conversa-
tional agents examining their usability and the capa-
bilities of conversational agents is described in (Hoy,
2018). A study investigating user interactions with
Amazon Alexa focusing on the types of tasks re-
quested and the variables that affect user behaviours
can be found in (Lopatovska et al., 2018). The re-
sults indicate that across all age groups, Alexa was
primarily used for checking weather, playing music,
and controlling other devices. Users reported being
satisfied with Alexa even when it did not produce the
information sought, suggesting that the interaction ex-
perience is more important to the users than the inter-
action output.
2.2 Conversational Search
While users of search tools have become accustomed
to standard “single shot” interfaces, of the form seen
in current web search engines, and have learned to
use them to good effect, research Interest in conver-
sational interactions in search has expanded consider-
ably (Radlinski and Craswell, 2017).
Multiple studies have investigated the potential of
conversational search. However, these have gener-
ally involved the use of a human serving on the agent
manages to the search dialogue wizard (Avula et al.,
2018; Avula et al., 2019; Avula and Arguello, 2020).
While the results of these studies have been interest-
ing and insightful, they have an important limitation
in that the agent has full human intelligence. Thus,
they do not reveal the potential for artificial agents
to support search in terms of effectiveness and user
acceptance. Studies have also been conducted to in-
vestigate the user search behaviour in speech settings
where the searcher interacts with the agent (the hu-
man “wizard”) via speech. These have the limitation
of assuming both human intelligence and error free
speech recognition, which will generally not be the
case in a real system (Trippas et al., 2017a; Trippas
et al., 2017b; Trippas et al., 2018).
2.3 Dialogue Strategy
A number of frameworks have been proposed for
human-machine dialogue systems. There are two im-
portant points to be considered in developing dialogue
strategies. The primary one is to understand the cur-
rent intention of the dialogue, and the second is to
maintain the ongoing interactions between the user
and the system. The use of dialogue models is cur-
rently very limited in IR. In the study(Sitter and Stein,
1992), authors model information seeking dialogue as
a directive, commissive and assertive type of dialogue
act (e.g. asking, rejecting, offering and answering).
This research was extended by Belkin et. al. (Belkin
et al., 1995) to design an interactive IR system based
on scripts and cases, called MERIT (Multi-Media In-
formation System). This considers different infor-
mation seeking strategies scripts and cases at once
to make the system interactive. In the study (Leech
and Weisser, 2003), the authors conducted a study
in which they annotated the task-oriented dialogues
in speech and introduced a dialogue taxonomy. The
investigation was extended by (Loisel et al., 2009)
which proposed a model on the basis of the taxonomy
and analyzing a medical text corpus.
HUCAPP 2023 - 7th International Conference on Human Computer Interaction Theory and Applications
306
3 SEARCH USING AMAZON
ALEXA
In this section we begin by outlining the features of
the Amazon Alexa application, and then introduce the
search tasks used for our investigation of its use for
exploratory search.
3.1 Amazon Alexa
The Amazon Alexa digital assistant provides a wide
range of information seeking services and control of
applications to users. Alexa can operate on a range
of dedicated hardware platforms including Amazon
Echo, Amazon Echo Show, Amazon Dot, and related
hardware, as well as an application running on more
general platforms. Alexa performs voice-operated
functions while communicating through a local WiFi
Internet connection or other wireless connection with
Amazon’s AWS cloud servers, or other networked
devices, to carry out these functions (Janarthanam,
2017; Website, 2020).
The workflow of a standard user engagement with
Alexa is divided into four steps: receiving a spoken
instruction or request, interaction mode (responsible
for speech recognition and intent identification), skill
application logic (action after triggering the intent),
and response. Where an intent is defined by Amazon
as actions that fulfill spoken requests from the user,
and a skill is an application which enables Alexa to
perform an operation. A key feature of Alexa as a re-
search tool is that new skills can be created to enable
Alexa to perform new or extended operations (Biehl,
2019); it is for this reason that we choose to use Alexa
for our investigations. Specifically, we base our study
on the use of an Amazon Echo Show which com-
bines spoken interaction with the availability of a high
quality 7-inch touchscreen display which can be used
within the applications to enable mulit-modal interac-
tion. Technical details of the Echo Show can be found
in (Akon, 2018).
3.2 Exploratory Search with Alexa
In this study we first examine the ability of the stan-
dard Alexa assistant to support exploratory informa-
tion seeking using its default conversational inter-
action features. As a source of information needs
for our study, we provide participants with backsto-
ries requiring information about an individual which
we anticipate users to able to address using a single
Wikipedia autobiography page. An example of back-
story expressing an information need of this sort is
shown in Figure 1). We developed twelve backstories
Martin Luther King Jr. was an American
Baptist minister and activist who became
the most visible spokesperson and leader
in the civil rights movement. You have
to find Information about the personality
using Alexa skills (as per the search
setting) and based on your information
gain, you have to write a short summary
(in the questionnaire) about the person
mentioned above and fill the
questionnaire accordingly.
Figure 1: Example backstory selected for Alexa task.
for which full review of the corresponding Wikipedia
autobiography page is a cognitively complex task,
such a task would be classified as class ”Analyze”
within the Taxonomy of Learning (Krathwohl, 2002).
3.3 Experimental Procedure
Participants in our study had to complete a structured
search session, as shown in Figure 2. They were given
printed details of the instructions for their search ses-
sion, and were provided with an opportunity to famil-
iarize themselves with using Alexa for 5-10 minutes
before starting the main study. Each participant had to
complete one search task using an assigned backstory.
They were given the printed backstory to study before
they began from the search, this was then withdrawn
to prevent them simply copying the details of the
backstory as the basis of their query. The search ses-
sion included completing questionnaires before and
after carrying out the search task. The questionnaire
included asking about the participant’s expectations
and experience of the search process, and required
them to write a short summary of their knowledge of
the topic of backstory before and after carrying out
the search process. After completing the search, they
were also required to attend a semi-structured inter-
view to gather details of their experience.
While participants carried the search tasks using
the Echo Show, they completed the questionnaires
online using a standard computer. All search activi-
ties were video recorded for post-collection review of
the user activities. Approval was obtained from our
university research ethics committee prior to begin-
ning the study. Search tasks were evaluated by ana-
lyzing the self-reported questionnaire interviews and
the recorded videos. All details from the interviews
and video recordings were assigned to response cate-
gories by independent analysts. This was done using
an annotation schema relevant to our research aims
designed after investigating the data; the complete
response dataset was then coded using these data-
Examining the Potential for Conversational Exploratory Search Using a Smart Speaker Digital Assistant
307
Figure 2: Procedure of the Alexa information seeking study.
Table 1: Details of age distribution throughout this investi-
gation.
Age No. Male No. Female Ratio
(M) (F) (M/F)
18-25 14 3 14:3
26-35 8 6 4:3
36-45 0 0 NA
Total 22 9 22:9
derived codes (Braun and Clarke, 2013) . The inter-
rater reliability between the annotators was very high
(K = 0.85 and 0.82). Search and interaction behaviour
was analyzed in terms of queries used and time spent
on the search task.
3.3.1 Pilot Studies
Prior to the main study, a pilot study was carried out
by two undergraduate students in Computer Science
using two additional backstory search tasks. Feed-
back from the pilot study was used to refine the spec-
ification of the questionnaire, and to design the clas-
sification categories for the user responses. Results
from the pilot study are not included in our analysis
in this project. Each of the pilot search tasks took
around 25 minutes to complete.
3.3.2 Main Study Participants
Each participant in the main study was assigned one
of the selected backstories from 12 with the expecta-
tion that their session would last around 30-40 min-
utes. The sequence of the tasks was arranged using a
Latin square method to avoid sequence and learning
effects.
In total 33 subjects participated in the experiments
of which results for 2 subjects were not included for
analysis due discrepancies in their data. The complete
demographic details found in the Table 1.
4 BEHAVIOUR OF ALEXA FOR
NON-FACTOID AND
EXPLORATORY SEARCH
This study examined user expectations of the Alexa
assistant to support exploratory search tasks and their
experiences when using Alexa to address this type of
information need.
The following research questions were investi-
gated in the study:
1. What are the challenges and opportunities of ex-
ploratory conversational search using Alexa?
2. What characteristics of Alexa prevent it from
functioning as an effective tool for complex in-
formation seeking?
3. What are the main expectations of users for con-
versational search systems?
HUCAPP 2023 - 7th International Conference on Human Computer Interaction Theory and Applications
308
4.1 RQ1: What Are the Challenges and
Opportunities of Exploratory
Conversational Search Using Alexa?
4.1.1 Challenges
Attempting to use the default Alexa assistant to ad-
dress the exploratory information needs expressed in
our backstories led to considerable user frustration
with poor success in addressing the information need.
From analysis of user feedback, we identified the fol-
lowing challenges.
1. Task Success: In 62% of cases, either Alexa did
not provide a response or gave irrelevant answers
for the user query.
2. User frustrations and feedback: On average par-
ticipants took approximately 5.5 minutes with an
average 14.1 interactions (turns). Survey feed-
back from the users clearly indicates high levels
of user frustration.
3. Major limitations of Alexa in exploratory search:
A number of limitations were identified from ob-
servation of user interactions and their feedback.
We divided these limitations into four broad cate-
gories as follows:
(a) Poor knowledge representation: From the re-
sponses to queries given by Alexa, it became
clear that Alexa only represents either fact-
based answers or simply starts reading from the
beginning of a long Wikipedia document. This
was noted by around 18% of participants who
claimed that Alexa had poor knowledge repre-
sentation.
(b) Poor speech recognition and high error rate:
While not directly related to its search capa-
bilities, around 52% of participants noted that
they experienced frustration arising from poor
speech recognition and high word error rates
while interacting with Alexa. For example, one
participant noted that “Alexa was not able to
understand my voice and its frustrating and tir-
ing to ask same thing again and again”. Such
errors can result in problems of participants be-
ing able to frame their desired query and recog-
nition errors leading to Alexa making mistakes
in interpreting the query correctly, leading to
incorrect responses.
(c) Difficulty in asking questions: This was the
most important reason identified by partici-
pants, 75% of whom indicated that they had dif-
ficulty in creating queries. They were unable to
search effectively since they had no background
knowledge about the subject. Alexa offered
no formal support to them in forming queries,
and its answers were too precise to enable the
searcher to build their known of the subject, as
they might with a standard web search engine.
(d) Others: A number of other factors were iden-
tified, the key ones were ”Interruptions” (2%)
and ”Cognitive Load” (4%). Participants ob-
served that they were unable to complete
queries due to interruptions by Alexa. In these
cases, Alexa took a partially completed query
as finished, and interrupted in the middle of the
process of entering the query, providing results
which may confuse the searcher or force them
to repeat or reformulate the query. This put
cognitive load and strain on the participants.
4.1.2 Opportunities
Our investigations using the standard Alexa applica-
tion highlight some critical areas which provide op-
portunities to improve exploratory search.
1. Background knowledge support and effective
knowledge representations: The Alexa applica-
tion provides fact-based answers, but does not
support the user in learning and refining the
search. After conducting this study, we propose
that the user should be provided with information
related to their search query based on facts which
could help them to create more effective queries.
2. Priming, dialogue-driven approach and interactive
search process: We observed that Alexa did not
actively engage user in the search process, with
high levels of user frustration. To reduce the frus-
tration and enhance search effectiveness, we pro-
pose to introduce a dialogue driven approach to
the search process.
4.2 RQ2: What Characteristics of Alexa
Prevent It from Functioning as an
Effective Tool for Complex
Information Seeking?
Our investigation found that Alexa did not support in-
formation seeking more complex than simple lookup
activities. This was reported by more than 45% of
participants. Its conversational agent has not been de-
signed to support typical information seeking strate-
gies to help the user. Two important factors in infor-
mation seeking strategies are: exploration and learn-
ing, which can be further subdivided into acquiring
knowledge, interaction with information sources, and
engagement with information sources, comparing,
Examining the Potential for Conversational Exploratory Search Using a Smart Speaker Digital Assistant
309
reasoning, analysing evaluation, discovery, planning
and forecasting. The three major reasons for poor in-
formation seeking in standard Alexa are reported to
be: lack of background knowledge due, to which the
user was unable to create the right query, Alexa not
being able to correctly recognise the user’s query, and
poor representation of knowledge by Alexa.
4.3 RQ3: What Are the Main
Expectations of Conversational
Search Systems?
From our study we found five major expectations of
our participants for search in conversational systems.
4.3.1 Exploratory
A conversational search system should provide a
broad information space to the user give them the op-
portunity to explore a space of relevant information
and to narrow the exploration to focus on addressing
their information need.
4.3.2 Content Selection
Our investigation showed that the important variables
with respect to the user experience in search are as fol-
lows: the average number of interactions, the number
of successful interactions, the number of unsuccess-
ful interactions, the average time to complete a search
task, and the quality of the presented text. Based on
our results, we can conclude that the average interac-
tion failure rate is around 62%, which is very high.
For the total of 438 interactions the average total time
of interactions by a user is 5.6 minutes for the default
Alexa system. We can see from the above figures that
this engagement is very inefficient leading to demoti-
vation and frustration for the user.
4.3.3 Content Interactions
Searcher interactions include use of multiple Alexa
skills including navigation skills, presentation skills
and speech skills of conversational agents.
1. Navigation skills: A conversational search agent
should support the user in navigation through the
information space or the documents.
2. Presentation skills: The user expects presentation
in different modes. More than 85% of users con-
sidered the combination of all three dimensions
(Text, Speech and Images) are required to present
the information in the most appropriate model
during the search.
3. Speech skills: The speech skills can be classified
based on multiple parameters, including speech
speed, speech recognition, interruptions, speech
content and its length. Searchers expected speech
recognition to support standard speaking speeds,
normal day-to-day length of spoken input, low
levels of interruption and high accuracy speech
recognition.
4.3.4 Information Representations
In our study, we found that 18% of the searchers
reported that the information represented during the
search process (in default setting) was poor. They also
found difficulty in maintaining the contextual infor-
mation flow during the task.
We observed that the various factors can refine the
information representations. These factors are: length
of the text shown on the screen (optimal), query rel-
evant information, the structure of the presentation
(right combination of text, images and speech) and
the flow of conversation and information.
4.3.5 Conversational Properties
Our study also indicates properties that every con-
versational search system should have (Vyas, 2017)
(Staven, 2017).
1. On boarding: This is the initial interaction in
which the user is introduced to the system in
which it explains its competencies.
2. System as teacher: The user expects a system
should ease their interactions by revealing its ca-
pabilities and essentially teach the user how to use
the system. In our study we observed that users
who have previous experience with the conversa-
tional application interacted with it for longer (7.2
minutes) than users who using it for the first or
second time (5.8 minutes).
3. Incite: We observed that most of the interactions
were one-way, with the system unable to engage
in useful dialogue with the user. However, ide-
ally a conversational application should engage
with back-and-forth dialogue with the user to as-
sist them in reaching goal.
4. Diverge flow and course corrections: A conver-
sational application should be robust. As such, it
should be able to handle any unexpected entries
from the users and use this to guide the user to-
wards their goal.
HUCAPP 2023 - 7th International Conference on Human Computer Interaction Theory and Applications
310
Figure 3: Flow Chart for Custom Alexa Dialogues strategy.
Examining the Potential for Conversational Exploratory Search Using a Smart Speaker Digital Assistant
311
Figure 4: Alexa Custom Search.
5 DIALOGUE STRATEGIES TO
SUPPORT CONTENT
ENGAGEMENT
As a result of investigations with the standard Alexa
application, we sought to develop a new dialogue
strategy for the Alexa assistant with the goal of im-
proving its ability to support exploratory search. We
refer to this revised Alexa application as Alexa Cus-
tom search. We implemented this as an Alexa skill
designed to enable a user to carry out exploratory in-
teractive search with Alexa.We deployed this as a pro-
totype using the Amazon Echo show and investigated
its effectiveness using a study following the same ex-
perimental setup as used in our exploration of stan-
dard Alexa application for exploratory search.
A total 31 search sessions were conducted us-
ing same participants as the standard Alexa study,
but with a different backstory assigned to each user
following a Latin square backstory assignment pro-
cess to avoid biasing effects between participants and
assigned backstories. The participants were again
given the opportunity to familiarize themselves with
the application for 5-10 minutes prior to beginning
the search task, had to complete pre-search and post-
search questionnaires during the search session, and
also to participate in a semi-structured interview at
the conclusion of the search task. The Custom Alexa
skills were developed iteratively using a series of pi-
lot studies with informal feedback from participants
prior to the formal evaluation described below.
5.1 Dialogue Strategy
The dialogue strategy was designed to enable users
to search and explore long retrieved documents, and
to facilitate two-way interaction between Alexa and
the searcher via a dialogue. The dialogue strategy has
two major components: developing the skill (train-
ing Alexa based on the new skills, e.g., search and
greetings) using an Amazon Developer account, and
the second part to embed the search process into the
dialogues by using python and AMAZON Alexa de-
veloper interface.
Developing Alexa Skills: We designed two intents
for the Alexa custom skill. These actions were
developed to fulfill spoken requests entered by the
searcher. Each intent has at least one trigger utterance,
a predefined word or phrase which the user might
say to invoke the intent. The intents are Greeting
HUCAPP 2023 - 7th International Conference on Human Computer Interaction Theory and Applications
312
(trained to answer greetings, unexpected questions
and non-relevant questions with respect to a search)
and Search (trained to identify a search query pass
it to the search system and to present the response
from the search system to the users). Each intent was
trained using likely user utterances with a correspond-
ing response which would be expected by the users.
We trained 80 alternative utterances for both intents.
The utterances were collected using a small survey
among a group of undergraduate students.
Embedding the Search Process with Dialogue: Alexa
skills supported us to identify and classify different
intents input by the user. Once Alexa identifies the
input as the search intent and passes the user input
to the search process which extracts the search query
from the user input. The extracted query is searched
based on similarity matching in the query archive and
which asks for confirmation from the user regard-
ing the search query. Each successful user-confirmed
query is saved in our query archive, which we can
subsequently utilise for similarity matching with fresh
queries. This is to confirm that the correct query has
been identified with the goal of reducing the error rate
and improving the reliability of relevant search re-
sults. This acknowledgement from the user regarding
confirmation of the query triggers the search process.
All the responses from the search process were em-
bedded within dialogue based on our dialogue strat-
egy.
5.2 Search Process
Figures 4 and 3 illustrates the complete search process
workflow. The search process triggered by the users
contains the following sub-modules:
1. Calling Duck Duck API: The query is passed to
the Duck Duck go search Application Program-
ming Interface (API). The titles of the top 3 doc-
uments returned are displayed on the Alexa Echo
Show screen. The searcher can then select one
of these by saying ’Open 1’ or ’Open <document
name>’, or they may request more results by re-
jecting the displayed items by saying ’No, show
me more results’. Alternatively, the searcher may
change their query and restart the search process
by saying ’Alexa start search’. Once the user has
selected an item from the displayed results, the
dialogue strategy triggers the Wikipedia API. The
role of the Duck Duck Go API here is essentially
to identify a focused short form query which can
be used for entry to the Wikipedia API to ex-
tract wikipedia documents to provide options to
the users.
2. Calling Wikipedia API: The title of the document
selected by the user is passed to the Wikipedia
search API. The section and subsection head-
ings of the highest-ranking retrieved item are then
shown to the user. The searcher can then se-
lect sections and subsections of the returned doc-
uments. These selected parts of the document are
then summarised using the summarisation com-
ponent outlined below. The document navigation
options enable the searcher to explore the individ-
ual summarised parts of the document.
3. Calling the Summarizer: A summarizer is used to
display the important content of a document sec-
tion. The Echo Show displays the summary along
with further sub-options, as shown in Figure 4
The summarizer splits a whole paragraph into sen-
tences. Each sentence is considered an individual
document. The ”frequency” (TF) of each word in
the document is calculated along with the inverse
document frequency (IDF) of each word in the
document. A normalised TF-IDF score of each
sentence is then by summing the TF-IDF scores of
each sentence and dividing by the word length of
the sentence.The top 50% of scoring sentences are
extracted. The Density-Based Spatial Clustering
of Applications with Noise (DBSCAN) clustering
algorithm divides the sentences into “clusters. A
cosine similarity score is calculated between the
number of clusters and the section name. Based
on the cosine similarity score, the top 70% of clus-
ters are selected
and are presented as the summary arranged in the
same order as they appear in the actual paragraph.
The searcher can explore further subsections or go
back to the previous view. As soon as the user
choses any section or sub-section to explore, the
summarizer extracts its summary.
5.3 Additional Functionality
Our Custom Alexa skill also displayed images asso-
ciated with the displayed subsections provided by the
Wikipedia API. Images are scored using cosine sim-
ilarity between the labels of the images and the title
of the selected Wikipedia subsection. The image with
the highest similarity is shown with the contents of
the selected subsection.
Examining the Potential for Conversational Exploratory Search Using a Smart Speaker Digital Assistant
313
6 INVESTIGATING THE
EFFECTIVENESS OF ALEXA
CUSTOM SEARCH
6.1 RQ4: How Well Does the Custom
Alexa Dialogue System Support
Exploratory Search?
The research question is divided into three sub-
questions.
6.1.1 RQ4(1): How Effectively Does It
Communicate Information to the User?
1. Dialogue Strategy: In the semi-structured inter-
view, carried out in this second study, searchers
reacted positively to Custom Alexa. Around 45%
of them found it helpful, 18% found noted that
it was interactive. 12% of searchers claimed that
in the custom Alexa setting, Alexa provided them
suggestions. Users also found setting 2 interest-
ing, easy to understand with comfortable speed
and structure unlike what they experienced in the
default setting.
2. Structure of representation of information: Rep-
resentation of the information was one of the key
criteria. We broke down the information structure
into two major components: content of the docu-
ment and the representation of the information.
(a) Content of document: Around 87% of partic-
ipants were satisfied with the document infor-
mation provided by custom Alexa. The custom
setting was able to satisfy around 85% of the
user’s information needs.
(b) Structure of text: Around 77% of searchers
were satisfied with the text structured provided
by custom alexa, while 9.7% users were more
content with the standard Alexa text structure.
We can say that, the custom setting was able to
satisfy 75% of the searchers. The reason be-
hind choosing the Alexa custom settings were:
ease of use (7.5%), ease of information seeking
(37.5%), its interactive nature (30%) and that
the information is more relevant and informa-
tive (25%).
6.1.2 RQ4(2): How to Verify the User
Understanding, Satisfaction and Search
Success in the Dialogue-Based Exploratory
Search Process?
The statistically significant indicates where P<0.05.
This signifies the experience of the users in custom
Table 2: Comparison between Default Setting and Custom
Setting with statistical Testing: Two tailed T.
Variables
Default Custom P Value
Text Quality 3.1 3.9 0.00012
Navigation Skills 2.7 4.1 0.00001
Speech Skills 3 4.2 0.00013
Presentation Skills 3.3 3.9 0.00070
Better Understanding 3.3 3.7 0.85020
Knowledge Expansion 3.2 3.9 0.00146
Cognitive Engagement 3.4 3.9 0.16490
Search Session Success 3.1 3.7 0.01354
Suggesting skills 2.5 3.7 0.00140
Alexa stop 1.8 1.3 0.28242
Ease of Multimodal 3.7 4 0.16067
search is statistically significant.
1. Knowledge gain and search success: Searchers
rated (out of five) a range of variables comparing
the Default setting and Custom setting of Alexa as
shown in Table 2. Searchers rated Custom Alexa
skills higher for most of the variables (as shown
in Table 2) in comparison to the Default search
setting.
2. Summary comparison: To verify the expansion in
knowledge, we conducted a comparison of pre-
task summary and post-task summary in Default
and Custom Alexa settings using a standard com-
parison methodology. The summary comparison
is based on three standard parameters named as D-
Qual, D-Intrp and D-Crit as defined explained in
Table 3 and Table 4 (Wilson and Wilson, 2013).
The difference between all factors in pre-search
task with post-search task is greater in custom (C)
setting than the default (D) setting. This indi-
cates where searcher wrote a better summary with
more facts and analysis when using Custom Alexa
search.
6.1.3 RQ4(3): Can Priming Help in Information
Seeking and Reducing Error in
Conversation?
During our interview sessions, users were asked about
their experiences using the Alexa Custom setting.
They answered questions relating to two dimensions:
i) reasons to prefer Alexa Custom setting and ii) what
are the challenges of using the Alexa Custom setting.
1. Reasons to prefer Alexa Custom setting: The
top three reasons to choose Alexa custom set-
ting were: Navigation and Directed Search (13%),
Relevant and More Informative (21%), and Op-
tions and Suggestions (19%). Overall, the users
found it informative, well directed search, and that
HUCAPP 2023 - 7th International Conference on Human Computer Interaction Theory and Applications
314
Table 3: Summary Comparison Metric (Wilson and Wilson, 2013).
Parameter Definition
Dqual Comparison of the quality of facts in the range 0-3 where
0 is irrelevant facts and 3 specific relevant facts.
Dintrp Measures the association of facts in a summary in the range 0-2
where 0 represents no association of the
facts and 2 that all facts in a summary are associated with each other in a meaning.
Dcrit Examines the quality of critiques of topic written by the author in range
the 0-1 where 0 represents facts are listed with a thought or
analysis of their value and 1 where both advantages and disadvantages of the facts are given.
Table 4: Summary comparison.
Parameters Default Custom P <0.05
Alexa Alexa
Dqual 14 35 0.0090
Dintrp 17 37 0.0157
Dcrit 15 21 0.0001
it provided options which gave them the opportu-
nity for exploration throughout the search process.
2. Challenges of using Alexa Custom setting:
Around 40% of searchers were happy and did
not find any challenges in using Alexa Cus-
tom setting, in contrast to the Default setting
where around 95% people found it challenging.
The three top challenges were too many options
(10%), slow speed (10%) and less freedom to ask
(10%). We considered these results to be pos-
itive outcomes for Alexa Custom setting since
searchers found the response of Default Alexa set-
ting too fast (i.e. spoken responses were delivered
too fast to be fully comprehended) and that it was
unable to provide suggestions and options during
the search process.
6.2 RQ5: What Is the User Search
Behaviour and Experience with the
Conversational System in an
Exploratory Search Setting?
Our final research question focuses on comparing user
behaviour patterns during an exploratory search us-
ing Default Alexa and Custom Alexa based on user
interaction and self-reported answers in the question-
naires.
1. Custom Alexa: Based on our analysis of user in-
teractions (Table 2), we observe that participants
found Alexa custom Search more cognitively en-
gaging than the default application. This obser-
vation implies two conclusions, that the custom
application can hold the participant’s interest in
the search process, and also that participants were
able to learn and understand more using the cus-
tom application. Some users reported a lower
level of knowledge of the topic before commenc-
ing the search task. However, they were interested
in the topic, which led them to engage with rele-
vant content with a very high of interaction during
which they explored in great depth. Other users
who also began with less knowledge had notably
less interest in the topic, showed strong engage-
ment with a limited amount of content but did
not explore the retrieved content so widely. Other
users with little initial knowledge of the topic but
a very high interest it, engaged more with diverse
sources of content, but with less interaction and
less detailed examination of specific areas of con-
tent. The majority of users were comfortable with
the multi-modality of their engagement and were
satisfied with the exploratory custom search inter-
face.
2. Default Alexa: In this setting, some users with
less background knowledge of the topic and en-
gaged repeatedly with the limited content by re-
peating the same queries to enhance their under-
standing and search experience with topic. Other
users restricted themselves to only few queries
since the poor speech recognition that they expe-
rienced led to frustration. Other users expected
more options and suggestions to be given by the
system as per convention of their previous expe-
riences with information retrieval systems. Gen-
erally, cognitive engagement with the system was
less in comparison to the Alexa custom applica-
tion. Most of the users were not very comfort-
able with the multi-modality available in the de-
fault setting. In general, we found that the indi-
vidual pieces of information provided in default
responses were not sufficient to develop a broad
knowledge of the topic, resulting in poor post-
search summaries.
Examining the Potential for Conversational Exploratory Search Using a Smart Speaker Digital Assistant
315
7 CONCLUSIONS AND FURTHER
WORK
We have described a study examing the use the stan-
dard Alexa assistant for exploratory search tasks. This
demonstrated that while it is generally able to an-
swer factoid type questions quite successfully, it is not
able to support the requirements of more exploratory
search tasks. Our study highlighted these shortcom-
ings in terms of the need to examine multiple retrieved
items and specifically engaging with larger items in
order to locate relevant information. In response to
the identified shortcoming, we proposed and imple-
mented a customised Alexa application specifically
designed to address these. A second study examining
our Custom Alexa application showed that it is able
to successfully address the identified problems and is
well received in terms of usability by the participants
in our experimental study.
While our study shows how existing commercial
conversational digital assistant applications such as
Alexa can be successfully extended to support ex-
ploratory search. this is only an initial prototype. In
order to refine the features of our prototype appli-
cation we need to study the different components in
more detail to better understand the specific require-
ments of searchers and how these can be supported by
conversational features. Further, it would be interest-
ing to explore the possibility of the assistant capturing
knowledge to which the user is exposed while carry-
ing out an exploratory search task and using this to
directly help the user in a conversational manner as
they progress through the search process.
ACKNOWLEDGEMENT
This work was supported by Science Founda-
tion Ireland as part of the ADAPT Centre (Grant
13/RC/2106) at Dublin City University.
REFERENCES
Akon, H. (2018). Echo show simplified user manual: A
simplified step by step amazon echo show user guide
that will help you explore the full capabilities. of ama-
zon echo show. 100% simplified!
Avula, S. and Arguello, J. (2020). Wizard of oz interface
to study system initiative for conversational search. In
Proceedings of CHIIR 2020, pages 447–451.
Avula, S., Arguello, J., Capra, R., Dodson, J., Huang,
Y., and Radlinski, F. (2019). Embedding search
into a conversational platform to support collaborative
search. In Proceedings of CHIIR 2019, pages 15–23.
Avula, S., Chadwick, G., Arguello, J., and Capra, R. (2018).
Searchbots: User engagement with chatbots during
collaborative search. In Proceedings of CHIIR 2018,
pages 52–61. ACM.
Belkin, N. J., Cool, C., Stein, A., and Thiel, U. (1995).
Cases, scripts, and information-seeking strategies: On
the design of interactive information retrieval systems.
Expert systems with applications, 9(3):379–395.
Biehl, M. (2019). Making money with alexa skills.
Braun, V. and Clarke, V. (2013). Successful qualitative re-
search: A practical guide for beginners. sage.
Hoy, M. B. (2018). Alexa, siri, cortana, and more: An in-
troduction to voice assistants. Medical Reference Ser-
vices Quarterly, 37(1):81–88. PMID: 29327988.
Janarthanam, S. (2017). Hands-on chatbots and conver-
sational UI development: build chatbots and voice
user interfaces with Chatfuel, Dialogflow, Microsoft
Bot Framework, Twilio, and Alexa Skills. Packt Pub-
lishing Ltd.
Krathwohl, D. R. (2002). A revision of bloom’s taxonomy:
An overview. Theory into practice, 41(4):212–218.
Leech, G. and Weisser, M. (2003). Generic speech act an-
notation for task-oriented dialogues. In Proceedings
of the corpus linguistics 2003 conference, volume 16,
pages 441–446. Lancaster: Lancaster University.
Loisel, A., Chaignaud, N., and Kotowicz, J.-P.
(2009). Modeling human interaction to design a
human-computer dialog system. arXiv preprint
arXiv:0911.5652.
Lopatovska, I., Rink, K., Knight, I., Raines, K., Cosenza,
K., Williams, H., Sorsche, P., Hirsch, D., Li, Q.,
and Martinez, A. (2018). Talk to me: Explor-
ing user interactions with the amazon alexa. Jour-
nal of Librarianship and Information Science, page
0961000618759414.
L
´
opez, G., Quesada, L., and Guerrero, L. A. (2018). Alexa
vs. siri vs. cortana vs. google assistant: A compari-
son of speech-based natural user interfaces. In Nunes,
I. L., editor, Advances in Human Factors and Systems
Interaction, pages 241–250, Cham. Springer Interna-
tional Publishing.
Radlinski, F. and Craswell, N. (2017). A theoretical frame-
work for conversational search. In Proceedings of the
2017 Conference on Conference Human Information
Interaction and Retrieval, pages 117–126. ACM.
Sitter, S. and Stein, A. (1992). Modeling the illocutionary
aspects of information-seeking dialogues. Informa-
tion Processing & Management, 28(2):165–180.
Staven, T. (2017). What makes a good bot or not?
Trippas, J. R., Spina, D., Cavedon, L., Joho, H., and Sander-
son, M. (2018). Informing the design of spoken con-
versational search: Perspective paper. In Proceedings
of CHIIR 2018, CHIIR ’18, pages 32–41, New York,
NY, USA. ACM.
Trippas, J. R., Spina, D., Cavedon, L., and Sanderson, M.
(2017a). Crowdsourcing user preferences and query
judgments for speech-only search. In 1st SIGIR Work-
shop on Conversational Approaches to Information
Retrieval (CAIR’17).
HUCAPP 2023 - 7th International Conference on Human Computer Interaction Theory and Applications
316
Trippas, J. R., Spina, D., Cavedon, L., and Sanderson, M.
(2017b). How do people interact in conversational
speech-only search tasks: A preliminary analysis. In
Proceedings of CHIIR 2017, pages 325–328.
Vyas, B. (2017). 6 key metrics to measure the performance
of your chatbot.
Website, A. O. (2020). Get started with the alexa skills kit.
Wilson, M. J. and Wilson, M. L. (2013). A comparison
of techniques for measuring sensemaking and learn-
ing within participant-generated summaries. Journal
of the Association for Information Science and Tech-
nology, 64(2):291–306.
Examining the Potential for Conversational Exploratory Search Using a Smart Speaker Digital Assistant
317