MovieOcean: Assessment of a Personality-based Recommender System

Luca Rolshoven

, Corina Masanti

, Jhonny Pincay

2,3 a

, Luis Ter

, Jos

e Mancera

and Edy Portmann

2 b

University of Bern, Hochschulstrasse 6, Bern, Switzerland

Human-IST Institute, University of Fribourg, Boulevard de P

erolles 90, Fribourg, Switzerland

Pontiﬁcia Universidad Cat

olica del Ecuador, Av. 12 de Octubre 1076, Quito, Ecuador

{jhonny.pincaynieves, luis.teran, josealberto.manceraandrade@unifr.ch, edy.portmann}@unifr.ch

Keywords:

Recommender Systems, Personality-based Recommenders, Personalized Recommendations, Big Five Model.

Abstract:

This research effort explores the incorporation of personality treats into user-user collaborative ﬁltering

algorithms. To explore the performance of such a method, MovieOcean, a movie recommender system

that uses a questionnaire based on the Big Five model to generate personality proﬁles, was implemented.

These personality proﬁles are used to precompute personality-based neighborhoods, which are then used

to predict movie ratings and generate recommendations. In an ofﬂine analysis, the root mean square error

metric is computed to analyze the accuracy of the predicted ratings and the F1-score to assess the relevance

of the recommendations for the personality-based and a standard-rating-based approach. The obtained

results showed that the root mean square error of the personality-based recommender system improves when

the personality has a higher weight than the information about the user ratings. A subsequent t-test was

conducted for the proposed personality-based approach underperformed based on the root mean square error

metric. Furthermore, interviews with users suggested that including aspects of personality when computing

recommendations is well-perceived and can indeed help improve current recommendation methods.

1 INTRODUCTION

While collaborative ﬁltering methods are widely

used in implementing recommender systems (RS)

with good results, there is still room for improvement

in the quality of recommendations. The advent of

streaming platforms in the last few years (e.g.,

Netﬂix, Disney+, and Paramount+) and the easiness

of accessing a broad amount of media content have

led many companies and researchers to ﬁnd methods

that provide better recommendations to engage users.

Because personality is about thinking, feeling, and

behaving, it seems coherent to think that it is possible

to improve the utility of recommendations by

harnessing personality data.

Authors such as Aaker (1999) had found that

brands that were associated with a set of personality

traits were perceived as more favorable when the

individuals were schematic on the personality

dimension that was also highly descriptive for the

https://orcid.org/0000-0003-2045-8820

https://orcid.org/0000-0001-6448-1139

brand. Mulyanegara et al. (2009) employed the Big

Five personality scale to assess the personality of 251

subjects in ﬁve dimensions and used those

assessments to measure the relationship between

consumer personality and brand personality in the

context of fashion products. Although the results

may differ in each domain, they suggest that

including personality aspects in the implementation

of RS engines could enhance the quality of

recommendations. The results of the research work

of Weaver Weaver III (1991) support the conception

that there is a correlation between personality traits

and media preferences across a variety of media.

This work addresses the following questions: (i)

how reliable are personality-based neighborhoods for

the computation of recommendations, and (ii) to

what extent are personality-based recommendations

perceived as more valuable than those based on

collaborative ﬁltering? To answer these questions,

the authors implemented MovieOcean, a

personality-based movie recommender system that

uses the Big Five model to generate personality

proﬁles for each user. These proﬁles are used to

690

Rolshoven, L., Masanti, C., Pincay, J., Terán, L., Mancera, J. and Portmann, E.

MovieOcean: Assessment of a Personality-based Recommender System.

DOI: 10.5220/0011002500003179

In Proceedings of the 24th International Conference on Enterprise Information Systems (ICEIS 2022) - Volume 1, pages 690-698

ISBN: 978-989-758-569-2; ISSN: 2184-4992

precompute personality-based neighborhoods, then

used to predict movie ratings and generate

recommendations. Furthermore, the resulting

recommendations are assessed by conducting

experiments and measuring the root mean square

error (RMSE) and F1-score.

This article is structured as follows: Section 2

presents the theoretical background in which this

research effort is grounded. The methods developed

and applied in the execution of this work are

described in Section 3. Results are presented in

Section 4. Section 5 ﬁnishes the article with a

summary of the work conducted and the lessons

learned.

2 THEORETICAL BACKGROUND

Through the presentation of related work, this section

introduces the concepts and theories used to address

the above mentioned research questions.

2.1 Personality and the Big Five Model

The American Psychological Association (APA)

deﬁnes personality as follows (American

Psychological Association, 2020): “Personality

refers to individual differences in characteristic

patterns of thinking, feeling, and behaving. The study

of personality focuses on two broad areas:

Understanding individual differences in particular

personality characteristics, such as sociability or

irritability. The other is understanding how the

various parts of a person come together as a whole.”

When discussing personality and personality

characteristics in differential psychology, it is

essential to provide a shared taxonomy. The Big Five

personality model was developed as a result of the

ﬁndings of many study initiatives. It is made up of

ﬁve components (John et al., 2008):

• Openness: People with high openness tend to be

more original, open-minded, experimental, and

creative.

• Conscientiousness: People with high

conscientiousness tend to be more orderly,

responsible, goal-oriented, and organized.

• Extraversion: People with high extroversion

tend to be more energetic, enthusiastic, active,

sociable, and assertive.

• Agreeableness: People with high agreeableness

tend to be more altruistic, affectionate, modest,

trustful, and cooperative.

• Neuroticism: People with high neuroticism tend

to be more nervous, anxious, sad, and tense.

Which can be abbreviated as OCEAN.

The Big in Big Five was chosen to emphasize the

fact that these ﬁve factors are extensive and that they

were selected to represent personality at the broadest

level of abstraction (Pervin and John, 1999).

2.2 Personality Correlations

People differ signiﬁcantly from each other based on

their personality traits. In psychology, researchers

aim to ﬁnd the most deﬁning characteristics detailed

enough to capture a personality but general enough

to avoid high complexity. Various studies have

proven a strong correlation and have also shown how

it can produce relevant predictions.

For instance, Rentfrow and Gosling (2003)

demonstrated how personality traits inﬂuence the

preferred music type. Also, Barrick and Mount

(1991) found out that there is a close relationship

between personality characteristics and job

performance. Other studies showed how personality

could inﬂuence the everyday online behavior of a

person. Preferences for speciﬁc social networks or

the amount of time that one spends online are also

correlated with the personality of the user, as Zhong

et al. (2011) were able to present in their manuscript.

Besides, Hu and Pu (2013) managed to show the

correlation between personality characteristics and

user rating behavior for retail products.

Cantador et al. (2013) looked at the relationship

between personality types and tastes in different

entertainment ﬁelds. Movies, TV shows, music, and

books were among the domains they selected. They

looked at data from 53 226 Facebook users to see any

signiﬁcant links between personality traits and

preferences in the domains listed. In each of the

chosen domains, they generated personality-based

user stereotypes and association rules for 16 genres.

2.3 Other Related Work

In the literature, it is possible to ﬁnd several

approaches to personality-based recommender

systems. Paiva et al. (2017) proposed a hybrid

recommendation approach that combines the Big

Five personality traits of the users with a correlation

between car fronts and power and sociability

perceptions to be used for semantic searches in

vehicle sales portals. They used the BFI-10 inventory

(Rammstedt and John, 2007), which solely consists

of 10 questions, to assess the Big Five traits of their

users. Then, they used the Euclidean distance

MovieOcean: Assessment of a Personality-based Recommender System

691

between users to form a neighborhood of size k = 3,

which is used with the correlations described above

to offer recommendations.

Braunhofer et al. (2015) used the Five-Item

Personality Inventory (FIPI), which includes even

fewer questions, to assess the personalities of the

users and then incorporate them into their

recommender system for Point of Interest (POIs)

recommendations. Their recommendation algorithm

makes use of matrix factorization (Koren et al.,

2009) and incorporates the demographic data of the

users as well as their personalities.

Another approach is to use a standard user-user

collaborative ﬁltering algorithm and change the

similarity metrics to include personality information.

Tkal

c et al. (2009) used this method and

determined that their personality-based similarities

led to an F1-score that was statistically equivalent to

the one obtained by using rating-based similarities.

However, they state that the personality-based

approach has three advantages: (i) an initial

questionnaire solves the new user problem, (ii) the

similarity computation between users is less

expensive, and (iii) the impact of the sparsity

problem is lowered. To assess the personalities of the

users, the authors used a version of the International

Personality Item Pool (IPIP) with 50 questions

(Goldberg et al., 2006).

In the realm of movies, Nalmpantis and Tjortjis

(2017) investigated the impact of integrating

personality into the recommendation process. They

used a hybrid approach to compute the

recommendations, in which a questionnaire was used

to measure the user’s personality characteristics. The

personality scores are then used together with the

ﬁndings of Cantador et al. (2013) to identify genres

that the consumer might enjoy. They estimated

movie ratings using collaborative ﬁltering, for which

users had to rate 20 movies in advance. Finally, the

recommendations were generated by considering the

movie ratings and the user’s personality with a

weight of 0.5 each. They compared their system to

one that only considers recommendations based on

ratings and discovered that users preferred the

50%-50% approach to the rating-based method.

Considering previous research ﬁndings, using

personality characteristics to perform movie

suggestions seems to be a suitable way of enhancing

the quality and usefulness of the recommendations.

Compared to previous efforts, this work aims to

evaluate the effect of creating a personality-based

neighborhood and determine whether incorporating

personality traits increases the perceived value of the

recommendations among users.

3 METHOD AND USE CASE

The approach used in this study consists of three

major stages: (i) personality assessment, (ii)

recommendation engine design, and (iii) evaluation.

The following sections go into greater detail about

these phases and the implementation of MovieOcean.

Besides that, the movie information was obtained

from The Movie Database (TMBD)

through its free

developer API.

3.1 Personality Assessment

The personality of a user was determined with the

help of the Big Five model introduced in Section 2.1.

A standardized questionnaire containing 50

questions

from the (IPIP) was used. This

questionnaire is managed by the Oregon Research

Institute (Goldberg et al., 2006).

When the questionnaire is applied, each

participant can score up to 50 points in each

personality dimension (Openness,

Conscientiousness, Extraversion, Agreeableness, and

Neuroticism). The higher the score in a dimension,

the more the person can be described by this speciﬁc

personality trait. The particular questions consist of

statements that have to be rated on a 5-point Likert

scale from Very Inaccurate to Very Accurate. There

were precisely ten questions for each personality

dimension, some of them being positively keyed and

some of them being negatively keyed. For a question

that is positively keyed, the answer Very Inaccurate

would add one point to the corresponding personality

trait. In contrast, the answer Very Accurate would

result in ﬁve points added to that trait. For questions

that were negative keyed, the opposite order applies.

After completing the questionnaire, users could

see how many points they had obtained in each

personality dimension.

3.2 Recommendation Engine Design

To build the recommendation engine, a user-user

collaborative ﬁltering algorithm was used.

Neighborhoods of size ﬁve were built for each user.

Neighborhoods were recalculated each time a new

user completed the IPIP questionnaire.

Recomputation of the neighborhoods was also

triggered if a user with an existing personality proﬁle

decided to change one or more of their answers in the

questionnaire. Equation 1 was used to calculate the

similarity of two users a and k.

https://www.themoviedb.org/documentation/api

https://ipip.ori.org/new ipip-50-item-scale.htm

ICEIS 2022 - 24th International Conference on Enterprise Information Systems

692

sim(a, k) =

1 + p



sim

cos

(a, k) + p · sim

pers

(a, k)



(1)

The variable p is the personality factor used to

change the importance we want to give to the

personality in contrast to the ratings. In MovieOcean,

the value of p was set to 2, as the main focus was on

the personality more than on the ratings. Equation 2

was used for calculating the cosine similarity

between users a and k.

sim

cos

(a, k) =

∑

i∈I

k,i

· r

a,i

∑

i∈I

k,i

)

∑

i∈I

a,i

)

, (2)

where I is the set of movies that were rated by

both users and r

k,i

is the rating of user k for movie

i. The personality similarity sim

pers

(a, k) of the two

users can be calculated by using Equation 3:

sim

pers

(a, k) =

kφ(a) − φ(k)k + ε

(3)

In Equation 3, φ(a) corresponds to the personality

embedding, (i.e., a vector containing the scores for

each personality dimension of user a). Moreover, the

term ε is a small value used to avoid a division by

zero. The predicted rating of a movie i for user a is

then calculated using the standard prediction formula

in Equation 4.

a,i

= ¯r

∑

u∈N

u,i

− ¯r

) · sim(a, u)

∑

u∈N

|sim(a, u)|

(4)

In the formula above, N

is a set of users in the

neighborhood of the active user a, ¯r

is the mean of

all the ratings issued by user a and ¯r

the mean rating

of user u respectively. This approach works as long

as some users already have an established personality

proﬁle and ratings in the system. Otherwise, it

cannot deliver good results. This issue is known as

the cold start or new user problem. To address this

problem, movies with high ratings that were of a

genre well received by this particular personality

type were recommended to the user. To establish the

link between the personality and movie genres, the

results of Cantador et al. (2013) were used. Then the

three movie genres that had the least Euclidean

distance to the user’s personality were selected. The

user would then receive recommendations based on

these three genres and existing movie ratings, fetched

from TMDB.

3.3 Evaluation Method

The personality-based approach presented in this

work was compared with the standard rating-based

approach of user-user collaborative ﬁltering to

evaluate the recommendation engine. Experiments

with different similarity functions and various values

for the neighborhood size and the personality factor

were conducted. Besides, it was veriﬁed how the

results changed by considering additional weighted

demographic data. To numerically capture which

approach performs better, two measurements were

applied: RMSE for assessing the accuracy of the

estimated ratings and the F1-score to measure the

relevance of the recommendations.

3.3.1 Similarity and Neighborhood

Experiments with different similarity functions and

different values for the neighborhood size were

conducted to ﬁnd the best possible combination. For

the similarity function, four different methods were

chosen Choi and Suh (2013): the cosine similarity,

the Pearson correlation coefﬁcient (PCC), the

inverted Euclidean distance, and the Jaccard index.

Our grid search procedure looked at neighborhoods

of sizes 5, 7, 9, 11, 13, and 15. In practice, the

neighborhood size is often larger than 15. In this

case, however, large neighborhoods were not used

because the dataset was small. A neighborhood of

ﬁfteen users is already big compared to the total

number of users that we could include in the

evaluation. We ﬁrst used the similarity function to

build the neighborhoods and then calculated the

predicted ratings of each user. In each step of the

grid search, one of the four similarity metrics was

used to build the neighborhood, either in the space of

user ratings or in the space of user personalities.

However, there was one exception: when looking at

the Jaccard Index, it was not used to build the

personality-based neighborhoods but only to

compute the rating similarity, which was still

considered in the personality-based predictions of the

ratings. Instead, the PCC was used to compute the

neighborhoods for the personality-based approach.

The Jaccard Index was not used to compute the

neighborhood of similar personalities because it does

not make sense in this scenario since the Jaccard

Index would always yield 1.

The following equations were used to compute

the predicted ratings, which are a modiﬁed version of

the standard collaborative ﬁltering algorithm

(McLaughlin and Herlocker, 2004):

MovieOcean: Assessment of a Personality-based Recommender System

693

Rating: sim(a, k) =

1 + d



sim

rating

(a, k) + d · sim

dem

(a, k)



(5)

Personality:

sim(a, k) =

1 + p + d



sim

rating

(a, k) + p · sim

pers

(a, k) + d · sim

dem

(a, k)



(6)

In these equations, sim

rating

corresponds to the

rating similarity between two users, sim

pers

to the

similarity between the personalities of two users and

sim

dem

is the demographic similarity between the

users. The factors p and d correspond to the

personality factor and demographic factor. They are

used as a weight to give importance to the

personality or demographic data. In the ﬁrst

experiment, p = d = 1 was set, and only the different

similarity metrics and neighborhood sizes were

changed. For sim

rating

and sim

pers

, the same

similarity that was applied to build the

neighborhoods was used.

The demographic similarity function sim

dem

was

the same for all experiments. It takes into account the

country of residence of the users, their age, and their

gender and is deﬁned as follows:

sim

dem

(a, k) =

ˆc + ˆg + ˆa



ˆc · δ

c(a)c(k)

+ ˆg · δ

g(a)g(k)

+ ˆa ·

|a(a) − a(k)| + 1



(7)

In Equation 7, the following helper functions were

used: c(a): returns the country of user a; g(a): returns

the gender of user a; and a(a): returns the age of user

Additionally, the Kronecker delta δ is used. It

returns 1 if the two values are equal (e.g.,

c(a) = c(k)) or 0 otherwise. The variables ˆc, ˆg, and ˆa

are the weights we give to each of the attributes. The

assumption behind this was that in some cases, it

might be important to give more importance to one

trait than another, especially if, for a given domain,

there exists knowledge about the strength of

correlations between certain demographic attributes

and the type of items that are used in the

recommender system. Finally, however, it was

decided to use the same weight for every

demographic attribute in our experiments (i.e.,

ˆc = ˆg = ˆa = 1).

3.3.2 Personality Factor

The second experiment used the best neighborhood

size and similarity measure that was found in the ﬁrst

experiment. Note that these values might vary

depending on the approach that is being tested

(rating-based vs. personality-based). The goal of this

experiment was to determine the best personality

factor in the personality-based approach. The RMSE

and F1-score were computed for several personality

factors p, each time applying 3-fold cross-validation

for each user in the dataset. First, it was attempted to

look at personality factors with p = 0, 1, . . . , 10.

However, it was noticed that this magnitude of

change did not reﬂect itself much in the evaluated

metrics. Therefore, it was decided to use larger

personality factors with p = 0, 5, 10, . . . , 50. It is also

important to note that during this experiment, the

demographic factor d was set to 1.

3.3.3 Demographic Factor

The last experiment used different values for the

demographic factor d to see whether demographic

data could improve prediction accuracy and

recommendation relevance. The best conﬁgurations

found in the two previous experiments were applied,

and only the value for d was changed. More

precisely, the demographic factor was chosen to be

d = 0, 1, . . . , 10. The results of the three experiments

are presented in detail in section 4.

3.3.4 Interviews

Besides doing a quantitative analysis of the

personality-based collaborative ﬁltering method,

opinions from the users of MovieOcean were

gathered. A small questionnaire was designed, and

interviews were conducted with three of the

registered users. Some of the questions asked

included: How did you ﬁnd the questionnaire?, How

many questions do you ﬁnd appropriate for such a

questionnaire in this setting?, Did you feel that it

captured your personality well?, and How accurate

were your recommendations?.

4 RESULTS

This section presents the results of implementing a

prototype following the methods explained in the

previous section.

ICEIS 2022 - 24th International Conference on Enterprise Information Systems

694

4.1 Prototype Implementation

A prototype of MovieOcean was implemented to

explore the incorporation of personality traits into a

recommender system. This artifact made it possible

to collect data from the users and test our

personality-based approach in movie

recommendations. In the following section, we will

give a brief overview of the implementation and

features of MovieOcean. The code of MovieOcean,

including its evaluation, is available in a public

repository on GitHub

In total, 42 users registered on the platform for

one month. To complete the registration process, the

users needed to answer a few questions about their

demographic background. These questions included

the age, the country where they lived, and the gender

of the user. Most of the users were aged between 20

to 30 years old and lived in Switzerland or Mexico.

The majority of the users were women, followed by

men, and three users picked another gender variant.

After the registration, the personality questionnaire

was presented to them. For the users that completed

the questionnaire, a personality proﬁle that contains

the scores they reached in each personality

dimension was provided. Figure 1 presents an

example of the personality radar chart shown to the

users. Moreover, by issuing a few movie ratings, the

users had the chance to improve their

recommendations.

Figure 1: Screenshot of the personality scores that a user

would see after completing the questionnaire.

Each movie can be rated on a 5-star scale, and it

is possible to give half stars. The minimum rating

that can be issued is 0.5 stars. In total, 35 out of the

42 users completed the questionnaire and had,

therefore, a personality proﬁle attached to their

account. However, only 28 users issued ratings. It

available at https://github.com/rolshoven/

MovieOcean/

was possible to collect a total of 445 movie ratings

during the studied period.

Figure 2: Comparison of the personality-based and rating-

based approach when using different similarity measures

and different neighborhood sizes.

4.2 Evaluation

The results of the ﬁrst experiment are summarized in

the plots in Figure 2. It can be observed that the

metrics improved with a more signiﬁcant number of

neighbors that were considered in the collaborative

ﬁltering algorithm. The rating-based approach with

cosine similarity achieved the best RMSE of 1.694.

The best F1-score was achieved using the

personality-based strategy and the inverse Euclidean

distance as the similarity measure. The best

conﬁguration was unambiguous for the rating-based

approach: cosine similarity and a neighborhood size

of 15 yielded the best results, both for the RMSE and

the F1-score. However, the results were more

ambiguous in the personality-based approach: The

best RMSE score was achieved using cosine

similarity and 15 neighbors. The best F1-score

occurred when using the inverse Euclidean distance

and 15 neighbors. We decided to use the inverse

MovieOcean: Assessment of a Personality-based Recommender System

695

Euclidean distance given that, in our opinion, the

F1-score is more relevant to the user than the

accuracy of the predicted ratings.

Regarding the second experiment, against our

initial assumptions, the personality factor did not

inﬂuence the F1-score of the personality-based

approach. However, it did improve the RMSE values

(as shown in Figure 3). It could be that an increasing

personality factor improves the accuracy for low or

high ratings but not much for ratings near the

threshold, which would result in better RMSE values

but the same F1-scores. On the left-hand side of

Figure 3, it can be observed that the RMSE

improvement of a more signiﬁcant personality factor

decays exponentially. While increasing the

personality factor a lot does not inﬂuence the

computational complexity, we suggest not choosing a

too high value because it will limit the importance of

the rating similarity and demographic similarity. A

good value for p seems to be somewhere between 30

and 50. In our case, p = 50 was chosen before

starting the third experiment.

Figure 3: Comparison of the RMSE and F1-Scores using

different personality factors.

The last experiment was about the importance of

demographics. The results showed that considering

additional demographic data during the prediction of

movie ratings did not inﬂuence the performance of

the personality-based approach. We infer that a

correlation between personality scores and our

demographic attributes (i.e., country, gender, and

age) could explain these results. The personality

scores could then be seen as a proxy to the

underlying demographic data, therefore already

considering certain aspects of the demographics

implicitly. On the other hand, Figure 4 shows that the

demographics inﬂuenced the results of the

rating-based approach. When we did not consider

demographic data, the RMSE value for the

rating-based approach was 1.841, whereas it was

1.694 when we set d = 1, which is an improvement

of 0.147 or almost 8%. Our results suggest that

demographic data should be included in a

rating-based approach. However, the algorithm

should not give too much weight to demographic

similarities because it will not further improve the

performance. If the recommender system already

uses information about the user’s personality, there

seems to be no need to include additional

demographic information.

Figure 4: Comparison of the RMSE and F1-scores using

different demographic factors.

As a ﬁnal comparison, a t-test on the RMSE and

F1-scores on the rating-based and personality-based

approach data in the third experiment with a

signiﬁcance level of α = 0.05 was performed. It was

done in two scenarios: First, considering no

additional demographic data (d = 0) and second,

using additional demographic data (d = 1). The

results of this analysis are illustrated in Table 1.

There was no signiﬁcant difference between the

F1-scores of the rating-based and the

personality-based method. Unfortunately, there was

a signiﬁcant difference between the RMSE values of

the two approaches, with the personality-based

method performing poorer. The RMSE values for the

rating-based approach improved when using

additional demographic data, leading to an even

smaller p-value.

On the other hand, the personality-based

approach does have some advantages over the

rating-based approach. Suppose we know that the

F1-scores of these two approaches do not differ

signiﬁcantly. In that case, we could assume that the

quality of recommendations will not worsen when

using the personality-based method.

Table 1: RMSE and F1-scores for the rating-based and

the personality-based approach with and without additional

demographic data. The last row shows the p-values of the

t-test that we performed.

Without demographics

(d = 0)

With demographics

(d = 1)

Approach RMSE F1-Score RMSE F1-Score

Rating-based 1.874 0.770 1.717 0.799

Personality-based 2.022 0.784 1.955 0.796

p-value 0.035 0.397 0.013 0.920

4.3 Interviews Outcome

The conception of recommending movies based on

the personality and the implementation of

MovieOcean was well-received by users. The design

ICEIS 2022 - 24th International Conference on Enterprise Information Systems

696

of MovieOcean was perceived as intuitive and

aesthetically pleasing. Only one out of the three

users had a problem ﬁnding a movie but also stated

that it could have been because of a typo in the

movie name. The users found it easy to navigate

through their recommendations and ﬁnd other

movies that were linked as similar movies at the end

of each movie detail view. Moreover, the personality

assessment seemed to reﬂect what the user thought

about themselves. It was mentioned that the

overview of the personality scores and the short

explanations were received well. Another point is

that the star rating system was a good idea and

brought some life into the recommender system.

5 SUMMARY AND LESSONS

LEARNED

This work explored the inclusion of personality

characteristics as an enhancement of user-user

collaborative ﬁltering algorithms. To this end,

MovieOcean, a personality-based movie

recommender system, was implemented.

MovieOcean uses a questionnaire based on the Big

Five model to generate personality proﬁles of the

users registered on the platform. With the personality

proﬁles, personality-based neighborhoods were

created to predict movie ratings and provide

recommendations. Furthermore, to assess how the

consideration of personality traits inﬂuences the

recommendations, the RMSE metric was calculated

to analyze the accuracy of the predicted ratings and

the F1-score to evaluate the relevance of the

recommendations for the personality-based and a

standard rating-based approach.

Unfortunately, it was not possible to ﬁnd

evidence that incorporating personality in

collaborative ﬁltering, in the way it was tested, leads

to better performance. The performance is

signiﬁcantly worse in terms of the RMSE of the

predicted ratings. However, the F1-scores do not

differ signiﬁcantly, which makes us think that the

recommendation quality is the same in both

approaches. If this is the case, it is still possible to

suggest using the personality-based approach

because it has several advantages as per our ﬁndings:

1. The neighborhoods of personalities can be

precomputed, which leads to a faster

recommendation process.

2. It is possible to use correlations between

personalities and movie genre tastes to overcome

the cold-start problem.

3. Users seem to enjoy the personality-based

approach, as the literature and interviews

conducted for this study showed.

Regarding the research questions deﬁned in

section 1, we conclude that recommender systems

cannot rely more on a personality-based

neighborhood than on a traditional rating-based

neighborhood. However, given the case study results,

one could claim that it is valid to use

personality-based neighborhoods without

deteriorating the quality of the recommendations.

Moreover, only one method to incorporate

personality into recommender systems was explored.

The second question was about the accuracy of

the recommendations. Based on our ﬁndings, the

recommendation quality – measured by the F1-score

– does not differ signiﬁcantly. The signiﬁcantly

worse RMSE of the personality-based approach

could also inﬂuence the recommendations, but this

did not seem to be the case in the conducted

experiments. A slightly better F1-Score was found

for the personality-based approach, when no

additional demographic data was considered.

However, when taking demographic data into

account, the rating-based method had a slightly

higher F1-score. Table 1 shows that in both cases, the

difference was not signiﬁcant. Suppose the F1-score

is more important than the RMSE of the predicted

ratings. In that case, the authors of this work believe

that it is possible to use a personality-based approach

without worrying about a negative effect on the

recommendation quality. However, based on the

obtained results, there are no improvements either.

In addition, the authors consider that the main

advantage of personality-based recommender

systems is that the neighborhood of similar users can

be precomputed, which results in a faster

recommendation process. In general, personalities

seem to be much more stable than rating vectors, and

we do not think the users would frequently change

the answers to their questionnaire. Even if they

change their answers, they probably only do so a few

times, resulting in a low number of recomputations

compared to the rating neighborhoods, which might

change very often. Moreover, the personality

information can be leveraged to solve the cold-start

recommendation problem by using correlations

between the user personalities and a prototype

personality of users that like movies of a speciﬁc

genre. While many users seem to like the

personality-based approach, there are probably also

other users that do not want to ﬁll out the entire

questionnaire before using a recommender system.

In future work, the impact of different kinds of

MovieOcean: Assessment of a Personality-based Recommender System

697

Big Five questionnaires on the performance of a

personality-based recommender system will be

studied. Furthermore, other ways of including

personality aspects are going to be explored. For

instance, one approach would be to build the

neighborhoods using a standard rating similarity and

then only consider the users’ personality in the

similarity measure during the prediction of ratings.

REFERENCES

Aaker, J. L. (1999). The malleable self: The role of

self-expression in persuasion. Journal of marketing

research, 36(1):45–57.

American Psychological Association (2020). Personality.

https://www.apa.org/topics/personality. Accessed:

2020-09-19.

Barrick, M. R. and Mount, M. K. (1991). The big ﬁve

personality dimensions and job performance: a meta-

analysis. Personnel psychology, 44(1):1–26.

Braunhofer, M., Elahi, M., and Ricci, F. (2015). User

personality and the new user problem in a context-

aware point of interest recommender system. In

Information and communication technologies in

tourism 2015, pages 537–549. Springer.

Cantador, I., Fern

andez-Tob

ıas, I., and Bellog

ın, A. (2013).

Relating personality types with user preferences in

multiple entertainment domains. In CEUR workshop

proceedings. Shlomo Berkovsky.

Choi, K. and Suh, Y. (2013). A new similarity function

for selecting neighbors for each target item in

collaborative ﬁltering. Knowledge-Based Systems,

37:146–153.

Goldberg, L. R., Johnson, J. A., Eber, H. W., Hogan,

R., Ashton, M. C., Cloninger, C. R., and Gough,

H. G. (2006). The international personality item pool

and the future of public-domain personality measures.

Journal of Research in personality, 40(1):84–96.

Hu, R. and Pu, P. (2013). Exploring relations between

personality and user rating behaviors. In UMAP

Workshops.

John, O. P., Naumann, L. P., and Soto, C. J. (2008).

Paradigm shift to the integrative big ﬁve trait

taxonomy. Handbook of personality: Theory and

research, 3(2):114–158.

Koren, Y., Bell, R., and Volinsky, C. (2009). Matrix

factorization techniques for recommender systems.

Computer, 42(8):30–37.

McLaughlin, M. R. and Herlocker, J. L. (2004). A

collaborative ﬁltering algorithm and evaluation metric

that accurately model the user experience. In

Proceedings of the 27th annual international ACM

SIGIR conference on Research and development in

information retrieval, pages 329–336.

Mulyanegara, R. C., Tsarenko, Y., and Anderson, A. (2009).

The big ﬁve and brand personality: Investigating

the impact of consumer personality on preferences

towards particular brand personality. Journal of brand

management, 16(4):234–247.

Nalmpantis, O. and Tjortjis, C. (2017). The 50/50

recommender: a method incorporating personality

into movie recommender systems. In International

Conference on Engineering Applications of Neural

Networks, pages 498–507. Springer.

Paiva, F. A., Costa, J. A., and Silva, C. R. (2017). A

personality-based recommender system for semantic

searches in vehicles sales portals. In International

Conference on Hybrid Artiﬁcial Intelligence Systems,

pages 600–612. Springer.

Pervin, L. A. and John, O. P. (1999). Handbook of

personality: Theory and research, chapter 4, pages

102–105. Guilford Press, 2 edition.

Rammstedt, B. and John, O. P. (2007). Measuring

personality in one minute or less: A 10-item

short version of the big ﬁve inventory in english

and german. Journal of research in Personality,

41(1):203–212.

Rentfrow, P. J. and Gosling, S. D. (2003). The do re mi’s of

everyday life: the structure and personality correlates

of music preferences. Journal of personality and

social psychology, 84(6):1236.

Tkal

c, M., Kunaver, M., Tasi

c, J., and Ko

sir, A.

(2009). Personality based user similarity measure for

a collaborative recommender system. In Proceedings

of the 5th Workshop on Emotion in Human-Computer

Interaction-Real world challenges, pages 30–37.

Weaver III, J. B. (1991). Exploring the links between

personality and media preferences. Personality and

individual differences, 12(12):1293–1299.

Zhong, B., Hardin, M., and Sun, T. (2011). Less

effortful thinking leads to more social networking?

the associations between the use of social network

sites and personality traits. Computers in Human

Behavior, 27(3):1265–1271.

ICEIS 2022 - 24th International Conference on Enterprise Information Systems

698