sim
π
: A Concept Similarity Measure under an Agent’s Preferences in
Description Logic ELH
Teeradaj Racharak
1,2
, Boontawee Suntisrivaraporn
1
and Satoshi Tojo
2
1
School of Information, Computer and Communication Technology, Sirindhorn International Institute of Technology,
Thammasat University, Pathumthani, Thailand
2
Graduate School of Information Science, Japan Advanced Institute of Science and Technology, Nomi, Japan
Keywords:
Concept Similarity Measures, Non-standard Reasoning Services, Preference Profile, Description Logics.
Abstract:
In Description Logics (DLs), concept similarity measures (CSMs) aim at identifying a degree of commonality
between two given concepts and are often regarded as a generalization of the classical reasoning problem of
equivalence. That is, any two concepts are equivalent if their similarity degree is one, and vice versa. When
two concepts are not equivalent, the level of similarity varies depending not only on the objective factors
(i.e. the concept descriptions) but also on the subjective factors (i.e. the agent’s preferences). This work
presents the notion of a general preference profile to be used in existing similarity measures and exemplifies
its applicability with the similarity measure for the DL ELH , called sim
π
. We show that our measure is
expressible for all aspects of preference profile and prove that sim
π
is preference-invariant w.r.t. equivalence,
i.e. similarity between two equivalent concepts is always one regardless of agents’ preferences.
1 INTRODUCTION
Agents’ preferences are used in a variety of related,
but not identical, ways in their daily life: to ex-
press what they like and dislike, to express their de-
sired goals when choosing routes for travelling (Son
et al., 2003), etc. In psychology, preferences may
be conceived of as an agent’s attitude towards a set
of objects when making decisions (Lichtenstein and
Slovic, 2006). Alternatively, preferences can be inter-
preted as a judgment in a sense of liking or disliking
an object (Scherer, 2005).
In Description Logics (DLs), concept similarity
measures (CSMs) aim at identifying a degree of com-
monality between two given concept names and are
often regarded as a generalization of the classical rea-
soning problem of equivalence. That is, any two con-
cepts are equivalent if their similarity degree is one,
and vice versa. To date, many semantic CSMs have
been developed (cf. Section 4). These developments
can induce efficient similarity-oriented DL reasoning
services, i.e., to measure if two concepts are similar,
to check if a given instance is a relaxed instance of
a concept, and to retrieve those instances similar to
a given instance. However, relatively limited efforts
have been placed on addressing real-world similarity
services executed by a user agent, i.e., finding similar-
ity w.r.t. the needs and preferences of an agent. These
issues can be illustrated with the following example:
Example 1.1. Suppose that Bob, a Ph.D student,
wants to visit a place for active activities, and he feels
like a place where he can enjoy walking. According
to his world, a terminology might have been modeled
in DL as follows:
ActivePlace v Place u canWalk.Trekking
u∃canSail.Kayaking
Mangrove v Place u canWalk.Trekking
Beach v Place u canSail.Kayaking
canWalk v canDo
canSail v canDo
Considering merely the objective aspects of the
world, it is reasonable to conclude that both
Mangrove and Beach are equally similar to the notion
of ActivePlace. Taking into account also Bob’s pref-
erences, however, Mangrove appears more suitable to
his perception of ActivePlace.
The example shows that preferences of an agent
play a decisive role in the choice of alternatives. Thus,
we need to be able to fine-tune the degree of simi-
larity by employing aspects apart from the objective
factors (i.e. the concept descriptions themselves). It
is worth observing that, with a few exceptions like
480
Racharak, T., Suntisrivaraporn, B. and Tojo, S.
sim
π
: A Concept Similarity Measure under an Agent’s Preferences in Description Logic ELH .
DOI: 10.5220/0005813404800487
In Proceedings of the 8th International Conference on Agents and Artificial Intelligence (ICAART 2016) - Volume 2, pages 480-487
ISBN: 978-989-758-172-4
Copyright
c
2016 by SCITEPRESS Science and Technology Publications, Lda. All rights reserved
sim and simi (cf. Section 4), most CSMs do not al-
low user agents to specify their preferences and use
them to identify a degree of similarity between two
concepts. The responsibility of finding similar con-
cepts w.r.t. the needs and preferences of an agent rests
solely on that agent.
In this work, we exemplify the applicability of the
so-called preference profile (Racharak et al., 2015),
which is a design guideline for the development of
concept similarity measures under an agent’s pref-
erences, to the similarity measure sim, in symbols
sim
π
. We also exhibit that sim
π
is expressible for
all aspects of preference profile and prove that sim
π
is preference-invariant w.r.t. equivalence, i.e. similar-
ity between two equivalent concepts is always one re-
gardless of agents’ preferences (cf. Section 3).
2 PRELIMINARIES
In Description Logics (DLs), concept descriptions
are inductively defined by the help of a set of con-
structors, a set of concept names CN, and a set of role
names RN. The set of concept descriptions, or simply
concepts, for a specific DL L is denoted by Con(L).
The set Con(ELH ) of all E LH concepts can be in-
ductively defined by the following grammar,
C, D A | > | C u D | r.C
where > denotes the top concept, C, D Con(ELH ),
A CN and r RN. Conventionally, concept names
are denoted by A and B, concept descriptions are de-
noted by C and D, and role names are denoted by r
and s.
A terminology or TBox O is a finite set of (possi-
bly primitive) concept definitions and role hierarchy
axioms, whose syntax is an expression of the form
(A v D) A D, and r v s, respectively. A TBox is
called unfoldable if it contains at most one concept
definition for each concept name in CN and does not
contain cyclic dependencies. Concept names occur-
ring on the left-hand side of a concept definition are
called defined concept names (denoted by CN
def
), all
other concept names are primitive concept names (de-
noted by CN
pri
). A primitive definition A v D can
easily be transformed into a semantically equivalent
full definitions A X u D where X is a fresh con-
cept name. When a TBox O is unfoldable, concept
names can be expanded by exhaustively replacing all
defined concept names by their definitions until only
primitive concept names remain. Such concept names
are called fully expanded concept names. In what fol-
lows, we assume that concepts are fully expanded,
and as such the TBox can be omitted. Like primi-
tive definitions, a role hierarchy axiom r v s can be
transformed in to a semantically equivalent role def-
inition r t u s where t is a fresh role name. Role
names occurring on the left-hand side of a role defini-
tion are called defined role names, denoted by RN
def
.
All others are primitive role names, collectively de-
noted by RN
pri
. We also denote a set of all rs super
roles by R
r
= {s RN|r = s or r
i
v r
i+1
O where
1 i n, r
1
= r, r
n
= s}.
In order to define a formal semantics for a spe-
cific DL L, we consider an interpretation I = h
I
, ·
I
i,
which consists of a nonempty set
I
as the domain
of the interpretation and an interpretation function ·
I
which assigns to every concept name A a set A
I
I
and to every role name r a binary relation r
I
I
×
I
. The interpretation function ·
I
is inductively ex-
tended to ELH concepts in the usual manner:
>
I
= ; (C u D)
I
= C
I
D
I
;
(r.C)
I
= {a
I
| b
I
: (a, b) r
I
b C
I
}.
An interpretation I is said to be a model of a TBox O
(in symbols, I |= O) if it satisfies all axioms in O. I
satisfies axioms A v, A C, and r v s, respectively, if
A
I
C
I
, A
I
= C
I
, and r
I
s
I
. One of the main clas-
sical reasoning problems is the subsumption problem.
That is, given two concept descriptions C and D and a
TBox O, C is subsumed by D w.r.t. a TBox O (written
as C v
O
D) if C
I
D
I
in every model I of O. Fur-
thermore, C and D are equivalent w.r.t. O (written as
C
O
D) if C v
O
D and D v
O
C. When a TBox O is
empty or is clear from the context, we omit to denote
O, i.e. C v D and C D.
Concept Similarity Measure (CSM). is one of
non-standard DL reasoning services. It determines
how similar two concepts are. Formally, given two
concept descriptions C, D Con(L) for a specific DL
L. Then, a concept similarity measure w.r.t. a TBox
O is a function
O
: Con(L) × Con(L) [0, 1] such
that C
O
D = 1 iff C
O
D (total similarity) and
C
O
D = 0 indicates total dissimilarity between C
and D. When a TBox O is clear from the context, we
simply write C D.
Since we present an extension to sim (Suntisri-
varaporn, 2013; Tongphu and Suntisrivaraporn, 2015)
for taking into account an agent’s preferences, the
original definitions of homomorphism degree and sim
are included here for self-containment. Let C
Con(ELH ) be a fully expanded concept to the form:
P
1
u ·· ·uP
m
u r
1
.C
1
u · · · u r
n
.C
n
where P
i
CN
pri
, r
j
RN, C
j
Con(ELH ) in the
same format, 1 i m, and 1 j n. The set
P
1
, . . . , P
m
and the set r
1
.C
1
, . . . , r
n
.C
n
are denoted
by P
C
and E
C
, respectively. An ELH concept de-
sim
π
: A Concept Similarity Measure under an Agent’s Preferences in Description Logic ELH
481
scription can be structurally transformed into the cor-
responding ELH description tree. The root v
0
of the
ELH description tree T
C
has {P
1
, . . . , P
m
} as its label
and has n outgoing edges, each labeled with r
j
to a
vertex v
j
for 1 j n. Then, a subtree with the root
v
j
is defined recursively relative to the concept C
j
.
Definition 2.1 (Homomorphism Degree (Tongphu
and Suntisrivaraporn, 2015)). Let T
ELH
be a set of
all ELH description trees and T
C
, T
D
T
ELH
cor-
responds to two ELH concept names C and D, re-
spectively. The homomorphism degree function hd :
T
ELH
× T
ELH
[0, 1] is inductively defined as fol-
lows:
hd(T
D
, T
C
) = µ · p-hd(P
D
, P
C
)
+ (1 µ) · e-set-hd(E
D
, E
C
), (1)
where | · | represents the set cardinality, µ =
|P
D
|
|P
D
E
D
|
and 0 µ 1;
p-hd(P
D
, P
C
) =
(
1 if P
D
=
/
0
|P
D
P
C
|
|P
D
|
otherwise,
(2)
e-set-hd(E
D
, E
C
) =
1 if E
D
=
/
0
0 if E
D
6=
/
0 and E
C
=
/
0
e
(E
D
, E
C
) otherwise,
(3)
where
e
(E
D
, E
C
) =
ε
i
E
D
max{e-hd(ε
i
, ε
j
) : ε
j
E
C
}
|E
D
|
(4)
with ε
i
, ε
j
existential restrictions; and
e-hd(r.X, s.Y ) = γ(ν + (1 ν) · hd(T
X
, T
Y
))
(5)
where γ =
|R
r
R
s
|
|R
r
|
and 0 ν < 1.
Definition 2.2 (E LH Similarity Degree (Tongphu
and Suntisrivaraporn, 2015)). Let C and D be ELH
concept names and T
C
, T
D
be the corresponding de-
scription trees. Then, the ELH similarity degree be-
tween C and D (in symbols, sim(C, D)) is defined as
follows:
sim(C, D) =
hd(T
C
, T
D
) + hd(T
D
, T
C
)
2
(6)
Example 2.1 (Continuation of Example 1.1). Each
primitive definition can be transformed to a corre-
sponding equivalent full definition as shown in the
following.
ActivePlace X uPlace
u∃canWalk.Trekking
u∃canSail.Kayaking
Mangrove Y u Placeu
u∃canWalk.Trekking
Beach Z u Place
u∃canSail.Kayaking
where X ,Y and Z are fresh primitive concept names.
Furthermore, R
canWalk
= {t, canDo} and R
canSail
=
{u, canDo} where t and u are fresh primitive role
names. For brevity, let ActivePlace, Mangrove,
Place, Trekking, Kayaking, canWalk, and canSail be
abbreviated as AP, M, P, T, K, cW, and cS, respec-
tively. Using Definition 2.1, the homomorphism de-
gree from ActivePlace to Mangrove, or hd(T
AP
, T
M
)
= (
2
4
)(
1
2
) + (
2
4
)
(
max{e-hd(cW.T,cW.T)}
2
+
max{e-hd(cS.K,cW.T)}
2
)
= (
2
4
)(
1
2
) + (
2
4
)(
0.5+0.1
2
) = 0.55
Similarly, hd(T
M
, T
AP
) = 0.67, hd(T
AP
, T
B
) = 0.55,
and hd(T
M
, T
AP
) = 0.67. Thus, sim(M, AP) = 0.61
and sim(B, AP) = 0.61
2.1 Preference Profile
Preference profile is first proposed in (Racharak et al.,
2015) as a guideline for developing CSMs under
preferences. It is a quintuple of preference func-
tions which exhibit five aspects for preference expres-
sions. It can be adopted into the development of ar-
bitrary CSMs and thereby influencing the calculation
of CSMs. The syntax and semantics of each aspect
are given in term of partial functions since different
agents can have different perspectives of preferences.
Any CSMs that expose those syntactic forms and sat-
isfy their corresponding semantics will infer a similar-
ity value w.r.t. the needs and preferences of an agent.
Each syntax and semantic is presented formally as
follows:
Definition 2.3 (Primitive Concept Importance). Let
CN
pri
(O) be a set of primitive concept names oc-
curring in O. Then, a primitive concept importance
is a partial function i
c
: CN R
0
, where CN
CN
pri
(O).
For any A CN
pri
(O), i
c
(A) = 1 captures an ex-
pression of normal importance for A, i
c
(A) > 1 (and
i
c
(A) < 1) indicates that A has higher (and lower, re-
spectively) importance, and i
c
(A) = 0 indicates that A
is entirely ignored by an agent. For example, suppose
Bob is keenly interested to visit places. Therefore, he
can express as i
c
(Place) = 2 for his preference profile.
ICAART 2016 - 8th International Conference on Agents and Artificial Intelligence
482
Definition 2.4 (Role Importance). Let RN(O) be a
set of role names occurring in O. Then, a role im-
portance is a partial function i
r
: RN R
0
, where
RN RN(O).
For any r RN(O), i
r
(r) = 1 captures an ex-
pression of normal importance for r, i
r
(r) > 1 (and
i
r
(r) < 1) indicates that r has higher (and lower, re-
spectively) importance, and i
r
(r) = 0 indicates that r
is entirely ignored by an agent. For example, Bob is
interested to visit places where he can enjoy walking.
Therefore, he can also express as i
r
(canWalk) = 2 for
his preference profile.
Definition 2.5 (Primitive Concepts Similarity). Let
CN
pri
(O) be a set of primitive concept names occur-
ring in O. For A, B CN
pri
(O), a primitive concepts
similarity is a partial function s
c
: CN × CN [0, 1],
where CN CN
pri
(O), such that s
c
(A, B) = s
c
(B, A)
and s
c
(A, A) = 1.
For A, B CN
pri
(O), s
c
(A, B) = 1 captures an
expression of total similarity between A and B
and s
c
(A, B) = 0 captures an expression of to-
tal dissimilarity between A and B. For example,
Bob believes that trekking and kayaking are a bit
similar in some sense. Hence, he can express
s
c
(Trekking, Kayaking) = 0.1 for his preference pro-
file.
Definition 2.6 (Primitive Roles Similarity). Let
RN
pri
(O) be a set of primitive role names occurring in
O. For r, s RN
pri
(O), a primitive roles similarity is
a partial function s
r
: RN × RN [0, 1], where RN
RN
pri
(O), such that s
r
(r, s) = s
r
(s, r) and s
r
(r, r) = 1.
For r, s RN(O), s
r
(r, s) = 1 captures an expres-
sion of total similarity between r and s and s
r
(r, s) = 0
captures an expression of total dissimilarity between
r and s. For example, Bob believes that walking is
a bit similar to sailing. Hence, he can also express
s
r
(t, u) = 0.1 for his preference profile.
Definition 2.7 (Role Discount Factor). Let RN(O)
be a set of role names occurring in O. Then, a role
discount factor is a partial function d : RN [0, 1],
where RN RN(O).
For any r RN(O), d(r) = 1 captures an expres-
sion of total importance on a role (over a correspond-
ing nested concept) and d(r) = 0 captures an expres-
sion of total importance on a nested concept (over a
corresponding role). For example, Bob does not con-
cern much if places permit to either walk or to sail.
He would rather consider on actual activities which he
can perform. Thus, he may express d(canWalk) = 0.3
and d(canSail) = 0.3 for his preference profile.
Definition 2.8 (Preference Profile). (Racharak et al.,
2015) A preference profile, in symbol π, is a quintuple
hi
c
, i
r
, s
c
, s
r
, di where i
c
, i
r
, s
c
, s
r
, and d are as defined
above and the default preference profile, in symbol π
0
,
is the quintuple hi
c
0
, i
r
0
, s
c
0
, s
r
0
, d
0
i where
i
c
0
(A) = 1 for all A CN
pri
(O),
i
r
0
(r) = 1 for all r RN(O),
s
c
0
(A, B) = 0 for all (A, B) CN
pri
(O) × CN
pri
(O),
s
r
0
(r, s) = 0 for all (r, s) RN(O) × RN(O), and
d
0
(r) = 0.4 for all r RN(O).
3 CSM UNDER AGENT’S
PREFERENCES
A numerical value obtained by CSMs indicates the
similarity between two concept descriptions. For
instance, sim(ActivePlace, Mangrove) = 0.61 and
sim(ActivePlace, Beach) = 0.61 indicates that the
similarity between ActivePlace and Mangrove, and
that between ActivePlace and Beach are equivalently
61%. This means both Mangrove and Beach match
equally to the general notion of ActivePlace. Un-
fortunately, this is not true because it does not corre-
spond with his needs and his preferences. Indeed, the
similarity degree between ActivePlace and Mangrove
should be greater than the similarity degree between
ActivePlace and Beach in order to consistent with
Bob’s perspectives (as exhibited by Figure 1).
In this section, we adopt those aspects of prefer-
ence profile into our development of concept simi-
larity measure under an agent’s preferences for DL
ELH . In the following, we have presented formal
definition of concept similarity measures under pref-
erence profile.
Definition 3.1. Given a CSM , a preference profile
π, and two concepts C, D Con(L). Then, a con-
cept similarity measure under preference profile π is
a function
π
: Con(L) × Con(L) [0, 1]. A CSM
is called preference invariant w.r.t. equivalence if
C D iff C
π
D = 1 for any π
By developing a concept similarity measure un-
der preference profile for ELH , we have generalized
the measure sim. To avoid confusion, we write
π
when referring to an arbitrary CSM in a generic sense,
whereas specific function symbols, e.g. sim
π
or hd
π
,
are used when talking about specific CSMs or func-
tions.
In order to consider those aspects of preference
profile, we have presented a total importance func-
tion as
ˆ
i : CN
pri
RN R
0
based on a concept im-
sim
π
: A Concept Similarity Measure under an Agent’s Preferences in Description Logic ELH
483
Figure 1: Similarity value with (and without) respect to Bob’s preferences.
portance and a role importance.
ˆ
i(x) =
i
c
(x) if x CN
pri
and i
c
is defined on x
i
r
(x) if x RN and i
r
is defined on x
1 otherwise
(7)
A total similarity function is also presented as
ˆ
s : (CN
pri
× CN
pri
) (RN
pri
× RN
pri
) [0, 1] using a
primitive concept similarity and a primitive role sim-
ilarity.
ˆ
s(x, y) =
1 if x = y
s
c
(x, y) if (x, y) CN
pri
× CN
pri
and s
c
is defined on (x, y)
s
r
(x, y) if (x, y) RN
pri
× RN
pri
and s
r
is defined on (x, y)
0 otherwise
(8)
Similarly, a total role discount factor function is pre-
sented in the following in term of a function
ˆ
d : RN
[0, 1] based on a role discount factor.
ˆ
d(x) =
(
d(x) if d is defined on x
0.4 otherwise
(9)
Let C and D be ELH concept names and r and s
be role names. Let T
C
, T
D
, P
C
, P
D
, E
C
, E
D
, R
r
, and
R
s
are as defined in Definition 2.1. Let T
ELH
be a set
of all ELH description trees and π = hi
c
, i
r
, s
c
, s
r
, di
be a preference profile. The homomorphism degree
under preference profile π can be formally defined as
follows:
Definition 3.2. The homomorphism degree under
preference profile π is a function hd
π
: T
ELH
×
T
ELH
[0, 1] defined inductively as follows:
hd
π
(T
D
, T
C
) = µ
π
· p-hd
π
(P
D
, P
C
)
+ (1 µ
π
) · e-set-hd
π
(E
D
, E
C
), (10)
where µ
π
=
AP
D
ˆ
i(A)
AP
D
ˆ
i(A) +
r.X E
D
ˆ
i(r)
; (11)
p-hd
π
(P
D
, P
C
) =
1 if
AP
D
ˆ
i(A) = 0
0 if
AP
D
ˆ
i(A) 6= 0 and
BP
C
ˆ
i(B) = 0
p
π
(P
D
, P
C
) otherwise,
(12)
where
p
π
(P
D
, P
C
) =
AP
D
ˆ
i(A) · max{
ˆ
s(A, B) : B P
C
}
AP
D
ˆ
i(A)
;
(13)
e-set-hd
π
(E
D
, E
C
) =
1 if
r.X E
D
ˆ
i(r) = 0
0 if
r.X E
D
ˆ
i(r) 6= 0
and
s.Y E
C
ˆ
i(s) = 0
e
π
(E
D
, E
C
) otherwise,
(14)
where
e
π
(E
D
, E
C
) =
r.X E
D
ˆ
i(r) · max{e-hd
π
(r.X, ε
j
) : ε
j
E
C
}
r.X E
D
ˆ
i(r)
(15)
with ε
j
existential restriction; and
e-hd
π
(r.X, s.Y ) = γ
π
(
ˆ
d(r) + (1
ˆ
d(r)) · hd
π
(T
X
, T
Y
))
(16)
where γ
π
=
1 if
r
0
R
r
ˆ
i(r
0
) = 0
r
0
R
r
ˆ
i(r
0
)·max{
ˆ
s(r
0
,s
0
):s
0
R
s
}
r
0
R
r
ˆ
i(r
0
)
, otherwise.
(17)
ICAART 2016 - 8th International Conference on Agents and Artificial Intelligence
484
It is obvious to see that Definition 3.2 exposes all
elements of preference profile, viz. i
c
, i
r
, s
c
, s
r
, and d
since it was generalized alongside the use of the func-
tions
ˆ
i,
ˆ
s, and
ˆ
d.
Intuitively, Equation 10 is defined as the weighted
sum of the degree under π of primitive concepts and
the degree under π of matching edges. Equation 11
indicates the weight of primitive concept names w.r.t.
the importance function. Equation 12 calculates the
proportion of best similarity between primitive con-
cept names. Similarly, Equation 14 calculates the pro-
portion of best similarity between existential informa-
tion from Equation 16 and Equation 17. Equation 16
calculates the degree of similarity between matching
edges. Finally, Equation 17 calculates the proportion
of best similarity between role names.
Let C and D be ELH concept names, T
C
and T
D
be the corresponding description trees, and π =
hi
c
, i
r
, s
c
, s
r
, di be a preference profile. The follow-
ing definition formally describes the ELH similarity
degree under preference profile π.
Definition 3.3. The ELH similarity degree under
preference profile π between C and D (denoted by
sim
π
(C, D)) is defined as follows:
sim
π
(C, D) =
hd
π
(T
C
, T
D
) + hd
π
(T
D
, T
C
)
2
(18)
Lemma 3.1. For T
D
, T
C
T
ELH
, hd
π
0
(T
D
, T
C
) =
hd(T
D
, T
C
).
Proof
Recall by Definition 2.8 that the default preference
profile π
0
is the quintuple hi
c
0
, i
r
0
, s
c
0
, s
r
0
, d
0
i. Also, sup-
pose a concept name D is of the form:
P
1
u · · · u P
m
u r
1
.D
1
u · · · u r
n
.D
n
where P
i
CN
pri
, r
j
CN
pri
, D
j
Con(E LH ), 1
i m, 1 j n, P
1
u ··· u P
m
is denoted by P
D
,
and r
1
.D
1
u ··· u r
n
.D
n
is denoted by E
D
. Let d
be the depth of T
D
. We prove that, for any d N,
hd
π
0
(T
D
, T
C
) = hd(T
D
, T
C
) with mathematical induc-
tion.
When d = 0, we know that D = P
1
u · ·· u
P
m
. To show that hd
π
0
(T
D
, T
C
) = hd(T
D
, T
C
), we
need to show that µ
π
0
= µ and p-hd
π
0
(P
D
, P
C
) =
p-hd(P
D
, P
C
). Let us derive as follows:
µ
π
0
=
AP
D
ˆ
i(A)
AP
D
ˆ
i(A) +
r.XE
D
ˆ
i(r)
=
m
i=1
1
m
i=1
1 + 0
=
m
m + 0
= µ.
Furthermore, we only need to show
AP
D
max{
ˆ
s(A, B) : B P
C
} = |P
D
P
C
| in or-
der to show p-hd
π
0
(P
D
, P
C
) = p-hd(P
D
, P
C
). We
know that s
c
0
maps name identity to 1 and otherwise
to 0. Thus,
AP
D
max{
ˆ
s(A, B) : B P
C
} = |{x : x
P
D
and x P
C
}| = |P
D
P
C
|.
We must now prove that if hd
π
0
(T
D
, T
C
) =
hd(T
D
, T
C
) holds for d = h 1 where h > 1
and D = P
1
u · · · u P
m
u r
1
.D
1
u · · · u r
n
.D
n
then hd
π
0
(T
D
, T
C
) = hd(T
D
, T
C
) also holds
for d = h. To do that, we have to show
e-set-hd
π
0
(E
D
, E
C
) = e-set-hd(E
D
, E
C
). This
can be done by showing in the similar manner
that γ
π
0
= γ and hd
π
0
(T
X
, T
Y
) = hd(T
X
, T
Y
) from
e-hd
π
0
(r.X, s.Y ) = e-hd(r.X , s.Y ), where
r.X E
D
and s.Y E
C
. Consequently, it
follows by induction that, for T
D
, T
C
T
ELH
,
hd
π
0
(T
D
, T
C
) = hd(T
D
, T
C
).
Theorem 3.1. For C, D Con(ELH ),
sim
π
0
(C, D) = sim(C, D).
The above theorem follows from Lemma 3.1, Def-
inition 2.2, and Definition 3.3.
Lemma 3.2. For T
D
, T
C
T
ELH
, hd(T
D
, T
C
) = 1 iff
hd
π
(T
D
, T
C
) = 1 for any π.
Proof
(Sketch) Let π = hi
c
, i
r
, s
c
, s
r
, di be an arbitrary pref-
erence profile and π
0
= hi
c
0
, i
r
0
, s
c
0
, s
r
0
, d
0
i be the de-
fault preference profile. This lemma can be shown
by mathematical induction on the depth of T
D
.
() hd(T
D
, T
C
) = 1 implies that there exists a ho-
momorphism mapping from the root of T
D
to the root
of T
C
. Consequently, any setting on π does not influ-
ence the calculation on hd
π
(T
D
, T
C
).
() hd
π
(T
D
, T
C
) = 1 for any π. In particular,
this holds for the default preference profile π
0
. By
Lemma 3.1, it is the case that hd(T
D
, T
C
) = 1.
Theorem 3.2. sim
π
is preference invariant w.r.t.
equivalence.
Proof
(Sketch) Given two concepts C and D and an arbi-
trary preference profile π, we have to show C D
iff sim
π
(C, D) = 1.
() By Proposition 7 in (Tongphu and Suntisri-
varaporn, 2015), we can derive that sim(C, D) = 1.
With the usage of Lemma 3.2, Definition 2.2, and
Definition 3.3, we can derive that sim
π
is preference
invariant w.r.t. equivalence.
() This can be shown similarly as in the forward
direction.
Example 3.1. (Continuation of Example 1.1) Let
enrich the example by assuming Bob’s preference
profile is expressed as follows: (i) i
c
(Place) = 2;
(ii) i
r
(canWalk) = 2; (iii) s
c
(Trekking, Kayaking) =
0.1; (iv) s
r
(t, u) = 0.1; (v) d(canWalk) = 0.3 and
d(canSail) = 0.3. Let ActivePlace, Mangrove, Place,
sim
π
: A Concept Similarity Measure under an Agent’s Preferences in Description Logic ELH
485
Trekking, Kayaking, canWalk, and canSail are rewrit-
ten shortly as AP, M, P, T, K, cW, and cS, respec-
tively. Using Definition 3.2, hd
π
(T
AP
, T
M
)
= (
3
6
) · p-hd
π
(P
AP
, P
M
) + (
3
6
) · e-set-hd
π
(E
AP
, E
M
)
= (
3
6
) · (
i(X)·max{s(X,Y ),s(X ,P)}+i(P)·max{s(P,Y),s(P,P)})
i(X)+i(P)
)
+(
3
6
) · e-set-hd
π
(E
AP
, E
M
)
= (
3
6
)(
1·max{0,0}+2·max{0,1}
1+2
) + (
3
6
)·e-set-hd
π
(E
AP
, E
M
)
= (
3
6
)(
2
3
)
+(
3
6
)
h
i(cW)·max{e-hd
π
(cW.T,cW,T)}+1·max{0.2035}
i(cW)+i(cS)
i
= (
3
6
)(
2
3
) + (
3
6
)
h
2·max{(1)(0.3+0.7(1))}+1·max{0.2035}
i(cW)+i(cS)
i
= (
3
6
)(
2
3
) + (
3
6
)
h
(2)(1)+(1)(0.2035)
2+1
i
0.70
Following the same step, we obtain hd
π
(T
M
, T
AP
) =
0.75. Hence, sim
π
(M, AP) 0.73 by using Definition
3.3.
Furthermore, using Definition 3.2, hd
π
(T
AP
, T
B
)
= (
3
6
) · p-hd
π
(P
AP
, P
B
) + (
3
6
) · e-set-hd
π
(E
AP
, E
B
)
= (
3
6
) · (
i(X)·max{s(X,Z),s(X,P)}+i(P)·max{s(P,Z),s(P,P)})
i(X)+i(P)
)
+(
3
6
) · e-set-hd
π
(E
AP
, E
B
)
= (
3
6
)(
1·max{0,0}+2·max{0,1}
1+2
) + (
3
6
) · e-set-hd
π
(E
AP
, E
B
)
= (
3
6
)(
2
3
)
+(
3
6
)
h
2·max{0.2035}+i(cS)·max{e-hd
π
(cS.K,cS,K)}
i(cW)+i(cS)
i
= (
3
6
)(
2
3
) + (
3
6
)
h
2·max{0.2035}+1·max{(1)(0.3+(0.7)(1))}
i(cW)+i(cS)
i
= (
3
6
)(
2
3
) + (
3
6
)
h
(2)(0.2035)+(1)(1)
2+1
i
0.57
Following the same step, we obtain hd
π
(T
B
, T
AP
) =
0.75. Hence, sim
π
(B, AP) 0.66 by using Definition
3.3.
These results, i.e. sim
π
(M, AP) > sim
π
(B, AP),
corresponds with Bob’s needs and preferences.
4 RELATED WORK
Several CSMs abound, but here we investigate those
CSMs that exhibit preferential elements and contrast
them with aspects of our preference profile. Ex-
cept the following two works, most CSMs do not
consider any of preferential elements. Hence, we
omit discussions thereof and merely refer interested
readers to their references for further details, namely
(Janowicz and Wilkes, 2009; Racharak and Suntisri-
varaporn, 2015; D’Amato et al., 2006; Fanizzi and
D’Amato, ) for structural-based similarity measures
and (D’Amato et al., 2009; D’Amato et al., 2008) for
interpretation-based measures.
In an extended work of sim, a range of number
for discount factor (ν) is used in the similarity ap-
plication of SNOMED CT. For instance, when the
roleGroup is found, the value of ν is set to 0. That
approach can be viewed as a specific application of d
function of preference profile. In simi, pairs of prim-
itive concept names and pairs of role names are per-
mitted to impose the similarity values via the function
pm. For instance, given two primitive concept names
A and B, we can establish the 50% similarity between
A and B by defining pm(A, B) = 0.5. In the similar
manner, given two role names r and s, we can estab-
lish the 50% similarity between r and s by defining
pm(r, s) = 0.5. The former is identical to s
c
of pref-
erence profile; however, the latter differs from s
r
of
preference profile in a sense that pm does not con-
sider primitive role names which contribute to simi-
larity between two arbitrary role names. Furthermore,
each primitive concept name and each existential re-
striction atoms (i.e., those concepts of the form r.C)
is permitted to be weighted w.r.t. a positive real num-
ber via the function g. However, we believe that the
imposition on existential restriction atoms will be im-
practical to use. After all, there can be infinitely many
existential restriction atoms. Thus, our sim
π
is de-
veloped according to preference profile by allowing
to define an importance over each role name instead.
Table 1 shows a summary of existing CSMs which
expose elements of preference profile, together with
our proposed sim
π
, where 4 denotes totally identical
to the specified function whereas 3 denotes partially
identical to the specified function.
Table 1: State-of-the-art CSMs embeded preference profile.
CSM i
c
i
r
s
c
s
r
d
sim
π
4 4 4 4 4
sim 4
simi 3 4
5 CONCLUDING REMARKS
In this work, the applicability of the so-called pref-
erence profile is first exemplified by generalizing the
mechanism of the similarity measure sim for the DL
ELH , called sim
π
. Our sim
π
can nicely utilize pref-
erences of an agent, which are represented in form of
a preference profile, for influencing the calculation.
We also prove that sim
π
is backward compatible in the
sense that sim
π
under the default preference profile
coincides with sim. This finding together with Propo-
sition 7 in (Tongphu and Suntisrivaraporn, 2015) are
important to show that sim
π
is preference-invariant
w.r.t. equivalence, i.e. similarity between two equiva-
lent concepts is always one regardless of agents’ pref-
erences. We also investigate existing CSMs and find
ICAART 2016 - 8th International Conference on Agents and Artificial Intelligence
486
that none of them, to the best of our knowledge, en-
tirely comply with the preference profile.
There are some directions for our future work.
Firstly, it appears to be a natural next step to ver-
ify desirable properties in which any CSMs under a
preference profile must have. Secondly, we are go-
ing to investigate deeply on the possibility of other
reasonable aspects to be included in the preference
profile, especially when considering more expressive
DLs. Thirdly, we intend to carry out an implementa-
tion of sim
π
and perform experiments on realistic on-
tologies. Finally, it is interesting to explore the possi-
bility to extend preference profile beyond other kinds
of similarity-based reasoning services. i.e., relaxed
instance checking and relaxed instance retrieval.
ACKNOWLEDGEMENTS
This research is partially supported by Thammasart
University Research Fund under the TU Research
Scholar, Contract No. TOR POR 1/13/2558; the Cen-
ter of Excellence in Intelligent Informatics, Speech
and Language Technology, and Service Innova-
tion (CILS), Thammasat University; and the JAIST-
NECTEC-SIIT dual doctoral degree program.
REFERENCES
D’Amato, C., Fanizzi, N., and Esposito, F. (2006). A dis-
similarity measure for alc concept descriptions. In
Proceedings of the 2006 ACM Symposium on Applied
Computing, pages 1695–1699.
D’Amato, C., Fanizzi, N., and Esposito, F. (2009). A se-
mantic similarity measure for expressive description
logics. In CoRR, abs/0911.5043.
D’Amato, C., Staab, S., and Fanizzi, N. (2008). On the in-
fluence of description logics ontologies on conceptual
similarity. In Proceedings of Knowledge Engineering:
Practice and Patterns, pages 48–63.
Fanizzi, N. and D’Amato, C. A similarity measure for the
aln description logic. In Proceedings of CILC 2006
- Italian Conference on Computational Logic, pages
26–27.
Janowicz, K. and Wilkes, M. (2009). Sim-dla: A novel
semantic similarity measure for description logics re-
ducing inter-concept to inter-instance similarity. In
Proceedings of the 6th European Semantic Web Con-
ference on The Semantic Web: Research and Applica-
tions, pages 353–367.
Lehmann, K. and Turhan, A.-Y. (2012). A framework for
semantic-based similarity measures for elh-concepts.
In del Cerro, L. F., Herzig, A., and Mengin, J., editors,
JELIA, volume 7519 of Lecture Notes in Computer
Science, pages 307–319. Springer.
Lichtenstein, S. and Slovic, P., editors (2006). The con-
struction of preference. Cambridge University Press,
New York.
Racharak, T. and Suntisrivaraporn, B. (2015). Similar-
ity measures for fl0 concept descriptions from an
automata-theoretic point of view. In Information and
Communication Technology for Embedded Systems
(IC-ICTES), 2015 6th International Conference of,
pages 1–6.
Racharak, T., Suntisrivaraporn, B., and Tojo, S. (2015).
Identifying an Agent’s Preferences Toward Similar-
ity Measures in Description Logics. In Proceedings
of The 5th Joint International Semantic Technology
(JIST 2015).
Scherer, K. (2005). What are emotions? and how can they
be measured? Social Science Information.
Son, T. C., Cao, T., and Pontelli, E. (2003). Planning with
preferences using logic programming. In In Proc. LP-
NMR’04, pages 247–260. Springer.
Suntisrivaraporn, B. (2013). A similarity measure for the
description logic el with unfoldable terminologies. In
INCoS, pages 408–413.
Tongphu, S. and Suntisrivaraporn, B. (2015). Algorithms
for measuring similarity between elh concept descrip-
tions: A case study on snomed ct. Journal of Comput-
ing and Informatics (accepted on May 7; to appear).
sim
π
: A Concept Similarity Measure under an Agent’s Preferences in Description Logic ELH
487