Continuous Set Packing and Near-Boolean Functions
Giovanni Rossi
Department of Computer Science and Engineering (DISI), University of Bologna,
Mura Anteo Zamboni 7, 40126, Bologna, Italy
Keywords:
Set Packing, Pseudo-Boolean Function, Polynomial Multilinear Extension, Local Search.
Abstract:
Given a family of feasible subsets of a ground set, the packing problem is to find a largest subfamily of pairwise
disjoint family members. Non-approximability renders heuristics attractive viable options, while efficient
methods with worst-case guarantee are a key concern in computational complexity. This work proposes a
novel near-Boolean optimization method relying on a polynomial multilinear form with variables ranging
each in a high-dimensional unit simplex. These variables are the elements of the ground set, and distribute
each a unit membership over those feasible subsets where they are included. The given problem is thus
translated into a continuous version where the objective is to maximize a function taking values on collections
of points in a unit hypercube. Maximizers are shown to always include collections of hypercube disjoint
vertices, i.e. partitions of the ground set, which constitute feasible solutions for the original discrete version of
the problem. A gradient-based local search in the expanded continuous domain is designed. Approximations
with polynomials of bounded degree and near-Boolean coalition formation games are also finally discussed.
1 INTRODUCTION
Consider a finite set N = {1, ..., n} of items to be
packed into feasible subsets, where these latter con-
stitute a family F 2
N
= {A : A N}. The prob-
lem is to find a subfamily F
F of pairwise dis-
joint feasible subsets with largest size |F
|. In the
weighted version, a function w : F R
+
identifies
as optimal those such subfamilies F
with maximum
weight W (F
) =
AF
w(A). Maximizing |F
| is
equivalent to setting w(A) = 1 for all A F . Thus
this work proposes to use the polynomial multilinear
extension, or MLE for short, of set functions (such
as w) in order to evaluate families of fuzzy feasible
subsets. Although unfeasible, such families shall still
drive the search towards locally optimal feasible ones.
Set packing is a key combinatorial optimization
problem (Korte and Vygen, 2002) extensively stud-
ied in computational complexity, where the aim is
to find efficient algorithms whose output approxi-
mates optimal solutions within a provable bounded
factor. In that field, the focus is placed mostly on non-
approximability results for k-set packing (Trevisan,
2001), where the size of every feasible subset is no
greater than some k n (and with unit weight for all
as above). Recall that if all feasible subsets have size
k = 2, then the problem is to find a maximal match-
ing in a graph with vertex set N, and an efficient (i.e.
with polynomial running time) algorithm capable to
output an exact solution is known to exist (Papadim-
itriou, 1994). In fact, if k > 2, then k-set packing
may be rephrased in terms of vertex clouring in hy-
pergraphs, with special focus on the d-regular and
k-uniform case, where every element of the ground
set is present in precisely d > 1 feasible subsets (i.e.
|{A : i A F }| = d for every i N), each of which,
in turn, has size k (i.e. |A| = k for every A F )
(Hazan et al., 2006).
Set packing aslo has important applications,
among which combinatorial auctions constitute a
main and lucrative example: the ground set may con-
sist of items to be sold in bundles (or subsets) to-
wards revenue maximization, and once bids are pro-
cessed the issue may be tackled as a maximum-weight
set packing problem, with maximum received bids on
bundles as weights (Sandholm, 2002). Given the ex-
ponentially large size of the search space, revenue
maximization often leads to use heuristics with no
worst-case guarantee or, more simply, to sell each
item independently but simultaneously over a suffi-
ciently long time period (Milgrom, 2004).
The approach to set packing problems proposed
in the sequel replaces standard pseudo-Boolean op-
timization (Boros and Hammer, 2002) with a novel
near-Boolean method. While the former employs
MLE to switch from {0,1} to [0,1] as the domain
of each of the n variables, the proposed near-Boolean
method relies on n variables ranging each in the 2
n1
-
84
Rossi, G.
Continuous Set Packing and Near-Boolean Functions.
DOI: 10.5220/0005697800840096
In Proceedings of the 5th International Conference on Pattern Recognition Applications and Methods (ICPRAM 2016), pages 84-96
ISBN: 978-989-758-173-1
Copyright
c
2016 by SCITEPRESS Science and Technology Publications, Lda. All rights reserved
set of extreme points of a unit simplex, and employs
MLE to include the continuum provided by the whole
simplex. The n variables correspond to the elements
i N of the ground set, while the extreme points of
each simplex are indexed by those (feasible) subsets
where each element is included. Then, the MLE of the
resulting near-Boolean function evaluates collections
of fuzzy subsets of N or, equivalently, fuzzy subfami-
lies of feasible subsets A F . The objective function
to be maximized takes thus values over the n-product
of 2
n1
1-dimensional unit simplices, allowing to
design a flexible gradient-based local search.
The following section comprenshively details the
framework for the full-dimensional case F = 2
N
.
This seems useful in general and also allows to clearly
see next that by simply introducing the empty set
/
0
and all n singletons {i} 2
N
into the family F of fea-
sible subsets (with null weights w(
/
0) = 0 = w({i} if
{i} / F ) the whole class of set packing problems may
be framed within the proposed method. The gradient-
based local search differs when switching from the
full-dimensional case to the lower-dimensional one
F 2
N
, as with the latter a cost function c : F N
also enters the picture, in line with greedy approaches
to weighted set packing (Chandra and Halldorsson,
2001). The cost c(A) = |{B : B F ,B A 6=
/
0}| of in-
cluding a feasible subset in the packing is the number
of members with which it has non-empty intersection
(itself included, hence c(A) N,A F ).
Note that maximum-weight set packing may be
tackled through constrained maximization of standard
pseudo-Boolean function v : {0,1}
|F |
R
+
given by v
x
A
1
,. .. ,x
A
|F |
=
1k≤|F |
x
A
k
w(A
k
) , s.t.
A
k
A
l
6=
/
0 x
A
k
+ x
A
l
1 for all 1 k < l |F |,
where x
A
{0,1} for all A F = {A
1
,. .. A
|F |
}. Also,
v can be replaced with xMx ' v, where M is a suit-
able |F | × |F |-matrix and x = (x
A
1
,. .. ,x
A
|F |
) (Ali-
daee et al., 2008). An heuristic then finds a con-
strained maximizer x
, while the corresponding so-
lution is F
= {A : x
A
= 1}. This differs from what is
proposed here, in many respects, the most evident of
which being that v has |F | constrained Boolean vari-
ables, while the expanded MLE developed below has
n unconstrained near-Boolean variables.
2 FULL-DIMENSIONAL CASE
The 2
n
-set {0,1}
n
of vertices of the n-dimensional
unit hypercube [0,1]
n
corresponds one-to-one to the
(power) set 2
N
of subsets A N through character-
istic functions χ
A
: N {0,1},A 2
N
defined by
χ
A
(i) = 1 if i A and χ
A
(i) = 0 if i N\A = A
c
,
while collection {ζ(A,·) : A 2
N
} is a linear basis of
the vector space R
2
n
of real-valued functions w on 2
N
,
where zeta function ζ : 2
N
×2
N
R is the element of
the incidence algebra (Rota, 1964b; Aigner, 1997) of
Boolean lattice (2
N
,, ) defined by ζ(A,B) = 1 if
B A and ζ(A,B) = 0 if B 6⊇ A. Linear combination
w(B) =
A2
N
µ
w
(A)ζ(A,B) =
AB
µ
w
(A) for B 2
N
applies to any w, with M
¨
obius inversion µ
w
: 2
N
R
uniquely given by ( is strict inclusion) µ
w
(A) =
=
BA
(1)
|A\B|
w(B)
with ζ(B,A) = (1)
|A\B|
= w(A)
BA
µ
w
(B) (recursion, with w(
/
0) = 0).
Given this essential combinatorial “analog of the fun-
damental theorem of the calculus” (Rota, 1964b), the
MLE f
w
: [0,1]
n
R of w takes values w(B) =
= f
w
(χ
B
) =
A2
N
iA
χ
B
(i)
!
µ
w
(A) =
AB
µ
w
(A)
on vertices, and f
w
(q) =
A2
N
iA
q
i
!
µ
w
(A) (1)
on any point q = (q
1
,. .. ,q
n
) [0,1]
n
. Convention-
ally,
i
/
0
q
i
:= 1 (Boros and Hammer, 2002, p. 157).
Let 2
N
i
= {A : i A 2
N
} =
{
A
1
,. .. ,A
2
n1
}
be the
2
n1
-set of subsets containing each i N. Simplex
i
=
(
q
A
1
i
,. .. ,q
A
2
n1
i
R
2
n1
+
:
1k2
n1
q
A
k
i
= 1
)
has dimension 2
n1
1 and generic point q
i
i
.
Definition 1. A fuzzy cover q specifies a membership
distribution for each i N over the 2
n1
subsets con-
taining it, i.e. q = (q
1
,. .. ,q
n
)
N
= ×
1in
i
.
Equivalently, q =
q
A
:
/
0 6= A 2
N
,q
A
[0,1]
n
is a 2
n
1-set whose elements q
A
=
q
A
1
,. .. ,q
A
n
are
n-vectors corresponding to non-empty subsets A 2
N
and specifying a membership q
A
i
for each i N, with
q
A
i
[0,1] if i A while q
A
i
= 0 if i A
c
. Fuzzy cov-
ers being collections of points in [0,1]
n
, and the MLE
f
w
of w allowing precisely to evaluate such points,
the global worth W (q) of q
N
is the sum over all
q
A
,A 2
N
of f
w
(q
A
) as defined by (1). That is,
W (q) =
A2
N
f
w
(q
A
) =
A2
N
"
BA
iB
q
A
i
!
µ
w
(B)
#
,
or W (q) =
A2
N
"
BA
iA
q
B
i
!#
µ
w
(A). (2)
Continuous Set Packing and Near-Boolean Functions
85
Example 2. For N = {1, 2,3}, consider w defined by
w({1}) = w({2}) = w({3}) = 0.2, w({1, 2}) = 0.8,
w({1,3}) = 0.3, w({2,3}) = 0.6, w(N) = 0.7. Mem-
bership distributions of elements i = 1,2, 3 over 2
N
i
are q
1
1
,q
2
2
,q
3
3
,
q
1
=
q
1
1
q
12
1
q
13
1
q
N
1
, q
2
=
q
2
2
q
12
2
q
23
2
q
N
2
, q
3
=
q
3
3
q
13
3
q
23
3
q
N
3
.
If ˆq
12
1
= ˆq
12
2
= 1, then any membership q
3
3
yields
W ( ˆq
1
, ˆq
2
,q
3
) = w({1,2})
+
q
3
3
+ q
13
3
+ q
23
3
+ q
N
3
µ
w
({3})
= w({1,2}) +w({3}) = 1.
This means that there is a continuum of fuzzy covers
achieving maximum worth, i.e. 1. In order to select
the one
ˆ
q = ( ˆq
1
, ˆq
2
, ˆq
3
) where ˆq
3
3
= 1, attention must
be placed only on exact ones, defined hereafter.
For any two fuzzy covers q = {q
A
:
/
0 6= A 2
N
}
and
ˆ
q = { ˆq
A
:
/
0 6= A 2
N
}, define
ˆ
q to be a shrinking
of q if there is a subset A, with
iA
q
A
i
> 0 and
ˆq
B
i
=
q
B
i
if B 6⊆ A
0 if B = A
for all B 2
N
,i N,
BA
ˆq
B
i
= q
A
i
+
BA
q
B
i
for all i A.
In words, a shrinking reallocates the whole member-
ship mass
iA
q
A
i
> 0 from A 2
N
to all proper sub-
sets B A, involving all and only those elements i A
with strictly positive membership q
A
i
> 0.
Definition 3. Fuzzy cover q
N
is exact as long as
W (q) 6= W(
ˆ
q) for all shrinkings
ˆ
q of q.
Proprosition 4. If q is exact, then for all A 2
N
i A : q
A
i
> 0
{0,|A|}.
Proof. For
/
0 A
+
(q) =
i : q
A
i
> 0
A 2
N
, with
α = |A
+
(q)| > 1, notice that
f
w
(q
A
) =
BA
+
(q)
iB
q
A
i
!
µ
w
(B). Let shrinking
ˆ
q,
with ˆq
B
0
= q
B
0
if B
0
6∈ 2
A
+
(q)
, satisfy conditions
B2
N
i
2
A
+
(q)
ˆq
B
i
= q
A
i
+
B2
N
i
2
A
+
(q)
q
B
i
for all i A
+
(q)
and
iB
ˆq
B
i
=
iB
q
B
i
+
iB
q
A
i
for all B 2
A
+
(q)
,|B| > 1.
These are 2
α
1 equations with
1kα
k
α
k
> 2
α
variables ˆq
B
i
,B A
+
(q). Thus there is a continuum
of solutions, each providing precisely a shrinking
ˆ
q
where
B2
A
+
(q)
f
w
( ˆq
B
) = f
w
(q
A
) +
B2
A
+
(q)
f
w
(q
B
).
This entails that q is not exact.
Partitions (Aigner, 1997) P = {A
1
,. .. ,A
|P|
} 2
N
of N are families of pairwise disjoint subsets called
blocks, i.e. A
k
A
l
=
/
0,1 k < l |P|, with union
N =
1k≤|P|
A
k
. Any P corresponds to the collection
{χ
A
: A P} of those |P| hypercube vertices identi-
fied by the characteristic functions of its blocks (see
above). Partitions P can thus be seen as p
N
where
p
A
i
= 1 for all A P,i A, i.e. exact fuzzy covers
where each i N concentrates its whole membershisp
on a unique A 2
N
i
, thus justifying the following.
Definition 5. Fuzzy partitions are exact fuzzy covers.
Ojective function W :
N
R includes among
its extremizers (non-fuzzy) partitions. This expands
a result from pseudo-Boolean optimization. Denote
by ex(
i
) the 2
n1
-set of extreme points of
i
. For
q
N
,i N, let q = q
i
|q
i
, with q
i
i
as well as
q
i
N\i
= ×
jN\i
j
. Then, for any w,
W (q) =
A2
N
i
f
w
(q
A
) +
A
0
2
N
\2
N
i
f
w
(q
A
0
) =
=
A2
N
i
BA\i
jB
q
A
j
!
q
A
i
µ
w
(B i) + µ
w
(B)
+
+
A
0
2
N
\2
N
i
B
0
A
0
j
0
B
0
q
A
0
j
0
!
µ
w
(B
0
)
at all q
N
and for all i N. Now define
W
i
(q
i
|q
i
) =
A2
N
i
q
A
i
"
BA\i
jB
q
A
j
!
µ
w
(B i)
#
,
W
i
(q
i
) =
A2
N
i
"
BA\i
jB
q
A
j
!
µ
w
(B)
#
+
+
A
0
2
N
\2
N
i
"
B
0
A
0
j
0
B
0
q
A
0
j
0
!
µ
w
(B
0
)
#
,
yielding W (q) = W
i
(q
i
|q
i
) +W
i
(q
i
). (3)
Proprosition 6. For all q
N
, there are q,q
N
such that
(i) W (q) W (q) W (q) and,
(ii) q
i
,q
i
ex(
i
) for all i N.
Proof. For i N, q
i
N\i
, define w
q
i
: 2
N
i
R by
w
q
i
(A) =
BA\i
jB
q
A
j
!
µ
w
(B i). (4)
Let A
+
q
i
= argmax w
q
i
and A
q
i
= argmin w
q
i
,
with A
+
q
i
6=
/
0 6= A
q
i
at all q
i
. Most importantly,
W
i
(q
i
|q
i
) =
A2
N
i
q
A
i
· w
q
i
(A)
= hq
i
,w
q
i
i, (5)
ICPRAM 2016 - International Conference on Pattern Recognition Applications and Methods
86
where , ·i denotes scalar product. Thus for given
membership distributions of all j N\i, global worth
is affected by is membership distribution through a
scalar product. In order to maximize (or minimize)
W by suitably choosing q
i
for given q
i
, the whole
of is membership mass must be placed over A
+
q
i
(or
A
q
i
), anyhow. Hence there are precisely |A
+
q
i
| > 0
(or |A
q
i
| > 0) available extreme points of
i
. The
following procedure selects (arbitrarily) one of them.
ROUNDUP(w,q)
Initialize: Set t = 0 and q(0) = q.
Loop: While there is a i N with q
i
(t) 6∈ ex(
i
),
set t = t + 1 and:
(a) select some A
A
+
q
i
(t)
,
(b) define, for all j N,A 2
N
,
q
A
j
(t) =
q
A
j
(t 1) if j 6= i
1 if j = i and A = A
0 otherwise
.
Output: Set q = q(t).
Every change q
A
i
(t 1) 6= q
A
i
(t) = 1 (for any
i N,A 2
N
i
) induces a non-decreasing variation
W (q(t)) W(q(t 1)) 0. Hence, the sought q is
provided in at most n iterations. Analogously, replac-
ing A
+
q
i
with A
q
i
yields the sought minimizer q (see
also (Boros and Hammer, 2002, p. 163)).
Remark 7. For i N, A 2
N
i
, if all j A\i 6=
/
0 sat-
isfy q
A
j
= 1, then (4) yields w
q
i
(A) = w(A) w(A\i),
while w
q
i
({i}) = w({i}) regardless of q
i
.
Corollary 8. Some partition P satisfies W (p) W (q)
for all q
N
, with W (p) =
AP
w(A).
Proof. Follows from propositions 4 and 6, with the
above notation associating p
N
to partition P.
Defining global maximizers is clearly immediate.
Definition 9. Fuzzy partition
ˆ
q
N
is a global max-
imizer if W (
ˆ
q) W (q) for all q
N
.
Concerning local maximizers, consider a vector
ω = (ω
1
,. .. ,ω
n
) R
n
++
of strictly positive weights,
with ω
N
=
jN
ω
j
, and focus on the (Nash) equilib-
rium (Mas-Colell et al., 1995) of the game with ele-
ments i N as players, each strategically choosing its
membership distribution q
i
i
while being rewarded
with fraction
ω
i
ω
N
W (q
1
,. .. ,q
n
) of the global worth at-
tained at any strategy profile (q
1
,. .. ,q
n
) = q
N
.
Definition 10. Fuzzy partition
ˆ
q
N
is a local max-
imizer if for all q
i
i
and all i N inequality
W
i
( ˆq
i
|
ˆ
q
i
) W
i
(q
i
|
ˆ
q
i
) holds (see (3)).
This definition of local maximizer entails that the
neighborhood N (q)
N
of any q
N
is
N (q) =
[
iN
n
˜
q :
˜
q = ˜q
i
|q
i
, ˜q
i
i
o
.
Definition 11. The (i,A)-derivative of W at q
N
is
W (q)/q
A
i
= W (q(i, A)) W (q(i,A)) =
= W
i
q
i
(i,A)|q
i
(i,A)
W
i
q
i
(i,A)|q
i
(i,A)
,
with q(i,A) =
q
1
(i,A), .. ., q
n
(i,A)
given by
q
B
j
(i,A) =
q
B
j
for all j N\i, B 2
N
j
1 for j = i, B = A
0 for j = i, B 6= A
,
and q(i,A) =
q
1
(i,A), .. ., q
n
(i,A)
given by
q
B
j
(i,A) =
q
B
j
for all j N\i, B 2
N
j
0 for j = i and all B 2
N
i
,
thus W (q) = {W (q)/q
A
i
: i N,A 2
N
i
} R
n2
n1
is the (full) gradient of W at q. The i-gradient
i
W (q) R
2
n1
of W at q = q
i
|q
i
is set function
i
W (q) : 2
N
i
R defined by
i
W (q)(A) = w
q
i
(A)
for all A 2
N
i
, where w
q
i
is given by (4).
Remark 12. Membership distribution q
i
(i,A) is the
null one: its 2
n1
entries are all 0, hence q
i
(i,A) 6∈
i
.
The setting obtained thus far allows to conceive
searching for a local maximizer partition p
from
given fuzzy partition q as initial candidate solution,
and while maintaing the whole search within the con-
tinuum of fuzzy partitions. This idea may be speci-
fied in alternative ways yielding different local search
methods. One possibility is the following.
LOCALSEARCH(w,q)
Initialize: Set t = 0 and q(0) = q, with require-
ment |{i : q
A
i
> 0}| {0,|A|} for all A 2
N
.
Loop 1: While 0 <
iA
q
A
i
(t) < |A| for a A 2
N
,
set t = t + 1 and
(a) select a A
(t) 2
N
such that
iA
(t)
w
q
i
(t1)
(A
(t))
jB
w
q
j
(t1)
(B)
for all B 2
N
such that 0 <
iB
q
B
j
(t) < |B|,
(b) for i A
(t) and A 2
N
i
, define
q
A
i
(t) =
1 if A = A
(t),
0 if A 6= A
(t),
Continuous Set Packing and Near-Boolean Functions
87
(c) for j N\A
(t) and A 2
N
j
with A A
(t) =
/
0,
define q
A
j
(t) = q
A
j
(t 1)+
+
w(A)
B2
N
j
BA
(t)6=
/
0
q
B
j
(t 1)
B
0
2
N
j
B
0
A
(t)=
/
0
w(B
0
)
1
(d) for j N\A
(t) and A 2
N
j
with A A
(t) 6=
/
0,
define
q
A
j
(t) = 0.
Loop 2: While q
A
i
(t) = 1,|A| > 1 for a i N and
w(A) < w({i}) + w(A\i), set t = t + 1 and define:
q
ˆ
A
i
(t) =
1 if |
ˆ
A| = 1
0 otherwise
for all
ˆ
A 2
N
i
,
q
B
j
(t) =
1 if B = A\i
0 otherwise
for all j A\i, B 2
N
j
,
q
ˆ
B
j
0
(t) = q
ˆ
B
j
0
(t 1) for all j
0
A
c
,
ˆ
B 2
N
j
0
.
Output: Set q
= q(t).
Both ROUNDUP above and LOCALSEARCH yield
a sequence q(0), .. ., q(t
) = q
where q
i
ex(
i
) for
all i N. In the former at the end of each iteration t
the novel q(t) N (q(t 1)) is in the neighborhood
of its predecessor. In the latter q(t) 6∈ N (q(t 1))
in general, as in |P| n iterations of Loop 1 a parti-
tion {A
(1),. .. ,A
(|P|)} = P is generated. Selected
subsets A
(t) 2
N
,t = 1,. .. ,|P| are any of those
where the sum over members i A
(t) of (i, A
(t))-
derivatives W (q(t 1))/q
A
(t)
i
(t 1) is maximal.
Once a block A
(t) is selected, then lines (c) and (d)
make all elements j N\A
(t) redistribute the entire
membership mass currently placed on subsets A
0
2
N
j
with non-empty intersection A
0
A
(t) 6=
/
0 over those
remaining A 2
N
j
such that, conversely, AA
(t) =
/
0.
The redistribution is such that each of these latter gets
a fraction w(A)/
B2
N
j
:BA
(t)=
/
0
w(B) of the newly
freed membership mass
A
0
2
N
j
:A
0
A
(t)6=
/
0
q
A
0
j
(t 1).
The subsequent Loop 2 checks whether the partition
generated by Loop 1 may be improved by exctract-
ing some elements from existing blocks and putting
them in singleton blocks of the final output. In the
limit, set function w may be such that for some el-
ement i N global worth decreases when the ele-
ment joins any subset A 2
N
i
,|A| > 1, that is to say
w(A)w(A\i)w({i}) =
B2
A
\2
A\i
:|B|>1
µ
w
(B) < 0.
Proprosition 13. LOCALSEARCH(W,q) outputs a
local maximizer q
.
Proof. It is plain that the output is a partition P or,
with the notation of corollary 8 above, q
= p. Ac-
cordingly, any element i N is either in a single-
ton block {i} P or else in a block A P,i A
such that |A| > 1. In the former case, any mem-
bership reallocation deviating from p
{i}
i
= 1, given
memberships p
j
, j N\i, yields a cover (fuzzy or
not) where global worth is the same as at p, be-
cause
jB\i
p
B
j
= 0 for all B 2
N
i
\{i} (see exam-
ple 2 above). In the latter case, any membership re-
allocation q
i
deviating from p
A
i
= 1 (given member-
hips p
j
, j N\i) yields a cover which is best seen by
distinguishing between 2
N
i
\A and A. Also recall that
w(A) w(A\i) =
B2
A
\2
A\i
µ
w
(B). Again, all mem-
bership mass
B2
N
i
\A
q
B
i
> 0 simply collapses on sin-
gleton {i} because
jB\i
p
B
j
= 0 for all B 2
N
i
\A.
Hence W (p) W (q
i
|p
i
) = w(A) w({i})+
q
A
i
B2
A
\2
A\i
:|B|>1
µ
w
(B) +
B
0
2
A\i
µ
w
(B
0
)
=
=
p
A
i
q
A
i
B2
A
\2
A\i
:|B|>1
µ
w
(B).
Now assume that q is not a local maximizer, i.e.
W (p) W (q
i
|p
i
) < 0. Since p
A
i
q
A
i
> 0 (because
p
A
i
= 1 and q
i
i
is a deviation from p
i
), then
B2
A
\2
A\i
:|B|>1
µ
w
(B) = w(A) w(A\i) w({i}) < 0.
Hence q cannot be the output of Second Loop.
In local search methods, the chosen initial can-
ditate solution determines what neighborhoods shall
be visited. The range of the objective function in a
neighborhood is a set of real values. In a neighbor-
hood N (p) of a p
N
or partition P only those
AP:|A|>1
|A| elements i A in non-sigleton blocks
A P,|A| > 1 can modify global worth by reallocating
their membership. In view of (the proof of) proposi-
tion 13, the only admissible variations obtain by de-
viating from p
A
i
= 1 with an alternative membership
distribution q
A
i
[0,1), with W (q
i
|p
i
) W (p) equal
to (q
A
i
1)
B2
A
\2
A\i
,|B|>1
µ
w
(B) + (1 q
A
i
)w({i}).
Hence, choosing partitions as initial candidate solu-
tions of LOCALSEARCH is evidently poor. A sen-
sible choice should conversely allow the search to
explore different neighborhoods where the objective
function may range widely. A simplest example of
such an initial candidate solution is q
A
i
= 2
1n
for all
A 2
N
i
and all i N, i.e. the uniform distribution.
On the other hand, the input of local search algo-
rithms is commonly desired to be close to a global
optimum, i.e. a maximizer in the present setting.
ICPRAM 2016 - International Conference on Pattern Recognition Applications and Methods
88
This translates here into the idea of defining the in-
put by means of set function w. In this view, consider
q
A
i
= w(A)/
B2
N
i
w(B), yielding
q
A
i
q
B
i
=
w(A)
w(B)
=
q
A
j
q
B
j
for
all A,B 2
N
i
2
N
j
and all i, j N (see lines (c), (d)).
With a suitable initial candidate solution, the
search may be restricted to explore only a maximum
number of fuzzy partitions, thereby containing the
computational burden. In particular, if q(0) is the
finest partition {{1},...,{n}} or q
{i}
i
(0) = 1 for all
i N, then the search does not explore any neigh-
borhood at all, and such an input coincides with the
output. More reasonably, let A
max
q
= {A
1
,. .. ,A
k
} de-
note the collection of -maximal subsets where input
memberships are strictly positive. That is, q
A
k
0
i
> 0
for all i A
k
0
,1 k
0
k as well as q
B
j
= 0 for all
B 2
N
\
2
A
1
··· 2
A
k
and all j B. Then, the out-
put shall be a partition P each of whose blocks A P
satisfies A A
k
0
for some 1 k
0
k. Hence, by suit-
ably choosing the input q, LOCALSEARCH outputs
a partition with no less than some maximum desired
number k(q) blocks.
3 LOWER-DIMENSIONAL CASE
If F 2
N
, then 2
N
i
F 6=
/
0 for every i N, other-
wise the problem reduces to packing the proper sub-
set N\{i : F 2
N
i
=
/
0} of elements contained in at
least one feasible subset. As outlined in section 1,
without additional notation simply let {
/
0} F 3 {i}
for all i N with null weigths w(
/
0) = 0 = w({i})
if {i} / F . Thus (F ,) is a poset (partially or-
dered set) with bottom element
/
0, and weight func-
tion w : F R
+
has well-defined M
¨
obius inversion
µ
w
: F R (Rota, 1964b). Memberships q
i
distribute
over F
i
= 2
N
i
F = {A
1
,. .. ,A
|F
i
|
}, with lower(|F
i
|)-
dimensional unit simplices
¯
i
=
q
A
1
i
,. .. ,q
A
|F
i
|
i
R
|F
i
|
+
:
1kA
|F
i
|
q
A
k
i
= 1
and corresponding fuzzy covers q
¯
N
= ×
1in
¯
i
.
Note that a fuzzy cover now may maximally con-
sist of |F | 1 points in the unit n-dimensional hy-
percube [0,1]
n
. Accordingly, hypercube [0,1]
n
is re-
placed with C (F ) = co({χ
A
: A F }) [0,1]
n
, i.e.
the convex hull of feasible characteristic functions, re-
garded as n-vectors (Gr
¨
unbaum, 2001). Recursively
(with w(
/
0) = 0), M
¨
obius inversion µ
w
: F R is
µ
w
(A) = w(A)
BF :BA
µ
w
(B),
while the MLE f
w
: C (F ) R of w is
f
w
(q
A
) =
BF 2
A
iB
q
A
i
!
µ
w
(B).
Therefore, every fuzzy cover q
¯
N
has global worth
W (q) =
AF
BF 2
A
iB
q
A
i
!
µ
w
(B).
For all i N,q
i
¯
i
, and q
i
¯
N\i
= ×
jN\i
¯
j
W
i
(q
i
) =
AF
i
"
BF 2
A\i
jB
q
A
j
!
µ
w
(B)
#
+
+
A
0
F \F
i
B
0
F 2
A
0
j
0
B
0
q
A
0
j
0
!
µ
w
(B
0
)
,
W
i
(q
i
|q
i
) =
AF
i
q
A
i
"
BF
i
2
A
jB\i
q
A
j
!
µ
w
(B)
#
,
yielding again
W (q) = W
i
(q
i
|q
i
) +W
i
(q
i
). (6)
From (4) above, w
q
i
: F
i
R now is
w
q
i
(A) =
BF
i
2
A
jB\i
q
A
j
!
µ
w
(B) (7)
for all i N, all A F
i
and all q
i
¯
N\i
.
For each i N, denote by ex(
¯
i
) the set of |F
i
|
extreme points of simplex
¯
i
. Like in the full-
dimensional case, at any fuzzy cover
ˆ
q
¯
N
every
i N such that ˆq
i
6∈ ex(
¯
i
) may deviate by concen-
trating its whole membership on some A F
i
such
that w
ˆ
q
i
(A) w
ˆ
q
i
(B) for all B F
i
. This yields a
non-decreasing variation W (q
i
|
ˆ
q
i
) W (
ˆ
q) in global
worth, with q
i
ex(
¯
i
). When all n elements do
so, one after the other while updating w
q
i
(t)
as in
ROUNDUP above, i.e. t = 0, 1,. .., then the final
q = (q
1
,. .. ,q
n
) satisfies q ×
iN
ex(
¯
i
). Yet cases
F 2
N
and F = 2
N
are different in terms of exact-
ness. Specifically, consider any
/
0 6= A F such that
|{i : q
A
i
= 1}| 6∈ {0,|A|} or A
+
q
= {i : q
A
i
= 1} A, with
f
w
(q
A
) =
BF 2
A
+
q
µ
w
(B). Then, F 2
A
+
q
is likely
to admit no shrinking (see above) yielding an exact
fuzzy cover with same global worth as (non-exact) q.
Proprosition 14. The values taken on exact fuzzy
covers do not saturate the range of W :
¯
N
R
+
.
Continuous Set Packing and Near-Boolean Functions
89
Proof. Consider this example: N = {1, 2,3, 4}
and F = {N,{4}, {1,2}, {1,3}, {2,3}}, with worth
w(N) = 3, w({4}) = 2, w({i, j}) = 1 for 1 i < j 3.
Define q = (q
1
,. .. ,q
4
) by q
{4}
4
= 1 = q
N
i
,i = 1,2,3,
with non-exactness |{i : q
N
i
> 0}| = 3 < 4 = |N|. As
W (q) = w({4}) +
1i< j3
w({i, j}) = 2 +1 + 1 + 1
and A
+
q
= {1,2,3}, for all distributions ˆq
1
, ˆq
2
, ˆq
3
placing membership only over feasible B F 2
A
+
q
global worth is W ( ˆq
1
, ˆq
2
, ˆq
3
,q
4
) < W (q).
This observation simply indicates that an arbitrary
search for optimal fuzzy covers may yield a maxi-
mizer (global or local) which is not reducible to any
feasible solution of the original set packing problem.
On the other hand, such feasible solutions are parti-
tions P all of whose blocks are feasible, and where
singleton blocks with worth 0 are not included in the
packing. In fact, similarly to the full-dimensional
case, fairly simple conditions may be shown to be suf-
ficient for a partition to be a local maximizer.
Definition 15. Any ˆq
i
|
ˆ
q
i
=
ˆ
q
¯
N
is a local maxi-
mizer of W :
¯
N
R
+
if W
i
( ˆq
i
|
ˆ
q
i
) W
i
(q
i
|
ˆ
q
i
) for
all i N and all q
i
¯
i
(see (6) above).
The neighborhood N (q)
¯
N
of q
¯
N
thus is
N (
ˆ
q) =
[
iN
n
q : q = q
i
|
ˆ
q
i
,q
i
¯
i
o
.
Any partition P with A F for each block A P
clearly has associated p such that p ×
iN
ex(
¯
i
)
¯
N
.
Proprosition 16. Any partition P with associated p
such that p
¯
N
is a local maximizer if for all A P
w(A) w({i}) +
ˆ
BF 2
A\i
µ
w
(
ˆ
B).
Proof. Firstly note that for all blocks A P, if any,
such that |A| = 1 there is nothing to prove, as the
summation reduces to w(
/
0) = 0, and thus there only
remains w({i}) w({i}). Accordingly, let A P
and |A| > 1. For every i A, any membership re-
allocation q
i
¯
i
deviating from p
i
(i.e. p
A
i
= 1),
given memberships p
i
of other elements j N\i (i.e.
¯
j
3 p
A
0
j
= 1 for all A
0
P and all j A
0
), yields
q = (q
i
|p
i
)
¯
N
which is best analyzed by distin-
guishing between F
i
\A and A. In particular,
w(A) = w({i}) +
BF
i
2
A
|B|>1
µ
w
(B) +
ˆ
BF 2
A\i
µ
w
(
ˆ
B).
All membership mass
BF
i
\A
q
B
i
> 0 collapses on
singleton {i}, because
i
0
B\i
p
B
i
0
= 0 for all B F
i
\A
by the definition of p
i
(see example 2 above). Thus,
W (p) W (q
i
|p
i
) = w(A) w({i})+
q
A
i
BF
i
2
A
|B|>1
µ
w
(B) +
ˆ
BF 2
A\i
µ
w
(
ˆ
B)
=
=
p
A
i
q
A
i
BF
i
2
A
|B|>1
µ
w
(B).
Now assume that p is not a local maximizer, i.e.
W (p) W(q
i
|p
i
) < 0. Since p
A
i
q
A
i
> 0 because
p
A
i
= 1 and q
i
¯
i
is a deviation from p
i
, then
BF
i
2
A
|B|>1
µ
w
(B) = w(A) w({i})
ˆ
BF 2
A\i
µ
w
(
ˆ
B) < 0
must hold. This contradicts precisely the premise
w(A) w({i}) +
ˆ
BF 2
A\i
µ
w
(
ˆ
B) for all A P and
i A, thus completing the proof.
4 LOCAL SEARCH WITH COST
In order to design a gradient-based local search for
this lower-dimensional case, the only tool still miss-
ing is the derivative, which clearly shall reproduce
definition 11 above with F
i
in place of 2
N
i
. Before
that, as outlined in section 1, let c : F N count the
number c(A) = |{B : B F ,B A 6=
/
0}| of feasible
subsets with which each A F has non-empty inter-
section, itself included, i.e. c(A) {1, .. ., |F |} is the
cost of including A in the packing. Accordingly, the
underlying poset function ˆw : F R
+
now used (still
taking positive values only) incorporates both weights
w(A),A F (used thus far) and costs by means of ra-
tio ˆw(A) =
w(A)
c(A)
. The result is quasi-objective function
ˆ
W :
¯
N
R
+
, obtained via MLE f
ˆw
: C (F ) R
+
of
ˆw, i.e.
ˆ
W (q) =
AF
f
ˆw
(q
A
), and all of the above ap-
plies invariately simply replacing W with
ˆ
W .
Definition 17. The (i,A)-derivative of
ˆ
W at q
¯
N
,
A F
i
, is
ˆ
W (q)/q
A
i
=
ˆ
W (q(i,A))
ˆ
W (q(i,A)) =
=
ˆ
W
i
q
i
(i,A)|q
i
(i,A)
ˆ
W
i
q
i
(i,A)|q
i
(i,A)
,
with q(i,A) =
q
1
(i,A), .. ., q
n
(i,A)
given by
q
B
j
(i,A) =
q
B
j
for all j N\i, B F
j
1 for j = i, B = A
0 for j = i, B 6= A
,
and q(i,A) =
q
1
(i,A), .. ., q
n
(i,A)
given by
q
B
j
(i,A) =
q
B
j
for all j N\i, B F
j
0 for j = i and all B F
i
.
ICPRAM 2016 - International Conference on Pattern Recognition Applications and Methods
90
The (full) gradient of
ˆ
W at q
¯
N
is
ˆ
W (q) =
ˆ
W (q)/q
A
i
: i N, A F
i
R
iN
|F
i
|
as well as the i-gradient
i
ˆ
W (q) R
|F
i
|
of
ˆ
W at
q = (q
i
|q
i
)
¯
N
is poset function
i
ˆ
W (q) : F
i
R
defined by
i
ˆ
W (q)(A) = ˆw
q
i
(A) for all A F
i
, where
ˆw
q
i
is given by (7) with ˆw in place of w. Again, mem-
bership distribution q
i
(i,A) is the null one: its |F
i
|
entries are all 0, hence q
i
(i,A) 6∈
¯
i
.
LS-WITHCOST( ˆw, q)
Initialize: Set t = 0 and q(0) = q, with require-
ment |{i : q
A
i
> 0}| {0,|A|} for all A F , ˆw(A) > 0.
Loop 1: While 0 <
iA
q
A
i
(t) < |A| for a A F ,
set t = t + 1 and:
(a) select a A
(t) F such that
min
iA
(t)
ˆw
q
i
(t1)
(A
(t)) min
jB
ˆw
q
j
(t1)
(B)
for all B 2
N
such that 0 <
iB
q
B
j
(t) < |B|,
(b) for i A
(t) and A F
i
, define
q
A
i
(t) =
1 if A = A
(t),
0 if A 6= A
(t),
(c) for j N\A
(t) and A F
j
with A A
(t) =
/
0,
define q
A
j
(t) = q
A
j
(t 1)+
+
ˆw(A)
BF
j
BA
(t)6=
/
0
q
B
j
(t 1)
B
0
F
j
B
0
A
(t)=
/
0
ˆw(B
0
)
1
(d) for j N\A
(t) and A F
j
with A A
(t) 6=
/
0,
define
q
A
j
(t) = 0.
(e) for A F with AA
(t) =
/
0, update cost function
by c(A) = |{B : B F ,BA 6=
/
0 = BA
(t)}| and
plug it into ˆw.
Loop 2: While q
A
i
(t) = 1, |A| > 1 for a i N and
w(A) < w({i}) +
ˆ
BF 2
A\i
µ
w
(
ˆ
B),
set t = t + 1 and define:
q
ˆ
A
i
(t) =
1 if |
ˆ
A| = 1
0 otherwise
for all
ˆ
A F
i
,
q
B
j
(t) =
1 if B = A\i
0 otherwise
for all j A\i, B F
j
,
q
ˆ
B
j
0
(t) = q
ˆ
B
j
0
(t 1) for all j
0
A
c
,
ˆ
B F
j
0
.
Output: Set q
= q(t).
Both LOCALSEARCH and LS-WITHCOST gen-
erate in |P| n iterations of Loop 1 a partition
{A
(1),. .. ,A
(|P|)} = P. Now selected blocks
A
(t) F , 1 t |P| are any of those feasible
subsets where the minimum over elements i A
(t)
of (i,A
(t))-derivatives
ˆ
W (q(t 1))/q
A
(t)
i
(t 1)
is maximal. The following Loop 2 again checks
whether the partition generated by Loop 1 may be
improved by exctracting some elements from exist-
ing blocks and putting them in singleton blocks of the
final output, which thus allows for the following.
Proprosition 18. LS-WITHCOST(W,q) outputs a lo-
cal maximizer q
.
Proof. Follows from proposition 16 since Loop 2
deals with w, not with ˆw.
Concerning input q = q(0), consider again setting
q
A
i
=
ˆw(A)
BF
i
ˆw(B)
for all A F
i
,i N, which entails
q
A
i
q
B
i
=
w(A)c(B)
w(B)c(A)
=
q
A
j
q
B
j
for all A,B F
i
F
j
,i, j N.
Evidently, Loop 1 may take exactly the same form
as in LOCALSEARCH, that is with selected blocks
A
(t) F ,t = 1,...,|P| of the generated partition P
being any of those feasible subsets where the sum,
rather than the minimum, over elements i A
(t)
of (i,A
(t))-derivatives
ˆ
W (q(t 1))/q
A
(t)
i
(t 1)
is maximal. This possibility may be useful in those
settings where set packing appears in its weighted
version, while using the minimum in place of the
sum may be interesting for k-uniform set packing
problems (see section 1). In fact, for the k-uniform
case M
¨
obius inversion is µ
ˆw
(A) =
1
c(A)
if |A| = k and
µ
ˆw
(A) = 0 if |A| {0, 1} for all A F (recall the con-
vention {
/
0} F 3 {i} for all i N), with the cost
function iteratively updated in line (e). It is also plain
that in k-uniform set packing Loop 2 is ineffective.
5 NEAR-BOOLEAN FUNCTIONS
Boolean functions (Crama and Hammer, 2011) pro-
vide key analytical tools and methods with a variety of
important applications. Beyond set packing problems
that here constitute the main benchmark, this section
further develops the full-dimensional case detailed in
section 2 with the aim to indicate additional oppor-
tunities obtained from expanding the standard frame-
work where pseudo-Boolean models are traditionally
exploited. Recall that Boolean functions of n vari-
ables have form f : {0, 1}
n
{0,1}, and constitute
Continuous Set Packing and Near-Boolean Functions
91
a subclass of pseudo-Boolean functions f : {0,1}
n
R, which in turn admit the unique MLE
ˆ
f : [0,1]
n
R
over the whole n-dimensional unit hypercube exten-
sively employed thus far. The n variables thus range
each in the unit interval [0,1]. Such a setting is here
expanded by letting each variable i = 1,...,n range
in a 2
n1
1-dimensional simplex
i
, with the goal
to evaluate collections of fuzzy subsets of a n-set
through the MLE given by (1) and (2).
Definition 19. Near-Boolean functions of n variables
have form F : ×
1in
ex(
i
) R.
Following (Hammer and Holzman, 1992, p. 4),
denote by N = {1,...,n} the set of indices of vari-
ables (i.e., the ground set in previous sections).
As already observed, any pseudo-Boolean function
has a unique expression as a multilinear polynomial
f (x
1
,. .. ,x
n
) =
AN
α
A
iA
x
i
in n variables (Boros
and Hammer, 2002, p. 162), since α
A
,A 2
N
is in fact
the M
¨
obius inversion (Rota, 1964b) of a unique set
function w : 2
N
R such that w(A) = f (χ
A
), where
χ
A
is the characteristic function defined in section 2.
Definition 20. The MLE
ˆ
F of near-Boolean
functions F has polynomial form
ˆ
F : ×
1in
i
R
given by expression (2) in section 2, that is
ˆ
F(q) =
A2
N
"
BA
iA
q
B
i
!#
µ
w
(A), with (see above)
q = (q
1
,. .. ,q
n
) and q
i
= (q
A
1
i
,. .. ,q
A
2
n1
i
)
i
.
5.1 Bounded-degree LS
Approximations
In line with (Hammer and Holzman, 1992), the issue
of approximating a given near-Boolean function F by
means of the least squares LS criterion amounts to
determine a near-Boolean function F
k
such that
q ×
iN
ex(
i
)
[F(q) F
k
(q)]
2
(8)
attains its minimum over all near-Boolean functions
F
k
with polynomial MLE
ˆ
F
k
of degree k, that is
ˆ
F
k
(q) =
A2
N
|A|≤k
"
BA
iA
q
B
i
!#
µ
w
(A),
or, equivalently stated in terms of the underlying set
function w, such that µ
w
(A) = 0 if |A| > k.
Near-Boolean functions F take their values on n-
product ×
iN
ex(
i
), and |ex(
i
)| = 2
n1
for each i N.
They might thus be regarded as points F R
2
n(n1)
in
a 2
n(n1)
-dimensional vector space. In view of propo-
sition 4 formalizing exactness, this seems conceptu-
ally incorrect and with useless enumerative demand.
Specifically, for every partition P P
N
with associ-
ated p ×
iN
ex(
i
), there clearly exist many non-exact
q ×
iN
ex(
i
) such that F(q) = F(p) (see corollary 8
above). Counting these redundant extreme points of
simplices appears wothless. Hence k-degree approxi-
mation is to be dealt with by replacing expression (8)
with the following, applying to partitions p P only:
p ×
iN
ex(
i
)
with pPP
N
[F(p) F
k
(p)]
2
. (9)
The number |P
N
| of partitions of a n-set is given by
Bell number B
n
(Rota, 1964a; Aigner, 1997). Ac-
cordingly, near-Boolean functions might be regarded
as points F R
B
n
in a B
n
-dimensional vector space.
Still, this also is far too large, as points in such a vec-
tor space correspond in fact to generic partition func-
tions, i.e. with M
¨
obius inversion free to live on every
partition P P
N
. Conversely, near-Boolean functions
factually involve only partition functions h : P
N
R
such that h(P) = h
w
(P) =
AP
w(A) for some set
function w : 2
N
R. The M
¨
obius inversion of these
partition functions lives only on the 2
n
n modular el-
ements (Stanley, 1971) of lattice (P
N
,, ), namely
on those partitions with a number of non-sigleton
blocks 1. When regarded as points in a vector space
(i.e. expressed as a linear combination of a basis, see
above) these functions may be seen as h
w
R
2
n
n
.
This is shown below via recursion through the M
¨
obius
inversion of additively separable partition functions
(Gilboa and Lehrer, 1990; Gilboa and Lehrer, 1991).
It seems crucial emphasizing that while pseudo-
Boolean functions admit a unique set function pro-
viding their best k-degree approximation, 0 k n
(Hammer and Holzman, 1992), every near-Boolean
function admits a continuum of set functions w deter-
mining their unique best k-degree approximation. In
particular, consider first the linear (i.e. k = 1) case:
the issue is to find a best (least squares) approxima-
tion F
1
of any given F. That is, the set function w
determining F
1
has to satisfy w(A) =
iA
w({i}) for
all A 2
N
. Then,
h
w
(P) =
AP
w(A) =
AP
iA
w({i}) = w(N)
for all P P
N
. Thus h
w
is a constant partition func-
tion, or a valuation (Aigner, 1997) of partition lat-
tice (P
N
,, ). Also, any further linear v : 2
N
R
ICPRAM 2016 - International Conference on Pattern Recognition Applications and Methods
92
such that v(N) = w(N) also satisfies h
v
(P) = h
w
(P)
for all P P
N
. In other terms, there is a contin-
uum of equivalent linear v 6= w such that h
w
= h
v
,
obtained each by distributing arbitrarily the whole of
w(N) over the n singletons {i} 2
N
. Cases k > 1
maintain this same feature: consider a set function w
such that µ
w
(A) 6= 0 for one or more (possibly all
n
k
)
subsets A 2
N
such that |A| = k. Now fix arbitrarily n
values v({i}),i N with
iN
w({i}) =
iN
v({i}).
For all A 2
N
,|A| > 1 M
¨
obius inversion µ
v
: 2
N
R
can always be determined uniquely via recursion by
v(A) +
iA
c
v({i}) =
BA
µ
v
(B) +
iA
c
v({i}) =
= w(A) +
iA
c
w({i}) =
BA
µ
w
(B) +
iA
c
w({i}).
If set function w additively separates partition func-
tion h, i.e. h = h
w
, and v
0
= w v is a linear set
function, then v + v
0
also additively separates h, i.e.
h
w
= h
v+v
0
. Hence, there is a continuum of equivalent
set functions w and v + v
0
available for the sought k-
degree approximation F
k
, but still the B
n
values taken
by F
k
(more precisely, the 2
n
n values taken by F
k
on
the modular elements of partition lattice (P
N
,, ))
are unique and independent from the chosen set func-
tion in the continuum of set functions w,v + v
0
avail-
able, each determining an equivalent MLE
ˆ
F of F.
5.2 A Continuum of Polynomials
In this work, two lattices play a central role, namely
the Boolean lattice (2
N
,, ) of subsets of N ordered
by inclusion , and the geometric lattice (P
N
,, )
of partitions of N ordered by coarsening > (Aigner,
1997; Stern, 1999). Both, of course, are posets (par-
tially ordered sets), and M
¨
obius inversion applies to
any (locally finite) poset, provided a bottom element
exists (Rota, 1964b). The bottom subset is
/
0, while
the bottom partition P
= {{1},...,{n}} is the finest
one. For a lattice (L, ,) ordered by > and with
generic elements x,y,z L, any lattice function f :
L R has M
¨
obius inversion µ
f
: L R given by
µ
f
(x) =
x
6y6x
µ
L
(y,x) f (y), where x
is the bottom
element and µ
L
is the M
¨
obius function, defined recur-
sively on ordered pairs (y,x) L × L by µ
L
(y,x) =
y6z<x
µ
L
(z,x) if y < x (i.e. y 6 x and y 6= x) as
well as µ
L
(y,x) = 1 if y = x, while µ
L
(y,x) = 0 if
y 66 x. The M
¨
obius function of the subset lattice im-
plicitly appears since the beginning of this work, and
is µ
2
N
(B,A) = (1)
|A\B|
, with B A. Concerning
the M
¨
obius function of P
N
, given any two partitions
P,Q P
N
, if Q < P = {A
1
,. .. ,A
|P|
}, then for every
block A P there are blocks B
1
,. .. ,B
k
A
Q such
that A = B
1
··· B
k
A
, with k
A
> 1 for at least one
A P. Segment [Q, P] = {P
0
: Q 6 P
0
6 P} is thus
isomorphic to product ×
AP
P (k
A
), where P (k) de-
notes the lattice of partitions of a k-set. Accordingly,
let m
k
= |{A : k
A
= k}| for k = 1,. .. ,n. Then (Rota,
1964b, pp. 359-360),
µ
P
N
(Q,P) = (1)
n+
1kn
m
k
1<k<n
(k!)
m
k+1
.
If a partition function h : P
N
R admits a set func-
tion v : 2
N
R to satisfy h(P) =
AP
v(A) for all
P P
N
, then it may be said to be additively separable
(Gilboa and Lehrer, 1990; Gilboa and Lehrer, 1991),
with the notation h = h
v
. As already outlined, any
such an additively separable partition function h = h
v
has M
¨
obius inversion µ
h
v
that lives only on the mod-
ular elements of the partition lattice, i.e. where only
one block, at most, has cardinality > 1. That is to
say, together with the bottom P
and top P
>
= {N},
all other modular elements are those partitions of the
form {A} P
A
c
for A 2
N
such that 1 < |A| < n,
where P
A
c
is the finest partition of A
c
(Aigner, 1997,
Ex. 13, p. 71). The total number of these modular
partitions thus is 2
n
n. The M
¨
obius inversion of an
additively separable partition function h
v
is detailed
hereafter; see (Gilboa and Lehrer, 1990, Prop. 4.4, p.
138 and Appendix, p. 144) and (Gilboa and Lehrer,
1991, Prop. 3.3, p. 452).
Proprosition 21. If h = h
v
, then h = h
w
for a contin-
uuum of set functions w : 2
N
R,w 6= v.
Proof. Firstly, by direct substitution, for all P P
N
,
µ
h
v
(P) =
AP
BA
v(B)
Q6P:BQ
µ
P
N
(Q,P).
Secondly, if P 6= {B} P
B
c
, then the recursive defini-
tion of M
¨
obius function µ
P
N
yields
Q6P:BQ
µ
P
N
(Q,P) = 0.
M
¨
obius inversion µ
h
v
thus takes non-zero values only
on modular elements, where it obtains recursively by
µ
h
v
(P
) =
iN
v({i}), µ
h
v
(P
>
) = µ
v
(N) as well as
µ
h
v
({A} P
A
c
) = µ
v
(A) for 1 < |A| < n. In other
terms, any w 6= v satisfying
iN
v({i}) =
iN
w({i})
and µ
v
(A) = µ
w
(A) for all A 2
N
,|A| > 1 also addi-
tively separates h, i.e. h
v
= h
w
.
In view of corollary 8, the setting considered in
this work deals precisely with additively separable
partition functions, and thus the polynomial expres-
sion (2) in section 2 is not unique. Specifically, recall
that the degree of a polynomial is the highest degree
of its terms. Hence in (2), for any chosen set function
w additively separating partiton function h = h
w
, the
degree is max{|A| : µ
w
(A) 6= 0}, while every non-zero
Continuous Set Packing and Near-Boolean Functions
93
value of M
¨
obius inversion µ
w
: 2
N
R is a coefficient
of the polynomial. The only degree k such that there
is a unique set function available for polynomial ex-
pression (2) is k = 0, in which case the unique addi-
vitely separating set function w is trivial: w(A) = 0
for all A 2
N
. On the other hand, for any degree
k, 0 < k n there exists a continuum of set func-
tions available for additive separability and such that
max{|A| : µ
w
(A) 6= 0} = k, each defining alternative
but equivalent coefficients of the polynomial.
6 NEAR-BOOLEAN GAMES
In view of the above definition of local maximiz-
ers relying on equilibrium conditions for strategic n-
player games, and having mentioned additive separa-
blity of partition functions or global games (Gilboa
and Lehrer, 1990), it seems now useful to regard vari-
ables as players in near-Boolean coalition formation
games (for pseudo-Boolean functions and coalitional
games see (Hammer and Holzman, 1992, section 3)).
Definition 22. A near-Boolean n-player game is a
triple (N,F,π) such that N = {1,.. ., n} is the player
set and F is a near-Boolean function taking real
values on profiles q = (q
1
,. .. ,q
n
) ×
iN
ex(
i
) of
strategies, while π : ×
iN
ex(
i
) R
n
efficiently as-
signs payoffs π(q) = (π
1
(q),. .. ,π
n
(q)) to players, i.e.
iN
π
i
(q) = F(q) at all q ×
iN
ex(
i
).
Definition 23. A fuzzy near-Boolean n-player game is
a triple (N,
ˆ
F, π) such that N = {1,...,n} is the player
set and
ˆ
F is the MLE of a near-Boolean function tak-
ing values on profiles q = (q
1
,. .. ,q
n
) ×
iN
i
, while
π : ×
iN
i
R
n
efficiently assigns payoffs to players,
i.e.
iN
π
i
(q) =
ˆ
F(q) at all q ×
iN
i
.
In both near-Boolean games and fuzzy ones the
player set is finite. Given this, a main distinction
is between games where players have either finite
or else infinite sets of strategies, with near-Boolean
games in the former class and fuzzy ones in the lat-
ter. In addition, players may play either determinis-
tic (i.e. pure) or else random (i.e. mixed) strategies.
In the latter case equilibrium conditions are stated
in terms of expected payoffs, and by means of fixed
point arguments for upper hemicontinuous correspon-
dences such conditions are commonly fulfilled (Mas-
Colell et al., 1995, p. 260). The sets of deterministic
strategies in fuzzy near-Boolean games are precisely
the sets of random strategies in near-Boolean games.
Nevertheless, the payoffs for the fuzzy setting are not,
in general, expectations.
The main framework where these games seem in-
teresting is coalition formation, which combines both
strategic and cooperative games. A generic strat-
egy profile q ×
iN
ex(
i
) of near-Boolean (non-fuzzy)
games may well fail to be exact (see proposition 4),
but it seems plain that there is a unique partition P
of N with associated p ×
iN
ex(
i
) obtained through
shrinkings of q and such that F(p) = F(q). Let
p(q) be such a unique p. In view of the above dis-
cussion, it is also evident that for every p there are
many (non-exact) q such that p=p(q). In these terms,
near-Boolean games model stategic coalition forma-
tion in a very handy manner, in that they totally by-
pass the need to define a mechanism mapping strategy
profiles into partitions of players or coalition struc-
tures (Slikker, 2001). More precisely, a mechanism
is a mapping M : ×
iN
ex(
i
) P
N
such that when
each player i N specifies a coalition A
i
2
N
i
, then
M(A
1
,. .. ,A
n
) = P is a resulting coalition structure.
If the n specified coalitions A
i
,i N are such that
for some partition P it holds A
i
= A for all i A
and all A P, then M(A
1
,. .. ,A
n
) = P. Otherwise,
the generated partition P
0
= M(A
1
,. .. ,A
n
) depends
on what mechanism is chosen, and generally may be
a rather fine one, i.e. possibly consisting of many
small blocks. Conversely, near-Boolean games do not
need any mechanism, in that even if players’ strate-
gies (q
1
,. .. ,q
n
) = q are such that q does not corre-
spond to a partition, still the global worth F(q) is that
attained at the partition P with corresponding p(q),
i.e. whose blocks A P each include maximal sub-
sets of players choosing the same superset A
0
A.
Given coalitional game v : 2
N
R
+
,v(
/
0) = 0,
with F(p) =
AP
v(A) for all partitions P p, let
payoffs be defined, for i A and A P and p = p(q),
by π
i
(q) =
B2
A
\2
A\i
µ
w
(B)
|B|
for all q ×
jN
ex(
j
),
where P thus is the partition with associated p(q).
Apart from the absence of any coalition structure gen-
eration mechanism (see above), this is in fact a well-
known coalition formation game (Slikker, 2001), with
payoffs given by the Shapley value (Roth, 1988).
Definition 24. A local maximizer of near-Boolean
function F is any q ×
iN
ex(
i
) such that for all i N
and all q
0
i
ex(
i
) inequality F(q) F(q
0
i
|q
i
) holds.
Remark 25. If payoffs are given by π
i
(q) =
ω
i
F(q)
jN
ω
j
for all i N, with ω
1
,. .. ,ω
n
> 0, then near-Boolean
games are (pure) common interest potential games
(Monderer and Shapley, 1996; Bowles, 2004). That
is to say, the set of equilibria of (N,F,π) coincides
ICPRAM 2016 - International Conference on Pattern Recognition Applications and Methods
94
with the set of local maximizers of F, and players’
preferences all agree on the set of strategy profiles.
In artificial intelligence, these games may model
coalition formation in multiagent systems (Rahwan
and Jenning, 2007; Conitzer and Sandholm, 2006).
7 CONCLUSIONS AND FUTURE
WORK
Via polynomial MLE, near-Boolean functions of
n variables take values on the n-product of high-
dimensional unit simplices, thus enabling to approach
discrete optimization problems, namely set pack-
ing/partitioning, through an objective function de-
fined over a continuous domain, and with feasible so-
lutions found at extreme points of the simplices. Least
squares approximations with polynomials of bounded
degree are discussed in terms of additive separabil-
ity of partition functions, while near-Boolean n-player
games flexibly model strategic coalition formation.
Apart from settings in artificial intelligence such
as combinatorial auctions and coalition structure gen-
eration in multiagent systems discussed above, near-
Boolean optimization seems generally interesting for
objective function-based clustering (Rossi, 2015), and
for graph clustering in particular (Schaeffer, 2007).
Specifically, for a given weighted graph, edge weights
and vertices’ weighted degrees may be used (in alter-
native ways) to obtain a quadratic MLE in expression
(2), i.e. a polynomial with degree 2. Then, optimiza-
tion with respect to such an objective function pro-
vides a method for partitioning the vertex set. This de-
serves separate investigation in a forthcoming work.
Along an alternative route, through the core con-
cept in cooperative game theory, it may be interesting
to consider the following problem: for given generic
(i.e. non-additively separable) global game or parti-
tion function h R
B
n
, determine a quadratic MLE in
(2), i.e. a set function w such that µ
w
(A) = 0 for all
A 2
N
,|A| > 2, satisfying h
w
(P) h(P) for all parti-
tions P P
N
of players, where h
w
(P) =
AP
w(A),
and with h
w
(P
>
) = h(P
>
) for the coarsest partition.
The idea behind this problem comes from a novel ap-
proach to the solution of global games h (Gilboa and
Lehrer, 1990), and shall be explored in future work.
REFERENCES
Aigner, M. (1997). Combinatorial Theory. Springer.
Reprint of the 1979 Edition.
Alidaee, B., Kochenberger, G., Lewis, K., Lewis, M., and
Wang, H. (2008). A new approach for modeling and
solving set packing problems. European Journal of
Operational Research, 186(2):504 – 512.
Boros, E. and Hammer, P. (2002). Pseudo-Boolean opti-
mization. Discrete App. Math., 123:155–225.
Bowles, S. (2004). Microeconomics: Behavior, Institutions,
and Evolution. Princeton University Press.
Chandra, B. and Halldorsson, M. M. (2001). Greedy local
improvement and weighted set packing. Journal of
Algorithms, (39):223–240.
Conitzer, V. and Sandholm, T. (2006). Complexity of
constructing solutions in the core based on synergies
among coalitions. Artificial Intel., 170:607–619.
Crama, Y. and Hammer, P. L. (2011). Boolean Functions:
Theory, Algorithms, and Applications. Cambridge
University Press.
Gilboa, I. and Lehrer, E. (1990). Global games. Interna-
tional Journal of Game Theory, (20):120–147.
Gilboa, I. and Lehrer, E. (1991). The value of information -
an axiomatic approach. Journal of Mathematical Eco-
nomics, 20(5):443–459.
Gr
¨
unbaum, B. (2001). Convex Polytopes 2nd ed. Springer.
Hammer, P. and Holzman, R. (1992). Approximations of
pseudo-Boolean functions; applications to game the-
ory. Math. Methods of Op. Res. - ZOR, 36(1):3–21.
Hazan, E., Safra, S., and Schwartz, O. (2006). On the
complexity of approximating k-set packing. Compu-
tational Complexity, (15):20–39.
Korte, B. and Vygen, J. (2002). Combinatorial Optimiza-
tion. Theory and Algorithms. Springer.
Mas-Colell, A., Whinston, M. D., and Green, J. R. (1995).
Microeconomic Theory. Oxford University Press.
Milgrom, P. (2004). Putting Auction Theory to Work. Cam-
bridge University Press.
Monderer, D. and Shapley, L. S. (1996). Potential games.
Games and Economic Behavior, 14(1):124–143.
Papadimitriou, C. (1994). Computational Complexity. Ad-
dison Wesley.
Rahwan, T. and Jenning, N. (2007). An algorithm for dis-
tributing coalitional value calculations among cooper-
ating agents. Artificial Intelligence, 171:535–567.
Rossi, G. (2015). Multilinear objective function-based clus-
tering. In Proc. 7th Int. J. Conf. on Computational
Intelligence, volume 2 (FCTA), pages 141–149.
Rota, G.-C. (1964a). The number of partitions of a set.
American Mathematical Monthly, 71:499–504.
Rota, G.-C. (1964b). On the foundations of combinatorial
theory I: theory of M
¨
obius functions. Z. Wahrschein-
lichkeitsrechnung u. verw. Geb., 2:340–368.
Roth, A. (1988). The Shapley value. Cambridge Univ. Press.
Sandholm, T. (2002). Algorithm for optimal winner deter-
mination in combinatorial auctions. Artificial Intelli-
gence, (135):1–54.
Schaeffer, S. E. (2007). Graph clustering. Computer Sci-
ence Review, 1(1):27–64.
Slikker, M. (2001). Coalition formation and potential
games. Games and Econ. Behavior, 37(2):436 – 448.
Stanley, R. (1971). Modular elements of geometric lattices.
Algebra Universalis, (1):214–217.
Continuous Set Packing and Near-Boolean Functions
95
Stern, M. (1999). Semimodular Lattices. Theory and Appli-
cations. Cambridge University Press.
Trevisan, L. (2001). Non-approximability results for opti-
mization problems on bounded degree instances. In
Proc. 33rd ACM Symp. on Theory of Computing,
pages 453–461.
ICPRAM 2016 - International Conference on Pattern Recognition Applications and Methods
96