RAO-BLACKWELLIZED RESAMPLING PARTICLE FILTER FOR
REAL-TIME PLAYER TRACKING IN SPORTS
Nicolai v. Hoyningen-Huene and Michael Beetz
Intelligent Autonomous Systems Group, Technische Universit
¨
at M
¨
unchen, Boltzmannstr. 3, D-85748 Garching, Germany
Keywords:
Multi-target tracking, Particle filter, Rao-Blackwellization, Kalman filter, Resampling.
Abstract:
Tracking multiple targets with similiar appearance is a common task in computer vision applications, espe-
cially in sports games. We propose a Rao-Blackwellized Resampling Particle Filter (RBRPF) as an imple-
mentable real-time continuation of a state-of-the-art multi-target tracking method. Target configurations are
tracked by sampling associations and solving single-target tracking problems by Kalman filters. As an ad-
vantage of the new method the independence assumption between data associations is relaxed to increase
the robustness in the sports domain. Smart resampling and memoization is introduced to equip the tracking
method with real-time capabilities in the first place. The probabilistic framework allows for consideration of
appearance models and the fusion of different sensors. We demonstrate its applicability to real world applica-
tions by tracking soccer players captured by multiple cameras through occlusions in real-time.
1 INTRODUCTION
Tracking multiple targets is needed in a lot of com-
puter vision applications like surveillance or sports
analysis. The sports domain provides a challeng-
ing testbed for concurrent tracking of multiple tar-
gets with similar appearance through frequent occlu-
sions measured from different views. In team sports
the complex coordination of movements of different
players are crucial to the success of a squad. For auto-
mated analysis thereof the correct association of play-
ers to movements is equally important as the recogni-
tion of the movement itself.
To achieve an automatic extraction of athlete posi-
tions during sports games from video streams, beside
camera estimation and player segmentation, a robust
and fast multi-target tracking method is needed. In
sports games the number of players is usually known
and constant. In contrast the number of observations
for each player obtained by measurements from sen-
sors or segmentation for videos varies; it ranges from
zero in case of occlusion and oversight to several mea-
surements in case of hallucination and inaccuracy of
the player extraction. Players usually differ by ap-
pearance from the field to help viewers and referees to
follow the game easily, so the association of players
of one team with their individual name is the bigger
problem.
In this paper we propose a Rao-Blackwellized Re-
sampling particle filter (RBRPF) for real-time track-
ing of multiple targets. Particles are represented as
configurations of all players to result in tracking a
mixture of Gaussians, where the multi-modality is
caused by possible mix-ups of associations and the
Gaussian refers to the uncertainty of dynamics. Sam-
pling of new target configurations is reduced to sam-
pling associations and Rao-Blackwellized by using
the Kalman filter. Taking advantage of the fact that the
number of probable associations for given player po-
sitions and measurements are usually low, the particle
filter focuses on the most likely associations and can
avoid unnecessary computations by smart resampling
and memoization. The Bayesian framework allows
the integration of kinematic and appearance models to
determine the most probable player locations through
occlusions.
Our contributions are the enhancements of a state-
of-the-art theoretical multi target tracking method to-
wards an implementable real-time algorithm that per-
forms well in the demanding sports domain. We relax
the independence assumption of single measurement
associations to suit the original method to the applica-
tion domain and achieve more robustness. Further we
invent a smart resampling procedure that allows real-
time in the first place and adapts to the complexity of
the tracking problem. The proposed memoization of
464
v. Hoyningen-Huene N. and Beetz M. (2009).
RAO-BLACKWELLIZED RESAMPLING PARTICLE FILTER FOR REAL-TIME PLAYER TRACKING IN SPORTS.
In Proceedings of the Fourth International Conference on Computer Vision Theory and Applications, pages 465-472
DOI: 10.5220/0001788804650472
Copyright
c
SciTePress
repeatedly computed results additionally improves ef-
ficiency.
The method is developed as part of the ASPOGA-
MO system (Beetz et al., 2006; Beetz et al., 2007),
that aims to extract knowledge from broadcasted soc-
cer games, and is evaluated by applying it to real
soccer games, showing robust real-time performance
over challenging sequences.
The remainder of this paper is organized as fol-
lows. We briefly talk about related work in the next
section. In section 3 we derive the Rao-Blackwellized
Resampling particle filter. Section 4 describes the ex-
periments we conducted. We finish in section 5 with
our conclusions.
2 RELATED WORK
Multiple-target tracking algorithms can be differenti-
ated by their data association methods. Multiple hy-
pothesis tracking (MHT) (Bar-Shalom et al., 2001)
builds a pruned tree of all possible association se-
quences of each measurement with close targets by
the Hungarian algorithm. The assumption of sin-
gle associations and the use of Kalman filtering al-
low computation in polynomial time, but inhibit to
handle multiple or merged associations. Khan et al.,
2006 (Khan et al., 2006) propose a real-time Rao-
Blackwellized MCMC-based particle filter where as-
sociations are sampled by a Markov chain. The
Markov chain allows also sampling of merged mea-
surement assignments but demands computation time
that reduces the number of particles. In their exper-
iments real-time could only be provided for a small
number of particles (less than 6) i.e. the tracker can
cope with three parallel mix-ups of targets max. Inter-
action of targets are modeled as correlations between
target positions which does not hold for many appli-
cations.
The Rao-Blackwellized particle filter approach by
S
¨
arkk
¨
a et al., 2004 (S
¨
arkk
¨
a et al., 2004; S
¨
arkk
¨
a et al.,
2007) samples the associations directly and handles
dependencies between them by data associations pri-
ors. The performance of the method was demon-
strated only on synthetic simulations without state-
ments about computation time. Our approach is an
extension of this method to real world applications
introducing smart resampling and memoization that
leads to real-time tracking in the first place and relax-
ation of the association independence assumption.
Tracking of soccer players is classified by Li et al.,
2005 (Li et al., 2005) in category and identity track-
ing. Category tracking extracts trajectories with team
affiliation where in the other case each single player
is traced with its identity. Barcel
´
o et al., 2005 (Bar-
cel
´
o et al., 2005) and Figueroa et al., 2006 (Figueroa
et al., 2006) label the measurements by nearest neigh-
bor assignment. In Gedikli et al., 2007 (Gedikli et al.,
2007) MHT was applied, but Particle filters constitute
the mostly used method in the literature for category
tracking (i.e. (Yang et al., 2005; A.Dearden and Grau,
2006)). Du et al., 2006 (Du et al., 2006; Du and Pi-
ater, 2007) aim on combining local particle filters to
fuse measurements captured from different views. A
MCMC method for team labelling is proposed by Liu
et al., 2007 (Liu et al., 2007) to link observations of
soccer players over time.
Identity tracking is often performed in a second
stage by consistent labelling of the trajectory graph
generated by category tracking. Huang and Hilton,
2006 (Huang and Hilton, 2006) propose an assign-
ment in batch mode by shortest path algorithm, Nil-
lius et al., 2006 (Nillius et al., 2006) solve the asso-
ciation of the trajectory graph by Bayesian network
inference, and Sullivan and Carlsson, 2006 (Sulli-
van and Carlsson, 2006) combine trajectories of un-
occluded players in a graph structure by clustering.
Barcel
´
o et al., 2005 (Barcel
´
o et al., 2005) resolve col-
lisions of nearest neighbor Kalman tracking by con-
straints in the trajectory graph. To the best of our
knowledge no real-time identity tracking method for
soccer player that allows multiple measurements and
fuses different camera views was proposed in the lit-
erature yet.
3 RAO-BLACKWELLIZED
RESAMPLING PARTICLE
FILTER
A particle filter for complete player configurations
constitutes the base of our algorithm. New particles
are predicted by sampling associations of players with
current measurements considering dependencies be-
tween them. Computation time is spend mostly on
the highly probable configurations and on ambiguous
associations by memoization of precomputed samples
and probabilities. Sampling and weighting is done by
using the Kalman filter for Rao-Blackwellization of
the particle filter.
3.1 Bayesian View of Tracking
The problem of tracking is to recursively estimate a
state x
k
knowing the evolution of the state sequence
x
k
= f
k
(x
k1
,v
k1
) (1)
RAO-BLACKWELLIZED RESAMPLING PARTICLE FILTER FOR REAL-TIME PLAYER TRACKING IN SPORTS
465
from measurements
z
k
= h
k
(x
k
,n
k
) (2)
where f
k
is called system or motion model and h
k
is
called measurement model, v
k1
and n
k
denote the
process and measurement noise, respectively. The
tracked state x
k
is represented as the configuration of
all player states
x
k
=
x
j,k
= N
x
y
˙x
˙y
;m
i
j,k
,V
j,k
j = 1,...,T
(3)
where x
j,k
contains the position and velocity of player
j at time k. An individual target sample x
i
j,k
is as-
sumed to be Gaussian with mean m
i
j,k
and correspond-
ing covariance matrix V
i
j,k
.
In a Bayesian framework, the problem of tracking
can be formulated as one of estimating the posterior
probability density function p(x
k
|z
1:k
) for the state x
k
given a sequence of measurements z
1:k
up to time k.
3.2 Particle Filtering
In Sampling Importance Sampling (SIS) particle fil-
tering, the posterior probability density function is ap-
proximated by a weighted sum of random samples x
i
k
also called particles (Arulampalam et al., 2002). The
weights are normalized such that
i
w
i
k
= 1:
p(x
k
|z
1:k
)
i
w
i
k
δ
x
k
x
i
k
. (4)
We draw samples x
i
k
by importance sampling from
a proposal q(.) called an importance density. Doucet,
1998 (Doucet, 1998) showed that the optimal impor-
tance density function that minimizes the variance of
the true weights conditioned on x
i
k1
and z
k
is
q
opt
x
k
|x
i
k1
,z
k
=
p
z
k
|x
k
,x
i
k1
p
x
k
|x
i
k1
p
z
k
|x
i
k1
. (5)
In our case the importance density is the probabil-
ity distribution of data associations, while the actual
sample is deduced from an association by the use of
Kalman filtering.
3.3 Sampling New Configurations
For known associations between measurements z
k
and
players of the sample x
i
k1
the new sampled con-
figuration is Gaussian and can be evaluated analyti-
cally as an optimal fusion between measurements and
predicted player positions. The Kalman filter pro-
vides the method to find the Gaussian that equals both
probabilities in the numerator of equation 5 and thus
maximizes their product. The sampling problem re-
duces therefor to sample associations between mea-
surements and the predicted player configuration and
solving multiple single target tracking problems by
Kalman filtering. The analytical sampling part forms
the Rao-Blackwellization of the particle filter. To
supply an optimal solution the Kalman filter assumes
state and measurement noise to be zero-mean, white
Gaussian and the measurement as well as the motion
model to be linear. If the last assumption does not
hold an extended or unscented Kalman filter could be
applied for a suboptimal solution. Following this ap-
proach the posterior probability density function of
configurations form a mixture of Gaussians, where
the multi-modality originates from ambiguities in the
associations.
3.3.1 Predicting by System Model
We can sample from p
x
0
k
|x
i
k1
analytically by the
Kalman prediction step according to the system dy-
namics of eq. 1. Each player state is predicted in-
dependently using the discretized Wiener velocity
model A
t
(Bar-Shalom et al., 2001) for time differ-
ence t between k 1 and k as a linear motion model:
m
0
j,k
=
x
0
j,k
y
0
j,k
˙x
0
j,k
˙y
0
j,k
=
1 0 t 0
0 1 0 t
0 0 1 0
0 0 0 1
x
i
j,k1
y
i
j,k1
˙x
i
j,k1
˙y
i
j,k1
(6)
The covariance matrix evolves to
V
0
k
= A
t
V
k1
A
T
t
+
t
3
3
0
t
2
2
0
0
t
3
3
0
t
2
2
t
2
2
0 t 0
0
t
2
2
0 t
˜q (7)
with power spectral density ˜q as a constant factor.
3.3.2 Sampling Associations
We introduce associations
J
k
: {1, . . . , |z
k
|} ({1, . . . , T }) (8)
as mappings from all measurements at time k to a
(possibly empty) subset of all targets. We denote
ˆ
J
k
as the inverse mapping from targets to their as-
signed observations for convenience. The space of
data associations equals the finite and discrete set of
all possible associations of measurements to targets
containing 2
|z
k
T
elements. If we restrict the data
associations J
k
to assign a measurement to one tar-
get max, the number of possible associations reduce
VISAPP 2009 - International Conference on Computer Vision Theory and Applications
466
to (T + 1)
|z
k
|
. We can further reduce this number to
min(T,|z
k
|)
i=0
min(T,|z
k
|)
i
max(T,|z
k
|)
min(T,|z
k
|)i
if we
prohibit multiple measurements per target, also called
exclusion principle (MacCormick and Blake, 1999).
Enumerating this set and solving each single target
tracking problem is still intractable even for a small
number of targets and measurements. Fortunately
only a few associations have high probability, but to
sample them efficiently, we have to assume the associ-
ations for single measurements to be independently or
the dependency between them to be determined fast.
Individual Independent Associations. If we look
at sampling an individual association for measure-
ment z z
k
, we can enumerate all possible assign-
ments easily as z can be clutter viz. a false alarm or
assigned to one of the players. Thus the importance
distribution π(z) for an association of a specific mea-
surement z can be evaluated by normalizing the prob-
abilities
ˆ
π(z) for each possible association.
Clutter measurements are assumed to be indepen-
dent from player positions and uniformly distributed
in the measurement space with volume M
ˆ
π
(z) = p(J
k
(z) = |z
k
)
1
M
. (9)
The probability for a data association between target
t and an observation z is up to a constant factor:
ˆ
π
t
(z) = p
t J
k
(z)|z
k
,x
0
k
p
app
(t J
k
(z))N
z;H
z
m
0
t,k
,H
T
z
V
0
t,k
H
z
+ R
z
(10)
with measurement model H
z
=
1 0 0 0
0 1 0 0
and R
z
as measurement noise covariance.
p
app
(t J
k
(z)) denotes the propability of an as-
sociation based on the appearance model only, which
is independent from player and measurement posi-
tions. The Gaussian in the second part refers to the
probability of the association by the kinematic model.
We included the appearance model in difference to
(S
¨
arkk
¨
a et al., 2007) to allow a realistic influence
of additional information from segmentation beside
spatial data only.
Importance Density. Utilizing the independence of
single associations the importance density for a sam-
pled state x
j
k
can be computed as a product over prob-
abilities of assignments for each single measurement
that are given by the normalized importance distribu-
tion π of equations 9 and 10.
q
x
k
|x
i
k1
,z
k
=
z
k
π (11)
Figure 1: Association of a and m
1
increases the probability
of b and m
2
being associated.
Relaxation of Independence. In the underlying
method by S
¨
arkk
¨
a et al., 2004, the measurements are
processed one at a time in sequential fashion based on
the independence assumption of associations of indi-
vidual measurements. This assumption does not al-
ways hold, the order of associations often do matter.
This can be best exemplified by figure 1 assuming that
measurements can be assigned to one target at max: If
target a is assigned to measurement m
1
, the probabil-
ity of the association of m
2
and target b increases.
S
¨
arkk
¨
a et al., 2007 did not consider this problem
at all but proposed the use of an data association prior.
We follow this solution instead of establishing an ad-
ditional Markov Chain as proposed by Khan et al.,
2006 (Khan et al., 2006) in favor of computational ef-
ficiency but change the procedure slightly to improve
robustness against violation of the mentioned assump-
tion. To generate new particles x
j
k
including the whole
player configuration, we repeatedly sample an order-
ing on the measurements of one sweep uniformly at
random, reducing the relevance of the ordering and
the induced dependencies on the tracking result. With
the randomly sampled ordering we draw an associa-
tion for each measurement with the normalized im-
portance distribution π(z) one at a time. If a target
was associated, it is excluded from further associa-
tions with the single detection probability p
sd
and the
importance distribution is renormalized. If the men-
tioned exclusion principle holds i.e. targets can be as-
signed to one measurement at max, p
sd
should be set
to one.
Determination of State from Associations. For
sampled associations J
k
the predicted player positions
x
0
k
can be updated individually by Kalman update with
their observations
x
i
j,k
= x
0
j,k
+V
0
j,k
H
T
HV
0
k
H
T
+ R
1
ˆ
J
k
( j) Hx
0
j,k
,
(12)
with H denoting the linear measurement model (2) as
H
z
stacked |
ˆ
J
k
( j)| times and R as diagonal matrix of
measurement covariances of observations
ˆ
J
k
( j).
RAO-BLACKWELLIZED RESAMPLING PARTICLE FILTER FOR REAL-TIME PLAYER TRACKING IN SPORTS
467
3.4 Weighting
For a good performance of the particle filter the com-
putation of the weights of each sampled state is cru-
cial. To approximate p (x
k
|z
1:k
) correctly the weights
w
i
k
have to be defined recursively as
w
i
k
w
i
k1
p
z
k
|x
i
k
p
x
i
k
|x
i
k1
q
x
i
k
|x
i
k1
,z
k
. (13)
The denominator was already computed in the sam-
pling phase and was depicted in equation 11. The
likelihood of the measurements given the sampled
state x
i
k
with known associations and the likelihood of
x
i
k
given the former state x
i
k1
and the dynamics can
be computed for each player and measurement sepa-
rately. The measurement likelihood can be computed
analogously to eq. 10 but substituting x
0
k
by x
i
k
and V
0
k
by V
i
k
, respectively:
p
z
k
|x
i
k
=
z/
ˆ
J
i
k
ˆ
π
(z)
j
p
ˆ
J
i
k
(x
i
j,k
)|x
i
j,k
. (14)
The likelihood of the new sample according to the
motion model can be computed by reusing the already
predicted state x
0
k
of eq. 6
p
x
i
k
|x
i
k1
=
j
N
m
i
j,k
;m
0
j,k
,V
i
j,k
+V
0
j,k
. (15)
3.5 Resampling
SIS particle filters suffer from the so called degener-
acy phenomenon, where only a small amount of all
particles have not negligible weights. This implies
that most of the computation time will be spent on
particles that contribute only marginally to the ap-
proximation of the posterior probability density func-
tion of equation 4. To reduce the degeneracy problem
resampling has been proposed to eliminate particles
with small weights and clone the others according to
their weights. We include the resampling step by sam-
pling w
i
k1
× N
max
associations for particle x
i
k1
. Par-
ticles with larger weights will therefor allocate more
particles in the next time step, while particles with
small weights are dropped.
Sampling several times from the same particle the
number of distinct sampled particles will approach the
number of ambiguities in the associations because a
specific assignment leads to the same sampled con-
figuration. Due to their discreteness there are usually
only a small number of distinct probable associations.
This allows a chance for noticeable improvement in
computation time by smart memoization. Caching
and testing sampled associations for equality can save
computation time considering not only the update to
generate a new state of equation 12 but also the pre-
diction in the next particle filtering step in equation
6.
After resampling the weights are usually reset to
w
k
= 1/N
max
to reflect the equal probability of all par-
ticles. In our case we count the times n
J
k
the same
association J
k
was sampled for a specific particle and
provide only one single particle for the next filtering
step having the weight set to w
k
= n
J
k
/N
max
. Then the
weights are recursively updated as in equation 13 and
normalized at the end of the filtering step. The actual
number of particles can therefor vary between 1 and
N
max
using more particles in situations with high as-
sociation ambiguities. This smart resampling reduces
the computation time and allows real time in the first
place.
3.6 Estimate of the State
An estimate of the player positions at time k i.e. of the
state x
k
can be found by either selecting the particle
with maximum weight or by clustering the particles
and taking the weighted mean of the most probable
cluster. Calculating the weighted mean of all parti-
cles should not be considered here because it can lead
to the so called ghost phenomenon for multi-modal
distributions i.e. it leads to a state estimated as the
mean of two modes that is known to be wrong.
3.7 Implementation
The complete algorithm is depicted in figure 2 fol-
lowing the derivation of the former section. The indi-
vidual importance distributions π as well as
ˆ
π and the
Kalman prediction and updates are cached for reuse in
the next sampling iteration to improve efficiency. The
importance distribution, all probabilities and weights
are calculated in log-space to avoid numerical prob-
lems.
4 EXPERIMENTAL RESULTS
The proposed tracking method is evaluated as part
of the ASPOGAMO system (Beetz et al., 2006; Beetz
et al., 2007), that aims to extract knowledge from
broadcasted soccer games. ASPOGAMO is able to track
multiple dynamic pan-tilt-zoom cameras and segment
the soccer players and referee by a combination of
variance filter and color templates. Segmentation in-
fluences the tracking process as the Kalman filters
smooth assigned measurements, quality evaluation of
the used method can be found in (Beetz et al., 2007).
However segmentation by background subtraction for
VISAPP 2009 - International Conference on Computer Vision Theory and Applications
468
h
x
i
k
,w
i
k
N
k
i=1
i
= RBRPF
h
x
i
k1
,w
i
k1
N
k1
i=1
,z
k
i
N
k
= 0
FOR i = 1 : N
k1
Predict x
0
k
as in 6
C =
FOR j = 1 :
w
i
k1
× N
max
Sample an association J
k
:
τ = {1,.. .,T }
Init J
k
: p τ.
ˆ
J
k
(p) =
Reorder measurements z
k
randomly
FOR l = 1 : |z
k
|
Compute
ˆ
π() as in 9 and 10
π = normalized
ˆ
π
Draw association for lth measurement
with player p τ by π
J
k
(p) = J
k
(p) {l}
IF random(0,1) < p
sd
: τ = τ \ p and
renormalize π
END FOR
IF J
k
not in C
N
k
= N
k
+ 1
n
J
k
= 1
Compute x
N
k
k
by Kalman update if not done
previously as in 12
ˆw
N
k
k
=
1
N
max
Update ˆw
N
k
k
as in 13
C = C {J
k
}
ELSE
n
J
k
= n
J
k
+ 1
ˆw
k
= ˆw
k
n
J
k
n
J
k
1
END IF
END FOR
END FOR
Calculate total weight: t =
N
k
j=1
ˆw
j
k
FOR j = 1 : N
k
Normalize: w
j
k
= t
1
ˆw
j
k
END FOR
Figure 2: Algorithm for one iteration of the proposed Rao-
Blackwellized Resampling particle filter.
static cameras is usually of higher quality. Digital
videos captured by two dynamic cameras with a frame
rate of 25Hz provide the basic raw material. Track-
ing results in both camera perspectives are depicted
in figures 3 and 4 and are presented quantitatively in
table 2. The extracted players spatial measurements
of each camera are fused by the proposed tracking al-
gorithm as different measurement sweeps with same
time stamps.
Player positions have been measured in meters
and were initialized manually in the image with co-
variance V
0
= 2I
4
, initial velocity was set to zero. The
factor for the kinematic process noise ˜q = 0.0008 is
derived from maximal human speed. The probability
of multiple observations for the same target was ob-
tained experimentally to p
sd
= 0.92. A confusion ma-
trix between different categories was used as a sim-
ple appearance model p
app
and is depicted in table
1. The measurement space is determined by the num-
ber of pixels in each camera frame and evaluates to
M = 720 × 576. We used N
max
= 50 particles to track
all of the 22 players and the referee.
There is no ground truth for broadcasted soccer
games because players can be tracked only visually
and camera parameters are unknown. We abandon to
present a spatial error as this is influenced mainly by
camera estimation and segmentation. Instead we tried
to find a error measure that is related with the number
of false associations. A failure was counted when the
projected player position differed from the real player
in the image by more than 10 pixels for longer than 3
frames. In this case the tracker was reset in the failed
player positions and run again on the rest of the se-
quence. We tracked both camera views separately and
also ran the same sequence fusing the measurements
of both perspectives. Because the broadcasted high-
angle camera shows only a part of the field and is pan-
ning and zooming fast, in average only 9.9 players are
visible (with standard deviation of 3.2). We splitted
the number of failures into association errors and as-
signing emerging players (second number) to be com-
parable with the other results. The second row of ta-
ble 2 shows the number of frames that were tracked
in the according experiment. The computation time
was taken for one update step, where all experiments
have been conducted on a 2.2 GHz Dual-core PC. We
think the actual needed time is more significant than
the theoretical complexity of the algorithm since the
input data do not scale but stay in fixed boundaries
(number of players is 22, number of measurements
usually lower than 200). The last row depicts the av-
erage number of particles and the corresponding stan-
dard deviation. Table 2 clearly evidences the real-time
tracking ability of the proposed method with low fail-
ure rate for single cameras. Fusion of different cam-
eras reduces the occurrence of occlusions and there-
with failure rate and number of particles even further.
The fourth experiment states a challenging se-
quence including several fouls and header duels
where kinematic and appearance model have often
been to weak to differentiate between players causing
a higher number of failures. The amount of measure-
ments (lower for the highangle view) correlates obvi-
ously with the number of particles and the computa-
tion time demonstrating the adaptiveness of the pro-
posed method to the complexity of the tracking prob-
lem. Also we observed assignment errors if segmen-
RAO-BLACKWELLIZED RESAMPLING PARTICLE FILTER FOR REAL-TIME PLAYER TRACKING IN SPORTS
469
Table 1: Confusion matrix between different categories.
Italy France Referee
Italy 0.6 0.1 0.3
France 0.1 0.8 0.1
Referee 0.3 0.1 0.6
Table 2: Tracking performance on the final of the world cup
2006.
Game Frames Fail Time(ms) Particles
Tactical 1262 13 23.3± 4 43.5±10
Highangle 1262 7+54 8.5± 5 12.1±12
Fused 1262 11 30.2±20 33.3±16
Fused II 3202 98 33.4±18 34.1±14
tation could not extract a specific player for longer
than 20 frames (e.g. fouled player on the ground).
We also implemented the method as proposed by
Khan et al., 2006 (Khan et al., 2006) and tested it on
the World Cup final. We encountered problems of two
kind: low variance in sparse particles and misleading
interaction handling. The real-time requirement al-
lowed only a small number of particles (6 in our case)
which had a low variance because the Markov chain
converged to very similar associations. This misled
the tracker to remember the most probable configu-
ration only which often did not equal the true posi-
tions. Interactions are handled by dependencies in the
positions via symmetric entries in the configuration
covariance matrix. This modeling is inappropriate for
interacting soccer players, where e.g. the player on the
ball shows contrary motion to his competitor. Both
drawbacks resulted in poor tracking performance for
the inspected soccer game sequences.
Figure 3: Tactical camera view of the World Cup final 2006.
Figure 4: Identity tracking of soccer players in the broad-
casted highangle camera view of the World Cup final 2006.
5 CONCLUSIONS
In this article we have proposed a real-time multiple
target tracking method based on Rao-Blackwellized
Resampling particle filtering for tracking soccer
player identities. We presented the necessary exten-
sions of an so far only theoretically evaluated state-
of-the-art multi-target tracking method to handle real
tracking problems being as challenging as in the
sports domain. The first extension comprises the pro-
cessing of measurements of one sweep instead of one
at a time to relax the independence assumption of as-
sociations. Secondly, smart resampling and memoiza-
tion was introduced to equip the tracking method with
real-time capabilities. Experimental results demon-
strate robustness and real-time performance of the
developed method in challenging soccer game se-
quences including increased achievements by fusion
of measurements from different cameras. A compari-
son with another recent multi-target tracking method
explains the supremacy of our approach for the soc-
cer domain. For future research we plan to examine
more complex appearance models for automatic reini-
tialization of the identities especially regarding broad-
casted single view sports videos.
ACKNOWLEDGEMENTS
This work was partially funded by the German Re-
search Foundation DFG.
VISAPP 2009 - International Conference on Computer Vision Theory and Applications
470
REFERENCES
A.Dearden, Y. and Grau, O. (2006). Tracking football
player movment from a single moving camera using
particle filters. In European Conf. on Visual Media
Production (CVMP 2006).
Arulampalam, M. S., Maskell, S., Gordon, N., and Clapp,
T. (2002). A tutorial on particle filters for on-
line nonlinear/non-gaussian bayesian tracking. IEEE
Trans. on Signal Processing, 50(2).
Bar-Shalom, Y., Li, X.-R., and Kirubarajan, T. (2001). Esti-
mation with Applications to Tracking and Navigation.
Wiley Interscience.
Barcel
´
o, L., Binefa, X., and Kender, J. R. (2005). Robust
methods and representations for soccer player track-
ing and collision resolution. In Proc. of Intl. Conf. on
Image and Video Retrieval (CIVR 2005), pages 237–
246.
Beetz, M., Bandouch, J., Gedikli, S., von Hoyningen-
Huene, N., Kirchlechner, B., and Maldonado, A.
(2006). Camera-based observation of football games
for analyzing multi-agent activities. In Proc. of Intl.
Joint Conf. on Autonomous Agents and Multiagent
Systems (AAMAS).
Beetz, M., Gedikli, S., Bandouch, J., Kirchlechner, B., von
Hoyningen-Huene, N., and Perzylo, A. (2007). Visu-
ally tracking football games based on tv broadcasts.
In Proc. of Intl. Joint Conf. on Artificial Intelligence
(IJCAI).
Doucet, A. (1998). On sequential Monte Carlo methods for
Bayesian filtering. Technical report, Dept. End., Univ.
Cambridge, UK.
Du, W., Hayet, J.-B., Piater, J., and Verly, J. (2006). Col-
laborative multi-camera tracking of athletes in team
sports. In Workshop on Computer Vision Based Anal-
ysis in Sport Environments (CVBASE), pages 2–13.
Du, W. and Piater, J. H. (2007). Multi-camera People
Tracking by Collaborative Particle Filters and Princi-
pal Axis-Based Integration. In Asian Conference on
Computer Vision, number 4843 in LNCS, pages 365–
374. Springer.
Figueroa, P. J., Leite, N. J., and Barros, R. M. L. (2006).
Tracking soccer players aiming their kinematical mo-
tion analysis. Computer Vision and Image Under-
standing, 101(2):122–135.
Gedikli, S., Bandouch, J., von Hoyningen-Huene, N.,
Kirchlechner, B., and Beetz, M. (2007). An Adap-
tive Vision System for Tracking Soccer Players from
Variable Camera Settings. In Proc. of Intl. Conf. on
Computer Vision Systems (ICVS).
Huang, P. and Hilton, A. (2006). Football player tracking
for video annotation. In European Conf. on Visual
Media Production.
Khan, Z., Balch, T., and Dellaert, F. (2006). Mcmc data
association and sparse factorization updating for real
time multitarget tracking with merged and multiple
measurements. IEEE Trans. on Pattern Analysis and
Machine Intelligence, 28(12):1960–1972.
Li, Y., Dore, A., and Orwell, J. (2005). Evaluating the per-
formance of systems for tracking football players and
ball. In IEEE Intl. Conf. on Advanced Video and Sig-
nal Based Surveillance.
Liu, J., Tong, X., Li, W., Wang, T., Zhang, Y., Wang, H.,
Yang, B., Sun, L., and Yang, S. (2007). Automatic
player detection, labeling and tracking in broadcast
soccer video. In British Machine Vision Conference.
MacCormick, J. and Blake, A. (1999). A probabilistic
exclusion principle for tracking multiple objects. In
Proc. of Intl. Conf. on Computer Vision (ICCV), pages
572–578.
Nillius, P., Sullivan, J., and Carlsson, S. (2006). Multi-target
tracking - linking identities using bayesian network
inference. In Proc. of Computer Vision and Pattern
Recognition, pages 2187–2194.
S
¨
arkk
¨
a, S., Vehtari, A., and Lampinen, J. (2004). Rao-
blackwellized monte carlo data association for mul-
tiple target tracking. In Proc. of Intl Conf. on Infor-
mation Fusion, volume 7, Stockholm.
S
¨
arkk
¨
a, S., Vehtari, A., and Lampinen, J. (2007). Rao-
blackwellized particle filter for multiple target track-
ing. Information Fusion Journal, 8(1):2–15.
Sullivan, J. and Carlsson, S. (2006). Tracking and labelling
of interacting multiple targets. In Proc. of European
Conf. on Computer Vision, pages 619–632.
Yang, C., Duraiswami, R., and Davis, L. (2005). Fast mul-
tiple object tracking via a hierarchical particle filter.
In Proc. of Intl. Conf. on Computer Vision, volume 1,
pages 212–219.
RAO-BLACKWELLIZED RESAMPLING PARTICLE FILTER FOR REAL-TIME PLAYER TRACKING IN SPORTS
471