RAO-BLACKWELLIZED RESAMPLING PARTICLE FILTER FOR

REAL-TIME PLAYER TRACKING IN SPORTS

Nicolai v. Hoyningen-Huene and Michael Beetz

Intelligent Autonomous Systems Group, Technische Universit

at M

unchen, Boltzmannstr. 3, D-85748 Garching, Germany

Keywords:

Multi-target tracking, Particle ﬁlter, Rao-Blackwellization, Kalman ﬁlter, Resampling.

Abstract:

Tracking multiple targets with similiar appearance is a common task in computer vision applications, espe-

cially in sports games. We propose a Rao-Blackwellized Resampling Particle Filter (RBRPF) as an imple-

mentable real-time continuation of a state-of-the-art multi-target tracking method. Target conﬁgurations are

tracked by sampling associations and solving single-target tracking problems by Kalman ﬁlters. As an ad-

vantage of the new method the independence assumption between data associations is relaxed to increase

the robustness in the sports domain. Smart resampling and memoization is introduced to equip the tracking

method with real-time capabilities in the ﬁrst place. The probabilistic framework allows for consideration of

appearance models and the fusion of different sensors. We demonstrate its applicability to real world applica-

tions by tracking soccer players captured by multiple cameras through occlusions in real-time.

1 INTRODUCTION

Tracking multiple targets is needed in a lot of com-

puter vision applications like surveillance or sports

analysis. The sports domain provides a challeng-

ing testbed for concurrent tracking of multiple tar-

gets with similar appearance through frequent occlu-

sions measured from different views. In team sports

the complex coordination of movements of different

players are crucial to the success of a squad. For auto-

mated analysis thereof the correct association of play-

ers to movements is equally important as the recogni-

tion of the movement itself.

To achieve an automatic extraction of athlete posi-

tions during sports games from video streams, beside

camera estimation and player segmentation, a robust

and fast multi-target tracking method is needed. In

sports games the number of players is usually known

and constant. In contrast the number of observations

for each player obtained by measurements from sen-

sors or segmentation for videos varies; it ranges from

zero in case of occlusion and oversight to several mea-

surements in case of hallucination and inaccuracy of

the player extraction. Players usually differ by ap-

pearance from the ﬁeld to help viewers and referees to

follow the game easily, so the association of players

of one team with their individual name is the bigger

problem.

In this paper we propose a Rao-Blackwellized Re-

sampling particle ﬁlter (RBRPF) for real-time track-

ing of multiple targets. Particles are represented as

conﬁgurations of all players to result in tracking a

mixture of Gaussians, where the multi-modality is

caused by possible mix-ups of associations and the

Gaussian refers to the uncertainty of dynamics. Sam-

pling of new target conﬁgurations is reduced to sam-

pling associations and Rao-Blackwellized by using

the Kalman ﬁlter. Taking advantage of the fact that the

number of probable associations for given player po-

sitions and measurements are usually low, the particle

ﬁlter focuses on the most likely associations and can

avoid unnecessary computations by smart resampling

and memoization. The Bayesian framework allows

the integration of kinematic and appearance models to

determine the most probable player locations through

occlusions.

Our contributions are the enhancements of a state-

of-the-art theoretical multi target tracking method to-

wards an implementable real-time algorithm that per-

forms well in the demanding sports domain. We relax

the independence assumption of single measurement

associations to suit the original method to the applica-

tion domain and achieve more robustness. Further we

invent a smart resampling procedure that allows real-

time in the ﬁrst place and adapts to the complexity of

the tracking problem. The proposed memoization of

464

v. Hoyningen-Huene N. and Beetz M. (2009).

RAO-BLACKWELLIZED RESAMPLING PARTICLE FILTER FOR REAL-TIME PLAYER TRACKING IN SPORTS.

In Proceedings of the Fourth International Conference on Computer Vision Theory and Applications, pages 465-472

DOI: 10.5220/0001788804650472

 SciTePress

repeatedly computed results additionally improves ef-

ﬁciency.

The method is developed as part of the ASPOGA-

MO system (Beetz et al., 2006; Beetz et al., 2007),

that aims to extract knowledge from broadcasted soc-

cer games, and is evaluated by applying it to real

soccer games, showing robust real-time performance

over challenging sequences.

The remainder of this paper is organized as fol-

lows. We brieﬂy talk about related work in the next

section. In section 3 we derive the Rao-Blackwellized

Resampling particle ﬁlter. Section 4 describes the ex-

periments we conducted. We ﬁnish in section 5 with

our conclusions.

2 RELATED WORK

Multiple-target tracking algorithms can be differenti-

ated by their data association methods. Multiple hy-

pothesis tracking (MHT) (Bar-Shalom et al., 2001)

builds a pruned tree of all possible association se-

quences of each measurement with close targets by

the Hungarian algorithm. The assumption of sin-

gle associations and the use of Kalman ﬁltering al-

low computation in polynomial time, but inhibit to

handle multiple or merged associations. Khan et al.,

2006 (Khan et al., 2006) propose a real-time Rao-

Blackwellized MCMC-based particle ﬁlter where as-

sociations are sampled by a Markov chain. The

Markov chain allows also sampling of merged mea-

surement assignments but demands computation time

that reduces the number of particles. In their exper-

iments real-time could only be provided for a small

number of particles (less than 6) i.e. the tracker can

cope with three parallel mix-ups of targets max. Inter-

action of targets are modeled as correlations between

target positions which does not hold for many appli-

cations.

The Rao-Blackwellized particle ﬁlter approach by

arkk

a et al., 2004 (S

arkk

a et al., 2004; S

arkk

a et al.,

2007) samples the associations directly and handles

dependencies between them by data associations pri-

ors. The performance of the method was demon-

strated only on synthetic simulations without state-

ments about computation time. Our approach is an

extension of this method to real world applications

introducing smart resampling and memoization that

leads to real-time tracking in the ﬁrst place and relax-

ation of the association independence assumption.

Tracking of soccer players is classiﬁed by Li et al.,

2005 (Li et al., 2005) in category and identity track-

ing. Category tracking extracts trajectories with team

afﬁliation where in the other case each single player

is traced with its identity. Barcel

o et al., 2005 (Bar-

cel

o et al., 2005) and Figueroa et al., 2006 (Figueroa

et al., 2006) label the measurements by nearest neigh-

bor assignment. In Gedikli et al., 2007 (Gedikli et al.,

2007) MHT was applied, but Particle ﬁlters constitute

the mostly used method in the literature for category

tracking (i.e. (Yang et al., 2005; A.Dearden and Grau,

2006)). Du et al., 2006 (Du et al., 2006; Du and Pi-

ater, 2007) aim on combining local particle ﬁlters to

fuse measurements captured from different views. A

MCMC method for team labelling is proposed by Liu

et al., 2007 (Liu et al., 2007) to link observations of

soccer players over time.

Identity tracking is often performed in a second

stage by consistent labelling of the trajectory graph

generated by category tracking. Huang and Hilton,

2006 (Huang and Hilton, 2006) propose an assign-

ment in batch mode by shortest path algorithm, Nil-

lius et al., 2006 (Nillius et al., 2006) solve the asso-

ciation of the trajectory graph by Bayesian network

inference, and Sullivan and Carlsson, 2006 (Sulli-

van and Carlsson, 2006) combine trajectories of un-

occluded players in a graph structure by clustering.

Barcel

o et al., 2005 (Barcel

o et al., 2005) resolve col-

lisions of nearest neighbor Kalman tracking by con-

straints in the trajectory graph. To the best of our

knowledge no real-time identity tracking method for

soccer player that allows multiple measurements and

fuses different camera views was proposed in the lit-

erature yet.

3 RAO-BLACKWELLIZED

RESAMPLING PARTICLE

FILTER

A particle ﬁlter for complete player conﬁgurations

constitutes the base of our algorithm. New particles

are predicted by sampling associations of players with

current measurements considering dependencies be-

tween them. Computation time is spend mostly on

the highly probable conﬁgurations and on ambiguous

associations by memoization of precomputed samples

and probabilities. Sampling and weighting is done by

using the Kalman ﬁlter for Rao-Blackwellization of

the particle ﬁlter.

3.1 Bayesian View of Tracking

The problem of tracking is to recursively estimate a

state x

knowing the evolution of the state sequence

= f

k−1

) (1)

RAO-BLACKWELLIZED RESAMPLING PARTICLE FILTER FOR REAL-TIME PLAYER TRACKING IN SPORTS

465

from measurements

= h

) (2)

where f

is called system or motion model and h

called measurement model, v

k−1

and n

denote the

process and measurement noise, respectively. The

tracked state x

is represented as the conﬁguration of

all player states











j,k

= N













˙x

˙y







j,k

















j = 1,...,T

(3)

where x

j,k

contains the position and velocity of player

j at time k. An individual target sample x

j,k

is as-

sumed to be Gaussian with mean m

j,k

and correspond-

ing covariance matrix V

j,k

In a Bayesian framework, the problem of tracking

can be formulated as one of estimating the posterior

probability density function p(x

1:k

) for the state x

given a sequence of measurements z

1:k

up to time k.

3.2 Particle Filtering

In Sampling Importance Sampling (SIS) particle ﬁl-

tering, the posterior probability density function is ap-

proximated by a weighted sum of random samples x

also called particles (Arulampalam et al., 2002). The

weights are normalized such that

∑

= 1:

p(x

1:k

) ≈

∑



− x



. (4)

We draw samples x

by importance sampling from

a proposal q(.) called an importance density. Doucet,

1998 (Doucet, 1998) showed that the optimal impor-

tance density function that minimizes the variance of

the true weights conditioned on x

k−1

and z

opt



k−1





k−1





k−1





k−1



. (5)

In our case the importance density is the probabil-

ity distribution of data associations, while the actual

sample is deduced from an association by the use of

Kalman ﬁltering.

3.3 Sampling New Conﬁgurations

For known associations between measurements z

and

players of the sample x

k−1

the new sampled con-

ﬁguration is Gaussian and can be evaluated analyti-

cally as an optimal fusion between measurements and

predicted player positions. The Kalman ﬁlter pro-

vides the method to ﬁnd the Gaussian that equals both

probabilities in the numerator of equation 5 and thus

maximizes their product. The sampling problem re-

duces therefor to sample associations between mea-

surements and the predicted player conﬁguration and

solving multiple single target tracking problems by

Kalman ﬁltering. The analytical sampling part forms

the Rao-Blackwellization of the particle ﬁlter. To

supply an optimal solution the Kalman ﬁlter assumes

state and measurement noise to be zero-mean, white

Gaussian and the measurement as well as the motion

model to be linear. If the last assumption does not

hold an extended or unscented Kalman ﬁlter could be

applied for a suboptimal solution. Following this ap-

proach the posterior probability density function of

conﬁgurations form a mixture of Gaussians, where

the multi-modality originates from ambiguities in the

associations.

3.3.1 Predicting by System Model

We can sample from p



k−1



analytically by the

Kalman prediction step according to the system dy-

namics of eq. 1. Each player state is predicted in-

dependently using the discretized Wiener velocity

model A

∆t

(Bar-Shalom et al., 2001) for time differ-

ence ∆t between k −1 and k as a linear motion model:

j,k







j,k

˙x

j,k

˙y

j,k













1 0 ∆t 0

0 1 0 ∆t

0 0 1 0

0 0 0 1













j,k−1

˙x

j,k−1

˙y

j,k−1







(6)

The covariance matrix evolves to

= A

∆t

k−1

∆t







∆t

0 ∆t 0

∆t

0 ∆t







˜q (7)

with power spectral density ˜q as a constant factor.

3.3.2 Sampling Associations

We introduce associations

: {1, . . . , |z

|} → ℘({1, . . . , T }) (8)

as mappings from all measurements at time k to a

(possibly empty) subset of all targets. We denote

as the inverse mapping from targets to their as-

signed observations for convenience. The space of

data associations equals the ﬁnite and discrete set of

all possible associations of measurements to targets

containing 2

|×T

elements. If we restrict the data

associations J

to assign a measurement to one tar-

get max, the number of possible associations reduce

VISAPP 2009 - International Conference on Computer Vision Theory and Applications

466

to (T + 1)

. We can further reduce this number to

min(T,|z

∑

i=0



min(T,|z



max(T,|z

min(T,|z

|)−i

if we

prohibit multiple measurements per target, also called

exclusion principle (MacCormick and Blake, 1999).

Enumerating this set and solving each single target

tracking problem is still intractable even for a small

number of targets and measurements. Fortunately

only a few associations have high probability, but to

sample them efﬁciently, we have to assume the associ-

ations for single measurements to be independently or

the dependency between them to be determined fast.

Individual Independent Associations. If we look

at sampling an individual association for measure-

ment z ∈ z

, we can enumerate all possible assign-

ments easily as z can be clutter viz. a false alarm or

assigned to one of the players. Thus the importance

distribution π(z) for an association of a speciﬁc mea-

surement z can be evaluated by normalizing the prob-

abilities

π(z) for each possible association.

Clutter measurements are assumed to be indepen-

dent from player positions and uniformly distributed

in the measurement space with volume M

∅

(z) = p(J

(z) = ∅|z

) ∼

. (9)

The probability for a data association between target

t and an observation z is up to a constant factor:

(z) = p



t ∈ J

(z)|z



∼ p

app

(t ∈ J

(z))N



z;H

t,k

+ R



(10)

with measurement model H



1 0 0 0

0 1 0 0



and R

as measurement noise covariance.

app

(t ∈ J

(z)) denotes the propability of an as-

sociation based on the appearance model only, which

is independent from player and measurement posi-

tions. The Gaussian in the second part refers to the

probability of the association by the kinematic model.

We included the appearance model in difference to

arkk

a et al., 2007) to allow a realistic inﬂuence

of additional information from segmentation beside

spatial data only.

Importance Density. Utilizing the independence of

single associations the importance density for a sam-

pled state x

can be computed as a product over prob-

abilities of assignments for each single measurement

that are given by the normalized importance distribu-

tion π of equations 9 and 10.



k−1



∏

π (11)

Figure 1: Association of a and m

increases the probability

of b and m

being associated.

Relaxation of Independence. In the underlying

method by S

arkk

a et al., 2004, the measurements are

processed one at a time in sequential fashion based on

the independence assumption of associations of indi-

vidual measurements. This assumption does not al-

ways hold, the order of associations often do matter.

This can be best exempliﬁed by ﬁgure 1 assuming that

measurements can be assigned to one target at max: If

target a is assigned to measurement m

, the probabil-

ity of the association of m

and target b increases.

arkk

a et al., 2007 did not consider this problem

at all but proposed the use of an data association prior.

We follow this solution instead of establishing an ad-

ditional Markov Chain as proposed by Khan et al.,

2006 (Khan et al., 2006) in favor of computational ef-

ﬁciency but change the procedure slightly to improve

robustness against violation of the mentioned assump-

tion. To generate new particles x

including the whole

player conﬁguration, we repeatedly sample an order-

ing on the measurements of one sweep uniformly at

random, reducing the relevance of the ordering and

the induced dependencies on the tracking result. With

the randomly sampled ordering we draw an associa-

tion for each measurement with the normalized im-

portance distribution π(z) one at a time. If a target

was associated, it is excluded from further associa-

tions with the single detection probability p

and the

importance distribution is renormalized. If the men-

tioned exclusion principle holds i.e. targets can be as-

signed to one measurement at max, p

should be set

to one.

Determination of State from Associations. For

sampled associations J

the predicted player positions

can be updated individually by Kalman update with

their observations

j,k

= x

j,k



+ R



−1



( j)− Hx

j,k



(12)

with H denoting the linear measurement model (2) as

stacked |

( j)| times and R as diagonal matrix of

measurement covariances of observations

( j).

RAO-BLACKWELLIZED RESAMPLING PARTICLE FILTER FOR REAL-TIME PLAYER TRACKING IN SPORTS

467

3.4 Weighting

For a good performance of the particle ﬁlter the com-

putation of the weights of each sampled state is cru-

cial. To approximate p (x

1:k

) correctly the weights

have to be deﬁned recursively as

∝ w

k−1







k−1





k−1



. (13)

The denominator was already computed in the sam-

pling phase and was depicted in equation 11. The

likelihood of the measurements given the sampled

state x

with known associations and the likelihood of

given the former state x

k−1

and the dynamics can

be computed for each player and measurement sepa-

rately. The measurement likelihood can be computed

analogously to eq. 10 but substituting x

by x

and V

by V

, respectively:





∏

z/∈

∅

(z)

∏



j,k

)|x

j,k



. (14)

The likelihood of the new sample according to the

motion model can be computed by reusing the already

predicted state x

of eq. 6



k−1



∏



j,k



. (15)

3.5 Resampling

SIS particle ﬁlters suffer from the so called degener-

acy phenomenon, where only a small amount of all

particles have not negligible weights. This implies

that most of the computation time will be spent on

particles that contribute only marginally to the ap-

proximation of the posterior probability density func-

tion of equation 4. To reduce the degeneracy problem

resampling has been proposed to eliminate particles

with small weights and clone the others according to

their weights. We include the resampling step by sam-

pling w

k−1

× N

max

associations for particle x

k−1

. Par-

ticles with larger weights will therefor allocate more

particles in the next time step, while particles with

small weights are dropped.

Sampling several times from the same particle the

number of distinct sampled particles will approach the

number of ambiguities in the associations because a

speciﬁc assignment leads to the same sampled con-

ﬁguration. Due to their discreteness there are usually

only a small number of distinct probable associations.

This allows a chance for noticeable improvement in

computation time by smart memoization. Caching

and testing sampled associations for equality can save

computation time considering not only the update to

generate a new state of equation 12 but also the pre-

diction in the next particle ﬁltering step in equation

After resampling the weights are usually reset to

= 1/N

max

to reﬂect the equal probability of all par-

ticles. In our case we count the times n

the same

association J

was sampled for a speciﬁc particle and

provide only one single particle for the next ﬁltering

step having the weight set to w

= n

max

. Then the

weights are recursively updated as in equation 13 and

normalized at the end of the ﬁltering step. The actual

number of particles can therefor vary between 1 and

max

using more particles in situations with high as-

sociation ambiguities. This smart resampling reduces

the computation time and allows real time in the ﬁrst

place.

3.6 Estimate of the State

An estimate of the player positions at time k i.e. of the

state x

can be found by either selecting the particle

with maximum weight or by clustering the particles

and taking the weighted mean of the most probable

cluster. Calculating the weighted mean of all parti-

cles should not be considered here because it can lead

to the so called ghost phenomenon for multi-modal

distributions i.e. it leads to a state estimated as the

mean of two modes that is known to be wrong.

3.7 Implementation

The complete algorithm is depicted in ﬁgure 2 fol-

lowing the derivation of the former section. The indi-

vidual importance distributions π as well as

π and the

Kalman prediction and updates are cached for reuse in

the next sampling iteration to improve efﬁciency. The

importance distribution, all probabilities and weights

are calculated in log-space to avoid numerical prob-

lems.

4 EXPERIMENTAL RESULTS

The proposed tracking method is evaluated as part

of the ASPOGAMO system (Beetz et al., 2006; Beetz

et al., 2007), that aims to extract knowledge from

broadcasted soccer games. ASPOGAMO is able to track

multiple dynamic pan-tilt-zoom cameras and segment

the soccer players and referee by a combination of

variance ﬁlter and color templates. Segmentation in-

ﬂuences the tracking process as the Kalman ﬁlters

smooth assigned measurements, quality evaluation of

the used method can be found in (Beetz et al., 2007).

However segmentation by background subtraction for

VISAPP 2009 - International Conference on Computer Vision Theory and Applications

468





i=1

= RBRPF



k−1



k−1

i=1

= 0

FOR i = 1 : N

k−1

Predict x

as in 6

C = ∅

FOR j = 1 :



k−1

× N

max



Sample an association J

τ = {1,.. .,T }

Init J

: ∀p ∈ τ.

(p) = ∅

Reorder measurements z

randomly

FOR l = 1 : |z

Compute

π() as in 9 and 10

π = normalized

Draw association for lth measurement

with player p ∈ τ by π

(p) = J

(p) ∪ {l}

IF random(0,1) < p

: τ = τ \ p and

renormalize π

END FOR

IF J

not in C

= N

+ 1

= 1

Compute x

by Kalman update if not done

previously as in 12

ˆw

max

Update ˆw

as in 13

C = C ∪ {J

}

ELSE

= n

+ 1

ˆw

= ˆw

−1

END IF

END FOR

Calculate total weight: t =

∑

j=1

ˆw

FOR j = 1 : N

Normalize: w

= t

−1

ˆw

END FOR

Figure 2: Algorithm for one iteration of the proposed Rao-

Blackwellized Resampling particle ﬁlter.

static cameras is usually of higher quality. Digital

videos captured by two dynamic cameras with a frame

rate of 25Hz provide the basic raw material. Track-

ing results in both camera perspectives are depicted

in ﬁgures 3 and 4 and are presented quantitatively in

table 2. The extracted players spatial measurements

of each camera are fused by the proposed tracking al-

gorithm as different measurement sweeps with same

time stamps.

Player positions have been measured in meters

and were initialized manually in the image with co-

variance V

= 2I

, initial velocity was set to zero. The

factor for the kinematic process noise ˜q = 0.0008 is

derived from maximal human speed. The probability

of multiple observations for the same target was ob-

tained experimentally to p

= 0.92. A confusion ma-

trix between different categories was used as a sim-

ple appearance model p

app

and is depicted in table

1. The measurement space is determined by the num-

ber of pixels in each camera frame and evaluates to

M = 720 × 576. We used N

max

= 50 particles to track

all of the 22 players and the referee.

There is no ground truth for broadcasted soccer

games because players can be tracked only visually

and camera parameters are unknown. We abandon to

present a spatial error as this is inﬂuenced mainly by

camera estimation and segmentation. Instead we tried

to ﬁnd a error measure that is related with the number

of false associations. A failure was counted when the

projected player position differed from the real player

in the image by more than 10 pixels for longer than 3

frames. In this case the tracker was reset in the failed

player positions and run again on the rest of the se-

quence. We tracked both camera views separately and

also ran the same sequence fusing the measurements

of both perspectives. Because the broadcasted high-

angle camera shows only a part of the ﬁeld and is pan-

ning and zooming fast, in average only 9.9 players are

visible (with standard deviation of 3.2). We splitted

the number of failures into association errors and as-

signing emerging players (second number) to be com-

parable with the other results. The second row of ta-

ble 2 shows the number of frames that were tracked

in the according experiment. The computation time

was taken for one update step, where all experiments

have been conducted on a 2.2 GHz Dual-core PC. We

think the actual needed time is more signiﬁcant than

the theoretical complexity of the algorithm since the

input data do not scale but stay in ﬁxed boundaries

(number of players is 22, number of measurements

usually lower than 200). The last row depicts the av-

erage number of particles and the corresponding stan-

dard deviation. Table 2 clearly evidences the real-time

tracking ability of the proposed method with low fail-

ure rate for single cameras. Fusion of different cam-

eras reduces the occurrence of occlusions and there-

with failure rate and number of particles even further.

The fourth experiment states a challenging se-

quence including several fouls and header duels

where kinematic and appearance model have often

been to weak to differentiate between players causing

a higher number of failures. The amount of measure-

ments (lower for the highangle view) correlates obvi-

ously with the number of particles and the computa-

tion time demonstrating the adaptiveness of the pro-

posed method to the complexity of the tracking prob-

lem. Also we observed assignment errors if segmen-

RAO-BLACKWELLIZED RESAMPLING PARTICLE FILTER FOR REAL-TIME PLAYER TRACKING IN SPORTS

469

Table 1: Confusion matrix between different categories.

Italy France Referee

Italy 0.6 0.1 0.3

France 0.1 0.8 0.1

Referee 0.3 0.1 0.6

Table 2: Tracking performance on the ﬁnal of the world cup

2006.

Game Frames Fail Time(ms) Particles

Tactical 1262 13 23.3± 4 43.5±10

Highangle 1262 7+54 8.5± 5 12.1±12

Fused 1262 11 30.2±20 33.3±16

Fused II 3202 98 33.4±18 34.1±14

tation could not extract a speciﬁc player for longer

than 20 frames (e.g. fouled player on the ground).

We also implemented the method as proposed by

Khan et al., 2006 (Khan et al., 2006) and tested it on

the World Cup ﬁnal. We encountered problems of two

kind: low variance in sparse particles and misleading

interaction handling. The real-time requirement al-

lowed only a small number of particles (6 in our case)

which had a low variance because the Markov chain

converged to very similar associations. This misled

the tracker to remember the most probable conﬁgu-

ration only which often did not equal the true posi-

tions. Interactions are handled by dependencies in the

positions via symmetric entries in the conﬁguration

covariance matrix. This modeling is inappropriate for

interacting soccer players, where e.g. the player on the

ball shows contrary motion to his competitor. Both

drawbacks resulted in poor tracking performance for

the inspected soccer game sequences.

Figure 3: Tactical camera view of the World Cup ﬁnal 2006.

Figure 4: Identity tracking of soccer players in the broad-

casted highangle camera view of the World Cup ﬁnal 2006.

5 CONCLUSIONS

In this article we have proposed a real-time multiple

target tracking method based on Rao-Blackwellized

Resampling particle ﬁltering for tracking soccer

player identities. We presented the necessary exten-

sions of an so far only theoretically evaluated state-

of-the-art multi-target tracking method to handle real

tracking problems being as challenging as in the

sports domain. The ﬁrst extension comprises the pro-

cessing of measurements of one sweep instead of one

at a time to relax the independence assumption of as-

sociations. Secondly, smart resampling and memoiza-

tion was introduced to equip the tracking method with

real-time capabilities. Experimental results demon-

strate robustness and real-time performance of the

developed method in challenging soccer game se-

quences including increased achievements by fusion

of measurements from different cameras. A compari-

son with another recent multi-target tracking method

explains the supremacy of our approach for the soc-

cer domain. For future research we plan to examine

more complex appearance models for automatic reini-

tialization of the identities especially regarding broad-

casted single view sports videos.

ACKNOWLEDGEMENTS

This work was partially funded by the German Re-

search Foundation DFG.

VISAPP 2009 - International Conference on Computer Vision Theory and Applications

470

REFERENCES

A.Dearden, Y. and Grau, O. (2006). Tracking football

player movment from a single moving camera using

particle ﬁlters. In European Conf. on Visual Media

Production (CVMP 2006).

Arulampalam, M. S., Maskell, S., Gordon, N., and Clapp,

T. (2002). A tutorial on particle ﬁlters for on-

line nonlinear/non-gaussian bayesian tracking. IEEE

Trans. on Signal Processing, 50(2).

Bar-Shalom, Y., Li, X.-R., and Kirubarajan, T. (2001). Esti-

mation with Applications to Tracking and Navigation.

Wiley Interscience.

Barcel

o, L., Binefa, X., and Kender, J. R. (2005). Robust

methods and representations for soccer player track-

ing and collision resolution. In Proc. of Intl. Conf. on

Image and Video Retrieval (CIVR 2005), pages 237–

246.

Beetz, M., Bandouch, J., Gedikli, S., von Hoyningen-

Huene, N., Kirchlechner, B., and Maldonado, A.

(2006). Camera-based observation of football games

for analyzing multi-agent activities. In Proc. of Intl.

Joint Conf. on Autonomous Agents and Multiagent

Systems (AAMAS).

Beetz, M., Gedikli, S., Bandouch, J., Kirchlechner, B., von

Hoyningen-Huene, N., and Perzylo, A. (2007). Visu-

ally tracking football games based on tv broadcasts.

In Proc. of Intl. Joint Conf. on Artiﬁcial Intelligence

(IJCAI).

Doucet, A. (1998). On sequential Monte Carlo methods for

Bayesian ﬁltering. Technical report, Dept. End., Univ.

Cambridge, UK.

Du, W., Hayet, J.-B., Piater, J., and Verly, J. (2006). Col-

laborative multi-camera tracking of athletes in team

sports. In Workshop on Computer Vision Based Anal-

ysis in Sport Environments (CVBASE), pages 2–13.

Du, W. and Piater, J. H. (2007). Multi-camera People

Tracking by Collaborative Particle Filters and Princi-

pal Axis-Based Integration. In Asian Conference on

Computer Vision, number 4843 in LNCS, pages 365–

374. Springer.

Figueroa, P. J., Leite, N. J., and Barros, R. M. L. (2006).

Tracking soccer players aiming their kinematical mo-

tion analysis. Computer Vision and Image Under-

standing, 101(2):122–135.

Gedikli, S., Bandouch, J., von Hoyningen-Huene, N.,

Kirchlechner, B., and Beetz, M. (2007). An Adap-

tive Vision System for Tracking Soccer Players from

Variable Camera Settings. In Proc. of Intl. Conf. on

Computer Vision Systems (ICVS).

Huang, P. and Hilton, A. (2006). Football player tracking

for video annotation. In European Conf. on Visual

Media Production.

Khan, Z., Balch, T., and Dellaert, F. (2006). Mcmc data

association and sparse factorization updating for real

time multitarget tracking with merged and multiple

measurements. IEEE Trans. on Pattern Analysis and

Machine Intelligence, 28(12):1960–1972.

Li, Y., Dore, A., and Orwell, J. (2005). Evaluating the per-

formance of systems for tracking football players and

ball. In IEEE Intl. Conf. on Advanced Video and Sig-

nal Based Surveillance.

Liu, J., Tong, X., Li, W., Wang, T., Zhang, Y., Wang, H.,

Yang, B., Sun, L., and Yang, S. (2007). Automatic

player detection, labeling and tracking in broadcast

soccer video. In British Machine Vision Conference.

MacCormick, J. and Blake, A. (1999). A probabilistic

exclusion principle for tracking multiple objects. In

Proc. of Intl. Conf. on Computer Vision (ICCV), pages

572–578.

Nillius, P., Sullivan, J., and Carlsson, S. (2006). Multi-target

tracking - linking identities using bayesian network

inference. In Proc. of Computer Vision and Pattern

Recognition, pages 2187–2194.

arkk

a, S., Vehtari, A., and Lampinen, J. (2004). Rao-

blackwellized monte carlo data association for mul-

tiple target tracking. In Proc. of Intl Conf. on Infor-

mation Fusion, volume 7, Stockholm.

arkk

a, S., Vehtari, A., and Lampinen, J. (2007). Rao-

blackwellized particle ﬁlter for multiple target track-

ing. Information Fusion Journal, 8(1):2–15.

Sullivan, J. and Carlsson, S. (2006). Tracking and labelling

of interacting multiple targets. In Proc. of European

Conf. on Computer Vision, pages 619–632.

Yang, C., Duraiswami, R., and Davis, L. (2005). Fast mul-

tiple object tracking via a hierarchical particle ﬁlter.

In Proc. of Intl. Conf. on Computer Vision, volume 1,

pages 212–219.

RAO-BLACKWELLIZED RESAMPLING PARTICLE FILTER FOR REAL-TIME PLAYER TRACKING IN SPORTS

471