
 
−  We propose a method to cluster the road sections 
based upon the network density statistics. Unlike 
some existing work, this clustering takes into 
account the orientation of the trajectory. Besides, 
this method utilizes the network topology to create 
relevant clusters. 
−  We propose a model to assess the evolution for 
dense route pairs at two consecutive time intervals. 
−  We propose a graph conveying the evolution as a 
mean to describe the information in a synthetic 
manner and to question the evolution of the density 
through the whole network. 
The rest of the paper is structured as follows. We 
describe a few preliminary concepts in section 2. In 
section 3, we present the first step concerning the 
clustering of road sections. The second step related 
to the evolution graph construction is presented in 
section 4. In section 5, we present the result of our 
experimental study. Finally, section 6 concludes this 
paper and sketches some future orientations. 
2  PRELIMINARIES 
The representation of the network is given by the set 
of road sections. The road section is represented 
through a graph NG (N, S). S is the set of directed 
segments, where each one represents the smallest 
unit of road section. N is the set of nodes, where 
each one represents a road junction. 
Besides, knowing the set of trajectories, we 
compute a matrix of transitions for the road network 
at each time interval. This tells how many times the 
junction have been taken for each turning movement 
(i.e. between each pair of adjacent sections), by 
reporting the number of moving objects going from 
one section to another at each time interval. This 
matrix is denoted M and M(i,j) represents the 
number of moving objects passing through S
i
 to 
section  S
j
 within the interval It
n
 (n  ∈  {1,…,k}, k 
stands for the number of time intervals). We also 
denote S
ij
 the transition (or turning movement) from 
S
i
 to S
j
. 
We adopt a symbolic representation of the 
trajectories as in (Du Mouza C. and P. Rigaux, 
2004), (Wan T., K. Zeitouni, 2005). In this model, a 
moving object trajectory tr is described by an 
identifier (tid) and a sequence of symbols where 
each one refers to a road section (S
i
), followed by a 
temporal identifier (t
i
) referring the time of entry of 
the trajectory tid to S
i
: 
tr = (tid , <(S
i1
 t
j1
), (S
i2
 t
j2
), …, (S
ik
 t
jk
)>)  with S
in
 ∈ S 
The order of symbols in the sequence above shows 
the movement direction. 
Concerning the similarity measure adopted in 
this work, we define the similarity (Trans_sim) at 
the level of the network for two adjacent transitions 
S
ij
 and S
jk
 as the difference of their density values: 
While the similarity between nonadjacent 
transitions is null: 
Trans_sim (S
ij
, S
uv
) = 0 if i≠v and j≠u 
(2) 
We define another similarity measure between dense 
routes (Route_sim). It allows comparing the dense 
route. Two routes are considered similar (with a 
similarity equal to 1) if they share at least one road 
section that corresponds to two successive time 
intervals. Otherwise, their similarity is null. 
3  SECTION CLUSTERING  
We call our proposed algorithm NETSCAN. It 
carries out the clustering of dense sections and 
incorporates them by forming dense routes. It is 
inspired from the density based clustering principle 
introduced with DBSCAN algorithm (Ester et al., 
1996), while applying it to road sections. It takes as 
input the set of sections that constitute the road 
network, the spatiotemporal transitions matrix 
associated with each time interval, a density 
threshold α and a similarity threshold ε between the 
transition densities. NETSCAN finds firstly the 
dense transitions, i.e. those having maximum value. 
Afterwards, for each dense transition, it groups the 
connected segments and transitions that have similar 
densities, thus creating dense routes.  
The process begins with the transition having the 
maximal density. Then, it begins searching the 
connected transitions in both ways in order to find 
those with a density ε near to the maximal one. To 
insure the non reuse of transitions that are included 
in dense routes, they are marked at the first 
assignment.  
The extension of a dense route is done in both 
ways if the constraints are verified, i.e., the 
candidate transition is only marked if it respects the 
α and ε thresholds. The obtained segment clusters 
correspond to the densest routes in the network. This 
procedure is performed again for each time interval. 
The dense routes are represented as a sequence of 
segments, the same as with the trajectories. Each 
segment is identified by an associated symbol. 
Trans_sim (S
ij
, S
uv
)= |M(i,j) – M(j,k)|  (1) 
KDIR 2009 - International Conference on Knowledge Discovery and Information Retrieval
320