Optimizing Collision Avoidance in Dynamic Multi-Robot Systems:

A Velocity Obstacle and BB-PSO Approach with Priority Consideration

Luis H. Sanchez-Vaca

, Gildardo Sanchez-Ante

and Hernan Abaunza

∗ c

Tecnologico de Monterrey, Department of Computing, Gral. Ramon Corona 2514, Zapopan, JAL, Mexico

Keywords:

Multi-Robot Systems, Prioritized Navigation, Collision Avoidance, Velocity Obstacles, Particle Swarm

Optimization.

Abstract:

This study proposes integrating Reciprocal Velocity Obstacles (RVO) with Bare Bones Particle Swarm Opti-

mization (BB-PSO) for prioritized motion planning in multi-robot systems. BB-PSO was chosen because it

has fewer parameters to tune, reduced computational complexity, and provides potentially faster convergence

compared to standard PSO. The methodology enables collision avoidance and path planning while allowing

differentiated robot behaviors based on priority levels. Simulations used a two-phase experimental strategy:

ﬁrst, tuning cost function parameters through grid search, and second, evaluating various priority conﬁgu-

rations and random scenarios. Results show that the selected weight conﬁguration (α = 4,β = 2) balances

goal-seeking and obstacle avoidance, enabling high-priority agents to move directly while ensuring overall

group safety. Scenarios with higher average priorities exhibited shorter travel distances and faster completion

times, whereas those with lower or imbalanced priorities led to more conservative behavior and delays. Com-

pared to a greedy baseline, the proposed method signiﬁcantly reduced collisions, achieving an average of 1.0

collision per scenario versus 6.6 with the greedy approach. Some priority conﬁgurations achieved complete

task fulﬁllment without any collisions, highlighting the potential for optimized multi-robot coordination. The

proposed method offers a promising strategy for prioritized motion planning, balancing efﬁciency and safety

based on task importance. Future research includes comparing BB-PSO with other optimization methods,

reducing sample requirements, dynamically adjusting priorities, and extending the model to incorporate task

parameterizations and autonomous priority adaptation.

1 INTRODUCTION

Motion planning in robotics involves determining a

sequence of movements that allow a robot to navi-

gate from an initial position to a desired goal while

avoiding obstacles. This essential aspect of robotics

requires computing a collision-free path through the

robot environment, accounting for kinematics, dy-

namics, and environmental constraints (Latombe,

1991). Even in its simplest form, where a rigid object

navigates among static obstacles, the problem is NP-

hard (Canny, 1988) and its complexity increases in the

presence of dynamic obstacles, deformable bodies, or

multiple robots.

This work addresses the Multi-Robot Motion

Planing Problem (MRMP) (Saha and Isto, 2006),

https://orcid.org/0009-0003-8875-0315

https://orcid.org/0000-0003-3666-0855

https://orcid.org/0000-0001-5325-730X

∗

Corresponding author: habaunza@tec.mx

while prioritizing the motion of certain robots over

others, with the goal of enabling efﬁcient, coordi-

nated navigation where some agents are granted pas-

sage preference owing to task urgency, resource con-

straints, or mission-critical roles.

A central challenge in multi-robot systems is co-

ordinating robot motion to prevent collisions and en-

sure efﬁcient operation. Effective coordination en-

ables robots to move simultaneously without unnec-

essary delays, thereby demanding quick decisions

in dynamic settings. The complexity of MRMP

increases exponentially with the number of robots,

owing to the expansion of the joint conﬁguration

space. Researchers have explored various strategies,

including centralized and decentralized approaches,

sampling-based algorithms, and optimization tech-

niques (Sanchez and Latombe, 2002). Among the

recent trends, heuristic and AI-based methods have

demonstrated improved adaptability and efﬁciency.

Bio-inspired strategies, as reviewed by Banik et

Sanchez-Vaca, L. H., Sanchez-Ante, G. and Abaunza, H.

Optimizing Collision Avoidance in Dynamic Multi-Robot Systems: A Velocity Obstacle and BB-PSO Approach with Priority Consideration.

DOI: 10.5220/0013720000003982

Paper published under CC license (CC BY-NC-ND 4.0)

In Proceedings of the 22nd International Conference on Informatics in Control, Automation and Robotics (ICINCO 2025) - Volume 2, pages 309-316

ISBN: 978-989-758-770-2; ISSN: 2184-2809

309

al. (Banik et al., 2025), highlight the growing inter-

est in hybrid approaches that blend heuristics with AI

to enhance performance in dynamic environments.

Chen et al. (Chen et al., 2024) present a veloc-

ity prediction algorithm combining backpropagation

(BP) neural networks with reciprocal velocity obsta-

cles (PRVO). This method predicts velocities to re-

construct the velocity obstacle region and optimizes

robot navigation in dense and dynamic environments.

Likewise, Jathunga and Rajapaksha (Jathunga and

Rajapaksha, 2025) proposed a hybrid Probabilistic

Roadmap (PRM) method enhanced with Genetic Al-

gorithms (GA), termed PRM-GA. Their approach im-

proves the path quality by reducing the path length

and turn count, outperforming traditional PRMs in

dynamic environments.

Task prioritization is also critical in multi-robot

systems, particularly when resources are limited or

mission objectives vary in urgency. For instance, in

search and rescue missions, it is vital to prioritize

robots that locate survivors over those that clear de-

bris (Banik et al., 2025). Dynamic task prioritization

requires the adaptation of priorities based on real-time

performance metrics, which can be computationally

intensive, but is essential for maintaining system ef-

ﬁciency (Guo et al., 2018). Tian et al. (Tian et al.,

2021) explored real-time path planning using wireless

sensor networks and enhanced AI algorithms, stress-

ing the importance of adaptive prioritization to avoid

delays and ensure continuous operation.

Another challenge lies in learning task priorities

from demonstrations. Extending models to incorpo-

rate task parameterizations and enable autonomous

priority adaptation are essential for enhancing robot

autonomy (Silv

erio et al., 2018). Kumar et al. (Ku-

mar et al., 2023) introduced a hybrid controller that

combines artiﬁcial bee colony optimization with re-

current neural networks, thereby achieving signiﬁcant

improvements in navigation performance in unknown

environments.

Prioritized motion planning assigns priority levels

to robots based on criteria, such as task urgency or

energy constraints. Higher-priority robots plan their

paths ﬁrst, whereas others adjust their trajectories ac-

cordingly. This sequential strategy simpliﬁes compu-

tation compared to centralized methods, but can limit

the solution space for lower-priority robots, poten-

tially leading to suboptimal routes.

The velocity-obstacle (VO) approach is widely

used in mobile robotics for motion planning and col-

lision avoidance. VO deﬁnes sets of velocities that

would result in a collision and represents them geo-

metrically in the velocity space. By selecting veloci-

ties outside these regions, robots can navigate safely

in dynamic environments (Fiorini and Shiller, 1998).

This method extends naturally to multi-robot systems

by treating each robot as a moving obstacle, enabling

decentralized coordination.

An important reﬁnement of VO is the Recipro-

cal Velocity Obstacles (RVO) approach, which as-

sumes mutual collision avoidance between robots.

Van den Berg et al. (Van den Berg et al., 2008) in-

troduced RVO to improve navigation efﬁciency by

considering reciprocal behavior. Further develop-

ments, such as Optimal Reciprocal Collision Avoid-

ance (ORCA) (Alonso-Mora et al., 2013) enhance

RVO by integrating optimization methods, resulting

in smoother and more robust trajectories in dense en-

vironments.

Paikray et al. (Paikray et al., 2021) proposed

an improved version of Particle Swarm Optimiza-

tion (PSO) using sine and cosine algorithms (IPSO-

SCA) to generate optimal deadlock-free paths. Their

method balances exploration and exploitation while

minimizing individual path lengths, underscoring the

robustness of PSO in dynamic and cluttered scenar-

ios.

1.1 Problem Statement

The Velocity Obstacle (VO) approach is an effective

framework for constructing motion plans for multiple

robots. Several VO variants have emerged that focus

on optimal trajectory generation. This study proposes

a novel VO variant guided by Particle Swarm Op-

timization (PSO) to enhance local decision-making

within the VO framework.

PSO is a population-based optimization algorithm

inspired by the collective behavior of swarms such

as bird ﬂocks and ﬁsh schools. Each particle rep-

resents a potential solution, and navigates a multi-

dimensional search space by adjusting its position

based on personal and global bests. PSO’s simplic-

ity, adaptability, and efﬁciency make it suitable for

addressing complex high-dimensional optimization

problems. Besides, PSO has recently being applied

to mobile robots, sometimes in hybrid approaches, as

the one presented in (Najm et al., 2024), where the

authors combine PSO with a Whale Optimization Al-

gorithm (WOA). They found good reductions in path

length with this approach. The versatility of PSO is

such that in (Nasir et al., 2025), the PSO is used to

ﬁne tune the parameters of Fuzzy Logic Controllers.

Its applications span robotics, engineering, and ma-

chine learning, particularly in parameter tuning, path

planning, and feature selection.

ICINCO 2025 - 22nd International Conference on Informatics in Control, Automation and Robotics

310

1.2 Speciﬁc Goals and Scope of the

Study

A key component of sampling-based MRMP is the

tensor roadmap, which combines individual PRMs

for each robot. Recent work introduces the ”stag-

gered grid” sampling scheme, which signiﬁcantly re-

duces the number of required samples while maintain-

ing near-optimality (Banik et al., 2025). This method

demonstrates high-quality solutions with fewer sam-

ples in multi-robot scenarios (Jathunga and Rajapak-

sha, 2025) and outperforms uniform random sam-

pling, particularly in low-sample regimes. Distributed

navigation and obstacle avoidance strategies further

enhance performance in complex, dynamic environ-

ments (Yang et al., 2023).

2 METHODOLOGY

2.1 System Model and Assumptions

Let us consider a set of N mobile robots that operate

in a shared two-dimensional environment. Each robot

, where i = 1,..., N, is modeled as a disc-shaped

holonomic agent with radius r, position x

∈ R

, and

velocity v

∈ R

. Robots can select their velocity at

discrete time steps, t

= k∆t, where ∆t is ﬁxed. Each

robot’s motion follows simple kinematics: straight-

line motion with constant velocity within each time

step, subject to velocity and acceleration constraints.

All robots are assumed to have the same size and ca-

pabilities.

We assume that all agents have access to the posi-

tions and velocities of other robots at all times. This

capability allows reactive navigation and avoidance

behaviors. However, each robot maintains its own pri-

vate navigation goal g

∈ R

. This approach resem-

bles a real-world scenario in which agents can have

sensing capabilities to detect other agents while main-

taining a decentralized navigation system.

Each robot R

is assigned a priority level P

∈ [0, 1].

This scalar value represents the importance of an

agent’s task. Robots with higher priority are expected

to maintain more direct trajectories toward their goals.

Robots with a lower priority are expected to adjust

their paths to avoid interference with higher-priority

agents.

Priority values inﬂuence the weights of the cost

functions used during the optimization. This mecha-

nism enables implicit hierarchical coordination with-

out requiring centralized control or explicit negotia-

tion between robots.

2.2 Reciprocal Velocity Obstacle

Formulation

To ensure safe navigation in a shared environment,

each robot must avoid collisions with the other agents.

We adopted the Velocity Obstacle (VO) framework to

identify the velocities that would lead to a future col-

lision if both agents maintain their current velocities.

The Velocity Obstacle is deﬁned as follows. Let

two robots, R

and R

, with current positions p

and

, velocities v

and v

, and radii r

and r

, the VO

for robot R

induced by robot R

is deﬁned in Eq. 1.

∈ R

− v

∈ CC(p

− p

+ r

)}

(1)

This equation implies that v

lies within a collision

cone (CC) of relative velocities that would result in a

collision with R

, assuming that both agents main-

tain a constant velocity. The cone is centered along

the vector p

− p

, and its angular aperture is deﬁned

as the sum of r

+ r

. Any velocity v

within VO

would eventually bring robot R

into contact with R

However, in VO, avoidance is unilateral; robot R

is responsible for avoiding R

. When both agents try

to avoid each other independently, this can lead to

overly conservative behavior or oscillations. To ad-

dress this, we adopted the Reciprocal Velocity Obsta-

cles (RVO) formulation.

The key idea in RVO is to share the responsibil-

ity for collision avoidance between agents. Instead

of avoiding the entire VO, each agent avoids only the

portion of the velocity space that would lead to col-

lision, assuming that the relative velocity is equally

adjusted. The RVO for robot R

with respect to R

deﬁned in Eq. 2.

RVO

:= {v

∈ R

−

+ v

∈ VO

} (2)

Similarly, R

avoids the VO region centered at

the average velocity of both agents rather than at V

This formulation results in more balanced and natural

agent interactions, reducing unnecessary detours and

avoiding deadlocks in symmetric scenarios.

At each planning step, robot R

generates a dis-

crete set of velocity candidates V

based on its dy-

namic constraints, known as Reachable Velocities

(RV). We then ﬁlter this set by removing all veloci-

ties within the union of the velocity obstacles induced

by neighboring agents, as shown in Eq. 3.

RAV

:= {v ∈ V

|v /∈ U(V O

)} (3)

The resulting set, the Reachable Avoiding Veloci-

ties (RAV), is the feasible domain for the subsequent

optimization step. The following section describes

how a Particle Optimization (PSO) strategy guides the

velocity selection process.

Optimizing Collision Avoidance in Dynamic Multi-Robot Systems: A Velocity Obstacle and BB-PSO Approach with Priority Consideration

311

2.3 Particle Swarm Optimization

Algorithm

To select the optimal velocity from the set of safe

candidates (RAV), we employ the Bare Bones Parti-

cle Swarm Optimization (BB-PSO) algorithm. This

variant of PSO eliminates the use of velocity vectors

and generates new candidate solutions by sampling a

Gaussian distribution deﬁned by the best-known po-

sitions.

Each robot runs an independent instance of BB-

PSO at every time step to select the next velocity. The

swarm comprises particles, each representing a can-

didate velocity, v ∈ R

For each particle p, a new position is generated by

sampling from the normal distribution given in Eq. 4.

best

+ g

best

)

σ = |p

best

− g

best

(t+1)

∼ N(µ

,σ),

(4)

Here, µ

is the mean of the distribution, calculated

as the average of the particle’s best-known position

best

) and the global best position among all parti-

cles (g

best

). σ is the standard deviation, deﬁned as the

absolute difference between p

best

and g

best

. The new

position x

(t+1)

is sampled from a normal distribution

N(µ

,σ).

The new position is considered only if it is a part

of the RAV set. Otherwise, the particle will maintain

the same position. This ensures that only valid ve-

locities are generated for all particles during the op-

timization process. We evaluate each valid candidate

velocity by using the cost function in Eq. 5.

J(v) = α ·

∥

g − (x + v · ∆t)

∥

+ β ·

∑

o∈O



−

tanh(k · (A − d

safe

)) +



(5)

where A = ∥x

−x

′

∥, x denotes the current position of

the robot, g denotes the goal of the robot, v denotes

the candidate velocity, and ∆t denotes the time step.

The predicted position is given by x

′

= x +v · ∆t, The

set O contains all obstacles, each located at x

. Pa-

rameters d

safe

and k control the shape of the penalty

from being close to the obstacle, whereas α and β re-

spectively determine the relative weights of the goal-

seeking and obstacle avoidance terms.

The particle with the lowest cost across all iter-

ations (g

best

) is selected as the robot’s next velocity

command.

The implementation of this algorithm is summa-

rized in the pseudocode shown in Alg. 1.

Algorithm 1: BB-PSO for Velocity Selection.

Initialize particles {v

,... , v

} ← RAV;

Set pbest

← v

, and evaluate J(pbest

);

Set global best gbest ← argmin

J(pbest

);

for k = 1 to N do

for each particle i do

Compute µ ← 0.5(pbest

+ gbest);

Compute σ ← ||pbest

− gbest||;

Sample new position v

′

∼ N (µ,σ);

if v

′

is valid (within RAV) then

Update particle: v

← v

′

;

end

Evaluate J(v

);

if J(v

) < J(pbest

) then

Update personal best:

pbest

← v

;

end

Update global best

gbest ← argmin

J(pbest

);

end

return gbest as the optimal velocity v

∗

2.4 Integration of RVO and PSO

To integrate VO constraints into the BB-PSO frame-

work—and, more importantly, to account for the in-

ﬂuence of robot priorities on the optimization—we

adopted the following procedure. At each planning

step, robot R

computes the set of RAV, which is

deﬁned as the subset of dynamically feasible veloc-

ities that do not fall inside any VO generated by other

agents. This set serves as the search domain for the

BB-PSO algorithm. Particles outside the RAV are re-

jected, and their positions will not be updated.

Each robot is assigned a scalar priority value, P

where a higher value indicates greater importance.

Priorities do not alter the VO constraints directly, but

they modulate the weights in the cost function to en-

courage differentiated behavior.

Speciﬁcally, the weights α and β in the cost func-

tion are adjusted using Eqs. 6.

= max(αP

,0.1)

= max(β(1 − P

),1.0)

(6)

A high-priority robot (P

→ 1) emphasizes reach-

ing a goal and reduces obstacle avoidance penalties.

A low-priority robot (P

→ 0) becomes more con-

servative, placing greater weight on obstacle avoid-

ance.

We ensured that the obstacle avoidance weight

did not fall below the minimum threshold of 1.0.

This helps to prevent agents with the highest pri-

ority from taking too many risks by ignoring other

ICINCO 2025 - 22nd International Conference on Informatics in Control, Automation and Robotics

312

agents, possibly causing collisions in clustered envi-

ronments. Simultaneously, we ensured that the goal-

seeking weight was never below 0.1. This ensures that

even if the agent has the lowest priority, it will still try

to reach its goal.

3 IMPLEMENTATION

3.1 Simulation Setup

To evaluate the proposed approach, we developed

a series of simulations in a controlled 2D environ-

ment using custom-built tools and clearly deﬁned

robot conﬁgurations. The simulation was developed

in Python and executed using a ﬁxed time-step of

dt = 0.1 s. The environment comprises a bounded

two-dimensional (2D) workspace with multiple holo-

nomic robots. All motion-planning logic, includ-

ing VO construction and BB-PSO, was implemented

from scratch using NumPy for numerical operations

and Matplotlib for visualization.

The simulation loop is synchronous; however, the

system is nondeterministic owing to the stochastic

sampling in BB-PSO. Speciﬁcally, new particle posi-

tions are sampled from a normal distribution, making

the outcome of each run sensitive to random initial-

ization and updates.

Each robot is modeled as a circular agent with a

radius r = 0.3 m, capable of holonomic motion. The

robots are constrained by a maximum speed of v

max

0.7 m/s and a maximum acceleration of a

max

= 2.0

m/s². These parameters are based on a robot that will

be used for further testing (Hiwonder, 2025).

The robots are initialized with predeﬁned initial

positions and static goal locations, allowing a consis-

tent running conﬁguration. Each robot was assigned

a ﬁxed priority level that was manually speciﬁed at

the beginning of each simulation. These values inﬂu-

ence the cost function used in the optimization phase.

Higher-priority robots seek faster and direct paths to

their goals, while lower-priority robots seek safer col-

lision avoidance paths.

3.2 Algorithm Implementation

The algorithm integrates RVO constraints with BB-

PSO optimization to compute collision-free velocities

in a decentralized multi-robot setting. Each robot iter-

atively evaluates its reachable velocity space, ﬁlters it

based on dynamic constraints and collision avoidance

conditions, and then applies the BB-PSO to select the

most suitable velocity. This process is repeated at

each time step until all robots converge to their re-

spective goals within a speciﬁed tolerance. The steps

in this loop are summarized in the pseudocode shown

in Alg. 2. For simplicity, the pseudocode updates the

velocities and positions of the robots within the same

loop. However, it is important to note that in the ac-

tual simulation, these updates are applied only after

all robots have independently computed their next ve-

locity. This ensures that each robot makes its decision

without access to the updated actions of the others,

preserving simultaneity in decision-making.

Algorithm 2: RVO with BB-PSO Optimization.

while distance to goal > threshold do

for R

in Robots do

Compute the reciprocal velocity

obstacle of R

Calculate the reachable velocities of

Filter reachable avoidance velocities

of R

Optimize RAV with BB-PSO to get

the best velocity of R

(t + 1)

Update the velocity of R

Update the position of R

end

The simulation used an object-oriented approach

centered on a Robot class, encapsulating all relevant

information and methods required for motion plan-

ning. Each robot instance stores its current position,

velocity, goal, radius, dynamic limits, priority, and in-

ternal buffers for storing RVO and RAV. The class

also includes methods for updating positions, com-

puting the VO geometry, and ﬁltering velocities based

on collision constraints.

This modular design allows each robot to op-

erate autonomously in the simulation loop, thereby

enabling straightforward scaling to a larger number

of agents. Although the current implementation is

single-threaded, the independence of the agents al-

lows parallelization in further research.

4 RESULTS

4.1 Experiment Setup

A two-phase experimental strategy was employed

to evaluate the performance of the proposed motion

planning algorithm.

In the ﬁrst phase, a baseline scenario was deﬁned

Optimizing Collision Avoidance in Dynamic Multi-Robot Systems: A Velocity Obstacle and BB-PSO Approach with Priority Consideration

313

in which four agents were placed at the corners of a

square and tasked with exchanging positions diago-

nally. Two experiments were conducted using this

setup. First, a grid search was performed to identify

the optimal values of the cost function parameters α

and β, assuming equal priorities among agents. Sec-

ond, a set of evaluations was conducted under varying

priority conﬁgurations, using the previously selected

parameters that balanced both efﬁciency and safety.

The same priority scenarios were also tested using a

greedy approach instead of BB-PSO, to enable a fair

performance comparison.

Table 1: Top 5 conﬁgurations by mean arrival time.

α β Arrival time [s] Collision Number

5.0 2.0 15.67 ± 1.61 10.6

5.0 1.0 15.73 ± 0.85 6.4

4.0 1.0 16.52 ± 0.63 1.8

4.0 2.0 16.56 ± 0.23 0.0

3.0 2.0 16.63 ± 0.68 2.2

The second phase involved testing the algorithm

in random scenarios, where both initial and goal posi-

tions were sampled randomly to assess the robustness

of the approach under varying conditions.

Each individual experiment was repeated ﬁve

times to account for the stochastic nature of the opti-

mization process, as BB-PSO relies on sampling from

Gaussian distributions.

The average performance was evaluated using

three metrics: the total travel time, the total travel dis-

tance, and the number of collisions observed. Colli-

sions were evaluated at every time step to detect over-

laps between robots. As a result, a single collision

event could be counted multiple times if the robots

remained in contact across consecutive time steps.

The results of these experiments are presented in

the next section.

4.2 Parameter Tuning and Weight

Selection

To tune the cost function parameters, 25 experiments

were conducted by performing a grid search over

α,β ∈ {1, 2, 3,4,5}, resulting in 5 × 5 combinations.

Table 1 presents the top ﬁve parameter conﬁgura-

tions with the lowest mean arrival times.

4.3 Evaluation Under Priority-Driven

Scenarios

After completing the grid search over α and β, con-

ﬁgurations α = 4 and β = 2 were selected for further

evaluation under varying priority assignments. Al-

though this combination did not yield the lowest aver-

age arrival time, it consistently demonstrated compet-

itive performance with zero collisions. Therefore, the

selected conﬁguration provided a balanced trade-off

between speed and safety.

Table 2: Priority conﬁguration for each scenario (Robot

0–3).

Robot Sc.0 Sc.1 Sc.2 Sc.3 Sc.4 Sc.5 Sc.6 Sc.7 Sc.8 Sc.9

1.00 1.00 0.25 1.00 0.50 1.00 0.80 0.20 1.00 0.30

1.00 0.75 0.50 1.00 0.50 0.00 0.60 0.40 0.30 1.00

1.00 0.50 0.75 0.50 0.50 1.00 0.40 0.60 0.30 0.30

1.00 0.25 1.00 0.50 0.50 0.00 0.20 0.80 0.30 0.30

To evaluate the impact of priority levels on multi-

robot behavior, ten custom scenarios were tested us-

ing the selected conﬁgurations. In each scenario, the

four robots were assigned different static priority val-

ues ranging from 0.0 to 1.0, as shown in Table 2.

Figure 1 presents the trade-off between the av-

erage arrival time and average distance traveled for

all scenarios. Each point is colored according to

the average priority assigned to all the agents in that

scenario. A general trend was observed: scenarios

with higher average priorities (darker red points) re-

sulted in shorter travel distances and shorter comple-

tion times. In contrast, scenarios with lower average

priorities or greater imbalance tend to show increased

path lengths and delays, often owing to more conser-

vative behavior or yielding to high-priority agents.

Almost half of the scenarios resulted in full task

completion without any collisions. In contrast, the

ﬁrst scenario—where all robots had maximum prior-

ity—exhibited the highest number of collisions, but

also achieved the fastest arrival times and the short-

est distances traveled. On average, only one collision

occurred per scenario, highlighting the overall feasi-

bility and efﬁciency of the proposed method.

Table 3 presents a summary of the quantitative re-

sults for the ten evaluated priority scenarios, including

the average arrival time, distance traveled, and num-

Figure 1: Trade-off between average arrival time and dis-

tance traveled for each scenario. Color indicates the aver-

age priority value across all four robots.

ICINCO 2025 - 22nd International Conference on Informatics in Control, Automation and Robotics

314

ber of collisions recorded across simulation runs.

On the other hand, Table 4 presents the results

of running the same set of experiments using a

greedy approach instead of BB-PSO for the veloc-

ity optimization phase. Although the greedy method

achieved better average results in terms of arrival time

and distance traveled, it resulted in a signiﬁcantly

higher number of collisions, with an average of 6.6

per scenario. This suggests that BB-PSO was able to

explore more diverse solutions beyond those initially

included in the RAV set. While the proposed method

prioritized safety, the greedy strategy led to more ag-

gressive decisions in pursuit of goal completion.

Table 3: Average performance metrics for each scenario us-

ing BB-PSO.

Scenario

Arrival

Time (s)

Distance

(m)

Collisions

(avg.)

0 12.82 11.50 3.0

1 18.80 16.07 0.6

2 19.10 16.10 2.6

3 18.02 15.60 1.0

4 16.53 13.21 0.0

5 19.24 16.37 0.6

6 20.99 17.31 0.0

7 21.45 17.68 0.0

8 20.14 16.91 2.2

9 23.15 19.26 0.0

Average 19.02 16.00 1.0

Table 4: Average performance metrics for each scenario us-

ing the greedy approach.

Scenario

Arrival

Time (s)

Distance

(m)

Collisions

(avg.)

0 14.80 12.03 10.0

1 17.03 14.07 19.0

2 19.20 16.19 0.0

3 17.38 14.56 2.0

4 18.40 13.93 4.0

5 20.20 16.66 1.0

6 17.55 14.42 18.0

7 21.65 17.19 4.0

8 21.95 17.73 0.0

9 17.30 13.77 8.0

Average 18.55 15.06 6.6

4.4 Evaluation with Random Initial

Positions and Goals

To evaluate the performance of the proposed method

under varying conditions, ﬁve additional experiments

were conducted using randomly assigned initial po-

sitions and goals, with all agents assigned the same

priority value of 0.5.

The positions were generated under two con-

straints: each agent’s path had to be at least 3 meters

long, and both initial and goal positions were placed

to prevent overlaps that could lead to collisions at the

start or upon arrival.

Table 5: Average performance metrics for scenarios with

random positions and goals.

Scenario

Arrival

Time (s)

Distance

(m)

Collisions

(avg.)

0 5.67 4.58 0.0

1 10.66 8.81 2.0

2 7.55 6.28 0.0

3 7.28 5.57 0.0

4 9.31 7.42 0.0

Average 8.09 6.53 0.4

Figure 2 illustrates one of the randomly generated

scenarios, along with the trajectories followed by the

agents to reach their respective goals. Table 5 summa-

rizes the average performance across multiple runs.

Almost all experiments resulted in zero collisions, ex-

cept for Scenario 1—the case shown in the ﬁgure.

In this particular instance, several path intersections

likely contributed to the observed collisions. Despite

this, the results demonstrate that the proposed method

is robust across diverse scenarios, consistently main-

taining a low collision rate and successfully generat-

ing feasible solutions under varying conﬁgurations.

(a) (b)

Figure 2: Random Scenario 1. (a) shows the initial and goal

positions of the robots, and (b) shows the resulting trajecto-

ries.

5 CONCLUSIONS

The proposed methodology, which integrates Recip-

rocal Velocity Obstacles (RVO) with Bare Bones

Particle Swarm Optimization (BB-PSO), effectively

facilitates collision avoidance and path planning in

multi-robot systems. The incorporation of priority

levels into the cost function signiﬁcantly inﬂuences

the robot’s behavior, enabling differentiated actions

based on the assigned importance. A balance between

goal-seeking and obstacle avoidance was achieved

with the selected weight conﬁguration (α = 4, β = 2),

allowing high-priority agents to move more directly

Optimizing Collision Avoidance in Dynamic Multi-Robot Systems: A Velocity Obstacle and BB-PSO Approach with Priority Consideration

315

while ensuring overall group safety. Scenarios char-

acterized by higher average priorities generally re-

sulted in shorter travel distances and faster comple-

tion times, whereas those with lower or imbalanced

priorities exhibited more conservative behavior and

potential delays. Certain priority conﬁgurations (e.g.,

scenarios 4, 6, 7, and 9) achieved complete task fulﬁll-

ment without any collisions, demonstrating the poten-

tial for optimized multi-robot coordination using this

approach. These ﬁndings suggest that the proposed

method offers a promising strategy for prioritized mo-

tion planning in multi-robot systems, balancing efﬁ-

ciency and safety based on assigned task importance.

Future research directions include an in-depth evalua-

tion of BB-PSO against a broader set of optimization

algorithms, given that the greedy strategy was less ef-

fective in minimizing collisions. Additionally, explor-

ing methods for dynamically adjusting robot priorities

based on real-time performance metrics or changing

mission objectives, along with extending the model

to incorporate task parameterizations, could enhance

robot autonomy.

ACKNOWLEDGEMENTS

This research was supported in part by the Univer-

sity of Alberta-Tecnologico de Monterrey Seed Grant

Program, under project ”Intelligent Distributed Con-

trols for Multi-Agent Autonomous Systems for Safe

Interaction with Humans”.

REFERENCES

Alonso-Mora, J., Breitenmoser, A., Ruﬂi, M., Beardsley,

P., and Siegwart, R. (2013). Optimal reciprocal col-

lision avoidance for multiple non-holonomic robots.

In Distributed autonomous robotic systems: The 10th

international symposium, pages 203–216. Springer.

Banik, S., Banik, S. C., and Mahmud, S. S. (2025). Path

planning approaches in multi-robot system: A review.

Engineering Reports, 7(1):e13035.

Canny, J. (1988). The complexity of robot motion planning.

MIT press.

Chen, Y., Wang, Y., Li, B., and Kamiya, T. (2024). Multi-

robot navigation based on velocity obstacle prediction

in dynamic crowded environments. Industrial Robot:

the international journal of robotics research and ap-

plication, 51(4):607–616.

Fiorini, P. and Shiller, Z. (1998). Motion planning in dy-

namic environments using velocity obstacles. The in-

ternational journal of robotics research, 17(7):760–

772.

Guo, M., Haque, A., Huang, D.-A., Yeung, S., and Fei-Fei,

L. (2018). Dynamic task prioritization for multitask

learning. In Proceedings of the European conference

on computer vision (ECCV), pages 270–287.

Hiwonder (2025). Jetauto ros robot car. https:

//www.hiwonder.com/products/jetauto?variant=

41201592762455. Accessed: 2025-07-28.

Jathunga, T. and Rajapaksha, S. (2025). Improved path

planning for multi-robot systems using a hybrid prob-

abilistic roadmap and genetic algorithm approach.

Journal of Robotics and Control (JRC), 6(2):715–733.

Kumar, S., Parhi, D. R., and Muni, M. K. (2023). Path plan-

ning and obstacle avoidance of multi-robotic system

in static and dynamic environments. Proceedings of

the Institution of Mechanical Engineers, Part B: Jour-

nal of Engineering Manufacture, 237(9):1376–1390.

Latombe, J. (1991). Robot motion planning, volume 124

of The Kluwer international series in engineering and

computer science. Kluwer.

Najm, H. T., Ahmad, N. S., and Al-Araji, A. S. (2024). En-

hanced path planning algorithm via hybrid woa-pso

for differential wheeled mobile robots. Systems Sci-

ence & Control Engineering, 12(1):2334301.

Nasir, N. M., Ghani, N. M. A., Nasir, A. N. K., Ahmad,

M. A., and Tokhi, M. O. (2025). Neuro-modelling

and fuzzy logic control of a two-wheeled wheelchair

system. Journal of Low Frequency Noise, Vibration

and Active Control, 44(1):588–602.

Paikray, H., Das, P., and Panda, S. (2021). Optimal multi-

robot path planning using particle swarm optimiza-

tion algorithm improved by sine and cosine algo-

rithms. Arabian Journal for Science and Engineering,

46(4):3357–3381.

Saha, M. and Isto, P. (2006). Multi-robot motion planning

by incremental coordination. In 2006 IEEE/RSJ In-

ternational Conference on Intelligent Robots and Sys-

tems, pages 5960–5963.

Sanchez, G. and Latombe, J.-C. (2002). Using a prm plan-

ner to compare centralized and decoupled planning for

multi-robot systems. In Proceedings 2002 IEEE In-

ternational Conference on Robotics and Automation

(Cat. No.02CH37292), volume 2, pages 2112–2119

vol.2.

Silv

erio, J., Calinon, S., Rozo, L., and Caldwell, D. G.

(2018). Learning task priorities from demonstrations.

IEEE Transactions on Robotics, 35(1):78–94.

Tian, S., Li, Y., Kang, Y., and Xia, J. (2021). Multi-robot

path planning in wireless sensor networks based on

jump mechanism pso and safety gap obstacle avoid-

ance. Future Generation Computer Systems, 118:37–

47.

Van den Berg, J., Lin, M., and Manocha, D. (2008). Re-

ciprocal velocity obstacles for real-time multi-agent

navigation. In 2008 IEEE international conference on

robotics and automation, pages 1928–1935. Ieee.

Yang, Z., Li, J., Yang, L., Wang, Q., Li, P., and Xia, G.

(2023). Path planning and collision avoidance meth-

ods for distributed multi-robot systems in complex dy-

namic environments. Mathematical Biosciences and

Engineering, 20(1):145–178.

ICINCO 2025 - 22nd International Conference on Informatics in Control, Automation and Robotics

316