
practical questions into consideration and proposes
a solution for multiple alignment of sequences us-
ing a hybrid solution with two tools: KAling (Lass-
mann, 2019) and Clustal Omega (Sievers and Hig-
gins, 2014) together with an optimization refinement
by ant colony combined with chaotic jump in order
to reduce the probability of reaching local maximums
and achieve higher quality results.
The aim of this work is to present a method for
multiple alignment of sequences, using the meta-
heuristic Ant Colony Optimization and seeking to ob-
tain results that are biologically relevant. Thus, hybrid
strategies were employed, such as the combination
of tools for generating a base alignment, refinement
of the alignment through the meta-heuristic cited and
application of the concept of chaotic jump combined
with partial realignment of sequences, if the algorithm
enters a local maximum point.
This work is organized as follows: In Section 2,
we present the related works, In Section 3 we detail
our methodology to develop the approach. In Section
4, we show the results of our method, focusing on
time execution improvement. Finally, in Section 5,
we make our conclusions about the work.
2 RELATED WORK
2.1 Multiple Sequence Alignment Based
on Chaotic PSO
The work of Lei et al. (2009) presents a multi-
sequence alignment approach, combining chaotic
jump with Particle Swarm Optimization. The authors
propose this solution to deal with the problem of pre-
mature convergence, which often means that the al-
gorithm has reached a local maximum or minimum
point (Gomes et al., 2022). Basically, the proposed
method perceives premature convergence by observ-
ing two points, first if the average distance of particles
is less than a threshold and second if the variance of
the particle satisfaction function is less than another
parameter of threshold.
When identifying the situation of premature con-
vergence, the algorithm uses a logistic map function
to generate values in the chaotic interval between 0
and 1. These values are then mapped to entire posi-
tions corresponding to gaps that will be inserted into
the target sequence. In situations where there is al-
ready a gap in the given position, a random posi-
tion is chosen, the previous gap is maintained, and
the new gap is inserted into that position. The algo-
rithm presented better solutions for a set of Ribonu-
clease sequences extracted from the BAliBase dataset,
which are sequences with an identity above 35%. For
other sequences, with smaller identities, such as Cy-
tochrome c, the results are not much better, but still,
the difference between the minimum and maximum
scores is small, which denotes robustness in the solu-
tion.
2.2 Chaotic Step Length Artificial Bee
Colony Algorithms for Protein
Structure Prediction
An important area of bioinformatics, in addition to
pattern recognition and multiple alignment of se-
quences, is the area of prediction of the protein struc-
ture. Originally, nuclear magnetic resonance and X-
ray crystallography techniques were used to deter-
mine the structure of the protein. However, these ap-
proaches require a costly laboratory equipment and
also consume a lot of time.
The work of Saxena et al. (2020) is in the predic-
tion of the protein structure field through a computa-
tional physical model. In general, the physical models
of protein prediction are constructed in two phases:
initially, a model with unknown energy is created and
optimization functions are applied on this model to
minimize protein-free energy.
To build and optimize free energy models, meta-
heuristics are often used, one of these meta-heuristics
is the Artificial Bee Colony. In the work of Saxena
et al. (2020), the hypothesis of applying chaotic func-
tions in a step of the algorithm Artificial Bee Colony
(ABC) is validated for best results. Originally, the
ABC algorithm updates bee velocity using the equa-
tion 1, where Φ is a randomly generated number in the
interval [−1, 1]. Saxena’s proposition is to replace the
random value with a method called Chaotic Length
Separator. The modified function is represented ac-
cording to the equation 2.
υ
i j
= x
i j
+ φ(x
best j
− x
i j
) (1)
υ
i j
= x
i j
+CLS
t
(x
best j
− x
i j
) (2)
To calculate Chaotic Length Separator, ten chaotic
map functions were chosen. The method were now
named Enhance Chaotic Artificial Bee colony and
for each function an indice was assigned. Some of
them are: Chebyshev chaotic (ECABC1), Sinusoidal
chaotic (ECABC9) and Tent chaotic (ECABC10).
After computing the functions, the result is normal-
ized using the equation 3, in the interval [0.2,1
e−10
]
where, t denotes the current iteration and T the maxi-
mum number of iterations.
ICEIS 2025 - 27th International Conference on Enterprise Information Systems
238