On the Application of Bio-Inspired Optimization Algorithms to Fuzzy C-Means Clustering of Time Series

Muhammad Marwan Muhammad Fuad

2015

Abstract

Fuzzy c-means clustering (FCM) is a clustering method which is based on the partial membership concept. As with the other clustering methods, FCM applies a distance to cluster the data. While the Euclidean distance is widely-used to perform the clustering task, other distances have been suggested in the literature. In this paper we study the use of a weighted combination of metrics in FCM clustering of time series where the weights in the combination are the outcome of an optimization process using differential evolution, genetic algorithms, and particle swarm optimization as optimizers. We show how the overfitting phenomenon interferes in the optimization process that the optimal results obtained during the training stage degrade during the testing stage as a result of overfitting.

References

  1. Bustos, B. and Skopal, T., 2006. Dynamic similarity search in multi-metric spaces. Proceedings of the ACM Multimedia, MIR Workshop. ACM Press, New York, NY.
  2. Esling, P., and Agon, C., 2012. Time-series data mining .ACM Comput. Surv.
  3. Feoktistov, V. , 2006. Differential evolution: in search of solutions (Springer Optimization and Its Applications). Secaucus, NJ, USA. Springer- Verlag New York, Inc.
  4. Krzysztof, J.C., Pedrycz, W., Swiniarski, R.W., Kurgan, L.A., 2007. Data mining: a knowledge discovery approach. Springer-Verlag New York, Inc. Secaucus, New Jersey.
  5. Haupt, R.L., Haupt, S. E., 2004. Practical Genetic Algorithms with CD-ROM. Wiley-Interscience.
  6. Keogh, E., Zhu, Q., Hu, B., Hao. Y., Xi, X., Wei, L. & Ratanamahatana C. A.. 2011.: The UCR time series classification/clustering homepage: www.cs.ucr.edu/ eamonn/time_series_data/
  7. Larose, D., 2005. Discovering Knowledge in Data: An Introduction to Data Mining, Wiley, Hoboken, NJ.
  8. Liao, T., 2005. Clustering of time series data-a survey. Pattern Recognition.
  9. Maulik, U., Bandyopadhyay, S., Mukhopadhyay, A.,2011. Multiobjective genetic algorithms for clustering: applications in data mining and bioinformatics. Springer.
  10. Mitchell, M., 1996. An Introduction to Genetic Algorithms, MIT Press, Cambridge, MA.
  11. Muhammad Fuad, M.M., 2014a. A synergy of artificial bee colony and genetic algorithms to determine the parameters of the S-Gram distance. In DEXA 2014, Munich, Germany. Lecture Notes in Computer Science Volume 8645, 2014, pp 147-154.
  12. Muhammad Fuad, M.M. , 2012a. Differential evolution versus genetic algorithms: towards symbolic aggregate approximation of non-normalized time series. In IDEAS'12 , Prague, Czech Republic, Published by BytePress/ACM.
  13. Muhammad Fuad, M.M., 2014b. One-step or two-step optimization and the overfitting phenomenon: a case study on time series classification. In ICAART 2014.
  14. Muhammad Fuad, M.M., 2012b. Using differential evolution to set weights to segments with different information content in the piecewise aggregate approximation. In KES 2012, San Sebastian, Spain, (FAIA). IOS Press.
  15. Muhammad Fuad, M.Mm 2013. When optimization is just an illusion. In ADMA 2013, Part I. LNCS, vol. 8346, pp. 121-132. Springer, Heidelberg(2013).
  16. Witten, I.H., Frank, E., 2009. Data mining practical machine learning tools and techniques, second edition, Elsevier.
Download


Paper Citation


in Harvard Style

Muhammad Fuad M. (2015). On the Application of Bio-Inspired Optimization Algorithms to Fuzzy C-Means Clustering of Time Series . In Proceedings of the International Conference on Pattern Recognition Applications and Methods - Volume 1: ICPRAM, ISBN 978-989-758-076-5, pages 348-353. DOI: 10.5220/0005276203480353


in Bibtex Style

@conference{icpram15,
author={Muhammad Marwan Muhammad Fuad},
title={On the Application of Bio-Inspired Optimization Algorithms to Fuzzy C-Means Clustering of Time Series},
booktitle={Proceedings of the International Conference on Pattern Recognition Applications and Methods - Volume 1: ICPRAM,},
year={2015},
pages={348-353},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0005276203480353},
isbn={978-989-758-076-5},
}


in EndNote Style

TY - CONF
JO - Proceedings of the International Conference on Pattern Recognition Applications and Methods - Volume 1: ICPRAM,
TI - On the Application of Bio-Inspired Optimization Algorithms to Fuzzy C-Means Clustering of Time Series
SN - 978-989-758-076-5
AU - Muhammad Fuad M.
PY - 2015
SP - 348
EP - 353
DO - 10.5220/0005276203480353