Violence Recognition in Spanish Words using Data Mining

Adolfo Flores Moreno, Silvia B. González-Brambila, Juan G. Vargas-Rubio

2014

Abstract

Violent behavior in our society has been studied from many points of view, yet many cause-effect relations remain unexplained. Security personnel are normally trained to be alert and recognize potential violent behavior, but they cannot be 100% effective in recognizing it due to the monotonous nature of their job. This paper presents the first results of a work in progress detecting violence from the analysis of words in conversations. We used a set of videos with two person conversations in Spanish and classified them as violent and non violent. The audio of the conversations was extracted and converted to text. We used “Ward”, “K-means” and “PAM” (clValid, 2014) to group words, performing a clValid analysis we found that the hierarchical technique was the best. The percentages of frequency were computed for each term and the SVM (Meyer, 2014) technique was applied, from which we found that there were unclassifiable terms. In three of the tests the prediction was erroneous and in another three we obtained good predictions with respect to the test set.

References

  1. clValid: An R Package for Cluster Validation, [Online], Available at: http://www.jstatsoft.org/v25/i04/paper [Retrieved January 2014]
  2. Meyer, D., “Support Vector Machines: The Interface to libsvm in package e1071”, September 2012, [Online], Available at: http://cran.r-project.org/web/packages/ e1071/vignettes/svmdoc.pdf [Retrieved January 2014]
  3. Brun, R. E., Senso, J. A., Minería textual, [Online], Available at: http://www.elprofesionaldelainformacion.com/conteni dos/2004/enero/2.pdf [Retrieved January 2014]
  4. Montes-y-Gómez, M., Minería de texto: Un nuevo reto computacional, [Online], Available at: http://ccc. inaoep.mx/mmontesg/publicaciones/2001/MineriaTe xto-md01.pdf [Retrieved January 2014]
  5. Villanueva, V. J., Escribano, M., Isorna, M., Pellicer, J., Alapont, L., Pellicer, P., Programa de apoyo al ámbito familiar: Agresividad y violencia, Editorial IES Pablo Serrano. Andorra (Teruel), España, 2007.
  6. Adobe Premiere Pro CS6, [Online], Available at: http://www.adobe.com/mena_en/products/premiere.ht ml [Retrieved January 2014]
  7. Modelos de análisis de voz para Adobe Premiere Pro CS6, [Online], Available at: http://www.adobe.com/es/ products/premiere/extend.displayTab3.html, [Retrieved January 2014]
  8. RStudio v0.97.551, [Online], Available at: http://www. rstudio.com/ide/download/desktop [Retrieved January 2014]
  9. Support Vector Machines in R, [Online], Available at: http://www.jstatsoft.org/v15/i09/paper [Retrieved February 2014]
  10. An Introduction to R, [Online], Available at: http://cran.rproject.org/doc/manuals/R-intro.pdf [Retrieved January 2014]
  11. R Data Import/Export, [Online], Available at: http://cran.rproject.org/doc/manuals/r-release/R-data.html [Retrieved January 2014]
  12. Grün, B., Hornik, K., “Topicmodels: An R Package for Fitting Topic Models”, 2011, [Online], Available at: http://cran.r-project.org/web/packages/topicmodels/ vignettes/topicmodels.pdf [Retrieved January 2014]
  13. Feinerer, I., Hornik, K., “Package 'tm'”, August 2013, [Online], Available at: http://cran.r-project.org/ web/packages/tm/tm.pdf [Retrieved January 2014]
  14. Wainschenker R., Doorn, J., Castro M., “Medición Cuantitativa de la Velocidad del Habla”, 2002, [Online], Available at: http://www.sepln.org/ revistaSEPLN/revista/28/28-Pag99.pdf [Retrieved March 2014]
  15. Zhao, Y., R and Data Mining: Examples and Case Studies, 2013, [Online], Available at: http://cran.rproject.org/doc/contrib/Zhao_R_and_data_mining.pdf [Retrieved March 2014]
  16. Data Preprocessing Techniques for Data Mining, [Online], Available at: http://iasri.res.in/ebook/win_school_aa/ notes/Data_Preprocessing.pdf [Retrieved March 2014]
  17. Hastie T., Tibshirani R., Friedman J., The Elements of Statistical Learning. Data Mining, Inference, and Prediction. Springer, 2001.
  18. Bourel, M., Support Vector Machines, [Online], Available at: http://www.iesta.edu.uy/wiki/images/7/71/SVM SemEstadistica.pdf [Retrieved March 2014]
  19. Derbas, N., Quénot, G., "Joint Audio-Visual Words for Violent Scenes Detection in Movies", Internation Conference on Multimedia Retrieval ICMR'14 Glasgow, United Kingdom, April 01-04, 2014
  20. Gong, Y., Wang, W., Jiang, S., Huang, Q., Gao, W., "Detecting violent scenes in movies by auditory and visual cues. Advances in Multimedia Information Processing PCM 2008, 317-326, Spring Berlin Heidelberg, 2008
  21. Lam, V., Phan, S., Ngo, T., Le, D., Duong, D., Satoh, S., "Violent Scene Detection Using Mid-level Feature", SolCt'13, Danang, Vietman. December 5-6, 2013.
  22. Fujii, Y.,, Yoshimura, T., Ito, T., "Filtering Harmful Sentences based on Three-Word Co-occurrence", CEAS'11, Perth, Western Australia, Australia, September 1-2, 2011.
Download


Paper Citation


in Harvard Style

Flores Moreno A., González-Brambila S. and Vargas-Rubio J. (2014). Violence Recognition in Spanish Words using Data Mining . In Proceedings of the International Conference on Knowledge Discovery and Information Retrieval - Volume 1: KDIR, (IC3K 2014) ISBN 978-989-758-048-2, pages 210-216. DOI: 10.5220/0005072502100216


in Bibtex Style

@conference{kdir14,
author={Adolfo Flores Moreno and Silvia B. González-Brambila and Juan G. Vargas-Rubio},
title={Violence Recognition in Spanish Words using Data Mining},
booktitle={Proceedings of the International Conference on Knowledge Discovery and Information Retrieval - Volume 1: KDIR, (IC3K 2014)},
year={2014},
pages={210-216},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0005072502100216},
isbn={978-989-758-048-2},
}


in EndNote Style

TY - CONF
JO - Proceedings of the International Conference on Knowledge Discovery and Information Retrieval - Volume 1: KDIR, (IC3K 2014)
TI - Violence Recognition in Spanish Words using Data Mining
SN - 978-989-758-048-2
AU - Flores Moreno A.
AU - González-Brambila S.
AU - Vargas-Rubio J.
PY - 2014
SP - 210
EP - 216
DO - 10.5220/0005072502100216