Power to the People! - Meta-Algorithmic Modelling in Applied Data Science

Marco Spruit, Raj Jagesar

Abstract

This position paper first defines the research field of applied data science at the intersection of domain expertise, data mining, and engineering capabilities, with particular attention to analytical applications. We then propose a meta-algorithmic approach for applied data science with societal impact based on activity recipes. Our people-centred motto from an applied data science perspective translates to design science research which focuses on empowering domain experts to sensibly apply data mining techniques through prototypical software implementations supported by meta-algorithmic recipes.

References

  1. Allahyari, H., and N. Lavesson. 2011. “User-Oriented Assessment of Classification Model Understandability,” in 11th Scandinavian Conference on Artifical Intelligence, pp. 11-19.
  2. Brinkkemper, S. 1996. “Method Engineering: Engineering of Information Systems Development Methods and Tools,” Information and Software Technology (38:4), pp. 275-280.
  3. Cortez, P., and M. J. Embrechts. 2013. “Using Sensitivity Analysis and Visualization Techniques to Open Black Box Data Mining Models,” Information Sciences (225), pp. 1-17.
  4. Domingos, P. 2012. “A Few Useful Things to Know about Machine Learning,” Communications of the ACM (55:10), pp. 78-87.
  5. Hall, M., E. Frank, G. Holmes, B. Pfahringer, P. Reutemann, and I. H. Witten. 2009. “The WEKA Data Mining Software: An Update,” ACM SIGKDD Explorations Newsletter (11:1), pp. 10-18.
  6. Hevner, A., S. March, P. Jinsoo, and S. Ram. 2004. “Design Science in Information Systems Research,” MIS Quarterly (28:1), pp. 75-105.
  7. Johansson, U., L. Niklasson, and R. König. 2004. “Accuracy Vs. Comprehensibility in Data Mining Models,” in Proceedings of the Seventh International Conference on Information Fusion Vol. 1, pp. 295-300.
  8. Kamwa, I., S. Samantaray, and G. Joós. 2012. “On the Accuracy Versus Transparency Trade-Off of DataMining Models for Fast-Response PMU-Based Catastrophe Predictors,” IEEE Transactions on Smart Grid (3:1), pp. 152-161.
  9. Kotsiantis, S. B., I. Zaharakis, and P. Pintelas. 2007. “Supervised Machine Learning: A Review of Classification Techniques,” in Emerging Artifical Intelligence Applications in Computer Engineering, pp. 3-24.
  10. Lou, Y., R. Caruana, J. Gehrke, and G. Hooker. 2013. “Accurate Intelligible Models with Pairwise Interactions,” in Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 623-631.
  11. Menger, V., M. Spruit, K. Hagoort, and F. Scheepers. 2016. “Transitioning to a Data Driven Mental Health Practice: Collaborative Expert Sessions for Knowledge and Hypothesis Finding,” Computational and Mathematical Methods in Medicine, Article ID 9089321.
  12. Olson, D. L., D. Delen, and Y. Meng. 2012. “Comparative Analysis of Data Mining Methods for Bankruptcy Prediction,” Decision Support Systems (52:2), pp. 464- 473.
  13. Orloff, M. 2016. “ABC-TRIZ: Introduction to Creative Design Thinking with Modern TRIZ Modelling,” Springer.
  14. Pachidi, S., M. Spruit, and I. van der Weerd. 2014. “Understanding Users' Behavior with Software Operation Data Mining,” Computers in Human Behavior (30), pp. 583-594.
  15. Pachidi, S., and M. Spruit. 2015. “The Performance Mining method: Extracting performance knowledge from software operation data”, International Journal of Business Intelligence Research (6:1), pp. 11-29.
  16. Pritzker, P., and W. May. 2015. NIST Big Data interoperability Framework (NBDIF): Volume 1: Definitions. NIST Special Publication 1500-1. Final Version 1, September 2015.
  17. Setiono, R. 2003. “Techniques for Extracting Classification and Regression Rules from Artificial Neural Networks,” Computational Intelligence: The Experts Speak Piscataway, NJ, USA: IEEE, pp. 99-114.
  18. Simke, S. 2013. “Meta-Algorithmics: Patterns for Robust, Low Cost, High Quality Systems,” Wiley - IEEE.
  19. Spruit, M., and B. Vlug. 2015. “Effective and Efficient Classification of Topically-Enriched Domain-Specific Text Snippets”, International Journal of Strategic Decision Sciences (6:3), pp. 1-17.
  20. Tang, J., S. Alelyani, and H. Liu. 2014. “Feature Selection for Classification: A Review,” Data Classification: Algorithms and Applications Vol. 37, pp. 2 - 29.
  21. Thornton, C., F. Hutter, H. H. Hoos, and K. LeytonBrown. 2013. “Auto-WEKA: Combined Selection and Hyperparameter Optimization of Classification Algorithms,” in Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 847-855.
  22. Tukey, J. W. 1977. “Exploratory Data Analysis,” Addison-Wesley.
  23. van de Weerd, I., and S. Brinkkemper. 2008. “MetaModelling for Situational Analysis and Design Methods,” Handbook of Research on Modern Systems Analysis and Design Technologies and Applications, pp. 35-54.
  24. Vleugel, A., M. Spruit, and A. van Daal. 2010. “Historical data analysis through data mining from an outsourcing perspective: the three-phases method,” International Journal of Business Intelligence Research, (1:3), pp. 42-65.
  25. Yoo, I., P. Alafaireet, M. Marinov, K. Pena-Hernandez, R. Gopidi, J. Chang, and L. Hua. 2012. “Data Mining in Healthcare and Biomedicine: A Survey of the Literature,” Journal of Medical Systems (36:4), pp. 2431-2448.
Download


Paper Citation


in Harvard Style

Spruit M. and Jagesar R. (2016). Power to the People! - Meta-Algorithmic Modelling in Applied Data Science . In Proceedings of the 8th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management - Volume 1: KDIR, (IC3K 2016) ISBN 978-989-758-203-5, pages 400-406. DOI: 10.5220/0006081604000406


in Bibtex Style

@conference{kdir16,
author={Marco Spruit and Raj Jagesar},
title={Power to the People! - Meta-Algorithmic Modelling in Applied Data Science},
booktitle={Proceedings of the 8th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management - Volume 1: KDIR, (IC3K 2016)},
year={2016},
pages={400-406},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0006081604000406},
isbn={978-989-758-203-5},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 8th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management - Volume 1: KDIR, (IC3K 2016)
TI - Power to the People! - Meta-Algorithmic Modelling in Applied Data Science
SN - 978-989-758-203-5
AU - Spruit M.
AU - Jagesar R.
PY - 2016
SP - 400
EP - 406
DO - 10.5220/0006081604000406