FORECAST ERROR REDUCTION BY PREPROCESSED HIGH-PERFORMANCE STRUCTURAL BREAK DETECTION

Dirk Pauli, Jens Feller, Bernhard Mauersberg, Ingo J. Timm

2011

Abstract

In this paper a new method for detecting multiple structural breaks, i.e. undesired changes of signal behavior, is presented and applied to real-world data. It will be shown how Chernoff Bounds can be used for highperformance hypothesis testing after preprocessing arbitrary time series to binary random variables using k-means-clustering. Theoretical results from part one of this paper have been applied to real-world time series from a pharmaceutical wholesaler and show striking improvement in terms of forecast error reduction, thereby greatly improving forecast quality. In order to test the effect of structural break detection on forecast quality, state of the art forecast algorithms have been applied to time series with and without previous application of structural break detection methods.

References

  1. Basseville, M. and Nikiforov, I. (1993). Detection of Abrupt Changes: Theory and Application. Prentice-Hall,Inc.
  2. Basseville, M. and Nikiforov, I. (1993). Detection of Abrupt Changes: Theory and Application. Prentice-Hall,Inc.
  3. Brown, R. (1959). Statistical forecasting for inventory control. McGraw-Hill New York.
  4. Brown, R. (1959). Statistical forecasting for inventory control. McGraw-Hill New York.
  5. Chandola, V., Banerjee, A., and Kumar, V. (2009). Anomaly detection: A survey. ACM Computing Surveys (CSUR), 41(3):1-58.
  6. Chandola, V., Banerjee, A., and Kumar, V. (2009). Anomaly detection: A survey. ACM Computing Surveys (CSUR), 41(3):1-58.
  7. Chernoff, H. (1952). A Measure of Asymptotic Efficiency for Tests of a Hypothesis Based on the Sum of Observations. The Annals of Mathematical Statistics, 23:493-507.
  8. Chernoff, H. (1952). A Measure of Asymptotic Efficiency for Tests of a Hypothesis Based on the Sum of Observations. The Annals of Mathematical Statistics, 23:493-507.
  9. Feller, S., Chevalier, R., and Morsili, S. (2010). Parameter Disaggregation for High Dimensional Time Series Data on the Example of a Gas Turbine. In Proceedings of the 38th ESReDA Seminar, Pcs, H, pages 13-26.
  10. Feller, S., Chevalier, R., and Morsili, S. (2010). Parameter Disaggregation for High Dimensional Time Series Data on the Example of a Gas Turbine. In Proceedings of the 38th ESReDA Seminar, Pcs, H, pages 13-26.
  11. Feller, W. (2009). An introduction to probability theory and its applications. Wiley-India.
  12. Feller, W. (2009). An introduction to probability theory and its applications. Wiley-India.
  13. Fujimaki, R., Yairi, T., and Machida, K. (2005). An Approach to Spacecraft Anomaly Detection Problem Using Kernel Feature Space. In Proceedings of the 11th ACM SIGKDD International Conference on Knowledge Discovery in Data Mining, pages 401- 410. ACM.
  14. Fujimaki, R., Yairi, T., and Machida, K. (2005). An Approach to Spacecraft Anomaly Detection Problem Using Kernel Feature Space. In Proceedings of the 11th ACM SIGKDD International Conference on Knowledge Discovery in Data Mining, pages 401- 410. ACM.
  15. Gardner Jr, E. (1985). Exponential smoothing: The state of the art. Journal of Forecasting, 4(1):1-28.
  16. Gardner Jr, E. (1985). Exponential smoothing: The state of the art. Journal of Forecasting, 4(1):1-28.
  17. Gelper, S., Fried, R., and Croux, C. (2010). Robust forecasting with exponential and Holt-Winters smoothing. Journal of Forecasting, 29(3):285-300.
  18. Gelper, S., Fried, R., and Croux, C. (2010). Robust forecasting with exponential and Holt-Winters smoothing. Journal of Forecasting, 29(3):285-300.
  19. Guralnik, V. and Srivastava, J. (1999). Event Detection from Time Series Data. In Proceedings of the 5th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pages 33-42. ACM.
  20. Guralnik, V. and Srivastava, J. (1999). Event Detection from Time Series Data. In Proceedings of the 5th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pages 33-42. ACM.
  21. Gustafsson, F. (1998). Estimation and Change Detection of Tire-Road Friction Using the Wheel Slip. IEEE Control System Magazine, 18(4):42-49.
  22. Gustafsson, F. (1998). Estimation and Change Detection of Tire-Road Friction Using the Wheel Slip. IEEE Control System Magazine, 18(4):42-49.
  23. Hartigan, J. and Wong, M. (1979). Algorithm AS 136: A k-means Clustering Algorithm. Journal of the Royal Statistical Society. Series C (Applied Statistics), 28(1):100-108.
  24. Hartigan, J. and Wong, M. (1979). Algorithm AS 136: A k-means Clustering Algorithm. Journal of the Royal Statistical Society. Series C (Applied Statistics), 28(1):100-108.
  25. Holt, C. (1957). Forecasting trends and seasonals by exponentially weighted moving averages. ONR Memorandum, 52:1957.
  26. Holt, C. (1957). Forecasting trends and seasonals by exponentially weighted moving averages. ONR Memorandum, 52:1957.
  27. Ibaida, A., Khalil, I., and Sufi, F. (2010). Cardiac abnormalities detection from compressed ECG in wireless telemonitoring using principal components analysis (PCA). In Intelligent Sensors, Sensor Networks and Information Processing (ISSNIP), 2009 5th International Conference on, pages 207-212. IEEE.
  28. Ibaida, A., Khalil, I., and Sufi, F. (2010). Cardiac abnormalities detection from compressed ECG in wireless telemonitoring using principal components analysis (PCA). In Intelligent Sensors, Sensor Networks and Information Processing (ISSNIP), 2009 5th International Conference on, pages 207-212. IEEE.
  29. Ide, T. and Kashima, H. (2004). Eigenspace-based Anomaly Detection in Computer Systems. In Proceedings of the 10th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pages 440-449. ACM.
  30. Ide, T. and Kashima, H. (2004). Eigenspace-based Anomaly Detection in Computer Systems. In Proceedings of the 10th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pages 440-449. ACM.
  31. Kawahara, Y. and Sugiyama, M. (2009). Change-point Detection in Time Series Data by Direct Density-Ratio Estimation. In Proceedings of 2009 SIAM International Conference on Data Mining (SDM2009), pages 389-400.
  32. Kawahara, Y. and Sugiyama, M. (2009). Change-point Detection in Time Series Data by Direct Density-Ratio Estimation. In Proceedings of 2009 SIAM International Conference on Data Mining (SDM2009), pages 389-400.
  33. Ma, J. and Perkins, S. (2003). Time Series Novelty Detection Using One-class Support Vector Machines. In Proceedings of the International Joint Conference on Neural Networks, volume 3, pages 1741-1745.
  34. Ma, J. and Perkins, S. (2003). Time Series Novelty Detection Using One-class Support Vector Machines. In Proceedings of the International Joint Conference on Neural Networks, volume 3, pages 1741-1745.
  35. Markou, M. and Singh, S. (2003). Novelty Detection: a Review-Part 1: Statistical Approaches. Signal Processing, 83(12):2481-2497.
  36. Markou, M. and Singh, S. (2003). Novelty Detection: a Review-Part 1: Statistical Approaches. Signal Processing, 83(12):2481-2497.
  37. Murad, U. and Pinkas, G. (1999). Unsupervised Profiling for Identifying Superimposed Fraud. Principles of Data Mining and Knowledge Discovery, 1704:251- 261.
  38. Murad, U. and Pinkas, G. (1999). Unsupervised Profiling for Identifying Superimposed Fraud. Principles of Data Mining and Knowledge Discovery, 1704:251- 261.
  39. Ng, T., Skitmore, M., and Wong, K. (2008). Using genetic algorithms and linear regression analysis for private housing demand forecast. Building and Environment, 43(6):1171-1184.
  40. Ng, T., Skitmore, M., and Wong, K. (2008). Using genetic algorithms and linear regression analysis for private housing demand forecast. Building and Environment, 43(6):1171-1184.
  41. Pauli, D., Timm, I., Lorion, Y., and Feller, S. (2011). Using Chernoff's Bounding Method for High-Performance Structural Break Detection. Submitted for publication.
  42. Pauli, D., Timm, I., Lorion, Y., and Feller, S. (2011). Using Chernoff's Bounding Method for High-Performance Structural Break Detection. Submitted for publication.
  43. Perron, P. (2006). Dealing with Structural Breaks. Palgrave handbook of econometrics, 1:278-352.
  44. Perron, P. (2006). Dealing with Structural Breaks. Palgrave handbook of econometrics, 1:278-352.
  45. Pinson, P., Nielsen, H., Madsen, H., and Nielsen, T. (2008). Local linear regression with adaptive orthogonal fitting for the wind power application. Statistics and Computing, 18(1):59-71.
  46. Pinson, P., Nielsen, H., Madsen, H., and Nielsen, T. (2008). Local linear regression with adaptive orthogonal fitting for the wind power application. Statistics and Computing, 18(1):59-71.
  47. Press, W., Teukolsky, S., Vetterling, W., and Flannery, B. (2007). Numerical Recipes: The Art of Scientific Computing. Cambridge University Press.
  48. Press, W., Teukolsky, S., Vetterling, W., and Flannery, B. (2007). Numerical Recipes: The Art of Scientific Computing. Cambridge University Press.
  49. Schwabacher, M., Oza, N., and Matthews, B. (2007). Unsupervised Anomaly Detection for Liquid-Fueled Rocket Propulsion Health Monitoring. In Proceedings of the AIAA Infotech@ Aerospace Conference, Reston, VA: American Institute for Aeronautics and Astronautics, Inc.
  50. Schwabacher, M., Oza, N., and Matthews, B. (2007). Unsupervised Anomaly Detection for Liquid-Fueled Rocket Propulsion Health Monitoring. In Proceedings of the AIAA Infotech@ Aerospace Conference, Reston, VA: American Institute for Aeronautics and Astronautics, Inc.
  51. Strang, G. (1989). Wavelets and dilation equations: A brief introduction. Siam Review, 31(4):614-627.
  52. Strang, G. (1989). Wavelets and dilation equations: A brief introduction. Siam Review, 31(4):614-627.
  53. Taylor, J. (2010). Multi-item sales forecasting with total and split exponential smoothing. Journal of the Operational Research Society.
  54. Taylor, J. (2010). Multi-item sales forecasting with total and split exponential smoothing. Journal of the Operational Research Society.
  55. Wadsworth, H. (1997). Handbook of statistical methods for engineers and scientists. McGraw-Hill Professional.
  56. Wadsworth, H. (1997). Handbook of statistical methods for engineers and scientists. McGraw-Hill Professional.
  57. Xia, B. and Zhao, C. (2009). The Application of Multiple Regression Analysis Forecast in Economical Forecast: The Demand Forecast of Our Country Industry Lavation Machinery in the Year of 2008 and 2009. In Second International Workshop on Knowledge Discovery and Data Mining, 2009. WKDD 2009, pages 405-408.
  58. Xia, B. and Zhao, C. (2009). The Application of Multiple Regression Analysis Forecast in Economical Forecast: The Demand Forecast of Our Country Industry Lavation Machinery in the Year of 2008 and 2009. In Second International Workshop on Knowledge Discovery and Data Mining, 2009. WKDD 2009, pages 405-408.
Download


Paper Citation


in Harvard Style

Pauli D., Feller J., Mauersberg B. and J. Timm I. (2011). FORECAST ERROR REDUCTION BY PREPROCESSED HIGH-PERFORMANCE STRUCTURAL BREAK DETECTION . In Proceedings of the 8th International Conference on Informatics in Control, Automation and Robotics - Volume 1: ICINCO, ISBN 978-989-8425-74-4, pages 262-271. DOI: 10.5220/0003457202620271


in Harvard Style

Pauli D., Feller J., Mauersberg B. and J. Timm I. (2011). FORECAST ERROR REDUCTION BY PREPROCESSED HIGH-PERFORMANCE STRUCTURAL BREAK DETECTION . In Proceedings of the 8th International Conference on Informatics in Control, Automation and Robotics - Volume 1: ICINCO, ISBN 978-989-8425-74-4, pages 262-271. DOI: 10.5220/0003457202620271


in Bibtex Style

@conference{icinco11,
author={Dirk Pauli and Jens Feller and Bernhard Mauersberg and Ingo J. Timm},
title={FORECAST ERROR REDUCTION BY PREPROCESSED HIGH-PERFORMANCE STRUCTURAL BREAK DETECTION},
booktitle={Proceedings of the 8th International Conference on Informatics in Control, Automation and Robotics - Volume 1: ICINCO,},
year={2011},
pages={262-271},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0003457202620271},
isbn={978-989-8425-74-4},
}


in Bibtex Style

@conference{icinco11,
author={Dirk Pauli and Jens Feller and Bernhard Mauersberg and Ingo J. Timm},
title={FORECAST ERROR REDUCTION BY PREPROCESSED HIGH-PERFORMANCE STRUCTURAL BREAK DETECTION},
booktitle={Proceedings of the 8th International Conference on Informatics in Control, Automation and Robotics - Volume 1: ICINCO,},
year={2011},
pages={262-271},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0003457202620271},
isbn={978-989-8425-74-4},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 8th International Conference on Informatics in Control, Automation and Robotics - Volume 1: ICINCO,
TI - FORECAST ERROR REDUCTION BY PREPROCESSED HIGH-PERFORMANCE STRUCTURAL BREAK DETECTION
SN - 978-989-8425-74-4
AU - Pauli D.
AU - Feller J.
AU - Mauersberg B.
AU - J. Timm I.
PY - 2011
SP - 262
EP - 271
DO - 10.5220/0003457202620271


in EndNote Style

TY - CONF
JO - Proceedings of the 8th International Conference on Informatics in Control, Automation and Robotics - Volume 1: ICINCO,
TI - FORECAST ERROR REDUCTION BY PREPROCESSED HIGH-PERFORMANCE STRUCTURAL BREAK DETECTION
SN - 978-989-8425-74-4
AU - Pauli D.
AU - Feller J.
AU - Mauersberg B.
AU - J. Timm I.
PY - 2011
SP - 262
EP - 271
DO - 10.5220/0003457202620271