A CLOUD-BASED SOLUTION FOR DATA QUALITY IMPROVEMENT

Marco Comerio

2012

Abstract

The application of techniques to improve the data quality of an organization is traditionally costly since different specific tools are required. Potentially, cloud computing models could offer powerful solutions to reduce costs. However, some challenges remain in the widespread acceptance of cloud computing models because they require the sharing of business critical data. Therefore, services for data quality improvements in the cloud should act in compliance with predefined contracts. This paper extends previous works on the specification, selection and evaluation of service and data contracts. Moreover, a cloud-based architecture for data quality improvement that supports contract-based service selection is proposed. Experimental activities on a real scenario demonstrate the feasibility of the proposed solution.

References

  1. Batini, C. and Scannapieco, M. (2006). Data Quality: Concepts, Methodologies and Techniques (Data-Centric Systems and Applications). Springer-Verlag.
  2. Batini, C. and Scannapieco, M. (2006). Data Quality: Concepts, Methodologies and Techniques (Data-Centric Systems and Applications). Springer-Verlag.
  3. Coen-Porisini, A., Colombo, P., and Sicari, S. (2010). Dealing with anonymity in wireless sensor networks. In Proc. of SAC 2010, pages 2216-2223. ACM.
  4. Coen-Porisini, A., Colombo, P., and Sicari, S. (2010). Dealing with anonymity in wireless sensor networks. In Proc. of SAC 2010, pages 2216-2223. ACM.
  5. Comerio, M., De Paoli, F., and Palmonari, M. (2009a). Effective and flexible nfp-based ranking of web services. In Proc. of ICSOC/ServiceWave 2009, pages 546-560.
  6. Comerio, M., De Paoli, F., and Palmonari, M. (2009a). Effective and flexible nfp-based ranking of web services. In Proc. of ICSOC/ServiceWave 2009, pages 546-560.
  7. Comerio, M., Truong, H.-L., Batini, C., and Dustdar, S. (2010). Service-oriented data quality engineering and data publishing in the cloud. In Proc. of SOCA 2010, pages 1-6.
  8. Comerio, M., Truong, H.-L., Batini, C., and Dustdar, S. (2010). Service-oriented data quality engineering and data publishing in the cloud. In Proc. of SOCA 2010, pages 1-6.
  9. Comerio, M., Truong, H.-L., De Paoli, F., and Dustdar, S. (2009b). Evaluating contract compatibility for service composition in the seco2 framework. In Proc. of ICSOC/ServiceWave 2009, pages 221-236.
  10. Comerio, M., Truong, H.-L., De Paoli, F., and Dustdar, S. (2009b). Evaluating contract compatibility for service composition in the seco2 framework. In Proc. of ICSOC/ServiceWave 2009, pages 221-236.
  11. Dani, M. N., Faruquie, T. A., Garg, R., Kothari, G., Mohania, M. K., Prasad, K. H., Subramaniam, L. V., and Swamy, V. N. (2010). A knowledge acquisition method for improving data quality in services engagements. In Proc. of SCC 2010, pages 346-353.
  12. Dani, M. N., Faruquie, T. A., Garg, R., Kothari, G., Mohania, M. K., Prasad, K. H., Subramaniam, L. V., and Swamy, V. N. (2010). A knowledge acquisition method for improving data quality in services engagements. In Proc. of SCC 2010, pages 346-353.
  13. De Paoli, F., Palmonari, M., Comerio, M., and Maurino, A. (2008). A Meta-Model for Non-Functional Property Descriptions of Web Services. In Proc. of ICWS 2008, pages 393-400.
  14. De Paoli, F., Palmonari, M., Comerio, M., and Maurino, A. (2008). A Meta-Model for Non-Functional Property Descriptions of Web Services. In Proc. of ICWS 2008, pages 393-400.
  15. Faruquie, T. A., Prasad, K. H., Subramaniam, L. V., Mohania, M. K., Venkatachaliah, G., Kulkarni, S., and Basu, P. (2010). Data cleansing as a transient service. In Proc. of ICDE 2010, pages 1025-1036.
  16. Faruquie, T. A., Prasad, K. H., Subramaniam, L. V., Mohania, M. K., Venkatachaliah, G., Kulkarni, S., and Basu, P. (2010). Data cleansing as a transient service. In Proc. of ICDE 2010, pages 1025-1036.
  17. Li, J., Stephenson, B., and Singhal, S. (2009). A policy framework for data management in services marketplaces. In Proc. of ARES 2009, pages 560-565.
  18. Li, J., Stephenson, B., and Singhal, S. (2009). A policy framework for data management in services marketplaces. In Proc. of ARES 2009, pages 560-565.
  19. Machanavajjhala, A., Kifer, D., Gehrke, J., and Venkitasubramaniam, M. (2007). L-diversity: Privacy beyond k-anonymity. ACM Trans. Knowl. Discov. Data, 1.
  20. Machanavajjhala, A., Kifer, D., Gehrke, J., and Venkitasubramaniam, M. (2007). L-diversity: Privacy beyond k-anonymity. ACM Trans. Knowl. Discov. Data, 1.
  21. Truong, H.-L., Gangadharan, G., Comerio, M., Dustdar, S., and De Paoli, F. (2011). On analyzing and developing data contracts in cloud-based data marketplaces. In Proc. of APSCC 2011, pages 174-181.
  22. Truong, H.-L., Gangadharan, G., Comerio, M., Dustdar, S., and De Paoli, F. (2011). On analyzing and developing data contracts in cloud-based data marketplaces. In Proc. of APSCC 2011, pages 174-181.
  23. Tsai, W.-T., Wei, X., Zhang, D., Paul, R., Chen, Y., and Chung, J.-Y. (2007). A new soa data-provenance framework. In Proc. of ISADS 2007, pages 105-112.
  24. Tsai, W.-T., Wei, X., Zhang, D., Paul, R., Chen, Y., and Chung, J.-Y. (2007). A new soa data-provenance framework. In Proc. of ISADS 2007, pages 105-112.
  25. Viega, J. (2009). Cloud computing and the common man. Computer, 42:106-108.
  26. Viega, J. (2009). Cloud computing and the common man. Computer, 42:106-108.
  27. Zhou, Y., Hanß, S., Cornils, M., Hahn, C., Niepage, S., and Schrader, T. (2009). A soa-based data quality assessment framework in a medical science center. In Proc. of ICIQ 2009, pages 149-160.
  28. Zhou, Y., Hanß, S., Cornils, M., Hahn, C., Niepage, S., and Schrader, T. (2009). A soa-based data quality assessment framework in a medical science center. In Proc. of ICIQ 2009, pages 149-160.
Download


Paper Citation


in Harvard Style

Comerio M. (2012). A CLOUD-BASED SOLUTION FOR DATA QUALITY IMPROVEMENT . In Proceedings of the 2nd International Conference on Cloud Computing and Services Science - Volume 1: CLOSER, ISBN 978-989-8565-05-1, pages 222-227. DOI: 10.5220/0003902802220227


in Harvard Style

Comerio M. (2012). A CLOUD-BASED SOLUTION FOR DATA QUALITY IMPROVEMENT . In Proceedings of the 2nd International Conference on Cloud Computing and Services Science - Volume 1: CLOSER, ISBN 978-989-8565-05-1, pages 222-227. DOI: 10.5220/0003902802220227


in Bibtex Style

@conference{closer12,
author={Marco Comerio},
title={A CLOUD-BASED SOLUTION FOR DATA QUALITY IMPROVEMENT},
booktitle={Proceedings of the 2nd International Conference on Cloud Computing and Services Science - Volume 1: CLOSER,},
year={2012},
pages={222-227},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0003902802220227},
isbn={978-989-8565-05-1},
}


in Bibtex Style

@conference{closer12,
author={Marco Comerio},
title={A CLOUD-BASED SOLUTION FOR DATA QUALITY IMPROVEMENT},
booktitle={Proceedings of the 2nd International Conference on Cloud Computing and Services Science - Volume 1: CLOSER,},
year={2012},
pages={222-227},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0003902802220227},
isbn={978-989-8565-05-1},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 2nd International Conference on Cloud Computing and Services Science - Volume 1: CLOSER,
TI - A CLOUD-BASED SOLUTION FOR DATA QUALITY IMPROVEMENT
SN - 978-989-8565-05-1
AU - Comerio M.
PY - 2012
SP - 222
EP - 227
DO - 10.5220/0003902802220227


in EndNote Style

TY - CONF
JO - Proceedings of the 2nd International Conference on Cloud Computing and Services Science - Volume 1: CLOSER,
TI - A CLOUD-BASED SOLUTION FOR DATA QUALITY IMPROVEMENT
SN - 978-989-8565-05-1
AU - Comerio M.
PY - 2012
SP - 222
EP - 227
DO - 10.5220/0003902802220227