NEW APPROACHES FOR XML DATA COMPRESSION

Márlon A. C. Teixeira, Rodrigo S. Miani, Gean D. Breda, Bruno B. Zarpelão, Leonardo S. Mendes

2012

Abstract

Integration of information systems is essential to organizations. Therefore, it is necessary to make different technologies interoperate. Extensible Markup Language (XML) is often used for data exchange because it is self-descriptive and platform-independent. However, XML is a verbose language which may bring problems related to the size of documents. This work proposes two new approaches for XML data compression and compares our solutions with three algorithms: WAP Binary Extensible Markup Language (WBXML), Xmill and Efficient XML Interchange (EXI). The comparison is based on compression rate and compression time for files with different sizes.

References

  1. Augeri, C. J., Mullins, B. E., Baird III, L. C., Bulutoglu, D. A., Baldwin, R. O., 2007. An Analysis of XML Compression Efficiency. In 2007 Workshop on Experimental Computer Science (ExpCS). ACM New York.
  2. Augeri, C. J., Mullins, B. E., Baird III, L. C., Bulutoglu, D. A., Baldwin, R. O., 2007. An Analysis of XML Compression Efficiency. In 2007 Workshop on Experimental Computer Science (ExpCS). ACM New York.
  3. Cokus, M., Winkowski, D., 2002. “XML sizing and compression study for military wireless data”. In XML Conference and Exposition.
  4. Cokus, M., Winkowski, D., 2002. “XML sizing and compression study for military wireless data”. In XML Conference and Exposition.
  5. EXI - Efficient XML Interchange, 2011. W3C Recomendation, http://www.w3.org/TR/exi/.
  6. EXI - Efficient XML Interchange, 2011. W3C Recomendation, http://www.w3.org/TR/exi/.
  7. Gailly, J. and Adler, M., 2011. GZIP. http://www.gzip. org/.
  8. Gailly, J. and Adler, M., 2011. GZIP. http://www.gzip. org/.
  9. Huffman, D. A., 1952. A Method for the Construction of Minimum-Redundancy Codes. In Proceedings of the I.R.E, p. 1098-1102.
  10. Huffman, D. A., 1952. A Method for the Construction of Minimum-Redundancy Codes. In Proceedings of the I.R.E, p. 1098-1102.
  11. Liefke, H. and Suciu, D., 2000. XMill: An efficient compressor for XML data. In Proceedings of the ACM SIGMOD International Conference on Management of Data, p. 153-164.
  12. Liefke, H. and Suciu, D., 2000. XMill: An efficient compressor for XML data. In Proceedings of the ACM SIGMOD International Conference on Management of Data, p. 153-164.
  13. Ng, W., Lam, W., Cheng, J., 2006. Comparative Analysis of XML Compression Technologies, World Wide Web: Internet and Web Information Systems, v. 9, p. 5-33.
  14. Ng, W., Lam, W., Cheng, J., 2006. Comparative Analysis of XML Compression Technologies, World Wide Web: Internet and Web Information Systems, v. 9, p. 5-33.
  15. Open Mobile Alliance, 2011, http://www.openmobilealli ance.org/.
  16. Open Mobile Alliance, 2011, http://www.openmobilealli ance.org/.
  17. Snyder, S. L., 2010. Efficient Xml Interchange (EXI) Compression and Performance Benefits: Development, Implementation And Evaluation. PhD thesis, Naval Postgraduate School Monterey, California.
  18. Snyder, S. L., 2010. Efficient Xml Interchange (EXI) Compression and Performance Benefits: Development, Implementation And Evaluation. PhD thesis, Naval Postgraduate School Monterey, California.
  19. Winzip, 2011. http://www.winzip.com/.
  20. Winzip, 2011. http://www.winzip.com/.
  21. WBXML - WAP Binary XML Content Format, 1999. W3C NOTE, http://www.w3.org/TR/wbxml/.
  22. WBXML - WAP Binary XML Content Format, 1999. W3C NOTE, http://www.w3.org/TR/wbxml/.
Download


Paper Citation


in Harvard Style

A. C. Teixeira M., S. Miani R., D. Breda G., B. Zarpelão B. and S. Mendes L. (2012). NEW APPROACHES FOR XML DATA COMPRESSION . In Proceedings of the 8th International Conference on Web Information Systems and Technologies - Volume 1: WEBIST, ISBN 978-989-8565-08-2, pages 233-237. DOI: 10.5220/0003896202330237


in Harvard Style

A. C. Teixeira M., S. Miani R., D. Breda G., B. Zarpelão B. and S. Mendes L. (2012). NEW APPROACHES FOR XML DATA COMPRESSION . In Proceedings of the 8th International Conference on Web Information Systems and Technologies - Volume 1: WEBIST, ISBN 978-989-8565-08-2, pages 233-237. DOI: 10.5220/0003896202330237


in Bibtex Style

@conference{webist12,
author={Márlon A. C. Teixeira and Rodrigo S. Miani and Gean D. Breda and Bruno B. Zarpelão and Leonardo S. Mendes},
title={NEW APPROACHES FOR XML DATA COMPRESSION},
booktitle={Proceedings of the 8th International Conference on Web Information Systems and Technologies - Volume 1: WEBIST,},
year={2012},
pages={233-237},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0003896202330237},
isbn={978-989-8565-08-2},
}


in Bibtex Style

@conference{webist12,
author={Márlon A. C. Teixeira and Rodrigo S. Miani and Gean D. Breda and Bruno B. Zarpelão and Leonardo S. Mendes},
title={NEW APPROACHES FOR XML DATA COMPRESSION},
booktitle={Proceedings of the 8th International Conference on Web Information Systems and Technologies - Volume 1: WEBIST,},
year={2012},
pages={233-237},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0003896202330237},
isbn={978-989-8565-08-2},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 8th International Conference on Web Information Systems and Technologies - Volume 1: WEBIST,
TI - NEW APPROACHES FOR XML DATA COMPRESSION
SN - 978-989-8565-08-2
AU - A. C. Teixeira M.
AU - S. Miani R.
AU - D. Breda G.
AU - B. Zarpelão B.
AU - S. Mendes L.
PY - 2012
SP - 233
EP - 237
DO - 10.5220/0003896202330237


in EndNote Style

TY - CONF
JO - Proceedings of the 8th International Conference on Web Information Systems and Technologies - Volume 1: WEBIST,
TI - NEW APPROACHES FOR XML DATA COMPRESSION
SN - 978-989-8565-08-2
AU - A. C. Teixeira M.
AU - S. Miani R.
AU - D. Breda G.
AU - B. Zarpelão B.
AU - S. Mendes L.
PY - 2012
SP - 233
EP - 237
DO - 10.5220/0003896202330237