loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Paper Unlock

Authors: Jaime Salvador-Meneses 1 ; Zoila Ruiz-Chavez 1 and Jose Garcia-Rodriguez 2

Affiliations: 1 Universidad Central del Ecuador, Ciudadela Universitaria, Quito and Ecuador ; 2 Universidad de Alicante, Ap. 99. 03080, Alicante and Spain

Keyword(s): Big Data, Data Compression, Categorical Data, Encoding.

Related Ontology Subjects/Areas/Topics: Artificial Intelligence ; Computational Intelligence ; Data Reduction and Quality Assessment ; Evolutionary Computing ; Knowledge Discovery and Information Retrieval ; Knowledge-Based Systems ; Machine Learning ; Pre-Processing and Post-Processing for Data Mining ; Soft Computing ; Symbolic Systems

Abstract: In the last years, some specialized algorithms have been developed to work with categorical information, however the performance of these algorithms has two important factors to consider: the processing technique (algorithm) and the representation of information used. Many of the machine learning algorithms depend on whether the information is stored in memory, local or distributed, prior to processing. Many of the current compression techniques do not achieve an adequate balance between the compression ratio and the decompression speed. In this work we propose a mechanism for storing and processing categorical information by compression at the bit level, the method proposes a compression and decompression by blocks, with which the process of compressed information resembles the process of the original information. The proposed method allows to keep the compressed data in memory, which drastically reduces the memory consumption. The experimental results obtained show a high compressi on ratio, while the block decompression is very efficient. Both factors contribute to build a system with good performance. (More)

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 3.141.8.247

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Salvador-Meneses, J.; Ruiz-Chavez, Z. and Garcia-Rodriguez, J. (2018). Low Level Big Data Compression. In Proceedings of the 10th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management (IC3K 2018) - KDIR; ISBN 978-989-758-330-8; ISSN 2184-3228, SciTePress, pages 353-358. DOI: 10.5220/0007228003530358

@conference{kdir18,
author={Jaime Salvador{-}Meneses. and Zoila Ruiz{-}Chavez. and Jose Garcia{-}Rodriguez.},
title={Low Level Big Data Compression},
booktitle={Proceedings of the 10th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management (IC3K 2018) - KDIR},
year={2018},
pages={353-358},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0007228003530358},
isbn={978-989-758-330-8},
issn={2184-3228},
}

TY - CONF

JO - Proceedings of the 10th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management (IC3K 2018) - KDIR
TI - Low Level Big Data Compression
SN - 978-989-758-330-8
IS - 2184-3228
AU - Salvador-Meneses, J.
AU - Ruiz-Chavez, Z.
AU - Garcia-Rodriguez, J.
PY - 2018
SP - 353
EP - 358
DO - 10.5220/0007228003530358
PB - SciTePress