
1
2
3
1 2 3
Compression Rank
Error Rank
TACO
TSXor
ATCS
Figure 3: Compression ratio vs. reconstruction error com-
parison.
compression ration in select cases, often resulted
in excessive reconstruction errors, with some com-
pressed series exceeding their original size. ATSC
achieved the highest compression for the remaining
two datasets but introduced significant reconstruction
errors and length mismatches in decompressed time
series.
5 CONCLUSION
The growth of time series data generated necessitates
efficient compression techniques to mitigate storage,
bandwidth, and computational challenges. While ex-
isting methods offer various trade-offs, they often suf-
fer from restrictive assumptions, high computational
costs, or limited flexibility. To address these short-
comings, we introduced TACO, a lightweight, tree-
based approximate compression method that oper-
ates without strong statistical assumptions, requires
no training, and supports selective decompression.
Our experimental evaluation on five diverse datasets
demonstrates that TACO achieves high compression
rates while maintaining low reconstruction errors,
outperforming state-of-the-art approaches three of the
five datasets. These results highlight TACO’s poten-
tial as a practical and efficient solution for real-world
time series compression.
ACKNOWLEDGEMENTS
This work was funded by the Deutsche Forschungs-
gemeinschaft (DFG, German Research Foundation) –
510552229.
REFERENCES
Bauer, A., Z
¨
ufle, M., Eismann, S., Grohmann, J., Herbst,
N., and Kounev, S. (2021). Libra: A benchmark for
time series forecasting methods. In Proceedings of
the ACM/SPEC International Conference on Perfor-
mance Engineering, pages 189–200.
Bruno, A., Nardini, F. M., Pibiri, G. E., Trani, R., and Ven-
turini, R. (2021). Tsxor: A simple time series com-
pression algorithm. In String Processing and Informa-
tion Retrieval: 28th International Symposium, SPIRE
2021, Lille, France, October 4–6, 2021, Proceedings
28, pages 217–223. Springer.
Cantor, G. (1878). Ein beitrag zur mannigfaltigkeitslehre.
Journal f
¨
ur die reine und angewandte Mathematik
(Crelles Journal), 1878(84):242–258.
Chen, X. (2023). Joint symbolic aggregate approximation
of time series. arXiv preprint arXiv:2401.00109.
Chiarot, G. and Silvestri, C. (2023). Time series compres-
sion survey. ACM Computing Surveys, 55(10):1–32.
Chiarot, G., Vascon, S., Silvestri, C., and Ochoa, I. (2025).
Rat-cc: a recurrent autoencoder for time-series com-
pression and classification. IEEE Access.
City of New York (2025). Tlc trip record data. https://www.
nyc.gov/site/tlc/about/tlc-trip-record-data.page.
Dau, H. A., Keogh, E., Kamgar, K., Yeh, C.-C. M., Zhu,
Y., Gharghabi, S., Ratanamahatana, C. A., Yanping,
Hu, B., Begum, N., Bagnall, A., Mueen, A., and
Batista, G. (2018). The ucr time series classifica-
tion archive. https://www.cs.ucr.edu/
∼
eamonn/time
series data 2018/.
Enderton, H. B. (1977). Elements of set theory. Gulf Pro-
fessional Publishing.
G
¨
odel, K. (1931).
¨
Uber formal unentscheidbare s
¨
atze
der principia mathematica und verwandter systeme i.
Monatshefte f
¨
ur mathematik und physik, 38:173–198.
Guerra, A., Vinciguerra, G., Boffa, A., and Ferrag-
ina, P. (2024). Learned compression of nonlin-
ear time series with random access. arXiv preprint
arXiv:2412.16266.
Hopcroft, J. E., Motwani, R., and Ullman, J. D. (2001). In-
troduction to automata theory, languages, and compu-
tation. Acm Sigact News, 32(1):60–65.
Lin, J., Keogh, E., Wei, L., and Lonardi, S. (2007). Ex-
periencing sax: a novel symbolic representation of
time series. Data Mining and knowledge discovery,
15:107–144.
Liu, J., Djukic, P., Kulhandjian, M., and Kantarci, B.
(2024). Deep dict: Deep learning-based lossy time
series compressor for iot data. In ICC 2024-IEEE
International Conference on Communications, pages
4245–4250. IEEE.
Liu, X. and Wang, W. (2024). Deep time series forecast-
ing models: A comprehensive survey. Mathematics,
12(10):1504.
Makridakis, S. and Hibon, M. (2000). The m3-competition:
results, conclusions and implications. International
journal of forecasting, 16(4):451–476.
Makridakis, S., Spiliotis, E., and Assimakopoulos, V.
(2020). The m4 competition: 100,000 time series
and 61 forecasting methods. International Journal of
Forecasting, 36(1):54–74.
Malinowski, S., Guyet, T., Quiniou, R., and Tavenard, R.
(2013). 1d-sax: A novel symbolic representation for
time series. In International Symposium on Intelligent
Data Analysis, pages 273–284. Springer.
TACO: A Lightweight Tree-Based Approximate Compression Method for Time Series
189