Location-Awareness in Time Series Compression

Xu Teng*, Andreas Züfle, Goce Trajcevski, Diego Klabjan

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

We present our initial findings regarding the problem of the impact that time series compression may have on similarity-queries, in the settings in which the elements of the dataset are accompanied with additional contexts. Broadly, the main objective of any data compression approach is to provide a more compact (i.e., smaller size) representation of a given original dataset. However, as has been observed in the large body of works on compression of spatial data, applying a particular algorithm “blindly” may yield outcomes that defy the intuitive expectations – e.g., distorting certain topological relationships that exist in the “raw” data [7]. In this study, we quantify this distortion by defining a measure of similarity distortion based on Kendall’s T. We evaluate this measure, and the correspondingly achieved compression ratio for the five most commonly used time series compression algorithms and the three most common time series similarity measures. We report some of our observations here, along with the discussion of the possible broader impacts and the challenges that we plan to address in the future.

Original languageEnglish (US)
Title of host publicationAdvances in Databases and Information Systems - 22nd European Conference, ADBIS 2018, Proceedings
EditorsAndras Benczur, Tomas Horvath, Bernhard Thalheim
PublisherSpringer Verlag
Pages82-95
Number of pages14
ISBN (Print)9783319983974
DOIs
StatePublished - Jan 1 2018
Event22nd East-European Conference on Advances in Databases and Information Systems, ADBIS 2018 - Budapest, Hungary
Duration: Sep 2 2018Sep 5 2018

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume11019 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Other

Other22nd East-European Conference on Advances in Databases and Information Systems, ADBIS 2018
CountryHungary
CityBudapest
Period9/2/189/5/18

Fingerprint

Location Awareness
Time series
Compression
Data compression
Data Compression
Spatial Data
Similarity Measure
Intuitive
Quantify
Query
Evaluate

ASJC Scopus subject areas

  • Theoretical Computer Science
  • Computer Science(all)

Cite this

Teng, X., Züfle, A., Trajcevski, G., & Klabjan, D. (2018). Location-Awareness in Time Series Compression. In A. Benczur, T. Horvath, & B. Thalheim (Eds.), Advances in Databases and Information Systems - 22nd European Conference, ADBIS 2018, Proceedings (pp. 82-95). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 11019 LNCS). Springer Verlag. https://doi.org/10.1007/978-3-319-98398-1_6
Teng, Xu ; Züfle, Andreas ; Trajcevski, Goce ; Klabjan, Diego. / Location-Awareness in Time Series Compression. Advances in Databases and Information Systems - 22nd European Conference, ADBIS 2018, Proceedings. editor / Andras Benczur ; Tomas Horvath ; Bernhard Thalheim. Springer Verlag, 2018. pp. 82-95 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).
@inproceedings{dfd6fdc538c349d8a8d058dd08a29662,
title = "Location-Awareness in Time Series Compression",
abstract = "We present our initial findings regarding the problem of the impact that time series compression may have on similarity-queries, in the settings in which the elements of the dataset are accompanied with additional contexts. Broadly, the main objective of any data compression approach is to provide a more compact (i.e., smaller size) representation of a given original dataset. However, as has been observed in the large body of works on compression of spatial data, applying a particular algorithm “blindly” may yield outcomes that defy the intuitive expectations – e.g., distorting certain{\^A} topological relationships that exist in the “raw” data [7]. In this study, we quantify this distortion by defining a measure of similarity distortion based on Kendall’s T. We evaluate this measure, and the correspondingly achieved compression ratio for the five most commonly used time series compression algorithms and the three most common time series similarity measures. We report some of our observations here, along with the discussion of the possible broader impacts and the challenges that we plan to address in the future.",
author = "Xu Teng and Andreas Z{\"u}fle and Goce Trajcevski and Diego Klabjan",
year = "2018",
month = "1",
day = "1",
doi = "10.1007/978-3-319-98398-1_6",
language = "English (US)",
isbn = "9783319983974",
series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",
publisher = "Springer Verlag",
pages = "82--95",
editor = "Andras Benczur and Tomas Horvath and Bernhard Thalheim",
booktitle = "Advances in Databases and Information Systems - 22nd European Conference, ADBIS 2018, Proceedings",
address = "Germany",

}

Teng, X, Züfle, A, Trajcevski, G & Klabjan, D 2018, Location-Awareness in Time Series Compression. in A Benczur, T Horvath & B Thalheim (eds), Advances in Databases and Information Systems - 22nd European Conference, ADBIS 2018, Proceedings. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 11019 LNCS, Springer Verlag, pp. 82-95, 22nd East-European Conference on Advances in Databases and Information Systems, ADBIS 2018, Budapest, Hungary, 9/2/18. https://doi.org/10.1007/978-3-319-98398-1_6

Location-Awareness in Time Series Compression. / Teng, Xu; Züfle, Andreas; Trajcevski, Goce; Klabjan, Diego.

Advances in Databases and Information Systems - 22nd European Conference, ADBIS 2018, Proceedings. ed. / Andras Benczur; Tomas Horvath; Bernhard Thalheim. Springer Verlag, 2018. p. 82-95 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 11019 LNCS).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

TY - GEN

T1 - Location-Awareness in Time Series Compression

AU - Teng, Xu

AU - Züfle, Andreas

AU - Trajcevski, Goce

AU - Klabjan, Diego

PY - 2018/1/1

Y1 - 2018/1/1

N2 - We present our initial findings regarding the problem of the impact that time series compression may have on similarity-queries, in the settings in which the elements of the dataset are accompanied with additional contexts. Broadly, the main objective of any data compression approach is to provide a more compact (i.e., smaller size) representation of a given original dataset. However, as has been observed in the large body of works on compression of spatial data, applying a particular algorithm “blindly” may yield outcomes that defy the intuitive expectations – e.g., distorting certain topological relationships that exist in the “raw” data [7]. In this study, we quantify this distortion by defining a measure of similarity distortion based on Kendall’s T. We evaluate this measure, and the correspondingly achieved compression ratio for the five most commonly used time series compression algorithms and the three most common time series similarity measures. We report some of our observations here, along with the discussion of the possible broader impacts and the challenges that we plan to address in the future.

AB - We present our initial findings regarding the problem of the impact that time series compression may have on similarity-queries, in the settings in which the elements of the dataset are accompanied with additional contexts. Broadly, the main objective of any data compression approach is to provide a more compact (i.e., smaller size) representation of a given original dataset. However, as has been observed in the large body of works on compression of spatial data, applying a particular algorithm “blindly” may yield outcomes that defy the intuitive expectations – e.g., distorting certain topological relationships that exist in the “raw” data [7]. In this study, we quantify this distortion by defining a measure of similarity distortion based on Kendall’s T. We evaluate this measure, and the correspondingly achieved compression ratio for the five most commonly used time series compression algorithms and the three most common time series similarity measures. We report some of our observations here, along with the discussion of the possible broader impacts and the challenges that we plan to address in the future.

UR - http://www.scopus.com/inward/record.url?scp=85051090085&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85051090085&partnerID=8YFLogxK

U2 - 10.1007/978-3-319-98398-1_6

DO - 10.1007/978-3-319-98398-1_6

M3 - Conference contribution

SN - 9783319983974

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 82

EP - 95

BT - Advances in Databases and Information Systems - 22nd European Conference, ADBIS 2018, Proceedings

A2 - Benczur, Andras

A2 - Horvath, Tomas

A2 - Thalheim, Bernhard

PB - Springer Verlag

ER -

Teng X, Züfle A, Trajcevski G, Klabjan D. Location-Awareness in Time Series Compression. In Benczur A, Horvath T, Thalheim B, editors, Advances in Databases and Information Systems - 22nd European Conference, ADBIS 2018, Proceedings. Springer Verlag. 2018. p. 82-95. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). https://doi.org/10.1007/978-3-319-98398-1_6