Expanding Statistical Similarity Based Data Reduction to Capture Diverse Patterns

Cited 0 time in webofscience Cited 2 time in scopus
  • Hit : 102
  • Download : 0
We propose a new class of lossy compression based on locally exchangeable measure that captures the distribution of repeating data blocks while preserving unique patterns. The technique has been demonstrated to reduce data volume by more than 100-fold on power grid monitoring data where a large number of data blocks can be characterized as following stationary probability distributions. To capture data with more diverse patterns, we propose two techniques to transform non-stationary time series into locally stationary blocks. We also propose a strategy to work with values in bounded ranges such as phase angles of alternating current. These new ideas are incorporated into a software package named IDEALEM. In experiments, IDEALEM reduces non-stationary data volume up to 100-fold. Compared with the state-of-the-art lossy compression methods such as SZ, IDEALEM can produce more compact output overall.
Publisher
IEEE
Issue Date
2017-04-04
Language
English
Citation

2017 Data Compression Conference, DCC 2017

DOI
10.1109/DCC.2017.77
URI
http://hdl.handle.net/10203/269624
Appears in Collection
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0