Improving statistical similarity based data reduction for non-stationary data

Cited 0 time in webofscience Cited 2 time in scopus
  • Hit : 149
  • Download : 0
We propose a new class of lossy compression based on locally exchangeable measure that captures the distribution of repeating data blocks while preserving unique patterns. The technique has been demonstrated to reduce data volume by more than 100-fold on power grid monitoring data where a large number of data blocks can be characterized as following stationary probability distributions. To capture data with more diverse patterns, we propose two techniques to transform non-stationary time series into locally stationary blocks. We also propose a strategy to work with values in bounded ranges such as phase angles of alternating current. These new ideas are incorporated into a software package named IDEALEM. In experiments, IDEALEM reduces non-stationary data volume up to 100-fold. Compared with the state-of-the-art lossy compression methods such as SZ, IDEALEM can produce more compact output overall.
Publisher
International Conference on Scientific and Statistical Database Management
Issue Date
2017-06-27
Language
English
Citation

29th International Conference on Scientific and Statistical Database Management, SSDBM 2017, pp.1 - 6

DOI
10.1145/3085504.3085583
URI
http://hdl.handle.net/10203/269612
Appears in Collection
RIMS Conference Papers
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0