Towards block-based compression of genomic data with random access functionality

Cited 1 time in webofscience Cited 0 time in scopus
  • Hit : 42
  • Download : 0
Current algorithms for compressing genomic data mostly focus on achieving high levels of effectiveness and reasonable levels of efficiency, ignoring the need for features such as random access and stream processing. Therefore, in this paper, we introduce a novel framework for compressing genomic data, with the aim of allowing for a better trade-off between effectiveness, efficiency and functionality. To that end, we draw upon concepts taken from the area of media data processing. In particular, we propose to compress genomic data as small blocks of data, using encoding tools that predict the nucleotides and that correct the prediction made by storing a residue. We also propose two techniques that facilitate random access. Our experimental results demonstrate that the compression effectiveness of the proposed approach is up to 1.91 bits per nucleotide, which is significantly better than binary encoding (3 bits per nucleotide) and Huffman coding (2.21 bits per nucleotide).
Publisher
Institute of Electrical and Electronics Engineers Inc.
Issue Date
2014-12
Language
English
Citation

2014 IEEE Global Conference on Signal and Information Processing, GlobalSIP 2014, pp.1360 - 1363

DOI
10.1109/GlobalSIP.2014.7032347
URI
http://hdl.handle.net/10203/313374
Appears in Collection
RIMS Conference Papers
Files in This Item
There are no files associated with this item.
This item is cited by other documents in WoS
⊙ Detail Information in WoSⓡ Click to see webofscience_button
⊙ Cited 1 items in WoS Click to see citing articles in records_button

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0