AQUa: an adaptive framework for compression of sequencing quality scores with random access functionality

Cited 6 time in webofscience Cited 0 time in scopus
  • Hit : 225
  • Download : 0
DC FieldValueLanguage
dc.contributor.authorParidaens, Tomko
dc.contributor.authorVanWallendael, Glennko
dc.contributor.authorDe Neve, Wesleyko
dc.contributor.authorLambert, Peterko
dc.date.accessioned2018-02-21T06:38:41Z-
dc.date.available2018-02-21T06:38:41Z-
dc.date.created2018-02-19-
dc.date.created2018-02-19-
dc.date.issued2018-02-
dc.identifier.citationBIOINFORMATICS, v.34, no.3, pp.425 - 433-
dc.identifier.issn1367-4803-
dc.identifier.urihttp://hdl.handle.net/10203/240375-
dc.description.abstractMotivation: The past decade has seen the introduction of new technologies that significantly lowered the cost of genome sequencing. As a result, the amount of genomic data that must be stored and transmitted is increasing exponentially. To mitigate storage and transmission issues, we introduce a framework for lossless compression of quality scores. Results: This article proposes AQUa, an adaptive framework for lossless compression of quality scores. To compress these quality scores, AQUa makes use of a configurable set of coding tools, extended with a Context-Adaptive Binary Arithmetic Coding scheme. When benchmarking AQUa against generic single-pass compressors, file sizes are reduced by up to 38.49% when comparing with GNU Gzip and by up to 6.48% when comparing with 7-Zip at the Ultra Setting, while still providing support for random access. When comparing AQUa with the purpose-built, single-pass, and state-of-the-art compressor SCALCE, which does not support random access, file sizes are reduced by up to 21.14%. When comparing AQUa with the purpose-built, dual-pass, and state-of-the-art compressor QVZ, which does not support random access, file sizes are larger by 6.42-33.47%. However, for one test file, the file size is 0.38% smaller, illustrating the strength of our single-pass compression framework. This work has been spurred by the current activity on genomic information representation (MPEG-G) within the ISO/IEC SC29/WG11 technical committee.-
dc.languageEnglish-
dc.publisherOXFORD UNIV PRESS-
dc.subjectLOSSY COMPRESSION-
dc.subjectCABAC-
dc.titleAQUa: an adaptive framework for compression of sequencing quality scores with random access functionality-
dc.typeArticle-
dc.identifier.wosid000423978700009-
dc.identifier.scopusid2-s2.0-85041396490-
dc.type.rimsART-
dc.citation.volume34-
dc.citation.issue3-
dc.citation.beginningpage425-
dc.citation.endingpage433-
dc.citation.publicationnameBIOINFORMATICS-
dc.identifier.doi10.1093/bioinformatics/btx607-
dc.contributor.nonIdAuthorParidaens, Tom-
dc.contributor.nonIdAuthorVanWallendael, Glenn-
dc.contributor.nonIdAuthorLambert, Peter-
dc.description.isOpenAccessN-
dc.type.journalArticleArticle-
dc.subject.keywordPlusLOSSY COMPRESSION-
dc.subject.keywordPlusCABAC-
Appears in Collection
Files in This Item
There are no files associated with this item.
This item is cited by other documents in WoS
⊙ Detail Information in WoSⓡ Click to see webofscience_button
⊙ Cited 6 items in WoS Click to see citing articles in records_button

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0