DC Field | Value | Language |
---|---|---|
dc.contributor.author | Paridaens, Tom | ko |
dc.contributor.author | VanWallendael, Glenn | ko |
dc.contributor.author | De Neve, Wesley | ko |
dc.contributor.author | Lambert, Peter | ko |
dc.date.accessioned | 2018-02-21T06:38:41Z | - |
dc.date.available | 2018-02-21T06:38:41Z | - |
dc.date.created | 2018-02-19 | - |
dc.date.created | 2018-02-19 | - |
dc.date.issued | 2018-02 | - |
dc.identifier.citation | BIOINFORMATICS, v.34, no.3, pp.425 - 433 | - |
dc.identifier.issn | 1367-4803 | - |
dc.identifier.uri | http://hdl.handle.net/10203/240375 | - |
dc.description.abstract | Motivation: The past decade has seen the introduction of new technologies that significantly lowered the cost of genome sequencing. As a result, the amount of genomic data that must be stored and transmitted is increasing exponentially. To mitigate storage and transmission issues, we introduce a framework for lossless compression of quality scores. Results: This article proposes AQUa, an adaptive framework for lossless compression of quality scores. To compress these quality scores, AQUa makes use of a configurable set of coding tools, extended with a Context-Adaptive Binary Arithmetic Coding scheme. When benchmarking AQUa against generic single-pass compressors, file sizes are reduced by up to 38.49% when comparing with GNU Gzip and by up to 6.48% when comparing with 7-Zip at the Ultra Setting, while still providing support for random access. When comparing AQUa with the purpose-built, single-pass, and state-of-the-art compressor SCALCE, which does not support random access, file sizes are reduced by up to 21.14%. When comparing AQUa with the purpose-built, dual-pass, and state-of-the-art compressor QVZ, which does not support random access, file sizes are larger by 6.42-33.47%. However, for one test file, the file size is 0.38% smaller, illustrating the strength of our single-pass compression framework. This work has been spurred by the current activity on genomic information representation (MPEG-G) within the ISO/IEC SC29/WG11 technical committee. | - |
dc.language | English | - |
dc.publisher | OXFORD UNIV PRESS | - |
dc.subject | LOSSY COMPRESSION | - |
dc.subject | CABAC | - |
dc.title | AQUa: an adaptive framework for compression of sequencing quality scores with random access functionality | - |
dc.type | Article | - |
dc.identifier.wosid | 000423978700009 | - |
dc.identifier.scopusid | 2-s2.0-85041396490 | - |
dc.type.rims | ART | - |
dc.citation.volume | 34 | - |
dc.citation.issue | 3 | - |
dc.citation.beginningpage | 425 | - |
dc.citation.endingpage | 433 | - |
dc.citation.publicationname | BIOINFORMATICS | - |
dc.identifier.doi | 10.1093/bioinformatics/btx607 | - |
dc.contributor.nonIdAuthor | Paridaens, Tom | - |
dc.contributor.nonIdAuthor | VanWallendael, Glenn | - |
dc.contributor.nonIdAuthor | Lambert, Peter | - |
dc.description.isOpenAccess | N | - |
dc.type.journalArticle | Article | - |
dc.subject.keywordPlus | LOSSY COMPRESSION | - |
dc.subject.keywordPlus | CABAC | - |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.