SoloDel: a probabilistic model for detecting low-frequent somatic deletions from unmatched sequencing data

Cited 2 time in webofscience Cited 3 time in scopus
  • Hit : 336
  • Download : 0
Motivation: Finding somatic mutations from massively parallel sequencing data is becoming a standard process in genome-based biomedical studies. There are a number of robust methods developed for detecting somatic single nucleotide variations However, detection of somatic copy number alteration has been substantially less explored and remains vulnerable to frequently raised sampling issues: low frequency in cell population and absence of the matched control samples. Results: We developed a novel computational method SoloDel that accurately classifies low-frequent somatic deletions from germline ones with or without matched control samples. We first constructed a probabilistic, somatic mutation progression model that describes the occurrence and propagation of the event in the cellular lineage of the sample. We then built a Gaussian mixture model to represent the mixed population of somatic and germline deletions. Parameters of the mixture model could be estimated using the expectation-maximization algorithm with the observed distribution of read-depth ratios at the points of discordant-read based initial deletion calls. Combined with conventional structural variation caller, SoloDel greatly increased the accuracy in classifying somatic mutations. Even without control, SoloDel maintained a comparable performance in a wide range of mutated subpopulation size (10-70%). SoloDel could also successfully recall experimentally validated somatic deletions from previously reported neuropsychiatric whole-genome sequencing data.
Publisher
OXFORD UNIV PRESS
Issue Date
2015-10
Language
English
Article Type
Article
Keywords

COPY-NUMBER ALTERATIONS; HUMAN CANCERS; ACCURATE DETECTION; CLONAL EVOLUTION; WHOLE-GENOME; GENE FUSIONS; MUTATIONS; RETROTRANSPOSITION; DISCOVERY; BRAIN

Citation

BIOINFORMATICS, v.31, no.19, pp.3105 - 3113

ISSN
1367-4803
DOI
10.1093/bioinformatics/btv358
URI
http://hdl.handle.net/10203/203679
Appears in Collection
BiS-Journal Papers(저널논문)
Files in This Item
There are no files associated with this item.
This item is cited by other documents in WoS
⊙ Detail Information in WoSⓡ Click to see webofscience_button
⊙ Cited 2 items in WoS Click to see citing articles in records_button

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0