Google Scholar

An error correction algorithm for NGS data

M Kchouk, JF Gibrat, M Elloumi - 2017 28th International …, 2017 - ieeexplore.ieee.org

2017 28th International Workshop on Database and Expert Systems …, 2017•ieeexplore.ieee.org

The Oxford Nanopore and Pacbio SMRT sequencing technologies has revolutionized the Next-Generation Sequencing (NGS) environment by producing long reads that exceed 60 kbp and helped to the completion of many biological projects. But, long reads are characterized by a high error rate which increases the difficulty of biological problems like the genome assembly problem. Error correction of long reads has become a challenge for bioinformaticians, which motivates the development of new approaches for error correction adapted to NGS technologies. In this paper, we present a new denovo self-error correction algorithm using only long reads. Our algorithm operates in two steps: First, we use a fast hashing method which allows to find alignments between the longest reads and other reads in a set of long reads. Next, we use the longest reads as seeds to obtain the final alignment of long reads by using a dynamic programming algorithm in a band of width w. Our error correction algorithm does not require high quality reads, in contrast to existing hybrid error correction ones.

ieeexplore.ieee.org

Show moreShow less

Save Cite Cited by 1 Related articles All 5 versions

Showing the best result for this search. See all results

Cite

Advanced search

Saved to My library

An error correction algorithm for NGS data