Improving genome assemblies using multi-platform sequence data
View/ Open
Access
info:eu-repo/semantics/openAccessDate
2016Author
Kavak, PınarErgüner, Bekir
Üstek, Duran
Yüksel, Bayram
Sağıroglu, Mahmut Şamil
Güngör, Tunga
Alkan, Can
Metadata
Show full item recordCitation
Kavak, P., Ergüner, B., Üstek, D., Yüksel, B., Saǧıroǧlu, M.Ş., Güngör, T. ve Alkan, C. (2016) Improving genome assemblies using multi-platform sequence data. Journal of Lecture Notes in Computer Science, 9874 LNCS, 220-232. https://dx.doi.org/10.1007/978-3-319-44332-4_17Abstract
Accurate de novo assembly using short reads generated by next generation sequencing technologies is still an open problem. Although there are several assembly algorithms developed for data generated with different sequencing technologies, and some that can make use of hybrid data, the assemblies are still far from being perfect. There is still a need for computational approaches to improve draft assemblies. Here we propose a new method to correct assembly mistakes when there are multiple types of data generated using different sequencing technologies that have different strengths and biases. We exploit the assembly of highly accurate short reads to correct the contigs obtained from less accurate long reads. We apply our method to Illumina, 454, and Ion Torrent data, and also compare our results with existing hybrid assemblers, Celera and Masurca.
Source
Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)Volume
9874 LNCSCollections
- Makale Koleksiyonu [3649]
- Scopus İndeksli Yayınlar Koleksiyonu [6283]