Abstract
Nanopore sequencing has introduced the ability to sequence long stretches of DNA, enabling the resolution of repeating segments, or paired SNPs across long stretches of DNA. Unfortunately, significant error rates >15%, introduced through systematic and random noise inhibit downstream analysis. We propose a novel method, using unsupervised learning, to correct biologically amplified reads before downstream analysis proceeds. We also demonstrate that our method has performance comparable to existing techniques without limiting the detection of repeats, or the length of the input sequence.
Original language | English (US) |
---|---|
Title of host publication | Proceedings of 2017 6th International Conference on Bioinformatics and Biomedical Science, ICBBS 2017 |
Publisher | Association for Computing Machinery |
Pages | 12-16 |
Number of pages | 5 |
Volume | Part F130950 |
ISBN (Electronic) | 9781450352222 |
DOIs | |
State | Published - Jun 22 2017 |
Event | 6th International Conference on Bioinformatics and Biomedical Science, ICBBS 2017 - Singapore, Singapore Duration: Jun 22 2017 → Jun 24 2017 |
Other
Other | 6th International Conference on Bioinformatics and Biomedical Science, ICBBS 2017 |
---|---|
Country/Territory | Singapore |
City | Singapore |
Period | 6/22/17 → 6/24/17 |
Keywords
- Accuracy improvement
- DNA thumbprint
- Nanopore sequencing
ASJC Scopus subject areas
- Human-Computer Interaction
- Computer Networks and Communications
- Computer Vision and Pattern Recognition
- Software