RECOGNIZING IRREGULARITIES IN STACKED NANOPORE SIGNALS FROM IN SILICO PERMUTED SEQUENCING DATA
Summary
We present results of two attempts to improve irregularity (epigenetic modifications or damage) recognition in nanopore sequenced DNA signals. This was done on a limited dataset for proof of concept purposes. The first approach uses bayesian optimization to improve current algorithms used, the second approach uses a random forest with curated features to recognize irregularities in the sequencing signal. Only the random forest approach showed some promise, with positive results for in silico generated mutations of the sequencing data for a limited genetic context.