Implementation of an existing motif discovery algorithm in SeqAn.
The goal of motif finding is the detection of novel, unknown signals in a set of sequences [1]. For example you might want to detect transcription factor binding sites in a genome, but do not know how they look like.
However, one can be quite certain, that such motifs are never conserved exactly but only approximately. This fact makes the problem difficult.
The SeqAn library contains an implementation of basic, text-book motif finding problem. However, there are recent advances in motif finding that would be a useful state-of-the-art addition to the baseline implementation.
The work by Marschal and Rahman in [2] is one example of a current algorithm.
The aim of this topic is to reimplement the method in [2,3] using the SeqAn library and data structures therein.