MaMF

Source: Wikipedia, the free encyclopedia.

MaMF, or Mammalian Motif Finder, is an algorithm for identifying

transcription factors bind.[1]

The algorithm takes as input a set of

promoter
sequences, and a motif width(w), and as output, produces a ranked list of 30 predicted motifs(each motif is defined by a set of N sequences, where N is a parameter).

The algorithm firstly

n-mer, and each sequence forms an ungapped alignment
with a substring of length w from the string of length 2w around the match, with a score exceeding a cut-off.

The pairs of sequences are then scored. The scoring function favours pairs which are very similar, but disfavours sequences which are very common in the target genome. The 1000 highest scoring pairs are kept, and the others are discarded. Each of these 1000 'seed' motifs are then used to search iteratively search for further sequences of length which maximise the score(a greedy algorithm), until N sequences for that motif are reached.

Very similar motifs are discarded, and the 30 highest scoring motifs are returned as output.

References

This page is based on the copyrighted Wikipedia article: MaMF. Articles is available under the CC BY-SA 3.0 license; additional terms may apply.Privacy Policy