loading...
A Greedy Two-stage Gibbs Sampling Method for Motif Discovery in Biological Sequences
May 27-May 30
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/BMEI.2008.1112008 International Conference on BioM ...
 This Article 
 
PDF
HTML
 
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
For the motif discovery problem of DNA sequences, a greedy two-stage Gibbs sampling algorithm is presented, and the related software package is called Greedy MotifSAM. Based on position weight matrix (PWM) motif model, a greedy strategy for choosing the initial parameters of PWM is employed. Two sampling methods, site sampler and motif sampler, are used. Site sampler is used to find one occurrence per sequence of the motif in the dataset. Motif sampler is used to find zero or more non-overlapping occurrences of the motif in each sequence. The algorithm is capable of discovering several different motifs with differing numbers of occurrences in a single dataset. We use the binding sites (motif) information of eukaryotic transcription factors stored in TRANSFAC database to test our methods. The prediction accuracy, scalability and reliability are compared to several other methods.
Index Terms:
Motif discovery, Gibbs sampling, Binding sites, Transcription factors
Citation:
Li-fang Liu, Li-cheng Jiao, Hong-wei Huo, "A Greedy Two-stage Gibbs Sampling Method for Motif Discovery in Biological Sequences," bmei, vol. 1, pp.13-17, 2008 International Conference on BioMedical Engineering and Informatics, 2008
Usage of this product signifies your acceptance of the Terms of Use.