loading...
Visualization of K -Tuple Distribution in Procaryote Complete Genomes and Their Randomized Counterparts
Stanford, California August 14-August 16
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/CSB.2002.1039327IEEE Computer Society Bioinformatics ...
 This Article 
 
PDF
HTML
 
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Huimin Xie, Suzhou University
Bailin Hao, T-Life Research Center

A few years ago we developed a simple scheme to visualize the string composition of long DNA sequences in terms of two- and one-dimensional (2D and 1D) histograms. While the patterns in the 2D histograms have been well understood, the structure of the 1D histograms has not been analyzed in details. It turns out that the structure of the 1D histograms of the genomic sequences and their randomized counterparts varies significantly depending on the g+c content of the genomes. In particular, the 1D histograms of some randomized sequences may show rich structure, a seemingly anti-intuitive result.

Three approaches are used to explain the phenomenon: (1) Monte Carlo simulation, (2) exact computation by using the Goulden-Jackson cluster method, and (3) a Poisson approximation method. The multi-modal phenomena in K-histograms are well elucidated by the last approach.

Citation:
Huimin Xie, Bailin Hao, "Visualization of K -Tuple Distribution in Procaryote Complete Genomes and Their Randomized Counterparts," csb, pp.31, IEEE Computer Society Bioinformatics Conference (CSB'02), 2002
Usage of this product signifies your acceptance of the Terms of Use.