loading...
Shannon Information in Complete Genomes
Stanford, California August 16-August 19
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/CSB.2004.13324132004 IEEE Computational Systems Bioin ...
 This Article 
 
PDF
HTML
 
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Chang-Heng Chang, National Central University
Li-Ching Hsieh, National Central University
Ta-Yuan Chen, National Central University
Hong-Da Chen, National Central University
Liaofu Luo, Inner Mongolia University
Hoong-Chien Lee, National Central University and National Center for Theoretical Sciences
Shannon information in the genomes of all completely sequenced prokaryotes and eukaryotes are measured in word lengths of two to ten letters. It is found that in a scale-dependent way, the Shannon information in complete genomes are much greater than that in matching random sequences - thousands of times greater in the case of short words. Furthermore, with the exception of the 14 chromosomes of Plasmodium falciparum, the Shannon information in all available complete genomes belong to a universality class given by an extremely simple formula. The data are consistent with a model for genome growth composed of two main ingredients: random segmental duplications that increase the Shannon information in a scale-independent way, and random point mutations that preferentially reduces the larger-scale Shannon information. The inference drawn from the present study is that the large-scale and coarse-grained growth of genomes was selectively neutral and this suggests an independent corroboration of Kimura?s neutral theory of evolution.
Citation:
Chang-Heng Chang, Li-Ching Hsieh, Ta-Yuan Chen, Hong-Da Chen, Liaofu Luo, Hoong-Chien Lee, "Shannon Information in Complete Genomes," csb, pp.20-30, 2004 IEEE Computational Systems Bioinformatics Conference (CSB'04), 2004
Usage of this product signifies your acceptance of the Terms of Use.