loading...
Enriching Reverse Engineering with Semantic Clustering
Pittsburgh, Pennsylvania November 07-November 11
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/WCRE.2005.1612th Working Conference on Reverse En ...
 This Article 
 
PDF
HTML
 
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Adrian Kuhn, University of Berne
Stéphane Ducasse, University of Berne
Tudor Gîrba, University of Berne
Understanding a software system by just analyzing the structure of the system reveals only half of the picture, since the structure tells us only how the code is working but not what the code is about. What the code is about can be found in the semantics of the source code: names of identifiers, comments etc. In this paper, we analyze how these terms are spread over the source artifacts using Latent Semantic Indexing, an information retrieval technique. We use the assumption that parts of the system that use similar terms are related. We cluster artifacts that use similar terms, and we reveal the most relevant terms for the computed clusters. Our approach works at the level of the source code which makes it language independent. Nevertheless, we correlated the semantics with structural information and we applied it at different levels of abstraction (e.g. classes, methods). We applied our approach on three large case studies and we report the results we obtained.
Index Terms:
semantic analysis, clustering, concept location, reverse engineering
Citation:
Adrian Kuhn, Stéphane Ducasse, Tudor Gîrba, "Enriching Reverse Engineering with Semantic Clustering," wcre, pp.133-142, 12th Working Conference on Reverse Engineering (WCRE 2005), 2005
Usage of this product signifies your acceptance of the Terms of Use.