loading...
Comparing Approaches to Mining Source Code for Call-Usage Patterns
Minneapolis, Minnesota May 20-May 26
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/MSR.2007.3Fourth International Workshop on Mini ...
 This Article 
 
PDF
HTML
 
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Huzefa Kagdi, Kent State University, USA
Michael L. Collard, Ashland University, USA
Jonathan I. Maletic, Kent State University, USA
Two approaches for mining function-call usage patterns from source code are compared. The first approach, itemset mining, has recently been applied to this problem. The other approach, sequential-pattern mining, has not been previously applied to this problem. Here, a call-usage pattern is a composition of function calls that occur in a function definition. Both approaches look for frequently occurring patterns that represent standard usage of functions and identify possible errors. Itemset mining produces unordered patterns, i.e., sets of function calls, whereas, sequential-pattern mining produces partially ordered patterns, i.e., sequences of function calls. The trade-off between the additional ordering context given by sequential-pattern mining and the efficiency of itemset mining is investigated. The two approaches are applied to the Linux kernel v2.6.14 and results show that mining ordered patterns is worth the additional cost.
Citation:
Huzefa Kagdi, Michael L. Collard, Jonathan I. Maletic, "Comparing Approaches to Mining Source Code for Call-Usage Patterns," msr, pp.20, Fourth International Workshop on Mining Software Repositories (MSR'07:ICSE Workshops 2007), 2007
Usage of this product signifies your acceptance of the Terms of Use.