loading...
Sampling Open Source Projects from Portals: Some Preliminary Investigations
Como, Italy September 19-September 22
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/METRICS.2005.4111th IEEE International Software Metr ...
 This Article 
 
PDF
HTML
 
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Austen Rainer, University of Hertfordshire
Stephen Gale, Middlesex University
In this paper, we provide a preliminary evaluation of the quality and quantity of data on 50 000 open source (OS) projects hosted at the SourceForge.net portal. Using several indicators of project activity, we identify one sample from the entire dataset: the "most-broadly-active" OS projects. The number of projects that are active across all of our main indicators of activity account for less than 1% of the projects on the portal. 75% of the projects currently hosted on the SourceForge.net portal are not, and have never really been, active on the portal. Furthermore, whilst there has been a substantial increase in the number of projects being added to SourceForge.net over time, the number of projects being added that then go on to become most-broadly-active projects seems to be decreasing over time. Finally, we recognise that care needs to be taken in defining samples, such as the most-broadly-active projects, as these definitions raise implications for the conclusions that one makes and the generalisations that one should draw.
Citation:
Austen Rainer, Stephen Gale, "Sampling Open Source Projects from Portals: Some Preliminary Investigations," metrics, pp.27, 11th IEEE International Software Metrics Symposium (METRICS'05), 2005
Usage of this product signifies your acceptance of the Terms of Use.