PASCAL - Pattern Analysis, Statistical Modelling and Computational Learning

Analysis of Linux Evolution Using Aligned Source Code Segments
Antti Rasinen, Jaakko Hollmen and Heikki Mannila
In: Ninth International Conference on Discovery Science, 7 - 10 Oct 2006, Barcelona, Spain.

Abstract

The Linux operating system embodies a development history of 15 years and community effort of hundreds of voluntary developers. We examine the structure and evolution of the Linux kernel by considering the source code of the kernel as ordinary text without any regard to its semantics. After selecting three functionally central modules to study, we identified code segments using local alignments of source code from a reduced set of file comparisons. The further stages of the analyses take advantage of these identified alignments. We build module-specific visualizations, or descendant graphs, to visualize the overall code migration between versions and files. More detailed view can be achieved with chain graphs which show the time evolution of alignments between selected files. The methods used here may also prove useful in studying large collections of legacy code, whose original maintainers are not available.

PDF - PASCAL Members only - Requires Adobe Acrobat Reader or other PDF viewer.
EPrint Type:Conference or Workshop Item (Paper)
Project Keyword:Project Keyword UNSPECIFIED
Subjects:Natural Language Processing
Information Retrieval & Textual Information Access
ID Code:2440
Deposited By:Jaakko Hollmen
Deposited On:22 November 2006