1. How to analyze git repositories with command line tools.
- Author
-
Spinellis, Diomidis and Gousios, Georgios
- Subjects
SOFTWARE engineering ,DATA extraction ,FEATURE selection ,SOFTWARE development tools ,DATA analytics - Abstract
Git repositories are an important source of empirical software engineering product and process data. Running the Git command-line tool and processing its output with other Unix tools allows the incremental construction of sophisticated data processing pipelines. Git data analytics on the command-line can be systematically presented through a pattern that involves fetching, selection, processing, summarization, and reporting. For each part of the processing pipeline, we examine the tools and techniques that can be most effectively used to perform the task at hand. The presented techniques can be easily applied, first to get a feeling of version control repository data at hand and then also for extracting empirical results. [ABSTRACT FROM AUTHOR]
- Published
- 2018
- Full Text
- View/download PDF