Back to Search
Start Over
Automatic Real-Time Mining Software Process Activities From SVN Logs Using a Naive Bayes Classifier
- Source :
- IEEE Access, Vol 7, Pp 146403-146415 (2019)
- Publication Year :
- 2019
- Publisher :
- IEEE, 2019.
-
Abstract
- The abundance of event data in current software configuration management systems makes it possible to discover software process models automatically by using actual observed behavior. However, traditional process mining algorithms cannot be applied to event logs recorded in software configuration management (SCM) systems, such as SVN, because of missing activity attributes. To address this problem, a software process activity classifier is proposed to build event-activity mapping relationships from software development event streams, revealing activity attributes and associating the activity to the original SVN log. The proposed approach extracts activity from the SVN log based on semantic features and introduces a novel technique based on a naive Bayes approach to associate event activities dynamically. The approach has been applied to two real-world software development process logs, ArgoUML and jEdit , consisting of more than 80,000 events, covering development information from 1998 to 2015. With the application of our approach to such data, activities can be extracted from event logs and a classifier can be constructed for adding activity attributes to new events. The results of the classification are evaluated in terms of precision rate , recall rate, and the F-measure . Overall, two real-world software development process logs are used to validate the method, and the experimental results show that the approach can mine software process activities from SVN log events automatically and in real-time.
- Subjects :
- General Computer Science
Computer science
Process mining
software process activity
02 engineering and technology
computer.software_genre
Software development process
Activity classifier
Naive Bayes classifier
Software
020204 information systems
SVN log
0202 electrical engineering, electronic engineering, information engineering
General Materials Science
Software configuration management
business.industry
General Engineering
Software development
020207 software engineering
Capability Maturity Model
machine learning
Data mining
lcsh:Electrical engineering. Electronics. Nuclear engineering
business
computer
Classifier (UML)
lcsh:TK1-9971
Subjects
Details
- Language :
- English
- ISSN :
- 21693536
- Volume :
- 7
- Database :
- OpenAIRE
- Journal :
- IEEE Access
- Accession number :
- edsair.doi.dedup.....53a038066fc17e455146fdb37ca680c8