usenix conference policies
Who Writes What Checkers?—Learning from Bug Repositories
Takeshi Yoshimura and Kenji Kono, Keio University
Static code checkers have been useful for finding bugs in large-scale C code. Domain-specific checkers are particularly effective in finding deep/subtle bugs because they can make use of domain-specific knowledge. To develop domain-specific checkers, however, typical bug patterns in certain domains must first be extracted. This paper explores the use of machine learning to help extract bug patterns from bug repositories. We used natural language processing to analyze over 370,000 bug descriptions of Linux and classified them into 66 clusters. Our preliminary work with this approach is encouraging: by investigating one of the 66 clusters, we were able to identify typical bug patterns in PCI device drivers and developed static code checkers to find them. When applied to the latest version of Linux, the developed checkers found two unknown bugs.
Open Access Media
USENIX is committed to Open Access to the research presented at our events. Papers and proceedings are freely available to everyone once the event begins. Any video, audio, and/or slides that are posted after the event are also free and open to everyone. Support USENIX and our commitment to Open Access.
author = {Takeshi Yoshimura and Kenji Kono},
title = {Who Writes What {Checkers?{\textemdash}Learning} from Bug Repositories},
booktitle = {10th Workshop on Hot Topics in System Dependability (HotDep 14)},
year = {2014},
address = {Broomfield, CO},
url = {https://www.usenix.org/conference/hotdep14/workshop-program/presentation/yoshimura},
publisher = {USENIX Association},
month = oct
}
connect with us