Computer Science and Engineering Faculty Publications

Code Execution Capability as a Metric for Machine Learning–Assisted Software Vulnerability Detection Models

Daniel Grahn, Wright State University - Main CampusFollow
Lingwei Chen, Wright State University - Main CampusFollow
Junjie Zhang, Wright State University - Main CampusFollow

Document Type

Article

Publication Date

2023

Abstract

In this paper, we consider how the ability to learn Code Execution Tasks affects a model’s accuracy on software vulnerability detection (SVD) benchmark datasets. We initially find that models can achieve near state-of-the-art accuracy on SVD benchmarks regardless of their ability to learn Code Execution Tasks. However, these models fail to generalize well across SVD benchmarks. The results indicate a bias in the datasets that allows models to predict non- SVD signals. Under the theory that different collection methods will reduce biases, we investigate combining the SVD datasets. When trained on combined datasets, SVD accuracy is reduced but correlation with Code Execution Task accuracy improves. Our contributions are (1) using a reversed curriculum learning to evaluate model capabilities, (2) demonstrating the criticality of code execution understanding to machine learning– assisted software vulnerability detection, (3) evidence that improved diversity of SVD datasets will lead to improved accuracy and generalizability, (4) and benchmarks of recent models across multiple SVD datasets.

Comments

This article was presented at the

Repository Citation

Grahn, D., Chen, L., & Zhang, J. (2023). Code Execution Capability as a Metric for Machine Learning–Assisted Software Vulnerability Detection Models. .
https://corescholar.libraries.wright.edu/cse/659

Download

Included in

Computer Sciences Commons

COinS

Computer Science and Engineering Faculty Publications

Code Execution Capability as a Metric for Machine Learning–Assisted Software Vulnerability Detection Models

Document Type

Publication Date

Abstract

Comments

Repository Citation

Included in

Search

Browse

About

SelectedWorks Sites

Computer Science and Engineering Faculty Publications

Code Execution Capability as a Metric for Machine Learning–Assisted Software Vulnerability Detection Models

Authors

Document Type

Publication Date

Abstract

Comments

Repository Citation

Included in

Share

Search

Browse

About

SelectedWorks Sites