Human-Understandable Explanation for Software Vulnerability Prediction

56 Pages Posted: 2 Nov 2024

See all articles by Hong Quy Nguyen

Hong Quy Nguyen

University of Wollongong

Thong Hoang

Government of the Commonwealth of Australia - Data61

Hoa Khanh Dam

University of Wollongong

Guoxin Su

University of Wollongong

Zhenchang Xing

affiliation not provided to SSRN

Qinghua Lu

Government of the Commonwealth of Australia - Data61

Jiamou Sun

affiliation not provided to SSRN

Abstract

Recent advances in deep learning have significantly improved the performance of software vulnerability prediction (SVP).  To enhance trustworthiness, the SVP highlights predicted lines of code (LoC) that may be vulnerable.  However, providing LoC alone is often insufficient for software practitioners, as it lacks detailed information about the nature of the vulnerability.  This paper introduces a novel framework that is built on SVP by offering additional explanatory information based on the suggested LoC.  Similar to security reports, our framework comprehensively explains the vulnerability aspects, such as Root Cause, Impact, Attack Vector, and Vulnerability Type.  The proposed framework is powered by transformer architectures. Specifically, we leverage pre-trained language models for code to fine-tune on two practical datasets: BigVul and VKA, ensuring our framework's applicability to real-world scenarios.  Experiments using the ROUGE and BLEU scores as evaluation metrics show that our framework achieves better performance with CodeT5+, statistically outperforming a baseline study in generating key vulnerability aspects. Additionally, we conducted a small-scale user study with experienced software practitioners to assess the effectiveness of the framework.  The results show that 72\% of the participants found our framework helpful in accepting the SVP results, and 68\% rated the additional explanations as moderately to extremely useful.

Suggested Citation

Nguyen, Hong Quy and Hoang, Thong and Dam, Hoa Khanh and Su, Guoxin and Xing, Zhenchang and Lu, Qinghua and Sun, Jiamou, Human-Understandable Explanation for Software Vulnerability Prediction. Available at SSRN: https://ssrn.com/abstract=5007903 or http://dx.doi.org/10.2139/ssrn.5007903

Hong Quy Nguyen (Contact Author)

University of Wollongong ( email )

Northfields Avenue
Wollongong, 2522
Australia

Thong Hoang

Government of the Commonwealth of Australia - Data61 ( email )

Brisbane
Australia

Hoa Khanh Dam

University of Wollongong ( email )

Guoxin Su

University of Wollongong ( email )

Zhenchang Xing

affiliation not provided to SSRN ( email )

Qinghua Lu

Government of the Commonwealth of Australia - Data61 ( email )

Brisbane
Australia

Jiamou Sun

affiliation not provided to SSRN ( email )

Do you have a job opening that you would like to promote on SSRN?

Paper statistics

Downloads
54
Abstract Views
171
Rank
830,820
PlumX Metrics