Human-Understandable Explanation for Software Vulnerability Prediction

Nguyen, Hong Quy; Hoang, Thong; Dam, Hoa  Khanh; Su, Guoxin; Xing, Zhenchang; Lu, Qinghua; Sun, Jiamou

doi:10.2139/ssrn.5007903

Download This Paper

Open PDF in Browser

Add Paper to My Library

Human-Understandable Explanation for Software Vulnerability Prediction

56 Pages Posted: 2 Nov 2024

See all articles by Hong Quy Nguyen

Thong Hoang

Government of the Commonwealth of Australia - Data61

Qinghua Lu

Government of the Commonwealth of Australia - Data61

Recent advances in deep learning have significantly improved the performance of software vulnerability prediction (SVP). To enhance trustworthiness, the SVP highlights predicted lines of code (LoC) that may be vulnerable. However, providing LoC alone is often insufficient for software practitioners, as it lacks detailed information about the nature of the vulnerability. This paper introduces a novel framework that is built on SVP by offering additional explanatory information based on the suggested LoC. Similar to security reports, our framework comprehensively explains the vulnerability aspects, such as Root Cause, Impact, Attack Vector, and Vulnerability Type. The proposed framework is powered by transformer architectures. Specifically, we leverage pre-trained language models for code to fine-tune on two practical datasets: BigVul and VKA, ensuring our framework's applicability to real-world scenarios. Experiments using the ROUGE and BLEU scores as evaluation metrics show that our framework achieves better performance with CodeT5+, statistically outperforming a baseline study in generating key vulnerability aspects. Additionally, we conducted a small-scale user study with experienced software practitioners to assess the effectiveness of the framework. The results show that 72\% of the participants found our framework helpful in accepting the SVP results, and 68\% rated the additional explanations as moderately to extremely useful.

Suggested Citation: Suggested Citation

Nguyen, Hong Quy and Hoang, Thong and Dam, Hoa Khanh and Su, Guoxin and Xing, Zhenchang and Lu, Qinghua and Sun, Jiamou, Human-Understandable Explanation for Software Vulnerability Prediction. Available at SSRN: https://ssrn.com/abstract=5007903 or http://dx.doi.org/10.2139/ssrn.5007903