TY - JOUR
T1 - Enhancing Vulnerability Reports with Automated and Augmented Description Summarization
AU - Althebeiti, Hattan
AU - Alkinoon, Mohammed
AU - Mohaisen, Manar
AU - Salem, Saeed
AU - Nyang, Dae Hun
AU - Mohaisen, David
N1 - Publisher Copyright:
© 2015 IEEE. All rights reserved.
PY - 2025
Y1 - 2025
N2 - Public vulnerability databases, such as the National Vulnerability Database (NVD), document vulnerabilities and facilitate threat information sharing. However, they often suffer from short descriptions and outdated or insufficient information. In this paper, we introduce Zad, a system designed to enrich NVD vulnerability descriptions by leveraging external resources. Zad consists of two pipelines: one collects and filters supplementary data using two encoders to build a detailed dataset, while the other fine-tunes a pre-trained model on this dataset to generate enriched descriptions. By addressing brevity and improving content quality, Zad produces more comprehensive and cohesive vulnerability descriptions. We evaluate Zad using standard summarization metrics and human assessments, demonstrating its effectiveness in enhancing vulnerability information.
AB - Public vulnerability databases, such as the National Vulnerability Database (NVD), document vulnerabilities and facilitate threat information sharing. However, they often suffer from short descriptions and outdated or insufficient information. In this paper, we introduce Zad, a system designed to enrich NVD vulnerability descriptions by leveraging external resources. Zad consists of two pipelines: one collects and filters supplementary data using two encoders to build a detailed dataset, while the other fine-tunes a pre-trained model on this dataset to generate enriched descriptions. By addressing brevity and improving content quality, Zad produces more comprehensive and cohesive vulnerability descriptions. We evaluate Zad using standard summarization metrics and human assessments, demonstrating its effectiveness in enhancing vulnerability information.
KW - National Vulnerability Database
KW - Natural Language Processing Summarization
KW - Transformer
KW - Vulnerability
UR - http://www.scopus.com/inward/record.url?scp=105004905140&partnerID=8YFLogxK
U2 - 10.1109/TBDATA.2025.3566618
DO - 10.1109/TBDATA.2025.3566618
M3 - Article
AN - SCOPUS:105004905140
SN - 2332-7790
JO - IEEE Transactions on Big Data
JF - IEEE Transactions on Big Data
ER -