Exploitation of Vulnerabilities: A Topic-Based Machine Learning Framework for Explaining and Predicting Exploitation

Konstantinos Charmanas

Nikolaos Mittas and Lefteris Angelis

Resumen

Security vulnerabilities constitute one of the most important weaknesses of hardware and software security that can cause severe damage to systems, applications, and users. As a result, software vendors should prioritize the most dangerous and impactful security vulnerabilities by developing appropriate countermeasures. As we acknowledge the importance of vulnerability prioritization, in the present study, we propose a framework that maps newly disclosed vulnerabilities with topic distributions, via word clustering, and further predicts whether this new entry will be associated with a potential exploit Proof Of Concept (POC" role="presentation">POCPOC P O C ). We also provide insights on the current most exploitable weaknesses and products through a Generalized Linear Model (GLM" role="presentation">GLMGLM G L M ) that links the topic memberships of vulnerabilities with exploit indicators, thus distinguishing five topics that are associated with relatively frequent recent exploits. Our experiments show that the proposed framework can outperform two baseline topic modeling algorithms in terms of topic coherence by improving LDA" role="presentation">LDALDA L D A models by up to 55%. In terms of classification performance, the conducted experiments?on a quite balanced dataset (57% negative observations, 43% positive observations)?indicate that the vulnerability descriptions can be used as exclusive features in assessing the exploitability of vulnerabilities, as the ?best? model achieves accuracy close to 87%. Overall, our study contributes to enabling the prioritization of vulnerabilities by providing guidelines on the relations between the textual details of a weakness and the potential application/system exploits.

Palabras claves

text mining - exploits - fuzzy clustering - topic extraction - security vulnerabilities - machine learning

Acceso

P�GINAS

pp. 0 - 0

N�MERO

Volumen: 14 Parte: 7 (2023)

MATERIAS

INGENIER�A Y CONSTRUCCI�N CIVIL
TECNOLOG�A

REVISTAS SIMILARES

Applied Sciences
Information

DOI