How to Train your Antivirus: RL-based Hardening through the Problem-Space

Tsingenopoulos, Ilias; Cortellazzi, Jacopo; Bošanský, Branislav; Aonzo, Simone; Preuveneers, Davy; Joosen, Wouter; Pierazzi, Fabio; Cavallaro, Lorenzo

doi:10.1145/3678890.3678912

Computer Science > Cryptography and Security

arXiv:2402.19027v2 (cs)

[Submitted on 29 Feb 2024 (v1), last revised 5 Sep 2024 (this version, v2)]

Title:How to Train your Antivirus: RL-based Hardening through the Problem-Space

Authors:Ilias Tsingenopoulos, Jacopo Cortellazzi, Branislav Bošanský, Simone Aonzo, Davy Preuveneers, Wouter Joosen, Fabio Pierazzi, Lorenzo Cavallaro

View PDF HTML (experimental)

Abstract:ML-based malware detection on dynamic analysis reports is vulnerable to both evasion and spurious correlations. In this work, we investigate a specific ML architecture employed in the pipeline of a widely-known commercial antivirus company, with the goal to harden it against adversarial malware. Adversarial training, the sole defensive technique that can confer empirical robustness, is not applicable out of the box in this domain, for the principal reason that gradient-based perturbations rarely map back to feasible problem-space programs. We introduce a novel Reinforcement Learning approach for constructing adversarial examples, a constituent part of adversarially training a model against evasion. Our approach comes with multiple advantages. It performs modifications that are feasible in the problem-space, and only those; thus it circumvents the inverse mapping problem. It also makes possible to provide theoretical guarantees on the robustness of the model against a particular set of adversarial capabilities. Our empirical exploration validates our theoretical insights, where we can consistently reach 0% Attack Success Rate after a few adversarial retraining iterations.

Comments:	20 pages,4 figures
Subjects:	Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2402.19027 [cs.CR]
	(or arXiv:2402.19027v2 [cs.CR] for this version)
	https://doi.org/10.48550/arXiv.2402.19027
Related DOI:	https://doi.org/10.1145/3678890.3678912

Submission history

From: Ilias Tsingenopoulos [view email]
[v1] Thu, 29 Feb 2024 10:38:56 UTC (1,023 KB)
[v2] Thu, 5 Sep 2024 17:07:23 UTC (894 KB)

Computer Science > Cryptography and Security

Title:How to Train your Antivirus: RL-based Hardening through the Problem-Space

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Cryptography and Security

Title:How to Train your Antivirus: RL-based Hardening through the Problem-Space

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators