Assessing LLMs in malicious code deobfuscation of real-world malware campaigns

Patsakis, C; Casino, F; Lykousas, N

doi:10.1016/j.eswa.2024.124912

Datos identificativos

Identificador: imarina:9379058

Handle: https://hdl.handle.net/20.500.11797/imarina9379058

Autores: Patsakis, C; Casino, F; Lykousas, N

Resumen:
The integration of large language models (LLMs) into various cybersecurity pipelines has become increasingly prevalent, enabling the automation of numerous manual tasks and often surpassing human performance. Recognising this potential, cybersecurity researchers and practitioners are actively investigating the application of LLMs to process vast volumes of heterogeneous data for anomaly detection, potential bypass identification, attack mitigation, and fraud prevention. Moreover, LLMs' advanced capabilities in generating functional code, interpreting code context, and code summarisation present significant opportunities for reverse engineering and malware deobfuscation. In this work, we comprehensively examine the deobfuscation capabilities of state-of-the-art LLMs. Specifically, we conducted a detailed evaluation of four prominent LLMs using real-world malicious scripts from the notorious Emotet malware campaign. Our findings reveal that while current LLMs are not yet perfectly accurate, they demonstrate substantial potential in efficiently deobfuscating payloads. This study highlights the importance of fine-tuning LLMs for specialised tasks, suggesting that such optimisation could pave the way for future AI-powered threat intelligence pipelines to combat obfuscated malware. Our contributions include a thorough analysis of LLM performance in malware deobfuscation, identifying strengths and limitations, and discussing the potential for integrating LLMs into cybersecurity frameworks for enhanced threat detection and mitigation. Our experiments illustrate that LLMs can automatically and accurately extract the necessary indicators of compromise from a real-world campaign with an accuracy of 69.56% and 88.78% for the URLs and the corresponding domains of the droppers, respectively.
Otros:

Enlace a la fuente original: https://www.sciencedirect.com/science/article/pii/S0957417424017792?via%3Dihub
Referencia de l'ítem segons les normes APA: Patsakis, C; Casino, F; Lykousas, N (2024). Assessing LLMs in malicious code deobfuscation of real-world malware campaigns. EXPERT SYSTEMS WITH APPLICATIONS, 256(), 124912-. DOI: 10.1016/j.eswa.2024.124912
Referencia al articulo segun fuente origial: EXPERT SYSTEMS WITH APPLICATIONS. 256 124912-
DOI del artículo: 10.1016/j.eswa.2024.124912
Año de publicación de la revista: 2024-12-05
Entidad: Universitat Rovira i Virgili
Versión del articulo depositado: info:eu-repo/semantics/publishedVersion
Fecha de alta del registro: 2026-05-09
Autor/es de la URV: Casino Cembellín, Francisco José
Departamento: Enginyeria Informàtica i Matemàtiques
URL Documento de licencia: https://repositori.urv.cat/ca/proteccio-de-dades/
Tipo de publicación: Journal Publications
Autor según el artículo: Patsakis, C; Casino, F; Lykousas, N
Acceso a la licencia de uso: https://creativecommons.org/licenses/by/3.0/es/
Áreas temáticas: Operations research & management science, General engineering, Engineering, electrical & electronic, Engineering (miscellaneous), Engineering (all), Computer science, artificial intelligence, Computer science applications, Ciencias sociales, Ciência da computação, Artificial intelligence, Administração, ciências contábeis e turismo, Administração pública e de empresas, ciências contábeis e turismo
Direcció de correo del autor: franciscojose.casino@urv.cat

Palabras clave:

Malware analysis
Large language models
Cybersecurity
Cybersecurit
Code deobfuscation
Artificial Intelligence
Computer Science Applications
Computer Science
Engineering (Miscellaneous)
Engineering
Electrical & Electronic
Operations Research & Management Science
General engineering
Engineering (all)
Ciencias sociales
Ciência da computação
Administração
ciências contábeis e turismo
Administração pública e de empresas
Documentos:

DocumentPrincipal
Cerca a google

Assessing LLMs in malicious code deobfuscation of real-world malware campaigns

Datos identificativos

Otros:

Palabras clave:

Documentos:

Cerca a google