In this work, we propose Malware Collection Booster (McBoost), a fast statistical malware detection tool that is intended to improve the scalability of existing malware collection and analysis approaches. Given a large collection of binaries that may contain both hitherto unknown malware and benign executables, McBoost reduces the overall time of analysis by classifying and filtering out the least suspicious binaries and passing only the most suspicious ones to a detailed binary analysis process for signature extraction. The McBoost framework consists of a classifier specialized in detecting whether an executable is packed or not, a universal unpacker based on dynamic binary analysis, and a classifier specialized in distinguishing between malicious or benign code. We developed a proof-of-concept version of McBoost and evaluated it on 5,586 malware and 2,258 benign programs. McBoost has an accuracy of 87.3%, and an Area Under the ROC curve (AUC) equal to 0.977. Our evaluation also shows that McBoost reduces the overall time of analysis to only a fraction (e.g., 13.4%) of the computation time that would otherwise be required to analyze large sets of mixed malicious and benign executables.

McBoost: Boosting scalability in malware collection and analysis using statistical classification of executables / R. Perdisci, A. Lanzi, W. Lee - In: Computer Security Applications Conference, 2008. ACSAC 2008. Annual[s.l] : IEEE, 2008. - ISBN 9780769534473. - pp. 301-310 (( Intervento presentato al 24. convegno ACSAC tenutosi a Anaheim nel 2008.

McBoost: Boosting scalability in malware collection and analysis using statistical classification of executables

A. Lanzi
Secondo
;
2008

Abstract

In this work, we propose Malware Collection Booster (McBoost), a fast statistical malware detection tool that is intended to improve the scalability of existing malware collection and analysis approaches. Given a large collection of binaries that may contain both hitherto unknown malware and benign executables, McBoost reduces the overall time of analysis by classifying and filtering out the least suspicious binaries and passing only the most suspicious ones to a detailed binary analysis process for signature extraction. The McBoost framework consists of a classifier specialized in detecting whether an executable is packed or not, a universal unpacker based on dynamic binary analysis, and a classifier specialized in distinguishing between malicious or benign code. We developed a proof-of-concept version of McBoost and evaluated it on 5,586 malware and 2,258 benign programs. McBoost has an accuracy of 87.3%, and an Area Under the ROC curve (AUC) equal to 0.977. Our evaluation also shows that McBoost reduces the overall time of analysis to only a fraction (e.g., 13.4%) of the computation time that would otherwise be required to analyze large sets of mixed malicious and benign executables.
Computer Networks and Communications; Software; Safety, Risk, Reliability and Quality
Settore INF/01 - Informatica
2008
Applied Computer Security Associates
Book Part (author)
File in questo prodotto:
File Dimensione Formato  
04721567.pdf

accesso riservato

Tipologia: Publisher's version/PDF
Dimensione 314.76 kB
Formato Adobe PDF
314.76 kB Adobe PDF   Visualizza/Apri   Richiedi una copia
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/455527
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 116
  • ???jsp.display-item.citation.isi??? 67
social impact