1. MAGMA: Network Behavior Classifier for Malware Traffic
- Author
-
Enrico Bocchi, Gaspar Modelo-Howard, Luigi Grimaudo, Elena Baralis, Stanislav Miskovic, Sung-Ju Lee, Sabyasachi Saha, Marco Mellia, Politecnico di Torino = Polytechnic of Turin (Polito), Laboratory of Information, Network and Communication Sciences (LINCS), Université Pierre et Marie Curie - Paris 6 (UPMC)-Institut National de Recherche en Informatique et en Automatique (Inria)-Institut Mines-Télécom [Paris] (IMT), Réseaux, Mobilité et Services (RMS), Laboratoire Traitement et Communication de l'Information (LTCI), Institut Mines-Télécom [Paris] (IMT)-Télécom Paris-Institut Mines-Télécom [Paris] (IMT)-Télécom Paris, Département Informatique et Réseaux (INFRES), Télécom ParisTech, Symantec Corporation, Department of Computer Science [KAIST] (CS), and Korea Advanced Institute of Science and Technology (KAIST)
- Subjects
Malware characterization ,Computer science ,Computer Networks and Communications ,Network traffic modeling ,Big data ,02 engineering and technology ,computer.software_genre ,Oracle ,[INFO.INFO-NI]Computer Science [cs]/Networking and Internet Architecture [cs.NI] ,Robustness (computer science) ,020204 information systems ,0202 electrical engineering, electronic engineering, information engineering ,False positive paradox ,business.industry ,Malicious behaviors detection ,Malware ,020201 artificial intelligence & image processing ,The Internet ,Graph networks ,Automatic classification ,Data mining ,business ,Communications protocol ,computer ,Classifier (UML) - Abstract
International audience; Malware is a major threat to security and privacy of network users. A large variety of malware is typically spread over the Internet, hiding in benign traffic. New types of malware appear every day, challenging both the research community and security companies to improve malware identification techniques. In this paper we present MAGMA, MultilAyer Graphs for MAlware detection, a novel malware behavioral classifier. Our system is based on a Big Data methodology, driven by real-world data obtained from traffic traces collected in an operational network. The methodology we propose automatically extracts patterns related to a specific input event, i.e., a seed, from the enormous amount of events the network carries. By correlating such activities over (i) time, (ii) space, and (iii) network protocols, we build a Network Connectivity Graph that captures the overall “network behavior” of the seed. We next extract features from the Connectivity Graph and design a supervised classifier. We run MAGMA on a large dataset collected from a commercial Internet Provider where 20,000 Internet users generated more than 330 million events. Only 42,000 are flagged as malicious by a commercial IDS, which we consider as an oracle. Using this dataset, we experimentally evaluate MAGMA accuracy and robustness to parameter settings. Results indicate that MAGMA reaches 95% accuracy, with limited false positives. Furthermore, MAGMA proves able to identify suspicious network events that the IDS ignored.
- Published
- 2016
- Full Text
- View/download PDF