This is a non-IMPACT record, meaning that access to the data is not controlled by IMPACT. For access, see the directions below.

This Resource is offered and provided outside of the IMPACT mediation framework. IMPACT and the IMPACT Coordination Council/Blackfire Technology, Inc. expressly disclaim all conditions, representations and warranties including but not limited to Resource availability, quality, accuracy, non-infringement, and non-interference. All Resource information and access is controlled by entities and under terms that are external to the IMPACT legal framework.


Malware Training Sets
External Dataset
External Data Source
56 (lowest rank is 56)

Category & Restrictions

malware, cyber attack


Aim of the project is to provide an useful and classified dataset to researchers who want to investigate deeper in malware analysis by using Machine Learning techniques.

One of the most challenging tasks during Machine Learning processing is to define a great training (and possible dynamic) dataset. Assuming a well known learning algorithm and a periodic learning supervised process what you need is a classified dataset to best train your machine. Thousands of training datasets are available out there, but no great classified datasets for malware analyses exist. This dataset was created to share with the scientific community (and everybody interested on it) in order to give to everyone a base point to start with Machine Learning for Malware Analysis.

The collected dataset is composed by the following samples:

APT1 292 Samples
Crypto 2024 Samples
Locker 434 Samples
Zeus 2014 Samples

Additional Details

malware, training, 1263, malware training sets, sets, corporation, external, inferlink corporation, inferlink, external data source, source, learning, dataset, machine, classified, analysis, aim, investigate, provide, techniques, project, researchers, deeper, samples, datasets, 2014, assuming, supervised, composed, 292, start, define, share, process, base, scientific, periodic, algorithm, tasks, dynamic, 2024, 434, locker, challenging, collected, train, created, other, processing, analyses, zeus, exist, crypto, community, thousands, apt1