SOREL-20M

Malware detector dataset

A large-scale dataset and codebase for training machine learning models to detect malicious software

Sophos-ReversingLabs 20 million sample dataset

GitHub

646 stars
32 watching
133 forks
Language: Python
last commit: over 3 years ago
Linked from 1 awesome list


Backlinks from these awesome lists:

Related projects:

Repository Description Stars
marcoramilli/malwaretrainingsets Provides machine learning datasets for malware analysis 227
13o-bbr-bbq/machine_learning_security An open-source project that explores the intersection of machine learning and security to develop tools for detecting vulnerabilities in web applications. 1,987
telekom-security/malware_analysis An analysis repository providing scripts, signatures, and IOCs for detecting and analyzing malware. 110
ethz-spylab/rlhf_trojan_competition Detecting backdoors in language models to prevent malicious AI usage 109
gosecure/malware-ioc Provides a set of standardized indicators to help detect and assess malware presence 10
sapphirex00/threat-hunting A collection of threat intelligence resources and tools for analyzing APT malware 257
neo23x0/rules A centralized repository of Yara rules for detecting malware and other malicious activities. 10
alexander-h-liu/malconv-pytorch An implementation of MalConv for malware detection using PyTorch 71
rew-sploit/rew-sploit Analyzes and dissects malware and obfuscated code from various attack frameworks like Metasploit and Cobalt Strike 139
withsecurelabs/snake A centralized storage solution for malicious samples to support malware investigation and analysis 217
airbnb/binaryalert Real-time malware detection and alert system for AWS S3 files 1,415
diogo-fernan/malsub A Python framework that provides an API interface to multiple online services for analyzing malware and threat intelligence 368
philipperemy/yolo-9000 Real-time object detection using deep learning and a large dataset of classes 1,184
exeinfoasl/asl An executable file detector software that identifies packers, protectors, compilers, .NET obfuscators, and other types of malware or unwanted code. 772
ayoolaolafenwa/pixellib A deep learning library for image segmentation and object detection using PyTorch. 1,054