warc-safe

Content scanner

A tool for detecting viruses and NSFW material in archived web content

A tool for detecting viruses and NSFW material in WARC files

GitHub

10 stars
4 watching
0 forks
Language: Python
last commit: 3 months ago
Linked from 1 awesome list

antivirusnsfw-classifierwarcwarc-safewebarchiving

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
internetarchive/warctools Tools for working with archived web content 152
nccgroup/shocker A tool to identify and exploit vulnerable servers using Python 333
fabasoad/nsfw-detection-action Detects NSFW content in committed files using various providers. 16
nla/httrack2warc Converts HTTrack crawls to WARC files by reconstructing requests and responses from logs 30
n0tan3rd/node-warc A tool for parsing and generating Web Archive files in JavaScript using Node.js 94
hannob/snallygaster A tool that scans HTTP servers for secret files and security vulnerabilities. 2,076
nlnwa/gowarcserver A tool for indexing and serving contents of WARC files. 14
kapejod/rtpnatscan A command line tool to scan RTP proxies for vulnerabilities to NAT stealing attacks 24
florents-tselai/warcdb A library for storing and querying web crawl data in a compact, easily sharable format. 394
webrecorder/warcio A fast streaming library for working with WARC format web archival data 385
belane/linux-soft-exploit-suggester A script to identify vulnerabilities in software packages on Linux systems 222
ikreymer/webarchive-indexing Tools for bulk indexing of WARC/ARC files to create a shared url index 42
nccgroup/argumentinjectionhammer An extension that identifies argument injection vulnerabilities in web applications using payloads and detection techniques 118
chfoo/warcat Tool for handling Web Archive files 150
nccgroup/sobelow A tool for detecting security vulnerabilities in Elixir and Phoenix applications 1,688