warc-safe

Content scanner

A tool for detecting viruses and NSFW material in archived web content

A tool for detecting viruses and NSFW material in WARC files

GitHub

11 stars
4 watching
0 forks
Language: Python
last commit: 5 months ago
Linked from 1 awesome list

antivirusnsfw-classifierwarcwarc-safewebarchiving

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
internetarchive/warctools Tools for working with archived web content 153
nccgroup/shocker A tool to identify and exploit vulnerable servers using Python 333
fabasoad/nsfw-detection-action Detects NSFW content in committed files using various providers. 17
nla/httrack2warc Converts HTTrack crawls to WARC files by reconstructing requests and responses from logs 32
n0tan3rd/node-warc A tool for parsing and generating Web Archive files in JavaScript using Node.js 95
hannob/snallygaster A tool that scans HTTP servers for secret files and security vulnerabilities. 2,077
nlnwa/gowarcserver A tool for indexing and serving contents of WARC files. 15
kapejod/rtpnatscan A command line tool to scan RTP proxies for vulnerabilities to NAT stealing attacks 24
florents-tselai/warcdb A library for storing and querying web crawl data in a compact, easily sharable format. 397
webrecorder/warcio A fast streaming library for working with WARC format web archival data 391
belane/linux-soft-exploit-suggester A script to identify vulnerabilities in software packages on Linux systems 222
ikreymer/webarchive-indexing Tools for bulk indexing of WARC/ARC files to create a shared url index 43
nccgroup/argumentinjectionhammer An extension that identifies argument injection vulnerabilities in web applications using payloads and detection techniques 118
chfoo/warcat Tool for handling Web Archive files 152
nccgroup/sobelow A tool for detecting security vulnerabilities in Elixir and Phoenix applications 1,692