unwarcit

Unzipping tool

A command line tool to unzip WARC and WACZ files

GitHub

10 stars
5 watching
0 forks
Language: Python
last commit: about 3 years ago
Linked from 1 awesome list


Backlinks from these awesome lists:

Related projects:

Repository Description Stars
chfoo/warcat Tool for handling Web Archive files 152
steffenfritz/html2warc Converts offline data into a standard archival format 18
natliblux/warc-safe A tool for detecting viruses and NSFW material in archived web content 11
unt-libraries/py-wasapi-client Downloads WARC files from a WASAPI access point. 15
warhub/wham A CLI tool and library for managing wargame data files, converting formats between different systems. 21
earldouglas/sbt-war An sbt plugin for packaging and running .war files. 382
webrecorder/warcio A fast streaming library for working with WARC format web archival data 391
joepvd/grep2awk Tool to transform grep commands into awk commands 27
internetarchive/warctools Tools for working with archived web content 153
gedex/unzipall A command-line utility that recursively unzips all zip files within a directory and its subdirectories to another specified directory. 2
n0tan3rd/node-warc A tool for parsing and generating Web Archive files in JavaScript using Node.js 95
vivkin/nozip A C library for reading ZIP files by parsing and decoding the ZIP format to provide direct access to the stored data 13
nla/httrack2warc Converts HTTrack crawls to WARC files by reconstructing requests and responses from logs 32
archiveteam/grab-site A web crawler designed to backup websites by recursively crawling and writing WARC files. 1,406
ikreymer/webarchive-indexing Tools for bulk indexing of WARC/ARC files to create a shared url index 43