unwarcit

Unzipping tool

A command line tool to unzip WARC and WACZ files

GitHub

8 stars
5 watching
0 forks
Language: Python
last commit: almost 3 years ago
Linked from 1 awesome list


Backlinks from these awesome lists:

Related projects:

Repository Description Stars
chfoo/warcat Tool for handling Web Archive files 150
steffenfritz/html2warc Converts offline data into a standard archival format 18
natliblux/warc-safe A tool for detecting viruses and NSFW material in archived web content 10
unt-libraries/py-wasapi-client Downloads WARC files from a WASAPI access point. 14
warhub/wham A CLI tool and library for managing wargame data files, converting formats between different systems. 21
earldouglas/sbt-war Sbt plugin for packaging and running Java EE web applications in a local container 382
webrecorder/warcio A fast streaming library for working with WARC format web archival data 385
joepvd/grep2awk A tool to convert grep commands into awk commands with minimal user interaction 27
internetarchive/warctools Tools for working with archived web content 152
gedex/unzipall A command-line utility that recursively unzips all zip files within a directory and its subdirectories to another specified directory. 2
n0tan3rd/node-warc A tool for parsing and generating Web Archive files in JavaScript using Node.js 94
vivkin/nozip A C library for reading ZIP files by parsing and decoding the ZIP format to provide direct access to the stored data 13
nla/httrack2warc Converts HTTrack crawls to WARC files by reconstructing requests and responses from logs 30
archiveteam/grab-site A web crawler designed to backup websites by recursively crawling and writing WARC files. 1,402
ikreymer/webarchive-indexing Tools for bulk indexing of WARC/ARC files to create a shared url index 42