har2warc
Converter
Converts HTTP Archive format to Web Archive format
Convert HTTP Archive (HAR) -> Web Archive (WARC) format
46 stars
7 watching
4 forks
Language: Python
last commit: about 6 years ago
Linked from 1 awesome list
Related projects:
Repository | Description | Stars |
---|---|---|
webrecorder/warcio | A fast streaming library for working with WARC format web archival data | 385 |
steffenfritz/html2warc | Converts offline data into a standard archival format | 18 |
nla/httrack2warc | Converts HTTrack crawls to WARC files by reconstructing requests and responses from logs | 30 |
internetarchive/warctools | Tools for working with archived web content | 152 |
internetarchive/warcprox | An HTTP proxy designed to capture and archive web traffic, including encrypted HTTPS connections. | 381 |
chfoo/warcat | Tool for handling Web Archive files | 150 |
peterk/warcworker | A web archiving tool that archives websites with high-fidelity preservation capabilities. | 55 |
turicas/crau | A command-line tool for archiving and playing back websites in WARC format | 57 |
webrecorder/archiveweb.page | A high-fidelity web archiving system for storing and replaying interactive web pages in browsers. | 862 |
iipc/warc2html | Converts WARC files to static HTML with relative link rewriting and renaming | 39 |
webrecorder/pywb | A toolkit for archiving and replaying web content accurately and efficiently | 1,407 |
svenskaspel/har2locust | Automatically converts browser recordings (.har files) into locust scripts. | 160 |
richardlehane/webarchive | Provides tools for reading and parsing web archive formats used in digital preservation. | 20 |
helgeho/warcpartitioner | Tool for partitioning and merging Web archive files by MIME type and year | 1 |
ikreymer/webarchive-indexing | Tools for bulk indexing of WARC/ARC files to create a shared url index | 42 |