pywb

Web archiver

A toolkit for archiving and replaying web content accurately and efficiently

Core Python Web Archiving Toolkit for replay and recording of web archives

GitHub

1k stars
61 watching
218 forks
Language: JavaScript
last commit: 10 days ago
Linked from 2 awesome lists

pythonpywbwaybackweb-archivesweb-archiving

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
webrecorder/archiveweb.page A high-fidelity web archiving system for storing and replaying interactive web pages in browsers. 862
peterk/warcworker A web archiving tool that archives websites with high-fidelity preservation capabilities. 55
oduwsdl/ipwb A system for dispersing and replaying archived web content using peer-to-peer technology. 617
wabarc/wayback A tool for capturing and preserving web content and making it accessible in the future. 1,818
machawk1/wail A graphical user interface layer for preserving and replaying web pages using multiple archiving tools. 350
webrecorder/warcio A fast streaming library for working with WARC format web archival data 385
wabarc/cairn A tool for archiving web pages as single HTML files 43
wabarc/rivet A tool for archiving webpages to IPFS 12
oduwsdl/archivenow A tool to automate archiving of web resources into public archives. 410
iipc/openwayback A Java-based tool for recording and replaying web pages from archives. 486
webrecorder/har2warc Converts HTTP Archive format to Web Archive format 46
jarofghosts/memento-client Provides a simple JavaScript interface to access historical web pages via the Wayback Machine 14
mementoweb/py-memento-client Provides access to archived web pages from various TimeGates. 25
wabarc/playback Replays archived webpages from the Wayback Machine 6
bellingcat/auto-archiver Automates archiving of online content from various sources into local storage or cloud services 570