webarchive
Web archive parser
Provides tools for reading and parsing web archive formats used in digital preservation.
golang readers for ARC and WARC webarchive formats
20 stars
7 watching
2 forks
Language: Go
last commit: almost 2 years ago
Linked from 1 awesome list
Related projects:
Repository | Description | Stars |
---|---|---|
| Tools for indexing and discovering archived web content | 117 |
| A high-fidelity web archiving system for storing and replaying interactive web pages in browsers. | 903 |
| A web-based archive service that allows users to store and manage web pages in various formats. | 115 |
| Tool for partitioning and merging Web archive files by MIME type and year | 1 |
| A web archiving tool that archives websites with high-fidelity preservation capabilities. | 57 |
| Archives a web page as a single HTML file with embedded resources. | 267 |
| A command-line tool for archiving and playing back websites in WARC format | 59 |
| A tool for parsing and generating Web Archive files in JavaScript using Node.js | 95 |
| A tool for archiving webpages to IPFS | 12 |
| Tools for working with archived web content | 153 |
| Tools for bulk indexing of WARC/ARC files to create a shared url index | 43 |
| A tool for capturing and preserving web content and making it accessible in the future. | 1,839 |
| A graphical user interface layer for preserving and replaying web pages using multiple archiving tools. | 353 |
| Provides a simple JavaScript interface to access historical web pages via the Wayback Machine | 14 |
| Converts HTTP Archive format to Web Archive format | 48 |