solrwayback
Web archiver
A search interface and archival tool for browsing historical web pages
A search interface and wayback machine for the UKWA Solr based warc-indexer framework.
102 stars
24 watching
21 forks
Language: Java
last commit: 3 months ago
Linked from 1 awesome list
Related projects:
Repository | Description | Stars |
---|---|---|
| A tool for capturing and preserving web content and making it accessible in the future. | 1,839 |
| Tools for indexing and discovering archived web content | 117 |
| An API interface and command-line tool for interacting with the Wayback Machine's web archiving service | 489 |
| Provides a simple JavaScript interface to access historical web pages via the Wayback Machine | 14 |
| A system for dispersing and replaying archived web content using peer-to-peer technology. | 617 |
| Replays archived webpages from the Wayback Machine | 8 |
| A RocksDB-based server for managing and replicating capture indexes used in web archiving | 33 |
| A Java-based tool for recording and replaying web pages from archives. | 487 |
| A tool for archiving web pages as single HTML files | 45 |
| A web archive exploration UI built on top of the Solr search engine and warc-discovery indexer. | 43 |
| A graphical user interface layer for preserving and replaying web pages using multiple archiving tools. | 353 |
| Generates a sitemap of a website using Wayback Machine | 225 |
| A toolkit for analyzing and extracting data from legacy web archives in a structured format suitable for further analysis or reuse | 3 |
| Tools for bulk indexing of WARC/ARC files to create a shared url index | 43 |
| Provides tools for reading and parsing web archive formats used in digital preservation. | 20 |