outbackcdx

Web archive server

A RocksDB-based server for managing and replicating capture indexes used in web archiving

Web archive index server based on RocksDB

GitHub

33 stars
23 watching
20 forks
Language: Java
last commit: about 2 months ago
Linked from 1 awesome list

waybackweb-archiving

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
ukwa/webarchive-discovery Tools for indexing and discovering archived web content 117
netarchivesuite/solrwayback A search interface and archival tool for browsing historical web pages 102
jarofghosts/memento-client Provides a simple JavaScript interface to access historical web pages via the Wayback Machine 14
oduwsdl/ipwb A system for dispersing and replaying archived web content using peer-to-peer technology. 617
internetarchive/arch A distributed compute analysis system for web archive collections 15
iipc/openwayback A Java-based tool for recording and replaying web pages from archives. 487
derfenix/webarchive A web-based archive service that allows users to store and manage web pages in various formats. 115
richardlehane/webarchive Provides tools for reading and parsing web archive formats used in digital preservation. 20
akamhy/waybackpy An API interface and command-line tool for interacting with the Wayback Machine's web archiving service 489
jjjake/internetarchive A command-line and Python interface to access Archive.org's services 1,643
florents-tselai/warcdb A library for storing and querying web crawl data in a compact, easily sharable format. 397
wabarc/cairn A tool for archiving web pages as single HTML files 45
wabarc/wayback A tool for capturing and preserving web content and making it accessible in the future. 1,839
keeweb/kdbxweb A high-performance JavaScript library for reading and writing KeePass v2 databases. 420
webrecorder/pywb A toolkit for archiving and replaying web content accurately and efficiently 1,418