outbackcdx

Web archive server

A RocksDB-based server for managing and replicating capture indexes used in web archiving

Web archive index server based on RocksDB

GitHub

32 stars
23 watching
20 forks
Language: Java
last commit: 14 days ago
Linked from 1 awesome list

waybackweb-archiving

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
ukwa/webarchive-discovery Tools for indexing and discovering archived web content 117
netarchivesuite/solrwayback A web-based search interface and Wayback machine for browsing archived web pages using an index of WARC files. 102
jarofghosts/memento-client Provides a simple JavaScript interface to access historical web pages via the Wayback Machine 14
oduwsdl/ipwb A system for dispersing and replaying archived web content using peer-to-peer technology. 616
internetarchive/arch A distributed compute analysis system for web archive collections 15
iipc/openwayback A Java-based tool for recording and replaying web pages from archives. 486
derfenix/webarchive A web-based archive service that allows users to store and manage web pages in various formats. 113
richardlehane/webarchive Provides tools for reading and parsing web archive formats used in digital preservation. 20
akamhy/waybackpy An API interface and command-line tool for interacting with the Wayback Machine's web archiving service 484
jjjake/internetarchive A command-line and Python interface to access Archive.org's services 1,638
florents-tselai/warcdb A library for storing and querying web crawl data in a compact, easily sharable format. 394
wabarc/cairn A tool for archiving web pages as single HTML files 43
wabarc/wayback A tool for capturing and preserving web content and making it accessible in the future. 1,824
keeweb/kdbxweb A high-performance JavaScript library for reading and writing KeePass v2 databases. 416
webrecorder/pywb A toolkit for archiving and replaying web content accurately and efficiently 1,417