internetarchive
Archive portal client
A command-line and Python interface to access Archive.org's services
A Python and Command-Line Interface to Archive.org
2k stars
56 watching
219 forks
Language: Python
last commit: 8 days ago
Linked from 1 awesome list
Related projects:
Repository | Description | Stars |
---|---|---|
mementoweb/py-memento-client | Provides access to archived web pages from various TimeGates. | 25 |
akamhy/waybackpy | An API interface and command-line tool for interacting with the Wayback Machine's web archiving service | 484 |
jarofghosts/memento-client | Provides a simple JavaScript interface to access historical web pages via the Wayback Machine | 14 |
richardlehane/webarchive | Provides tools for reading and parsing web archive formats used in digital preservation. | 20 |
internetarchive/arch | A distributed compute analysis system for web archive collections | 15 |
jiiks/asar.net | A .NET implementation of the Atom Asar archive format, allowing extraction and manipulation of archived files. | 35 |
jcgregorio/httplib2 | An HTTP client library for Python | 383 |
internetarchive/warctools | Tools for working with archived web content | 152 |
nla/outbackcdx | A RocksDB-based server for managing and replicating capture indexes used in web archiving | 32 |
netarchivesuite/jwat-tools | An extension of utility libraries with command-line tools for archiving and compression tasks. | 5 |
archivesspace/archivesspace | An archives management tool with features for managing and providing web access to archival collections, including metadata management and digital object storage. | 354 |
internetarchive/bookreader | A JavaScript-based web application for displaying and reading digital books from the Internet Archive. | 994 |
chatnoir-eu/chatnoir-resiliparse | A toolkit for processing and analyzing web archive data | 84 |
netarchivesuite/jwat | A toolkit for analyzing and extracting data from legacy web archives in a structured format suitable for further analysis or reuse | 3 |
internetarchive/warcprox | An HTTP proxy designed to capture and archive web traffic, including encrypted HTTPS connections. | 381 |