internetarchive
Archive portal client
A command-line and Python interface to access Archive.org's services
A Python and Command-Line Interface to Archive.org
2k stars
56 watching
220 forks
Language: Python
last commit: about 1 month ago
Linked from 1 awesome list
Related projects:
Repository | Description | Stars |
---|---|---|
mementoweb/py-memento-client | Provides access to archived web pages from various TimeGates. | 25 |
akamhy/waybackpy | An API interface and command-line tool for interacting with the Wayback Machine's web archiving service | 489 |
jarofghosts/memento-client | Provides a simple JavaScript interface to access historical web pages via the Wayback Machine | 14 |
richardlehane/webarchive | Provides tools for reading and parsing web archive formats used in digital preservation. | 20 |
internetarchive/arch | A distributed compute analysis system for web archive collections | 15 |
jiiks/asar.net | A .NET implementation of the Atom Asar archive format, allowing extraction and manipulation of archived files. | 36 |
jcgregorio/httplib2 | An HTTP client library for Python | 383 |
internetarchive/warctools | Tools for working with archived web content | 153 |
nla/outbackcdx | A RocksDB-based server for managing and replicating capture indexes used in web archiving | 33 |
netarchivesuite/jwat-tools | An extension of utility libraries with command-line tools for archiving and compression tasks. | 5 |
archivesspace/archivesspace | A web-based application for managing and providing access to archives and cultural heritage collections | 355 |
internetarchive/bookreader | A JavaScript-based web application for displaying and reading digital books from the Internet Archive. | 1,003 |
chatnoir-eu/chatnoir-resiliparse | A toolkit for processing and analyzing web archive data | 89 |
netarchivesuite/jwat | A toolkit for analyzing and extracting data from legacy web archives in a structured format suitable for further analysis or reuse | 3 |
internetarchive/warcprox | An HTTP proxy designed to capture and archive web traffic, including encrypted HTTPS connections. | 389 |