internetarchive

Archive portal client

A command-line and Python interface to access Archive.org's services

A Python and Command-Line Interface to Archive.org

GitHub

2k stars
56 watching
219 forks
Language: Python
last commit: 8 days ago
Linked from 1 awesome list


Backlinks from these awesome lists:

Related projects:

Repository Description Stars
mementoweb/py-memento-client Provides access to archived web pages from various TimeGates. 25
akamhy/waybackpy An API interface and command-line tool for interacting with the Wayback Machine's web archiving service 484
jarofghosts/memento-client Provides a simple JavaScript interface to access historical web pages via the Wayback Machine 14
richardlehane/webarchive Provides tools for reading and parsing web archive formats used in digital preservation. 20
internetarchive/arch A distributed compute analysis system for web archive collections 15
jiiks/asar.net A .NET implementation of the Atom Asar archive format, allowing extraction and manipulation of archived files. 35
jcgregorio/httplib2 An HTTP client library for Python 383
internetarchive/warctools Tools for working with archived web content 152
nla/outbackcdx A RocksDB-based server for managing and replicating capture indexes used in web archiving 32
netarchivesuite/jwat-tools An extension of utility libraries with command-line tools for archiving and compression tasks. 5
archivesspace/archivesspace An archives management tool with features for managing and providing web access to archival collections, including metadata management and digital object storage. 354
internetarchive/bookreader A JavaScript-based web application for displaying and reading digital books from the Internet Archive. 994
chatnoir-eu/chatnoir-resiliparse A toolkit for processing and analyzing web archive data 84
netarchivesuite/jwat A toolkit for analyzing and extracting data from legacy web archives in a structured format suitable for further analysis or reuse 3
internetarchive/warcprox An HTTP proxy designed to capture and archive web traffic, including encrypted HTTPS connections. 381