py-wasapi-client

WARC client

Downloads WARC files from a WASAPI access point.

A client for the Archive-It And Webrecorder WASAPI Data Transfer API

GitHub

14 stars
5 watching
5 forks
Language: Python
last commit: about 5 years ago
Linked from 1 awesome list


Backlinks from these awesome lists:

Related projects:

Repository Description Stars
webrecorder/warcio A fast streaming library for working with WARC format web archival data 385
sul-dlss/wasapi-downloader An application to download archives of web archiving projects 6
internetarchive/warctools Tools for working with archived web content 152
nla/httrack2warc Converts HTTrack crawls to WARC files by reconstructing requests and responses from logs 30
chfoo/warcat Tool for handling Web Archive files 150
webrecorder/har2warc Converts HTTP Archive format to Web Archive format 46
internetarchive/warcprox An HTTP proxy designed to capture and archive web traffic, including encrypted HTTPS connections. 381
ambianic/peerjs-python Enables peer-to-peer communication between web applications and edge devices using WebRTC protocol. 89
iipc/jwarc A Java library for reading and writing WARC files with a typed API 47
mementoweb/py-memento-client Provides access to archived web pages from various TimeGates. 25
ripe-ncc/ripe-atlas-cousteau A Python library that provides access to the RIPE ATLAS API. 65
turicas/crau A command-line tool for archiving and playing back websites in WARC format 57
ikreymer/webarchive-indexing Tools for bulk indexing of WARC/ARC files to create a shared url index 42
woocommerce/wc-api-python A Python wrapper for interacting with WooCommerce's REST API. 213
commoncrawl/whirlwind-python Tours using Common Crawl's WARC format data to demonstrate its structure and contents 12