py-wasapi-client

WARC client

Downloads WARC files from a WASAPI access point.

A client for the Archive-It And Webrecorder WASAPI Data Transfer API

GitHub

15 stars
5 watching
5 forks
Language: Python
last commit: about 5 years ago
Linked from 1 awesome list


Backlinks from these awesome lists:

Related projects:

Repository Description Stars
webrecorder/warcio A fast streaming library for working with WARC format web archival data 391
sul-dlss/wasapi-downloader An application to download archives of web archiving projects 6
internetarchive/warctools Tools for working with archived web content 153
nla/httrack2warc Converts HTTrack crawls to WARC files by reconstructing requests and responses from logs 32
chfoo/warcat Tool for handling Web Archive files 152
webrecorder/har2warc Converts HTTP Archive format to Web Archive format 48
internetarchive/warcprox An HTTP proxy designed to capture and archive web traffic, including encrypted HTTPS connections. 389
ambianic/peerjs-python Enables peer-to-peer communication between web applications and edge devices using WebRTC protocol. 90
iipc/jwarc A Java library for reading and writing WARC files with a typed API 48
mementoweb/py-memento-client Provides access to archived web pages from various TimeGates. 25
ripe-ncc/ripe-atlas-cousteau A Python library that provides access to the RIPE ATLAS API. 65
turicas/crau A command-line tool for archiving and playing back websites in WARC format 59
ikreymer/webarchive-indexing Tools for bulk indexing of WARC/ARC files to create a shared url index 43
woocommerce/wc-api-python A Python wrapper for interacting with WooCommerce's REST API. 216
commoncrawl/whirlwind-python Tours using Common Crawl's WARC format data to demonstrate its structure and contents 14