py-wasapi-client
WARC client
Downloads WARC files from a WASAPI access point.
A client for the Archive-It And Webrecorder WASAPI Data Transfer API
14 stars
5 watching
5 forks
Language: Python
last commit: about 5 years ago
Linked from 1 awesome list
Related projects:
Repository | Description | Stars |
---|---|---|
webrecorder/warcio | A fast streaming library for working with WARC format web archival data | 385 |
sul-dlss/wasapi-downloader | An application to download archives of web archiving projects | 6 |
internetarchive/warctools | Tools for working with archived web content | 152 |
nla/httrack2warc | Converts HTTrack crawls to WARC files by reconstructing requests and responses from logs | 30 |
chfoo/warcat | Tool for handling Web Archive files | 150 |
webrecorder/har2warc | Converts HTTP Archive format to Web Archive format | 46 |
internetarchive/warcprox | An HTTP proxy designed to capture and archive web traffic, including encrypted HTTPS connections. | 381 |
ambianic/peerjs-python | Enables peer-to-peer communication between web applications and edge devices using WebRTC protocol. | 89 |
iipc/jwarc | A Java library for reading and writing WARC files with a typed API | 47 |
mementoweb/py-memento-client | Provides access to archived web pages from various TimeGates. | 25 |
ripe-ncc/ripe-atlas-cousteau | A Python library that provides access to the RIPE ATLAS API. | 65 |
turicas/crau | A command-line tool for archiving and playing back websites in WARC format | 57 |
ikreymer/webarchive-indexing | Tools for bulk indexing of WARC/ARC files to create a shared url index | 42 |
woocommerce/wc-api-python | A Python wrapper for interacting with WooCommerce's REST API. | 213 |
commoncrawl/whirlwind-python | Tours using Common Crawl's WARC format data to demonstrate its structure and contents | 12 |