webarchive-indexing

Tools for bulk indexing of WARC/ARC files on Hadoop, EMR or local file system.

GitHub

41 stars
9 watching
10 forks
Language: Python
last commit: almost 7 years ago
Linked from 1 awesome list


Backlinks from these awesome lists: