webarchive-indexing
Tools for bulk indexing of WARC/ARC files on Hadoop, EMR or local file system.
41 stars
9 watching
10 forks
Language: Python
last commit: almost 7 years ago
Linked from 1 awesome list
Tools for bulk indexing of WARC/ARC files on Hadoop, EMR or local file system.