warc2html
WARC file converter
Converts WARC files to static HTML with relative link rewriting and renaming
Converts WARC files to static HTML
41 stars
10 watching
4 forks
Language: Java
last commit: 9 months ago
Linked from 1 awesome list
Related projects:
Repository | Description | Stars |
---|---|---|
| A Java library for reading and writing WARC files with a typed API | 48 |
| Converts HTTrack crawls to WARC files by reconstructing requests and responses from logs | 32 |
| Converts offline data into a standard archival format | 18 |
| Converts HTTP Archive format to Web Archive format | 48 |
| Tools for working with archived web content | 153 |
| Converts PDF documents to HTML files without losing text or format. | 89 |
| Converts CSV files to HTML tables with customizable formatting options. | 74 |
| A tool for parsing and generating Web Archive files in JavaScript using Node.js | 95 |
| Tool for partitioning and merging Web archive files by MIME type and year | 1 |
| Tools to identify and convert duplicate records in archived web content | 6 |
| Generates HTML documentation from RAML files using JavaScript and Nunjucks templates. | 1,135 |
| Converts HTML to plain text that can be easily read and formatted as Markdown. | 1,862 |
| A fast streaming library for working with WARC format web archival data | 391 |
| Converts HTML to different formats | 4 |
| Converts images to the WebP format using various quality and compression settings. | 118 |