warc2html
WARC file converter
Converts WARC files to static HTML with relative link rewriting and renaming
Converts WARC files to static HTML
39 stars
10 watching
3 forks
Language: Java
last commit: 5 months ago
Linked from 1 awesome list
Related projects:
Repository | Description | Stars |
---|---|---|
iipc/jwarc | A Java library for reading and writing WARC files with a typed API | 47 |
nla/httrack2warc | Converts HTTrack crawls to WARC files by reconstructing requests and responses from logs | 30 |
steffenfritz/html2warc | Converts offline data into a standard archival format | 18 |
webrecorder/har2warc | Converts HTTP Archive format to Web Archive format | 46 |
internetarchive/warctools | Tools for working with archived web content | 152 |
ricn/pdf2htmlex | Converts PDF documents to HTML files without losing text or format. | 88 |
dbohdan/csv2html | Converts CSV files to HTML tables with customizable formatting options. | 74 |
n0tan3rd/node-warc | A tool for parsing and generating Web Archive files in JavaScript using Node.js | 94 |
helgeho/warcpartitioner | Tool for partitioning and merging Web archive files by MIME type and year | 1 |
arcalex/warcrefs | Tools to identify and convert duplicate records in archived web content | 6 |
raml2html/raml2html | Generates HTML documentation from RAML files using JavaScript and Nunjucks templates. | 1,134 |
alir3z4/html2text | Converts HTML to plain text that can be easily read and formatted as Markdown. | 1,845 |
webrecorder/warcio | A fast streaming library for working with WARC format web archival data | 385 |
deedy5/html2text_rs | Converts HTML to different formats | 2 |
somerandomdude/grunt-webp | Converts images to the WebP format using various quality and compression settings. | 118 |