warc2html

WARC file converter

Converts WARC files to static HTML with relative link rewriting and renaming

Converts WARC files to static HTML

GitHub

39 stars
10 watching
3 forks
Language: Java
last commit: 5 months ago
Linked from 1 awesome list


Backlinks from these awesome lists:

Related projects:

Repository Description Stars
iipc/jwarc A Java library for reading and writing WARC files with a typed API 47
nla/httrack2warc Converts HTTrack crawls to WARC files by reconstructing requests and responses from logs 30
steffenfritz/html2warc Converts offline data into a standard archival format 18
webrecorder/har2warc Converts HTTP Archive format to Web Archive format 46
internetarchive/warctools Tools for working with archived web content 152
ricn/pdf2htmlex Converts PDF documents to HTML files without losing text or format. 88
dbohdan/csv2html Converts CSV files to HTML tables with customizable formatting options. 74
n0tan3rd/node-warc A tool for parsing and generating Web Archive files in JavaScript using Node.js 94
helgeho/warcpartitioner Tool for partitioning and merging Web archive files by MIME type and year 1
arcalex/warcrefs Tools to identify and convert duplicate records in archived web content 6
raml2html/raml2html Generates HTML documentation from RAML files using JavaScript and Nunjucks templates. 1,134
alir3z4/html2text Converts HTML to plain text that can be easily read and formatted as Markdown. 1,845
webrecorder/warcio A fast streaming library for working with WARC format web archival data 385
deedy5/html2text_rs Converts HTML to different formats 2
somerandomdude/grunt-webp Converts images to the WebP format using various quality and compression settings. 118