warc2html

WARC file converter

Converts WARC files to static HTML with relative link rewriting and renaming

Converts WARC files to static HTML

GitHub

41 stars
10 watching
4 forks
Language: Java
last commit: 7 months ago
Linked from 1 awesome list


Backlinks from these awesome lists:

Related projects:

Repository Description Stars
iipc/jwarc A Java library for reading and writing WARC files with a typed API 48
nla/httrack2warc Converts HTTrack crawls to WARC files by reconstructing requests and responses from logs 32
steffenfritz/html2warc Converts offline data into a standard archival format 18
webrecorder/har2warc Converts HTTP Archive format to Web Archive format 48
internetarchive/warctools Tools for working with archived web content 153
ricn/pdf2htmlex Converts PDF documents to HTML files without losing text or format. 89
dbohdan/csv2html Converts CSV files to HTML tables with customizable formatting options. 74
n0tan3rd/node-warc A tool for parsing and generating Web Archive files in JavaScript using Node.js 95
helgeho/warcpartitioner Tool for partitioning and merging Web archive files by MIME type and year 1
arcalex/warcrefs Tools to identify and convert duplicate records in archived web content 6
raml2html/raml2html Generates HTML documentation from RAML files using JavaScript and Nunjucks templates. 1,135
alir3z4/html2text Converts HTML to plain text that can be easily read and formatted as Markdown. 1,862
webrecorder/warcio A fast streaming library for working with WARC format web archival data 391
deedy5/html2text_rs Converts HTML to different formats 4
somerandomdude/grunt-webp Converts images to the WebP format using various quality and compression settings. 118