 ArchiveBox
 ArchiveBox 
 Preservation tool
 Automated preservation of internet content in durable formats
🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...
23k stars
 174 watching
 1k forks
 
Language: Python 
last commit: 11 months ago 
Linked from   1 awesome list  
  archiveboxbackupsbookmark-archiverbrowser-bookmarkschromiumdigipresfirefoxheadless-browserinternet-archivingpinboardpocketpythonrssself-hostedsinglefilewarcwayback-machineweb-archivingwgetyoutube-dl 
 Related projects:
| Repository | Description | Stars | 
|---|---|---|
|  | A browser that runs on a remote server and provides isolated access to web content for security, compliance, and other purposes. | 3,486 | 
|  | Automates archiving of online content from various sources into local storage or cloud services | 585 | 
|  | A tool to automate archiving of web resources into public archives. | 409 | 
|  | Archives a web page as a single HTML file with embedded resources. | 267 | 
|  | A simple Rails application for archiving websites | 27 | 
|  | A graphical user interface layer for preserving and replaying web pages using multiple archiving tools. | 353 | 
|  | A tool for capturing and preserving web content and making it accessible in the future. | 1,839 | 
|  | A tool to organize and search archived YouTube videos | 5,381 | 
|  | A suite of tools and strategies for efficiently caching and serving web assets | 12,416 | 
|  | A web archiving tool that archives websites with high-fidelity preservation capabilities. | 57 | 
|  | A command-line and Python interface to access Archive.org's services | 1,643 | 
|  | A multi-format archive utility and Go library that provides a generic replacement for platform-specific or format-specific archive utilities. | 4,442 | 
|  | A tool to collect and manage links to websites and other online resources for long-term archiving. | 2,684 | 
|  | A web-based archive service that allows users to store and manage web pages in various formats. | 115 | 
|  | A Java library for working with PDF documents. | 2,700 |