wikiteam

Wiki archiver

A set of tools for archiving and preserving wikis from various sources.

Tools for downloading and preserving wikis. We archive wikis, from Wikipedia to tiniest wikis. As of 2024, WikiTeam has preserved more than 600,000 wikis.

GitHub

729 stars
40 watching
149 forks
Language: Python
last commit: 3 months ago
Linked from 1 awesome list

archive-wikisbackupdigital-preservationdumpexportmediawikipythonwikiwikipediawikiteamxml

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
wikidata/wikidata-toolkit A Java library providing access to Wikibase data and tools for data extraction and analysis. 375
archiveteam/wpull Downloads and crawls web pages, allowing for the archiving of websites. 556
zaataylor/wikiref An extension that extracts and edits Wikipedia references with ease 2
wabarc/rivet A tool for archiving webpages to IPFS 12
turicas/crau A command-line tool for archiving and playing back websites in WARC format 57
peterk/warcworker A web archiving tool that archives websites with high-fidelity preservation capabilities. 55
wikipendium/wikipendium.no A wiki-based project for collecting and organizing technical knowledge in various programming languages. 36
archivesunleashed/notebooks Provides tools and examples for working with web archives using the Archives Unleashed Toolkit 22
webrecorder/pywb A toolkit for archiving and replaying web content accurately and efficiently 1,407
wackowiki/wackowiki A multilingual Wiki-engine with various features and compatibility options. 42
ukwa/webarchive-discovery Tools for indexing and discovering archived web content 116
archiveteam/grab-site A web crawler designed to backup websites by recursively crawling and writing WARC files. 1,402
bellingcat/auto-archiver Automates archiving of online content from various sources into local storage or cloud services 570
machawk1/wail A graphical user interface layer for preserving and replaying web pages using multiple archiving tools. 350
jjjake/internetarchive A command-line and Python interface to access Archive.org's services 1,625