wikiteam
Wiki archiver
A set of tools for archiving and preserving wikis from various sources.
Tools for downloading and preserving wikis. We archive wikis, from Wikipedia to tiniest wikis. As of 2024, WikiTeam has preserved more than 600,000 wikis.
729 stars
40 watching
149 forks
Language: Python
last commit: 3 months ago
Linked from 1 awesome list
archive-wikisbackupdigital-preservationdumpexportmediawikipythonwikiwikipediawikiteamxml
Related projects:
Repository | Description | Stars |
---|---|---|
wikidata/wikidata-toolkit | A Java library providing access to Wikibase data and tools for data extraction and analysis. | 375 |
archiveteam/wpull | Downloads and crawls web pages, allowing for the archiving of websites. | 556 |
zaataylor/wikiref | An extension that extracts and edits Wikipedia references with ease | 2 |
wabarc/rivet | A tool for archiving webpages to IPFS | 12 |
turicas/crau | A command-line tool for archiving and playing back websites in WARC format | 57 |
peterk/warcworker | A web archiving tool that archives websites with high-fidelity preservation capabilities. | 55 |
wikipendium/wikipendium.no | A wiki-based project for collecting and organizing technical knowledge in various programming languages. | 36 |
archivesunleashed/notebooks | Provides tools and examples for working with web archives using the Archives Unleashed Toolkit | 22 |
webrecorder/pywb | A toolkit for archiving and replaying web content accurately and efficiently | 1,407 |
wackowiki/wackowiki | A multilingual Wiki-engine with various features and compatibility options. | 42 |
ukwa/webarchive-discovery | Tools for indexing and discovering archived web content | 116 |
archiveteam/grab-site | A web crawler designed to backup websites by recursively crawling and writing WARC files. | 1,402 |
bellingcat/auto-archiver | Automates archiving of online content from various sources into local storage or cloud services | 570 |
machawk1/wail | A graphical user interface layer for preserving and replaying web pages using multiple archiving tools. | 350 |
jjjake/internetarchive | A command-line and Python interface to access Archive.org's services | 1,625 |