twarc
Data archiver
A tool for archiving Twitter JSON data via the Twitter API
A command line tool (and Python library) for archiving Twitter JSON
1k stars
35 watching
255 forks
Language: Python
last commit: over 1 year ago
Linked from 2 awesome lists
Related projects:
Repository | Description | Stars |
---|---|---|
| Analyzes line-oriented JSON data from Twitter APIs using Apache Spark | 9 |
| A tool to backup Twitter user's tweets to a JSON file | 4 |
| Collects and processes tweets from the Twitter API using Academic access | 20 |
| A Python library that provides a simple interface to stream information from Twitter's Full-Archive Search Endpoint. | 12 |
| A tool for collecting tweets from Twitter's search API and storing them in a MongoDB database | 80 |
| A collection of input formats and utilities for working with compressed data files in various formats. | 1,137 |
| Converts HTTrack crawls to WARC files by reconstructing requests and responses from logs | 32 |
| A web archiving tool that archives websites with high-fidelity preservation capabilities. | 57 |
| A tool to retrieve and display Twitter account statistics. | 4 |
| A Node.js application allowing users to collect and aggregate Twitter data through its v2 API | 7 |
| An archival crawler built on top of Chrome or Chromium to preserve the web in high fidelity and user scriptable manner | 170 |
| Provides access to Twitter data and functionality via a Python interface | 1,854 |
| A web crawler designed to backup websites by recursively crawling and writing WARC files. | 1,406 |
| Converts HTTP Archive format to Web Archive format | 48 |
| Tool for handling Web Archive files | 152 |