grimoirelab-perceval
Repository scraper
A tool to gather data from various software repositories
Send Sir Perceval on a quest to retrieve and gather data from software repositories.
293 stars
28 watching
177 forks
Language: Python
last commit: about 1 month ago
Linked from 1 awesome list
data-fetchingdata-miningdata-sourcesgrimoirelabhacktoberfestpercevalpythonsoftware-analytics
Related projects:
Repository | Description | Stars |
---|---|---|
needmorecowbell/giggity | A tool to scrape and store hierarchical data about GitHub organizations, users, or repositories. | 127 |
nerolation/ethereum-datafarm | A tool to harvest event data from Ethereum contracts without requiring an archive or node. | 64 |
malfrats/xeuledoc | A tool to fetch information about public Google documents from various services | 856 |
harisekhon/lib | A comprehensive utility library in Perl | 18 |
emersonelectricco/boomerang | A tool designed to safely capture off-network web resources for network defense and security analysis | 38 |
slotix/dataflowkit | A framework for extracting structured data from web pages using CSS selectors. | 667 |
fimad/scalpel | A web scraping library providing a declarative interface on top of an HTML parsing library to extract data from HTML pages | 325 |
muchirijane/learning-code-through-github-repos | A collection of GitHub repositories providing resources and projects to learn web development skills | 130 |
projectdiscovery/chaos-client | An open-source tool for querying the Chaos DB API to enumerate subdomains of given domains | 657 |
meilisearch/docs-scraper | Automates scraping and indexing of documentation content into a search engine | 297 |
gushonorato/mechanize | A web scraping and automation tool for Elixir. | 30 |
glebarez/cero | A tool that extracts domain names from SSL certificates of arbitrary hosts during TLS handshakes | 623 |
holgerd77/django-dynamic-scraper | An app that allows you to manage Scrapy spiders through a Django admin interface. | 1,155 |
recrm/archivetools | A collection of tools for extracting and analyzing data from web archives | 71 |
gregorut/vgchartzscrape | A Python script that captures data from vgchartz.com and saves it to a CSV file | 80 |