grimoirelab-perceval

Repository scraper

A tool to gather data from various software repositories

Send Sir Perceval on a quest to retrieve and gather data from software repositories.

GitHub

290 stars
28 watching
177 forks
Language: Python
last commit: 11 days ago
Linked from 1 awesome list

data-fetchingdata-miningdata-sourcesgrimoirelabhacktoberfestpercevalpythonsoftware-analytics

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
needmorecowbell/giggity A tool to scrape and store hierarchical data about GitHub organizations, users, or repositories. 126
nerolation/ethereum-datafarm A tool to harvest event data from Ethereum contracts without requiring an archive or node. 63
malfrats/xeuledoc A tool to fetch information about public Google documents from various services 846
harisekhon/lib A comprehensive utility library in Perl 18
emersonelectricco/boomerang A tool designed to safely capture off-network web resources for network defense and security analysis 37
slotix/dataflowkit A framework for extracting structured data from web pages using CSS selectors. 662
fimad/scalpel A web scraping library providing a declarative interface on top of an HTML parsing library to extract data from HTML pages 323
muchirijane/learning-code-through-github-repos A collection of GitHub repositories providing resources and projects to learn web development skills 129
projectdiscovery/chaos-client An open-source tool for querying the Chaos DB API to enumerate subdomains of given domains 641
meilisearch/docs-scraper Automates scraping and indexing of documentation content into a search engine 290
gushonorato/mechanize A web scraping and automation tool for Elixir. 30
glebarez/cero A tool that extracts domain names from SSL certificates of arbitrary hosts during TLS handshakes 620
holgerd77/django-dynamic-scraper An app that allows you to manage Scrapy spiders through a Django admin interface. 1,153
recrm/archivetools A collection of tools for extracting and analyzing data from web archives 69
gregorut/vgchartzscrape A Python script that captures data from vgchartz.com and saves it to a CSV file 79