GitMiner

Code scraper

Automated tool for gathering code information from Github repositories

Tool for advanced mining for content on Github

GitHub

2k stars
108 watching
426 forks
Language: Python
last commit: about 4 years ago
Linked from 2 awesome lists

git-mining-toolgitminerinformation-gathering-tool

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
needmorecowbell/giggity A tool to scrape and store hierarchical data about GitHub organizations, users, or repositories. 126
hisxo/gitgraber Automated tool to monitor GitHub repositories for sensitive data in real-time 2,034
jetbrains-research/astminer A tool for mining path-based representations of code and data from various programming languages 282
medialab/minet A command line tool and Python library for extracting data from various web sources. 286
laramies/metagoofil Extracts metadata from public documents available on websites 1,028
kwpolska/pkgbuilder A command-line application for building and managing Arch Linux packages from the AUR. 71
murmele/gittyup A graphical Git client designed to help users understand and manage their source code history 1,548
jetbrains-research/psiminer A tool that processes code syntax trees to create datasets for machine learning pipelines 58
digininja/githunter A tool for searching a Git repository for interesting content 95
eyurtsev/kor Extracts structured data from unstructured text using large language models 1,629
felipecsl/wombat A Ruby-based web crawler and data extraction tool with an elegant DSL. 1,315
jdalrymple/gitbeaker A comprehensive GitLab SDK for various environments and languages. 1,567
chenjiandongx/github-spider A Python-based web crawler for scraping Github user and repository data. 264
ishepard/pydriller Analyzes Git repositories to extract information about commits, developers, and modified files 840
hackerlist/glassdoor An API to extract data from Glassdoor's website using Python. 81