GitMiner
Code scraper
Automated tool for gathering code information from Github repositories
Tool for advanced mining for content on Github
2k stars
108 watching
426 forks
Language: Python
last commit: about 4 years ago
Linked from 2 awesome lists
git-mining-toolgitminerinformation-gathering-tool
Related projects:
Repository | Description | Stars |
---|---|---|
needmorecowbell/giggity | A tool to scrape and store hierarchical data about GitHub organizations, users, or repositories. | 126 |
hisxo/gitgraber | Automated tool to monitor GitHub repositories for sensitive data in real-time | 2,034 |
jetbrains-research/astminer | A tool for mining path-based representations of code and data from various programming languages | 282 |
medialab/minet | A command line tool and Python library for extracting data from various web sources. | 286 |
laramies/metagoofil | Extracts metadata from public documents available on websites | 1,028 |
kwpolska/pkgbuilder | A command-line application for building and managing Arch Linux packages from the AUR. | 71 |
murmele/gittyup | A graphical Git client designed to help users understand and manage their source code history | 1,548 |
jetbrains-research/psiminer | A tool that processes code syntax trees to create datasets for machine learning pipelines | 58 |
digininja/githunter | A tool for searching a Git repository for interesting content | 95 |
eyurtsev/kor | Extracts structured data from unstructured text using large language models | 1,629 |
felipecsl/wombat | A Ruby-based web crawler and data extraction tool with an elegant DSL. | 1,315 |
jdalrymple/gitbeaker | A comprehensive GitLab SDK for various environments and languages. | 1,567 |
chenjiandongx/github-spider | A Python-based web crawler for scraping Github user and repository data. | 264 |
ishepard/pydriller | Analyzes Git repositories to extract information about commits, developers, and modified files | 840 |
hackerlist/glassdoor | An API to extract data from Glassdoor's website using Python. | 81 |