GitMiner

Code scraper

Automated tool for gathering code information from Github repositories

Tool for advanced mining for content on Github

GitHub

2k stars
107 watching
426 forks
Language: Python
last commit: over 4 years ago
Linked from 2 awesome lists

git-mining-toolgitminerinformation-gathering-tool

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
needmorecowbell/giggity A tool to scrape and store hierarchical data about GitHub organizations, users, or repositories. 127
hisxo/gitgraber Automated tool to monitor GitHub repositories for sensitive data in real-time 2,044
jetbrains-research/astminer A tool for mining path-based representations of code and data from various programming languages 284
medialab/minet A command line tool and Python library for extracting data from various web sources. 293
laramies/metagoofil Extracts metadata from public documents found on websites, useful for brute-force attacks. 1,050
kwpolska/pkgbuilder An AUR helper and library that automates the process of building and installing Arch Linux packages from source. 71
murmele/gittyup A graphical Git client designed to help users understand and manage their source code history 1,574
jetbrains-research/psiminer A tool that processes code syntax trees to create datasets for machine learning pipelines 58
digininja/githunter A tool for searching a Git repository for interesting content 97
eyurtsev/kor An open-source wrapper around LLMs to extract structured data from text 1,638
felipecsl/wombat A Ruby-based web crawler and data extraction tool with an elegant DSL. 1,315
jdalrymple/gitbeaker A comprehensive GitLab SDK for various environments and languages. 1,575
chenjiandongx/github-spider A Python-based web crawler for scraping Github user and repository data. 264
ishepard/pydriller Analyzes Git repositories to extract information about commits, developers, and modified files 847
hackerlist/glassdoor An API to extract data from Glassdoor's website using Python. 81