metagoofil
Document Extractor
Extracts metadata from public documents found on websites, useful for brute-force attacks.
Metadata harvester
1k stars
58 watching
209 forks
Language: Python
last commit: 10 months ago
Linked from 2 awesome lists
Related projects:
Repository | Description | Stars |
---|---|---|
gomoob/php-metadata-extractor | A PHP wrapper to call the Java metadata-extractor library. | 9 |
meilisearch/docs-scraper | Automates scraping and indexing of documentation content into a search engine | 297 |
jaimeiniesta/metainspector | A Ruby gem for web scraping and extracting metadata from web pages. | 1,038 |
needmorecowbell/giggity | A tool to scrape and store hierarchical data about GitHub organizations, users, or repositories. | 127 |
erikriver/opengraph | A Python module to extract and parse metadata from web pages using the Open Graph Protocol. | 230 |
unkl4b/gitminer | Automated tool for gathering code information from Github repositories | 2,093 |
neon-jungle/wagtail-metadata | A tool to help with metadata for search engines and social media platforms. | 117 |
barasher/go-exiftool | A Go wrapper around ExifTool to extract metadata from various file types. | 255 |
davemolk/gogetjs | Tools for extracting and analyzing JavaScript files from web pages | 41 |
pachterlab/ffq | A tool to fetch and display metadata from various public databases | 556 |
jkongie/mobi | An Ruby Gem to extract metadata from MOBI files | 38 |
michaelhelmick/lassie | Library for retrieving basic content from websites | 615 |
jgomezdans/get_modis | Tools to download MODIS data from the USGS repository using Python | 62 |
aantron/lambdasoup | A functional HTML scraping and manipulation library in OCaml | 384 |
holgerd77/django-dynamic-scraper | An app that allows you to manage Scrapy spiders through a Django admin interface. | 1,155 |