metagoofil

Document Extractor

Extracts metadata from public documents found on websites, useful for brute-force attacks.

Metadata harvester

GitHub

1k stars
58 watching
209 forks
Language: Python
last commit: 10 months ago
Linked from 2 awesome lists


Backlinks from these awesome lists:

Related projects:

Repository Description Stars
gomoob/php-metadata-extractor A PHP wrapper to call the Java metadata-extractor library. 9
meilisearch/docs-scraper Automates scraping and indexing of documentation content into a search engine 297
jaimeiniesta/metainspector A Ruby gem for web scraping and extracting metadata from web pages. 1,038
needmorecowbell/giggity A tool to scrape and store hierarchical data about GitHub organizations, users, or repositories. 127
erikriver/opengraph A Python module to extract and parse metadata from web pages using the Open Graph Protocol. 230
unkl4b/gitminer Automated tool for gathering code information from Github repositories 2,093
neon-jungle/wagtail-metadata A tool to help with metadata for search engines and social media platforms. 117
barasher/go-exiftool A Go wrapper around ExifTool to extract metadata from various file types. 255
davemolk/gogetjs Tools for extracting and analyzing JavaScript files from web pages 41
pachterlab/ffq A tool to fetch and display metadata from various public databases 556
jkongie/mobi An Ruby Gem to extract metadata from MOBI files 38
michaelhelmick/lassie Library for retrieving basic content from websites 615
jgomezdans/get_modis Tools to download MODIS data from the USGS repository using Python 62
aantron/lambdasoup A functional HTML scraping and manipulation library in OCaml 384
holgerd77/django-dynamic-scraper An app that allows you to manage Scrapy spiders through a Django admin interface. 1,155