metagoofil

Document Extractor

Extracts metadata from public documents found on websites, useful for brute-force attacks.

Metadata harvester

1k stars

58 watching

209 forks

Language: Python

last commit: over 2 years ago

Linked from 2 awesome lists

Backlinks from these awesome lists:

Related projects:

Repository	Description	Stars
gomoob/php-metadata-extractor	A PHP wrapper to call the Java metadata-extractor library.	9
meilisearch/docs-scraper	Automates scraping and indexing of documentation content into a search engine	297
jaimeiniesta/metainspector	A Ruby gem for web scraping and extracting metadata from web pages.	1,038
needmorecowbell/giggity	A tool to scrape and store hierarchical data about GitHub organizations, users, or repositories.	127
erikriver/opengraph	A Python module to extract and parse metadata from web pages using the Open Graph Protocol.	230
unkl4b/gitminer	Automated tool for gathering code information from Github repositories	2,093
neon-jungle/wagtail-metadata	A tool to help with metadata for search engines and social media platforms.	117
barasher/go-exiftool	A Go wrapper around ExifTool to extract metadata from various file types.	255
davemolk/gogetjs	Tools for extracting and analyzing JavaScript files from web pages	41
pachterlab/ffq	A tool to fetch and display metadata from various public databases	556
jkongie/mobi	An Ruby Gem to extract metadata from MOBI files	38
michaelhelmick/lassie	Library for retrieving basic content from websites	615
jgomezdans/get_modis	Tools to download MODIS data from the USGS repository using Python	62
aantron/lambdasoup	A functional HTML scraping and manipulation library in OCaml	384
holgerd77/django-dynamic-scraper	An app that allows you to manage Scrapy spiders through a Django admin interface.	1,155