Photon

Crawler

A fast and flexible web crawler designed to gather information from the internet

Incredibly fast crawler designed for OSINT.

GitHub

11k stars

323 watching

2k forks

Language: Python

last commit: almost 2 years ago

Linked from 2 awesome lists

crawlerinformation-gatheringosintpythonspider

Backlinks from these awesome lists:

Related projects:

Repository	Description	Stars
unclecode/crawl4ai	A web crawling tool designed to extract structured data from the web for use in AI applications	18,541
gocolly/colly	A framework for extracting structured data from websites in a fast and elegant way	23,444
hakluke/hakrawler	A tool for automatically discovering and crawling web application endpoints and assets	4,528
apify/crawlee	A tool for building reliable web scraping and browser automation pipelines in Node.js.	16,081
yujiosaka/headless-chrome-crawler	A distributed crawling framework that leverages Headless Chrome to scrape dynamic websites	5,534
internetarchive/heritrix3	A web crawler designed to collect and preserve digital artifacts while respecting site policies and load constraints.	2,857
dedsecinside/torbot	An OSINT tool for exploring and analyzing dark web sites using Tor network	3,016
matthewmueller/x-ray	A flexible web scraping framework for extracting data from websites with customizable selectors and pagination support.	5,883
jaeles-project/gospider	A tool for web crawling and exploitation written in Go.	2,598
cobrateam/splinter	A Python test framework for automating web applications using Selenium and other tools.	2,726
spatie/crawler	A powerful web crawler written in PHP that can execute JavaScript and crawl multiple URLs concurrently.	2,552
finic-ai/finic	Provides cloud-hosted browsers for automation and scraping tasks to avoid detection by websites.	2,311
nabla-c0d3/sslyze	An SSL/TLS scanning tool and Python library for assessing server security configurations	3,290
smicallef/spiderfoot	Automates information gathering and analysis from various data sources to support threat intelligence and cybersecurity efforts	13,364
geziyor/geziyor	A fast and flexible web crawling and scraping framework for extracting structured data from websites.	2,646