haul
Image crawler
A tool to extract images from web pages and URLs
An Extensible Image Crawler
158 stars
11 watching
38 forks
Language: Python
last commit: about 8 years ago
Linked from 1 awesome list
Related projects:
Repository | Description | Stars |
---|---|---|
| A web crawler designed to backup websites by recursively crawling and writing WARC files. | 1,406 |
| Downloads images from a webpage in parallel using multiple threads and saves them to a specified directory | 763 |
| A containerized browser-based crawler system for capturing web content in a high-fidelity and customizable manner. | 677 |
| A web crawler designed to efficiently collect and prioritize relevant content from the web | 459 |
| A tool to search and add stock images to the Wagtail content management system. | 10 |
| A high-performance web crawling and scraping solution with customizable settings and worker pooling. | 945 |
| A flexible web crawler that follows robots.txt policies and crawl delays. | 787 |
| A Pythonic framework for building high-speed web crawlers with flexible data extraction and storage options. | 188 |
| A tool to extract hidden data from images by detecting embedded files and strings. | 116 |
| An end-to-end pipeline for extracting features from aerial and satellite imagery using convolutional neural networks | 2,027 |
| A distributed web crawler that fetches and extracts links from websites using a real browser. | 678 |
| Downloads and crawls web pages, allowing for the archiving of websites. | 556 |
| A tool to extract AppImage release data from web pages | 11 |
| A Python web crawling framework utilizing asyncio and aiohttp for efficient data extraction from websites. | 2,037 |
| A flexible web crawler that can be used to extract data from websites in a scalable and efficient manner | 226 |