PHPScraper
Web scraper
A web scraping utility for PHP that simplifies the process of extracting information from websites.
A universal web-util for PHP.
542 stars
18 watching
74 forks
Language: PHP
last commit: 8 months ago
Linked from 1 awesome list
beautifulsoupchromiumheadless-chromephpphp-crawlerphp-scraperphp-spiderphp-spiderspuppeteerpyppeteerscraperscrapingscraping-websitesscrapyweb-scraperweb-scraping
Related projects:
Repository | Description | Stars |
---|---|---|
spider-rs/spider | A web crawler and scraper built on top of Rust, designed to extract data from the web in a flexible and configurable manner. | 1,185 |
the-markup/blacklight-collector | A tool for scraping website content and analyzing browser behavior | 204 |
jakopako/goskyr | A tool to simplify web scraping of list-like structured data from web pages | 36 |
benibela/xidel | A tool to extract data from web pages using various query languages and selectors. | 687 |
slotix/dataflowkit | A framework for extracting structured data from web pages using CSS selectors. | 662 |
joseconstela/webparsy | A Node.js library and CLI for scraping websites using Puppeteer and YAML definitions | 44 |
propublica/upton | A web scraping framework that simplifies the process by handling repetitive tasks and provides options for efficient data retrieval | 1,611 |
oscarotero/embed | A PHP library to extract metadata and embeddable code from any web page using various protocols and scraping techniques. | 2,095 |
postmodern/spidr | A Ruby web crawling library that provides flexible and customizable methods to crawl websites | 808 |
fimad/scalpel | A web scraping library providing a declarative interface on top of an HTML parsing library to extract data from HTML pages | 323 |
scrapy/scrapely | A pure-python library for extracting structured data from HTML pages. | 1,864 |
rivermont/spidy | A simple command-line web crawler that automatically extracts links from web pages and can be run in parallel for efficient crawling | 341 |
skallwar/suckit | A Rust-based web scraping tool that recursively visits and downloads websites to disk. | 749 |
miyagawa/web-scraper | A Perl toolkit for extracting structured data from HTML documents using a DSL-like interface. | 104 |
zhuyingda/webster | A framework for automating web scraping and crawling tasks using Node.js | 518 |