PHPScraper

Web scraper

A web scraping utility for PHP that simplifies the process of extracting information from websites.

A universal web-util for PHP.

GitHub

542 stars
18 watching
74 forks
Language: PHP
last commit: 8 months ago
Linked from 1 awesome list

beautifulsoupchromiumheadless-chromephpphp-crawlerphp-scraperphp-spiderphp-spiderspuppeteerpyppeteerscraperscrapingscraping-websitesscrapyweb-scraperweb-scraping

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
spider-rs/spider A web crawler and scraper built on top of Rust, designed to extract data from the web in a flexible and configurable manner. 1,185
the-markup/blacklight-collector A tool for scraping website content and analyzing browser behavior 204
jakopako/goskyr A tool to simplify web scraping of list-like structured data from web pages 36
benibela/xidel A tool to extract data from web pages using various query languages and selectors. 687
slotix/dataflowkit A framework for extracting structured data from web pages using CSS selectors. 662
joseconstela/webparsy A Node.js library and CLI for scraping websites using Puppeteer and YAML definitions 44
propublica/upton A web scraping framework that simplifies the process by handling repetitive tasks and provides options for efficient data retrieval 1,611
oscarotero/embed A PHP library to extract metadata and embeddable code from any web page using various protocols and scraping techniques. 2,095
postmodern/spidr A Ruby web crawling library that provides flexible and customizable methods to crawl websites 808
fimad/scalpel A web scraping library providing a declarative interface on top of an HTML parsing library to extract data from HTML pages 323
scrapy/scrapely A pure-python library for extracting structured data from HTML pages. 1,864
rivermont/spidy A simple command-line web crawler that automatically extracts links from web pages and can be run in parallel for efficient crawling 341
skallwar/suckit A Rust-based web scraping tool that recursively visits and downloads websites to disk. 749
miyagawa/web-scraper A Perl toolkit for extracting structured data from HTML documents using a DSL-like interface. 104
zhuyingda/webster A framework for automating web scraping and crawling tasks using Node.js 518