WebCollector

Web crawler library

A Java-based framework for building web crawlers with automatic URL detection and content extraction capabilities.

WebCollector is an open source web crawler framework based on Java.It provides some simple interfaces for crawling the Web,you can setup a multi-threaded web crawler in less than 5 minutes.

GitHub

3k stars
328 watching
1k forks
Language: Java
last commit: 10 months ago
Linked from 1 awesome list


Backlinks from these awesome lists:

Related projects:

Repository Description Stars
pallets/quart An asynchronous Python framework for building web applications using ASGI and asyncio. 3,076
mtytel/helm A software platform for creating and manipulating sound in real-time using multiple oscillators, filters, and effects 2,387
ruanyf/weekly A platform sharing and discussing technology-related content on a weekly basis. 49,688
yogeshojha/rengine Automated reconnaissance framework for web applications with customizable workflow and data correlation. 7,588
amyreese/zsh-take Replication of the "take" functionality from Oh My Zsh, allowing users to create and navigate directories with a single command. 3
joaomdmoura/machinery A lightweight Elixir library for creating and managing state machines 538
kdomanski/iso9660 A package for reading and creating ISO9660 images 265
jxub/barrel_ex An Elixir client for interacting with the BarrelDB database system. 0
batate/elixir-pipes Provides macros to extend Elixir's pipe operator for more flexible function composition and error handling 325
fabiokiatkowski/exercism.plugin.zsh A plugin to enhance the Oh My Zsh framework with features and functionality from Excercism.io for learning and improving shell scripting skills. 10
jwhiteman/a-little-elixir-goes-a-long-way Port of The Little Schemer to Elixir with exercises and algorithms in Scheme, along with unit tests and comparisons. 349
hemanth/bangalore-startups An extensive list of Bangalore-based startups organized by name 31
parroty/oauth2ex An OAuth 2.0 client library for Elixir. 56
antonmi/espec_phoenix Provides an Elixir wrapper around ESpec to enable Behavioral Driven Development (BDD) for the Phoenix web framework 138
veelenga/aasm.cr A simple finite state machine library for Crystal classes 52