PigPen

Map-reduce framework

A map-reduce framework for Clojure that compiles to Apache Pig or Cascading without requiring prior knowledge of those systems.

Map-Reduce for Clojure

GitHub

567 stars
473 watching
54 forks
Language: Clojure
last commit: almost 2 years ago
Linked from 3 awesome lists


Backlinks from these awesome lists:

Related projects:

Repository Description Stars
apache/pig Enables data processing and transformation in large files using a high-level language with compile-time optimizations for efficient execution on distributed computing frameworks. 682
alexeyco/pig A pgx wrapper that simplifies executing and scanning query results in PostgreSQL databases 16
nathanmarz/cascalog A library for data processing and querying on large datasets without the need for Hadoop expertise 1,375
apache/impala A high-performance query engine designed to handle large-scale data processing and analytics 1,164
schollz/cowyo A simple wiki webserver with minimalistic features such as versioning, page locking and encryption. 925
bauplanlabs/quack-reduce A playground for running DuckDB as a stateless query engine over a data lake. 178
scicloj/tablecloth A dataset manipulation library built on top of tech.ml.dataset, providing a simplified API for data processing and analysis. 308
sameeragarwal/blinkdb A system designed to process large datasets efficiently by answering queries with approximate results and error bars. 660
apache/tinkerpop Provides a framework for graph computing and processing 1,985
netcarver/asy_jpcache A caching plugin for Textpattern that optimizes page loading by storing full pages in memory 8
netflix-skunkworks/policyuniverse A Python package for parsing and processing AWS IAM policies and statements. 427
rinuboney/clatern A Clojure-based machine learning library providing tools for data preprocessing and modeling various algorithms. 67
cakephp/queue A Queueing library for CakePHP that allows tasks to be processed asynchronously 36
alienrobotwizard/varaha A set of Apache Pig scripts and UDFs for machine learning and natural language processing 53
apache/opennlp Provides a toolkit for natural language text processing tasks using machine learning algorithms in Java. 1,449