PigPen

Map-reducer

A map-reduce framework for Clojure that compiles to Apache Pig or Cascading without requiring extensive knowledge of those systems.

Map-Reduce for Clojure

GitHub

567 stars
474 watching
55 forks
Language: Clojure
last commit: over 1 year ago
Linked from 3 awesome lists


Backlinks from these awesome lists:

Related projects:

Repository Description Stars
apache/pig Enables data processing and transformation in large files using a high-level language with compile-time optimizations for efficient execution on distributed computing frameworks. 681
alexeyco/pig A pgx wrapper that simplifies executing and scanning query results in PostgreSQL databases 16
nathanmarz/cascalog A library for data processing and querying on large datasets without the need for Hadoop expertise 1,376
apache/impala A high-performance query engine designed to handle large-scale data processing and analytics 1,151
schollz/cowyo A simple wiki webserver with minimalistic features such as versioning, page locking and encryption. 926
bauplanlabs/quack-reduce A playground for running DuckDB as a stateless query engine over a data lake. 170
scicloj/tablecloth A dataset manipulation library built on top of tech.ml.dataset, providing a simplified API for data processing and analysis. 303
sameeragarwal/blinkdb A system designed to process large datasets efficiently by answering queries with approximate results and error bars. 660
apache/tinkerpop Provides a framework for graph computing and processing 1,975
netcarver/asy_jpcache A caching plugin for Textpattern that optimizes page loading by storing full pages in memory 8
netflix-skunkworks/policyuniverse A Python package for parsing and processing AWS IAM policies and statements. 428
rinuboney/clatern A Clojure-based machine learning library providing tools for data preprocessing and modeling various algorithms. 67
cakephp/queue A Queueing library for CakePHP that allows tasks to be processed asynchronously 37
alienrobotwizard/varaha A set of Apache Pig scripts and UDFs for machine learning and natural language processing 53
apache/opennlp A machine learning-based toolkit for text processing and analysis 1,447