PigPen
Map-reducer
A map-reduce framework for Clojure that compiles to Apache Pig or Cascading without requiring extensive knowledge of those systems.
Map-Reduce for Clojure
567 stars
474 watching
55 forks
Language: Clojure
last commit: over 1 year ago
Linked from 3 awesome lists
Related projects:
Repository | Description | Stars |
---|---|---|
apache/pig | Enables data processing and transformation in large files using a high-level language with compile-time optimizations for efficient execution on distributed computing frameworks. | 681 |
alexeyco/pig | A pgx wrapper that simplifies executing and scanning query results in PostgreSQL databases | 16 |
nathanmarz/cascalog | A library for data processing and querying on large datasets without the need for Hadoop expertise | 1,376 |
apache/impala | A high-performance query engine designed to handle large-scale data processing and analytics | 1,151 |
schollz/cowyo | A simple wiki webserver with minimalistic features such as versioning, page locking and encryption. | 926 |
bauplanlabs/quack-reduce | A playground for running DuckDB as a stateless query engine over a data lake. | 170 |
scicloj/tablecloth | A dataset manipulation library built on top of tech.ml.dataset, providing a simplified API for data processing and analysis. | 303 |
sameeragarwal/blinkdb | A system designed to process large datasets efficiently by answering queries with approximate results and error bars. | 660 |
apache/tinkerpop | Provides a framework for graph computing and processing | 1,975 |
netcarver/asy_jpcache | A caching plugin for Textpattern that optimizes page loading by storing full pages in memory | 8 |
netflix-skunkworks/policyuniverse | A Python package for parsing and processing AWS IAM policies and statements. | 428 |
rinuboney/clatern | A Clojure-based machine learning library providing tools for data preprocessing and modeling various algorithms. | 67 |
cakephp/queue | A Queueing library for CakePHP that allows tasks to be processed asynchronously | 37 |
alienrobotwizard/varaha | A set of Apache Pig scripts and UDFs for machine learning and natural language processing | 53 |
apache/opennlp | A machine learning-based toolkit for text processing and analysis | 1,447 |