PeelAndSlice.Java
Data processor
A Java implementation of a self-contained, serverless, and zero-configuration data processing framework
1 stars
5 watching
3 forks
Language: Java
last commit: about 23 hours ago
Linked from 1 awesome list
Related projects:
Repository | Description | Stars |
---|---|---|
danielstjules/pjs | A tool for filtering, mapping, and reducing data in JavaScript from the command line. | 420 |
sodiumjoe/lobar | A command-line wrapper around lodash's chain method for functional data processing | 28 |
michalmuskala/jason | A high-performance JSON parser and generator written in Elixir. | 1,629 |
j-easy/easy-batch | A simple framework for automating repetitive data processing tasks by abstracting away common boilerplate code | 616 |
scicloj/tablecloth | A dataset manipulation library built on top of tech.ml.dataset, providing a simplified API for data processing and analysis. | 308 |
pyjanitor-devs/pyjanitor | A Python library providing a clean and expressive API for data cleaning by chaining multiple operations together in a logical order. | 1,371 |
apache/pig | Enables data processing and transformation in large files using a high-level language with compile-time optimizations for efficient execution on distributed computing frameworks. | 682 |
ileriayo/ileriayo | A software project focused on developing a system for managing and processing complex data flows. | 52 |
castagna/jena-grande | A collection of utilities and examples for processing RDF data using various big-data technologies. | 24 |
jondot/crunch | A toolkit for extracting insights from large datasets by parsing and processing semi-structured data | 214 |
emorynlp/nlp4j | Provides tools and APIs for text processing and analysis on Java-based platforms. | 148 |
line/decaton | A framework for high-throughput, concurrent task processing on Apache Kafka | 337 |
loosechainsaw/slack | A lazy functional JavaScript library providing methods to manipulate and query data in an array-like structure. | 3 |
apache/samza | A distributed stream processing framework for handling high-volume data streams with fault tolerance and durability guarantees | 817 |
spreads/spreads | A high-performance library for real-time data processing and time series manipulation | 430 |