PeelAndSlice.Java

Data processor

A Java implementation of a self-contained, serverless, and zero-configuration data processing framework

GitHub

1 stars
5 watching
3 forks
Language: Java
last commit: about 23 hours ago
Linked from 1 awesome list


Backlinks from these awesome lists:

Related projects:

Repository Description Stars
danielstjules/pjs A tool for filtering, mapping, and reducing data in JavaScript from the command line. 420
sodiumjoe/lobar A command-line wrapper around lodash's chain method for functional data processing 28
michalmuskala/jason A high-performance JSON parser and generator written in Elixir. 1,629
j-easy/easy-batch A simple framework for automating repetitive data processing tasks by abstracting away common boilerplate code 616
scicloj/tablecloth A dataset manipulation library built on top of tech.ml.dataset, providing a simplified API for data processing and analysis. 308
pyjanitor-devs/pyjanitor A Python library providing a clean and expressive API for data cleaning by chaining multiple operations together in a logical order. 1,371
apache/pig Enables data processing and transformation in large files using a high-level language with compile-time optimizations for efficient execution on distributed computing frameworks. 682
ileriayo/ileriayo A software project focused on developing a system for managing and processing complex data flows. 52
castagna/jena-grande A collection of utilities and examples for processing RDF data using various big-data technologies. 24
jondot/crunch A toolkit for extracting insights from large datasets by parsing and processing semi-structured data 214
emorynlp/nlp4j Provides tools and APIs for text processing and analysis on Java-based platforms. 148
line/decaton A framework for high-throughput, concurrent task processing on Apache Kafka 337
loosechainsaw/slack A lazy functional JavaScript library providing methods to manipulate and query data in an array-like structure. 3
apache/samza A distributed stream processing framework for handling high-volume data streams with fault tolerance and durability guarantees 817
spreads/spreads A high-performance library for real-time data processing and time series manipulation 430