 datasketches-java
 datasketches-java 
 Data Processing Library
 A software library of stochastic streaming algorithms, providing efficient data processing and analysis tools
A software library of stochastic streaming algorithms, a.k.a. sketches.
899 stars
 58 watching
 209 forks
 
Language: Java 
last commit: 10 months ago 
Linked from   1 awesome list  
  datasketches 
 Related projects:
| Repository | Description | Stars | 
|---|---|---|
|  | A toolbox for industrial data analytics and stream processing | 614 | 
|  | An end-to-end data science platform that integrates data integration, machine learning model training, and deployment | 1,038 | 
|  | An analytics engine designed to handle large-scale data processing and analysis | 40,170 | 
|  | A tool to abstract storage details and automate common data access patterns for developers working with relational technologies | 209 | 
|  | A lightweight Python implementation of Spark's RDD and DStream interfaces for improved performance on small datasets | 262 | 
|  | A distributed stream processing framework for handling high-volume data streams with fault tolerance and durability guarantees | 817 | 
|  | A JavaScript library that enables the creation of stable, decentralized data streams using hypercore | 28 | 
|  | An API that provides random data from various nerdy franchises. | 109 | 
|  | A library that enables integration between Apache Spark and Apache Cassandra for fast data processing and analysis. | 1,944 | 
|  | A collection of Java implementations of various data structures and algorithms used in computer science | 146 | 
|  | A collection of well-known Algebraic Data Types and their associated helper functions for functional programming in JavaScript. | 1,592 | 
|  | A Java implementation of a self-contained, serverless, and zero-configuration data processing framework | 1 | 
|  | A system that enables flexible data processing pipelines using a low-level engine for higher-level frameworks | 482 | 
|  | A high-performance real-time analytics database for fast queries and ingest | 13,548 | 
|  | A programming language and runtime environment for creating data-driven programs with a focus on Linked Data and RDF data sources | 101 |