datafu
Hadoop data processing library
A collection of libraries for working with large-scale data in Hadoop, providing incremental processing capabilities and user-defined functions.
Hadoop library for large-scale data processing, now an Apache Incubator project
583 stars
75 watching
133 forks
Language: Java
last commit: over 10 years ago Related projects:
Repository | Description | Stars |
---|---|---|
| A collection of libraries for data mining and statistics in large-scale Hadoop environments | 119 |
| A Java framework that simplifies Hadoop's MapReduce API to build efficient data processing pipelines | 57 |
| A flexible library for enabling rapid development of typeahead search functionality | 565 |
| A utility package wrapping set implementations on document lists with compression and set operation support. | 22 |
| A Java library providing efficient, functional data structures with customizable equality semantics and high performance. | 968 |
| A system that enables flexible data processing pipelines using a low-level engine for higher-level frameworks | 482 |
| A JavaScript library for working with RDF data in various formats and querying RDF stores | 567 |
| A programming language and library for describing dataflow-based digital hardware in a high-level, object-oriented way | 82 |
| A modern javascript library for creating interactive and editable tables on the web | 1,050 |
| A Scala library for specifying and executing MapReduce jobs in Hadoop | 3,506 |
| A utility library providing common functions for working with data structures like slices and maps in Go. | 148 |
| A distributed object store designed to efficiently store and serve large media objects in web applications. | 1,749 |
| Maps RDF data into HBase for scalable storage and processing of Linked Data | 17 |
| A framework for building Linked Data applications using PHP | 32 |
| A library that enables the definition of complex data pipelines in a functional, typesafe, and efficient way using a declarative syntax | 139 |