 crunch
 crunch 
 Data processor
 A toolkit for extracting insights from large datasets by parsing and processing semi-structured data
A fast to develop, fast to run, Go based toolkit for ETL and feature extraction on Hadoop.
214 stars
 18 watching
 16 forks
 
Language: Go 
last commit: almost 11 years ago 
Linked from   2 awesome lists  
 Related projects:
| Repository | Description | Stars | 
|---|---|---|
|  | A toolset for working with signals and files from GOES satellites | 375 | 
|  | An analytics engine designed to handle large-scale data processing and analysis | 40,170 | 
|  | A real-time data processing pipeline that transforms and sends data to a storage system | 14,293 | 
|  | A JSON query processor with a custom syntax that simplifies complex queries by breaking them down into step-by-step operations. | 895 | 
|  | A Go library for fast and simple feature engineering and machine learning data preprocessing | 121 | 
|  | A Java implementation of a self-contained, serverless, and zero-configuration data processing framework | 1 | 
|  | A framework for handling and transforming streaming data in a consistent and efficient way | 903 | 
|  | A utility for working with nested data structures | 190 | 
|  | A toolbox for processing earth observation data with Python. | 14 | 
|  | An iterator implementation providing map and reduce functionalities for data processing in Go. | 16 | 
|  | A package for describing, loading, and processing data in a declarative way | 1,015 | 
|  | A fast image processing and resizing library for Go. | 1,321 | 
|  | A Haskell-based framework for processing and distributing large datasets across multiple nodes in parallel. | 116 | 
|  | A comprehensive set of utilities to handle JSON data in Go. | 11 | 
|  | A Go package for working with nested data structures like maps and slices | 20 |