yato

Data processor

An orchestrator for DuckDB databases that automates data transformation and integration with other tools.

The smallest DuckDB SQL orchestrator on Earth.

GitHub

182 stars
5 watching
3 forks
Language: Python
last commit: 3 months ago

Related projects:

Repository Description Stars
kapolos/pramda A PHP implementation of functional programming concepts to simplify data processing and analysis. 245
h2oai/datatable A Python package for manipulating 2-dimensional tabular data structures with an emphasis on speed and big data support. 1,821
reubano/meza A lightweight toolkit for processing tabular data with a focus on functional programming and PyPy compatibility. 417
bauplanlabs/quack-reduce A playground for running DuckDB as a stateless query engine over a data lake. 178
yaa110/goterator An iterator implementation providing map and reduce functionalities for data processing in Go. 16
markroddy/duckdb-pytables An extension for DuckDB that allows running SQL queries on arbitrary data sources using Python functions. 84
mahmoud/glom Provides a declarative way to handle nested data structures in Python 1,920
blooddy/blooddy_crypto A set of algorithms and data processing tools for binary data 91
dodger487/dplython A Python implementation of data manipulation functions inspired by the R package Dplyr. 764
snoyberg/conduit A framework for handling and transforming streaming data in a consistent and efficient way 903
alanmarazzi/panthera A Clojure-based library for working with dataframes and numerical computations using Python libraries. 189
datonic/datadex A platform for collaborative open data management and analysis 264
apache/pig Enables data processing and transformation in large files using a high-level language with compile-time optimizations for efficient execution on distributed computing frameworks. 682
cube2222/jql A JSON query processor with a custom syntax that simplifies complex queries by breaking them down into step-by-step operations. 895
nysol/mcmd A set of commands for high-speed processing of large-scale CSV data 33