awesome-duckdb
DuckDB toolkit
A curated collection of resources and libraries for using DuckDB in software development projects
🦆 A curated list of awesome DuckDB resources
1k stars
39 watching
109 forks
last commit: about 21 hours ago
Linked from 1 awesome list
awesomeawesome-list
Awesome DuckDB / Resources | |||
Official Documentation | Official DuckDB documentation | ||
Awesome DuckDB / Resources / Official Documentation | |||
Official Blog | Official DuckDB blog | ||
DuckDB Clients | Client APIs for DuckDB | ||
DuckDB Documentation PDF | The DuckDB documentation as a single PDF file | ||
Awesome DuckDB / Resources | |||
docker-duckdb | 22 | 6 months ago | Docker image for DuckDB CLI |
DuckDB setup | GitHub Action to install DuckDB in CI | ||
Serverless DuckDB over S3 | 170 | 11 months ago | Running DuckDB over a data lake on S3 using lambda |
DuckDB snippets | Collection of snippets curated by MotherDuck | ||
DuckDB tldr page | DuckDB's entry in , available in CLI via the command | ||
DuckDB AWS Lambda layer | 97 | 10 days ago | Run DuckDB in AWS Lambda functions |
Compatible DuckDB Extensions for AWS Lambda | Extensions specifically compiled for the AWS Lambda runtime (GLIBC 2.26) | ||
Serverless DuckDB as API | 120 | 10 days ago | Use DuckDB as API with Amazon API Gateway and AWS Lambda |
Serverless Parquet Repartitioner | 29 | 10 days ago | Use DuckDB to repartition data in S3-based Data Lakes |
Observable notebooks | Notebooks using DuckDB on the Observable data visualization platform | ||
duckdb-nf | 2 | 10 months ago | Example uses of DuckDB with Nextflow |
DuckDB version manager (duckman) | 14 | 16 days ago | – Cross-platform installer and version manager for DuckDB |
DuckERD CLI | 70 | 2 months ago | – A CLI tool to create an ER Diagram from DuckDB database files |
Awesome DuckDB / Client APIs | |||
C | |||
C++ | |||
CLI | |||
Go | 725 | 7 days ago | |
Julia | |||
Node.js | |||
Python | |||
R | |||
Ruby | 159 | 4 days ago | |
Rust | 505 | about 1 month ago | |
Swift | |||
TypeScript | |||
Wasm | |||
ADBC | |||
ODBC | |||
.NET | 437 | 7 days ago | |
Common Lisp | 36 | 16 days ago | |
PowerShell | 42 | 2 months ago | |
Dart | 35 | 9 days ago | |
Awesome DuckDB / Tools Powered by DuckDB | |||
Rill Developer | 1,723 | 6 days ago | Tool for effortlessly transforming data sets into powerful, opinionated dashboards using SQL |
Ibis Project | A DataFrame API for interacting with DuckDB (and other compute engines) | ||
MotherDuck | Serverless data warehouse powered by DuckDB | ||
Boiling Data | Serverless data analytics overlay on top of S3 Data Lakes | ||
Hex Dataframe SQL | Hex's Dataframe SQL cells are powered by DuckDB | ||
Mode | Mode uses DuckDB for their in-memory data engine | ||
VulcanSQL | DuckDB can be used as a caching layer or a data connector in VulcanSQL, a Data API framework for data folks to create REST APIs by writing SQL templates | ||
Tad | A fast, free, cross-platform tabular data viewer application powered by DuckDB | ||
Honeycomb Maps | A browser-based geospatial analysis tool leveraging DuckDB Wasm | ||
Bauplan | A serverless data transformation platform for data lakes | ||
Malloy | Malloy is an experimental language for describing data relationships and transformations. Malloy connects to BigQuery, Snowflake, Trino, and Postgres, and natively supports DuckDB | ||
Evidence | Generate reports using SQL and markdown. The DuckDB connector allows querying across DuckDB, csv, parquet and json | ||
Latitude | Latitude uses DuckDB to power data snapshots. Drop a CSV file and query it with SQL at the speed of light | ||
Census | Census's dataset diffing for incremental syncs is powered by DuckDB | ||
Huey | 260 | 4 days ago | Blazing-fast & intuitive pivot tables on .parquet, .csv, .json files and .duckdb tables in the browser based on DuckDB WASM. Open source (MIT). Zero install! |
Parquet Explorer | Visual Studio Code extension for exploring Parquet files with SQL, powered by DuckDB | ||
DQOps | Data quality platform for data engineers, data quality teams and data operations | ||
DatalakeStudio | 76 | 2 months ago | Load, explore, transform your datasets and expose them via API. Integration with external APIs, S3, PostgreSQL and ChatGPT |
Spice.ai | 1,918 | 4 days ago | A unified SQL query interface and portable runtime to locally materialize (using an embedded DuckDB), accelerate, and query datasets from any database, data warehouse, or data lake |
Definite | Definite pulls all your data into a single place for analytics and dashboards. No engineering or SQL required. Get a managed data warehouse (DuckDB), ELT, data modeling / transformations and BI in a single platform | ||
Amphi ETL | 904 | 6 days ago | Low-code data pipelines for structured and unstructured data. SQL transformations are powered by DuckDB |
Quackpipe | 183 | 13 days ago | Serverless OLAP API/UI built on top of DuckDB with basic ClickHouse API compatibility and Motherduck support |
ParadeDB | 6,218 | 6 days ago | Postgres for Search and Analytics, powered by DuckDB-embedded-in-Postgres |
Crunchy Bridge for Analytics | Fully managed DBaaS based in Postgres integrated with DuckDB | ||
UniverSQL | 98 | 8 days ago | An implementation of Snowflake API, enables running queries on Snowflake tables locally with DuckDB without a running warehouse |
Whereabouts | 20 | 21 days ago | Fast, accurate, open-source geocoding in Python, using DuckDB |
Phoenix Analytics | 270 | about 1 month ago | Plug and play analytics for Phoenix applications, powered by DuckDB |
sqlglot | 6,745 | 4 days ago | Python transpiler that translates between 23 different SQL dialects including DuckDB |
yato | 174 | 2 months ago | The smallest DuckDB SQL orchestrator on Earth |
SQLMesh | 1,825 | 2 days ago | A next-generation data transformation and modeling framework with support for DuckDB connections for state, transformations & running unit tests locally |
Awesome DuckDB / Tools Powered by DuckDB / Web Clients | |||
Online DuckDB Shell | Online DuckDB shell powered by WebAssembly | ||
SQL Workbench | DuckDB-WASM based SQL Workbench for running queries on local or remote data, being able to show data as tables or visually as graphs, and sharing queries via URLs | ||
Sekuel Playground | Query your local parquet, csv, json. Your data will not be sent out of the device you are using | ||
CSVFiddle | Free tool to explore and share insights from CSV files using SQL. Import data, write SQL, then instantly share it with anyone | ||
Codapi | Embed executable code snippets directly into your product documentation, online course or blog post | ||
QuackDB | Open-source online DuckDB SQL playground and editor | ||
WhatTheDuck | WhatTheDuck is an open-source web application built on DuckDB. It allows users to upload CSV files, store them in tables, and perform SQL queries on the data | ||
Sidequery | Sidequery is a privacy-preserving DuckDB-powered query editor & data exploration tool for local & remote data | ||
Awesome DuckDB / Tools Powered by DuckDB / Backends | |||
DuckDB API | 40 | 11 days ago | a TypeScript-based Docker image containing DuckDB, and a Hono framework REST API with JSON or streaming Arrow responses |
Mosaic DuckDB Server | A Python-based server that runs a local DuckDB instance and support queries over Web Sockets or HTTP, returning data in either Apache Arrow or JSON format | ||
duckdb-server | A Rust-based server that runs a local DuckDB instance and support queries over Web Sockets or HTTP/HTTPS, returning data in either Apache Arrow or JSON format | ||
Awesome DuckDB / Libraries Powered by DuckDB | |||
Mosaic | An extensible framework for linking databases and interactive views | ||
Fugue | 2,013 | 2 months ago | A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rewrites |
Splink | A free Python library for fast, accurate data deduplication and record linkage | ||
Simple-data-analysis | 252 | 6 days ago | Easy-to-use and high-performance JavaScript library for data analysis |
pg_analytics | 380 | 18 days ago | PostgreSQL extension embedding DuckDB-in-Postgres for fast on-disk and remote object storage analytics from Postgres. Built as a Foreign Data Wrapper with full query pushdown to DuckDB |
duckdb_fdw | 340 | 18 days ago | DuckDB Foreign Data Wrapper for PostgreSQL |
@jetblack/duckdb-react | 8 | 9 months ago | A context manager for React and duckdb-wasm |
QuackOSM | 207 | 17 days ago | A Python library for downloading and transforming raw OpenStreetMap data into GeoParquet files |
PyGWalker | A Pyhon library that turns your dataframe into an interactive UI for data visualization | ||
https://github.com/DataZooDE/flapi] | [ API Framework heavily relies on the power of DuckDB and DuckDB extensions. Ready to build performant and cost-efficient APIs on top of BigQuery or Snowflake for AI Agents and Data Apps | ||
Awesome DuckDB / SQL Clients and IDE that Support DuckDB | |||
Harlequin | The DuckDB IDE for your terminal. ( ) | ||
qStudio | A free SQL tool specialized for data analysts. It runs on every operating system and allows easy browsing of tables and charting of results | ||
DuckDB SQL Tools | Free DuckDB SQL Tools for VS Code IDE. with advanced features | ||
VSCode SQLTools (Free) | Free open-source VSCode extension to query and explore your DuckDB databases with latest DuckDB support | ||
DBeaver | DBeaver is a universal database access and development tool that can be used to connect almost any type of database | ||
DataGrip | Paid SQL IDE by Jetbrains that supports many different database technologies, including DuckDB | ||
Duckling | 241 | 20 days ago | A fast viewer for CSV/Parquet files and DuckDB/SQLite, based on Tauri |
rsql | 127 | 4 days ago | CLI for DuckDB, LibSQL, MariaDB, MySQL, PostgreSQL, SQLite3 and SQL Server |
jsqltranspiler | 29 | 10 days ago | Rewrite BigQuery, Redshift, Snowflake and Databricks queries into DuckDB compatible SQL |
jOOQ | Type safe querying of DuckDB (and many other RDBMS) from Java. A from and to DuckDB is also available | ||
SQL DATA LENS | A lightweight, commercial SQL IDE that supports different DBMS, including DuckDB. The focus on performance and special DBMS features | ||
Dataflare | Simple easy-to-use database manager, supports DuckDB, PostgreSQL, MySQL, SQL Server, SQLite etc | ||
manifold-sql (DuckDB for Java) | 2,412 | 8 days ago | Use native DuckDB SQL of any complexity directly & type-safely in Java source with comprehensive IntelliJ support |
Termux DuckB package) | DuckDB CLI client for the Termux Android terminal emulator | ||
Awesome DuckDB / Projects Powered by DuckDB | |||
NBA Monte Carlo | 451 | 5 days ago | Monte Carlo simulation of the NBA season, leveraging Meltano, dbt, DuckDB and Evidence |
Datadex | 260 | 13 days ago | Open source and local friendly data platform to collaborate on Open Data using DuckDB, Dagster, dbt, and Quarto |
endoflife.date database | Daily dumps of endoflife.date data | ||
transfermarkt-datasets | 247 | 6 days ago | Curated football datasets from |
duckDB-embedding-search | 106 | 22 days ago | A search engine for DuckDB that uses embedding vectors to find similar documents |
DuckDB PyPI stats live dashboard | ( ) - Live dashboard of PyPI downloads using DuckDB, dbt, Evidence and MotherDuck with code source to build your own | ||
Specter | 5 | about 1 month ago | Specter is a CLI tool to search and monitor Databricks audit logs |
Awesome DuckDB / Integrations | |||
dbt-duckdb | 920 | 17 days ago | DuckDB dbt adapter |
data load tool - DuckDB destination | Extract and load data from APIs to DuckDB using dlt | ||
target-duckdb | Load data to DuckDB based on Singer spec | ||
Airbyte DuckDB destination | Load data to DuckDB with Airbyte | ||
Kestra DuckDB plugin | Run queries with DuckDB to schedule data transformations and process automations, and run event-driven anomaly detection pipelines | ||
SQLFlite - Arrow Flight SQL Server | 205 | about 2 months ago | An example implementation of the that runs in a client-server setup with DuckDB or SQLite as backends |
SQLFlow | 69 | 7 months ago | Enables SQL-based stream-processing, powered by DuckDB |
nf-sqldb | 26 | 16 days ago | This plugin provides support for interacting with SQL databases in Nextflow scripts |
MindsDB | 26,793 | 6 days ago | The platform for customizing AI from enterprise data. , making data from DuckDB accessible to a diverse range of AI/ML models |
Sqlite2Duckdb | 9 | 6 months ago | A CLI tool to convert SQLite database to DuckDB |
nodbi | NoSQL Database Connector for R, providing a common API across Elasticsearch, CouchDB, MongoDB, SQLite, PostgreSQL, and DuckDB | ||
duckplyr | Drop-in replacement for dplyr in R that uses DuckDB for performance | ||
kwack | 79 | 6 days ago | In-Memory Analytics for Kafka using DuckDB |
PSDuckDB | 42 | 2 months ago | A PowerShell module for DuckDB integration |
duckdb-tableau-connector | 6 | 5 months ago | DuckDB Tableau connector |
duckdb-power-query-connector | 45 | about 1 month ago | DuckDB Power Query Custom Connector |
metabase_duckdb_driver | 56 | 9 days ago | Metabase DuckDB Driver shipped as 3rd party plugin |
Awesome DuckDB / Extensions / Official Extensions | |||
arrow | 34 | 4 months ago | A zero-copy data integration between Apache Arrow and DuckDB |
aws | 42 | 20 days ago | For handling AWS credentials |
azure | 50 | 24 days ago | For using the Azure Blob storage |
delta | 136 | 7 days ago | For Delta Lake support |
fts | To support full text search | ||
iceberg | 158 | about 1 month ago | For reading Iceberg tables |
inet | 1 | about 1 month ago | For storing and handling IPv4 and IPv6 Internet addresses |
mysql | To read from and write to MySQL databases | ||
postgres | To read from and write to PostgreSQL databases | ||
spatial | Enables geospatial processing | ||
sqlite | To read from and write to SQLite databases | ||
vss | Add support for vector similarity search | ||
Awesome DuckDB / Extensions / Community Extensions | |||
chsql | 40 | 14 days ago | ClickHouse SQL Dialect macros for DuckDB |
crypto | 11 | about 2 months ago | Cryptographic hash functions and HMAC |
duckpgq | 80 | 6 days ago | Graph workloads that supports the SQL/PGQ standard |
evalexpr_rhai | 14 | 9 days ago | Evaluates the scripting language as part of SQL |
fuzzycomplete | 7 | about 2 months ago | Performs fuzzy string matching for autocompletion |
h3 | 171 | 17 days ago | Adds support for the H3 discrete global grid system |
lindel | 32 | 10 days ago | Linearization/Delinearization, Z-Order, Hilbert and Morton Curves |
prql | 260 | 3 months ago | Run PRQL commands directly within DuckDB |
scrooge | 129 | 2 months ago | A set of aggregation functions and data scanners on financial data |
shellfs | 56 | about 2 months ago | Allows shell commands to be used for input and output |
ulid | 14 | 5 months ago | ULID data type for DuckDB. A ULID is similar to a UUID except that it also contains a timestamp component |
gsheets | 129 | 5 days ago | Read and write Google Sheets using SQL |
httpserver | 116 | 5 days ago | DuckDB HTTP API Server and Query Interface |
Awesome DuckDB / Extensions / Other Extensions | |||
DuckDB Extension Radar | 82 | 5 days ago | Repository that contains DuckDB extensions on GitHub. Refreshed daily |
duckdb-bigquery | 61 | 4 days ago | Enables seamless integration and querying of BigQuery datasets within DuckDB |
duckdb-engine | 355 | 6 days ago | SQLAlchemy driver for DuckDB |
duckdb-extension-template-zig | 22 | 8 months ago | A Zig & Nix toolkit template for building extensions against multiple versions of DuckDB using Zig, C or C++ |
duckdb-jfr-extension | 5 | about 1 month ago | DuckDB extension to read JFR (Java Flight Recorder) files directly |
duckdb-protobuf | 17 | 2 months ago | Plugin for querying encoded protobuf messages (both sequences and individual messages per file) |
duckdb-pytables | 83 | 7 months ago | DuckDB extension to allow running SQL on arbitrary data sources |
ERPL | DuckDB SAP connector using RFC, ODP, or BICS | ||
Kùzu | 1,406 | 5 days ago | Scan DuckDB tables in Kùzu, an embeddable property graph database management system |
Lance | 3,956 | 5 days ago | Integrate Lance (modern columnar data format for ML implemented in Rust) with DuckDB |
ODBC Scanner DuckDB Extension | 79 | about 1 year ago | DuckDB extension to read data directly from databases supporting the ODBC interface |
QDuckDB | Plugin for reading DuckDB spatial tables in QGIS software | ||
uc_catalog | 52 | 16 days ago | Proof-of-concept extension combining the extension with Unity Catalog |
duckdb-flockmtl | 65 | 5 days ago | Integrate language model (LLM) capabilities directly into your queries and workflows |
erpl-web | 8 | 21 days ago | ERPL WEB is a DuckDB extension that connects API-based ecosystems via standard interfaces like OData, GraphQL, and REST |
Awesome DuckDB / Media / Talks | |||
DuckDB: Crunching data anywhere from laptops to servers @ GOTO Amsterdam 2024 | Gábor Szárnyas | ||
DuckDB – Overview and latest developments @ DuckCon #5 | Hannes Mühleisen and Mark Raasveldt | ||
DuckCon #5 playlist | |||
DuckCon #4 playlist | |||
DuckCon #3 playlist | |||
In-Process Analytical Data Management with DuckDB @ PyData Amsterdam | Hannes Mühleisen | ||
DuckDB: The Power of a Data Warehouse in your Python Process @ PyData Yerevan | Gábor Szárnyas | ||
DuckDB: Bringing analytical SQL directly to your Python shell @ EuroPython | Pedro Holanda | ||
DuckDB keynote @ Data + AI Summit 2023 | Hannes Mühleisen | ||
DuckDB: Bringing Analytical SQL Directly To Your Python Shell @ FOSDEM | Pedro Holanda | ||
State of the Duck @ DuckCon #2 | Hannes Mühleisen & Mark Raasveldt | ||
DuckDB Extensions @ DuckCon | Pedro Holanda & Sam Ansmink | ||
Developing Systems in Academia: The Good, the Bad, and the not-so-Ugly Duckling @ CIDR | Hannes Mühleisen | ||
DuckDB An Embeddable Analytical Database @ FOSDEM | Hannes Mühleisen | ||
DuckDB tutorials playlist by Learn Data with Mark | Mark Needham | ||
DuckDB tutorials playlist by MotherDuck | Mehdi Ouazza | ||
Nextflow and database uses: powering data engineering, exploring DuckDB, and beyond | Edmund Miller | ||
Why should you care about DuckDB? @ Dublin DuckDB meetup | Mihai Bojin | ||
Exploring Monte Carlo Simulations With DuckDB @ Dublin DuckDB meetup | James McNeill | ||
DuckDB and recommenders : a lightning fast synergy @ Dublin DuckDB meetup | Khalil Muhammad | ||
Awesome DuckDB / Media / Podcasts | |||
Developer Voices: Implementing Hardware-Friendly Databases | Hannes Mühleisen | ||
The Geek Narrator: DuckDB Internals | Mark Raasveldt | ||
Software Engineering Daily: DuckDB | Hannes Mühleisen | ||
Data Engineering Podcast: Move Your Database To The Data And Speed Up Your Analytics With DuckDB | Hannes Mühleisen | ||
The Analytics Engineering Podcast: The Personal Data Warehouse | Jordan Tigani | ||
Awesome DuckDB / Media / Blog Posts | |||
Modern Data Stack in a Box | Fast, free, and open-source Modern Data Stack deployed on a laptop using the combination of DuckDB, Meltano, dbt, and Apache Superset | ||
How to use DuckDB, Motherduck and Kestra for ETL | How DuckDB can transform data, mask sensitive PII information, detect anomalies in event-driven workflows, and streamline reporting use cases | ||
DuckDB vs. MotherDuck — how do they compare | What are key differences between them, and when to choose each of these options | ||
Building DuckDB Extensions with Zig and Nix | For Nix users and Zig developers familiar with DuckDB looking to extend its capabilities with custom extensions | ||
Exploring StarCraft 2 data with Airflow, DuckDB and Streamlit | Example project using DuckDB to persist API data, but also explains how to use DuckDB as a versatile data manipulation tool in data wrangling scripts | ||
DuckDB: The Rising Star in the Big Data Landscape | |||
How to Make a DuckDB Extension for a Table Function? | How to make a DuckDB extension to fetch data from external sources | ||
Putting DuckDB in Postgres to Query Iceberg | How ParadeDB embedded DuckDB in Postgres to achieve fast analytics and Apache Iceberg compatibility from Postgres | ||
Awesome DuckDB / Media / Books | |||
DuckDB in Action | DuckDB in Action will show you how to quickly get your hands dirty with DuckDB | ||
Getting Started with DuckDB | A practical guide for accelerating your data science, data analytics, and data engineering workflows |
More related projects:
- lawrencecchen/postgres-bun
- duckdb/duckdb-swift
- pandas-dev/pandas
- kipdata/fncksql
- awesome-spectral-indices/spyndex
- pganalyze/libpg_query
- ibireme/yyjson
- clarkcga/udef-arp
- multiprocessio/dsq
- climate-strike/license
- bookingcom/perl-net-kafka
- xstevens/pg_kafka
- icemachined/kafka-kotlin-native
- lancedb/lancedb