awesome-duckdb
🦆 A curated list of awesome DuckDB resources
1k stars
38 watching
106 forks
last commit: 17 days ago
Linked from 1 awesome list
awesomeawesome-list
Awesome DuckDB / Resources | |||
Official Documentation | Official DuckDB documentation | ||
Awesome DuckDB / Resources / Official Documentation | |||
Official Blog | Official DuckDB blog | ||
DuckDB Clients | Client APIs for DuckDB | ||
DuckDB Documentation PDF | The DuckDB documentation as a single PDF file | ||
Awesome DuckDB / Resources | |||
docker-duckdb | 17 | 5 months ago | Docker image for DuckDB CLI |
DuckDB setup | GitHub Action to install DuckDB in CI | ||
Serverless DuckDB over S3 | 159 | 9 months ago | Running DuckDB over a data lake on S3 using lambda |
DuckDB snippets | Collection of snippets curated by MotherDuck | ||
DuckDB tldr page | DuckDB's entry in , available in CLI via the command | ||
DuckDB AWS Lambda layer | 86 | 18 days ago | Run DuckDB in AWS Lambda functions |
Compatible DuckDB Extensions for AWS Lambda | Extensions specifically compiled for the AWS Lambda runtime (GLIBC 2.26) | ||
Serverless DuckDB as API | 103 | 4 months ago | Use DuckDB as API with Amazon API Gateway and AWS Lambda |
Serverless Parquet Repartitioner | 26 | 12 months ago | Use DuckDB to repartition data in S3-based Data Lakes |
Observable notebooks | Notebooks using DuckDB on the Observable data visualization platform | ||
duckdb-nf | 2 | 8 months ago | Example uses of DuckDB with Nextflow |
DuckDB version manager (duckman) | 13 | 18 days ago | – Cross-platform installer and version manager for DuckDB |
DuckERD CLI | 69 | 22 days ago | – A CLI tool to create an ER Diagram from DuckDB database files |
Awesome DuckDB / Client APIs | |||
C | |||
C++ | |||
CLI | |||
Go | 660 | 11 days ago | |
Julia | |||
Node.js | |||
Python | |||
R | |||
Ruby | 138 | 13 days ago | |
Rust | 472 | about 1 month ago | |
Swift | |||
TypeScript | |||
Wasm | |||
ADBC | |||
ODBC | |||
.NET | 407 | 4 days ago | |
Common Lisp | 35 | 11 days ago | |
PowerShell | 39 | 26 days ago | |
Awesome DuckDB / Tools Powered by DuckDB | |||
Rill Developer | 1,642 | 11 days ago | Tool for effortlessly transforming data sets into powerful, opinionated dashboards using SQL |
Ibis Project | A DataFrame API for interacting with DuckDB (and other compute engines) | ||
MotherDuck | Serverless data warehouse powered by DuckDB | ||
Boiling Data | Serverless data analytics overlay on top of S3 Data Lakes | ||
Hex Dataframe SQL | Hex's Dataframe SQL cells are powered by DuckDB | ||
Mode | Mode uses DuckDB for their in-memory data engine | ||
VulcanSQL | DuckDB can be used as a caching layer or a data connector in VulcanSQL, a Data API framework for data folks to create REST APIs by writing SQL templates | ||
Tad | A fast, free, cross-platform tabular data viewer application powered by DuckDB | ||
Honeycomb Maps | A browser-based geospatial analysis tool leveraging DuckDB Wasm | ||
Bauplan | A serverless data transformation platform for data lakes | ||
Malloy | Malloy is an experimental language for describing data relationships and transformations. Malloy connects to BigQuery, Snowflake, Trino, and Postgres, and natively supports DuckDB | ||
Evidence | Generate reports using SQL and markdown. The DuckDB connector allows querying across DuckDB, csv, parquet and json | ||
Latitude | Latitude uses DuckDB to power data snapshots. Drop a CSV file and query it with SQL at the speed of light | ||
Census | Census's dataset diffing for incremental syncs is powered by DuckDB | ||
Huey | 212 | 12 days ago | Blazing-fast & intuitive pivot tables on .parquet, .csv, .json files and .duckdb tables in the browser based on DuckDB WASM. Open source (MIT). Zero install! |
Parquet Explorer | Visual Studio Code extension for exploring Parquet files with SQL, powered by DuckDB | ||
DQOps | Data quality platform for data engineers, data quality teams and data operations | ||
DatalakeStudio | 68 | 27 days ago | Load, explore, transform your datasets and expose them via API. Integration with external APIs, S3, PostgreSQL and ChatGPT |
Spice.ai | 1,846 | 11 days ago | A unified SQL query interface and portable runtime to locally materialize (using an embedded DuckDB), accelerate, and query datasets from any database, data warehouse, or data lake |
Definite | pulls all your data into a single place for analytics and dashboards. No engineering or SQL required. Get a managed data warehouse (DuckDB), ELT, data modeling / transformations and BI in a single platform | ||
Amphi ETL | 791 | 8 days ago | Low-code data pipelines for structured and unstructured data. SQL transformations are powered by DuckDB |
Quackpipe | 152 | 14 days ago | Serverless OLAP API/UI built on top of DuckDB with basic ClickHouse API compatibility and Motherduck support |
ParadeDB | 5,922 | 3 days ago | Postgres for Search and Analytics, powered by DuckDB-embedded-in-Postgres |
Crunchy Bridge for Analytics | Fully managed DBaaS based in Postgres integrated with DuckDB | ||
UniverSQL | 65 | about 1 month ago | An implementation of Snowflake API, enables running queries on Snowflake tables locally with DuckDB without a running warehouse |
Whereabouts | 17 | 9 days ago | Fast, accurate, open-source geocoding in Python, using DuckDB |
Phoenix Analytics | 217 | 11 days ago | Plug and play analytics for Phoenix applications, powered by DuckDB |
Awesome DuckDB / Tools Powered by DuckDB / Web Clients | |||
Online DuckDB Shell | Online DuckDB shell powered by WebAssembly | ||
SQL Workbench | DuckDB-WASM based SQL Workbench for running queries on local or remote data, being able to show data as tables or visually as graphs, and sharing queries via URLs | ||
Sekuel Playground | Query your local parquet, csv, json. Your data will not be sent out of the device you are using | ||
CSVFiddle | Free tool to explore and share insights from CSV files using SQL. Import data, write SQL, then instantly share it with anyone | ||
Codapi | Embed executable code snippets directly into your product documentation, online course or blog post | ||
QuackDB | Open-source online DuckDB SQL playground and editor | ||
WhatTheDuck | WhatTheDuck is an open-source web application built on DuckDB. It allows users to upload CSV files, store them in tables, and perform SQL queries on the data | ||
Awesome DuckDB / Libraries Powered by DuckDB | |||
Mosaic | An extensible framework for linking databases and interactive views | ||
Fugue | 1,974 | 14 days ago | A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rewrites |
Splink | A free Python library for fast, accurate data deduplication and record linkage | ||
Simple-data-analysis | 237 | 12 days ago | Easy-to-use and high-performance JavaScript library for data analysis |
pg_analytics | 140 | 12 days ago | PostgreSQL extension embedding DuckDB-in-Postgres for fast on-disk and remote object storage analytics from Postgres. Built as a Foreign Data Wrapper with full query pushdown to DuckDB |
duckdb_fdw | 330 | 5 days ago | DuckDB Foreign Data Wrapper for PostgreSQL |
@jetblack/duckdb-react | 8 | 7 months ago | A context manager for React and duckdb-wasm |
QuackOSM | 179 | 11 days ago | A Python library for downloading and transforming raw OpenStreetMap data into GeoParquet files |
PyGWalker | A Pyhon library that turns your dataframe into an interactive UI for data visualization | ||
Awesome DuckDB / SQL Clients and IDE that Support DuckDB | |||
Harlequin | The DuckDB IDE for your terminal. ( ) | ||
qStudio | A free SQL tool specialized for data analysts. It runs on every operating system and allows easy browsing of tables and charting of results | ||
DuckDB SQL Tools | Free DuckDB SQL Tools for VS Code IDE. with advanced features | ||
VSCode SQLTools (Free) | Free open-source VSCode extension to query and explore your DuckDB databases with latest DuckDB support | ||
DBeaver | DBeaver is a universal database access and development tool that can be used to connect almost any type of database | ||
DataGrip | Paid SQL IDE by Jetbrains that supports many different database technologies, including DuckDB | ||
Duckling | 203 | 22 days ago | A fast viewer for CSV/Parquet files and DuckDB/SQLite, based on Tauri |
rsql | 93 | about 1 month ago | CLI for DuckDB, LibSQL, MariaDB, MySQL, PostgreSQL, SQLite3 and SQL Server |
jsqltranspiler | 28 | 25 days ago | Rewrite BigQuery, Redshift, Snowflake and Databricks queries into DuckDB compatible SQL |
jOOQ | Type safe querying of DuckDB (and many other RDBMS) from Java. A from and to DuckDB is also available | ||
SQL DATA LENS | A lightweight, commercial SQL IDE that supports different DBMS, including DuckDB. The focus on performance and special DBMS features | ||
Dataflare | Simple easy-to-use database manager, supports DuckDB, PostgreSQL, MySQL, SQL Server, SQLite etc | ||
manifold-sql (DuckDB for Java) | 2,380 | 5 days ago | Use native DuckDB SQL of any complexity directly & type-safely in Java source with comprehensive IntelliJ support |
Awesome DuckDB / Projects Powered by DuckDB | |||
NBA Monte Carlo | 422 | 12 days ago | Monte Carlo simulation of the NBA season, leveraging Meltano, dbt, DuckDB and Evidence |
Datadex | 240 | 26 days ago | Open source and local friendly data platform to collaborate on Open Data using DuckDB, Dagster, dbt, and Quarto |
endoflife.date database | Daily dumps of endoflife.date data | ||
transfermarkt-datasets | 223 | 12 days ago | Curated football datasets from |
duckDB-embedding-search | 70 | 7 months ago | A search engine for DuckDB that uses embedding vectors to find similar documents |
DuckDB PyPI stats live dashboard | ( ) - Live dashboard of PyPI downloads using DuckDB, dbt, Evidence and MotherDuck with code source to build your own | ||
Awesome DuckDB / Integrations | |||
dbt-duckdb | 870 | 12 days ago | DuckDB dbt adapter |
data load tool - DuckDB destination | Extract and load data from APIs to DuckDB using dlt | ||
target-duckdb | Load data to DuckDB based on Singer spec | ||
Airbyte DuckDB destination | Load data to DuckDB with Airbyte | ||
Kestra DuckDB plugin | Run queries with DuckDB to schedule data transformations and process automations, and run event-driven anomaly detection pipelines | ||
Arrow Flight SQL Server | 191 | 26 days ago | An example implementation of the what runs in a client-server setup with DuckDB or SQLite as backends |
SQLFlow | 62 | 6 months ago | Enables SQL-based stream-processing, powered by DuckDB |
nf-sqldb | 26 | 10 months ago | This plugin provides support for interacting with SQL databases in Nextflow scripts |
MindsDB | 26,362 | 4 days ago | The platform for customizing AI from enterprise data. , making data from DuckDB accessible to a diverse range of AI/ML models |
Sqlite2Duckdb | 7 | 4 months ago | A CLI tool to convert SQLite database to DuckDB |
nodbi | NoSQL Database Connector for R, providing a common API across Elasticsearch, CouchDB, MongoDB, SQLite, PostgreSQL, and DuckDB | ||
duckplyr | Drop-in replacement for dplyr in R that uses DuckDB for performance | ||
kwack | 64 | 13 days ago | In-Memory Analytics for Kafka using DuckDB |
PSDuckDB | 39 | 26 days ago | A PowerShell module for DuckDB integration |
duckdb-tableau-connector | 4 | 4 months ago | DuckDB Tableau connector |
duckdb-power-query-connector | 38 | 4 months ago | DuckDB Power Query Custom Connector |
metabase_duckdb_driver | 41 | 15 days ago | Metabase DuckDB Driver shipped as 3rd party plugin |
Awesome DuckDB / Extensions / Official Extensions | |||
arrow | 33 | 3 months ago | A zero-copy data integration between Apache Arrow and DuckDB |
aws | 40 | about 2 months ago | For handling AWS credentials |
azure | 47 | about 1 month ago | For using the Azure Blob storage |
delta | 120 | 11 days ago | For Delta Lake support |
fts | To support full text search | ||
iceberg | 132 | about 2 months ago | For reading Iceberg tables |
inet | 1 | 3 months ago | For storing and handling IPv4 and IPv6 Internet addresses |
mysql | To read from and write to MySQL databases | ||
postgres | To read from and write to PostgreSQL databases | ||
spatial | Enables geospatial processing | ||
sqlite | To read from and write to SQLite databases | ||
vss | Add support for vector similarity search | ||
Awesome DuckDB / Extensions / Community Extensions | |||
chsql | 26 | 12 days ago | ClickHouse SQL Dialect macros for DuckDB |
crypto | 9 | 11 days ago | Cryptographic hash functions and HMAC |
duckpgq | 60 | 18 days ago | Graph workloads that supports the SQL/PGQ standard |
evalexpr_rhai | 12 | 11 days ago | Evaluates the scripting language as part of SQL |
fuzzycomplete | 5 | 11 days ago | Performs fuzzy string matching for autocompletion |
h3 | 159 | 11 days ago | Adds support for the H3 discrete global grid system |
lindel | 23 | 11 days ago | Linearization/Delinearization, Z-Order, Hilbert and Morton Curves |
prql | 252 | about 1 month ago | Run PRQL commands directly within DuckDB |
scrooge | 128 | 20 days ago | A set of aggregation functions and data scanners on financial data |
shellfs | 51 | 11 days ago | Allows shell commands to be used for input and output |
ulid | 11 | 3 months ago | ULID data type for DuckDB. A ULID is similar to a UUID except that it also contains a timestamp component |
Awesome DuckDB / Extensions / Other Extensions | |||
DuckDB Extension Radar | 75 | 12 days ago | Repository that contains DuckDB extensions on GitHub. Refreshed daily |
duckdb-bigquery | 28 | 12 days ago | Enables seamless integration and querying of BigQuery datasets within DuckDB |
duckdb-engine | 337 | 12 days ago | SQLAlchemy driver for DuckDB |
duckdb-extension-template-zig | 20 | 7 months ago | A Zig & Nix toolkit template for building extensions against multiple versions of DuckDB using Zig, C or C++ |
duckdb-jfr-extension | 5 | 10 months ago | DuckDB extension to read JFR (Java Flight Recorder) files directly |
duckdb-protobuf | 15 | 24 days ago | Plugin for querying encoded protobuf messages (both sequences and individual messages per file) |
duckdb-pytables | 80 | 5 months ago | DuckDB extension to allow running SQL on arbitrary data sources |
ERPL | DuckDB SAP connector using RFC, ODP, or BICS | ||
Kùzu | 1,288 | 11 days ago | Scan DuckDB tables in Kùzu, an embeddable property graph database management system |
Lance | 3,807 | 11 days ago | Integrate Lance (modern columnar data format for ML implemented in Rust) with DuckDB |
ODBC Scanner DuckDB Extension | 75 | about 1 year ago | DuckDB extension to read data directly from databases supporting the ODBC interface |
QDuckDB | Plugin for reading DuckDB spatial tables in QGIS software | ||
uc_catalog | 42 | 11 days ago | Proof-of-concept extension combining the extension with Unity Catalog |
Awesome DuckDB / Media / Talks | |||
DuckCon #4 playlist | |||
State of the Duck @ DuckCon #4 | Hannes Mühleisen and Mark Raasveldt | ||
In-Process Analytical Data Management with DuckDB @ PyData Amsterdam | Hannes Mühleisen | ||
DuckDB: The Power of a Data Warehouse in your Python Process @ PyData Yerevan | Gábor Szárnyas | ||
DuckDB: Bringing analytical SQL directly to your Python shell @ EuroPython | Pedro Holanda | ||
DuckDB keynote @ Data + AI Summit 2023 | Hannes Mühleisen | ||
State of the Duck @ DuckCon #3 | Hannes Mühleisen and Mark Raasveldt | ||
DuckCon #3 playlist | |||
DuckDB: Bringing Analytical SQL Directly To Your Python Shell @ FOSDEM | Pedro Holanda | ||
State of the Duck @ DuckCon #2 | Hannes Mühleisen & Mark Raasveldt | ||
DuckDB Extensions @ DuckCon | Pedro Holanda & Sam Ansmink | ||
Developing Systems in Academia: The Good, the Bad, and the not-so-Ugly Duckling @ CIDR | Hannes Mühleisen | ||
DuckDB An Embeddable Analytical Database @ FOSDEM | Hannes Mühleisen | ||
DuckDB tutorials playlist by Learn Data with Mark | Mark Needham | ||
DuckDB tutorials playlist by MotherDuck | Mehdi Ouazza (MotherDuck) | ||
Nextflow and database uses: powering data engineering, exploring DuckDB, and beyond | Edmund Miller | ||
Why should you care about DuckDB? @ Dublin DuckDB meetup | Mihai Bojin | ||
Exploring Monte Carlo Simulations With DuckDB @ Dublin DuckDB meetup | James McNeill | ||
DuckDB and recommenders : a lightning fast synergy @ Dublin DuckDB meetup | Khalil Muhammad | ||
Awesome DuckDB / Media / Podcasts | |||
Developer Voices: Implementing Hardware-Friendly Databases | Hannes Mühleisen | ||
The Geek Narrator: DuckDB Internals | Mark Raasveldt | ||
Software Engineering Daily: DuckDB | Hannes Mühleisen | ||
Data Engineering Podcast: Move Your Database To The Data And Speed Up Your Analytics With DuckDB | Hannes Mühleisen | ||
The Analytics Engineering Podcast: The Personal Data Warehouse | Jordan Tigani | ||
Awesome DuckDB / Media / Blog Posts | |||
Modern Data Stack in a Box | Fast, free, and open-source Modern Data Stack deployed on a laptop using the combination of DuckDB, Meltano, dbt, and Apache Superset | ||
How to use DuckDB, Motherduck and Kestra for ETL | How DuckDB can transform data, mask sensitive PII information, detect anomalies in event-driven workflows, and streamline reporting use cases | ||
DuckDB vs. MotherDuck — how do they compare | What are key differences between them, and when to choose each of these options | ||
Building DuckDB Extensions with Zig and Nix | For Nix users and Zig developers familiar with DuckDB looking to extend its capabilities with custom extensions | ||
Exploring StarCraft 2 data with Airflow, DuckDB and Streamlit | Example project using DuckDB to persist API data, but also explains how to use DuckDB as a versatile data manipulation tool in data wrangling scripts | ||
DuckDB: The Rising Star in the Big Data Landscape | |||
How to Make a DuckDB Extension for a Table Function? | How to make a DuckDB extension to fetch data from external sources | ||
Putting DuckDB in Postgres to Query Iceberg | How ParadeDB embedded DuckDB in Postgres to achieve fast analytics and Apache Iceberg compatibility from Postgres | ||
Awesome DuckDB / Media / Books | |||
DuckDB in Action | DuckDB in Action will show you how to quickly get your hands dirty with DuckDB | ||
Getting Started with DuckDB | A practical guide for accelerating your data science, data analytics, and data engineering workflows |