datahub

Data Management Platform

A platform for managing and discovering data across an organization's data stack

The Metadata Platform for your Data and AI Stack

GitHub

10k stars
254 watching
3k forks
Language: Java
last commit: about 1 month ago
Linked from 4 awesome lists

data-catalogdata-discoverydata-governancedatahubmetadata

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
metabase/metabase A platform that allows users to ask questions and learn from data without needing advanced technical skills 39,103
awesomedata/apd-core Provides core metadata and management for a collection of publicly available datasets 358
datopian/datahub A platform for building rich data portals with a modern frontend approach, decoupled from backend services via APIs. 2,209
opendatacube/datacube-core A Python-based platform for integrated gridded data analysis from decades of Earth observation satellite data 518
openlab/ogdi-datalab A cloud-based platform providing access to government data through APIs and a web interface. 58
simonw/datasette An interactive platform for exploring and publishing data in various formats 9,639
multiprocessio/datastation An all-in-one application for querying, scripting, and visualizing data from various sources 2,907
iterative/datachain A Python-based framework for transforming and analyzing unstructured data from various formats like images, audio, videos, text, and PDFs. 2,088
oxinabox/datadeps.jl Provides tools and infrastructure for setting up and managing reproducible data science projects 152
ropensci/hddtools A collection of R functions and tools to access and manipulate hydrological data from various online sources. 47
datonic/datadex A platform for collaborative open data management and analysis 264
linkedin/databus A distributed system to capture changes from primary data stores and route them through complex data pipelines. 3,643
feathr-ai/feathr A unified data and AI engineering platform for enterprise 1,985
databendlabs/databend A high-performance, scalable data warehouse built on Rust, offering blazing-fast query execution and real-time analytics capabilities. 7,978
kedro-org/kedro A toolbox for production-ready data science pipelines with software engineering best practices for reproducibility and modularity 10,050