datahub

Data Management Platform

A platform for managing and discovering data across an organization's data stack

The Metadata Platform for your Data and AI Stack

GitHub

10k stars
251 watching
3k forks
Language: Java
last commit: 4 days ago
Linked from 4 awesome lists

data-catalogdata-discoverydata-governancedatahubmetadata

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
metabase/metabase A platform that allows users to ask questions and learn from data without needing advanced technical skills 38,770
awesomedata/apd-core Provides metadata and core functionality for a curated collection of public datasets. 358
datopian/datahub A platform for building rich data portals with a modern frontend approach, decoupled from backend services via APIs. 2,204
opendatacube/datacube-core A Python-based platform for integrated gridded data analysis from decades of Earth observation satellite data 514
openlab/ogdi-datalab A cloud-based platform providing access to government data through APIs and a web interface. 58
simonw/datasette An interactive platform for exploring and publishing data in various formats 9,562
multiprocessio/datastation An all-in-one application for querying, scripting, and visualizing data from various sources 2,903
iterative/datachain An AI-data warehouse that transforms and analyzes unstructured data from various formats 1,935
oxinabox/datadeps.jl Provides tools and infrastructure for setting up and managing reproducible data science projects 151
ropensci/hddtools A collection of R functions and tools to access and manipulate hydrological data from various online sources. 45
datonic/datadex A platform for collaborative open data management and analysis 260
linkedin/databus A distributed system to capture changes from primary data stores and route them through complex data pipelines. 3,641
feathr-ai/feathr A unified data and AI engineering platform for enterprise 1,986
databendlabs/databend An open-source cloud-based data warehouse built on Rust with a focus on high-performance analytics and scalable storage 7,856
kedro-org/kedro A toolbox for production-ready data science pipelines with software engineering best practices for reproducibility and modularity 10,004