bigdata-ecosystem

Big Data Projects

A curated collection of big data related projects and resources

BigData Ecosystem Dataset

GitHub

576 stars
98 watching
175 forks
Language: HTML
last commit: almost 3 years ago
Linked from 4 awesome lists


Backlinks from these awesome lists:

Related projects:

Repository Description Stars
dat-ecosystem-archive/docs Preserves and serves documentation resources for a decentralized data storage platform. 530
groda/big_data A collection of interactive tutorials and demonstrations on Big Data technologies such as Hadoop, Spark, and MapReduce. 68
jhorey/ferry A tool for setting up and managing big data applications on various platforms using Docker 252
projectnessie/nessie Transactional catalog for data lakes with Git-like semantics 1,038
understandlingbv/tuktu An integrated suite of tools for big data science tasks 60
gopherdata/resources A collection of Go-based resources and tools for data science tasks 876
noaa-edab/ecodata Provides ecosystem data for reporting on the Northeast Continental Shelf's ecosystem status and trends 31
intel-bigdata/hibench A set of benchmarking tools to evaluate big data frameworks' performance and resource utilization 1,458
googleapis/python-bigtable A Python client for interacting with Google's NoSQL Big Data database service. 68
kdmayer/pointer A LiDAR-derived point cloud dataset of one million English buildings linked to energy characteristics 13
wikidata/wikidata-toolkit A Java library providing access to Wikibase data and tools for data extraction and analysis. 375
yildizdb/yildiz A high-performance graph database layer on top of Google Bigtable 26
nodefluent/bigquery-kafka-connect A Node.js library that enables data transfer between Kafka and BigQuery using Google Cloud services 17
hafenkran/duckdb-bigquery An extension that integrates DuckDB with Google BigQuery for direct querying and management of datasets 61
cidree/forestdata A package providing easy access to forestry and land use datasets. 13