bigdata-ecosystem

Big Data Projects

A curated collection of big data related projects and resources

BigData Ecosystem Dataset

GitHub

575 stars
98 watching
175 forks
Language: HTML
last commit: about 3 years ago
Linked from 4 awesome lists


Backlinks from these awesome lists:

Related projects:

Repository Description Stars
dat-ecosystem-archive/docs Preserves and serves documentation resources for a decentralized data storage platform. 530
groda/big_data A collection of interactive tutorials and demonstrations on Big Data technologies such as Hadoop, Spark, and MapReduce. 68
jhorey/ferry A tool for setting up and managing big data applications on various platforms using Docker 252
projectnessie/nessie Transactional catalog for data lakes with Git-like semantics 1,064
understandlingbv/tuktu An integrated suite of tools for big data science tasks 60
gopherdata/resources A collection of Go-based resources and tools for data science tasks 879
noaa-edab/ecodata Provides ecosystem data for reporting on the Northeast Continental Shelf's ecosystem status and trends 31
intel-bigdata/hibench A set of benchmarking tools to evaluate big data frameworks' performance and resource utilization 1,463
googleapis/python-bigtable Provides a Python interface to interact with Google Cloud Bigtable NoSQL database service. 68
kdmayer/pointer A LiDAR-derived point cloud dataset of one million English buildings linked to energy characteristics 13
wikidata/wikidata-toolkit A Java library providing access to Wikibase data and tools for data extraction and analysis. 376
yildizdb/yildiz A high-performance graph database layer on top of Google Bigtable 26
nodefluent/bigquery-kafka-connect A Node.js library that enables data transfer between Kafka and BigQuery using Google Cloud services 17
hafenkran/duckdb-bigquery An extension that integrates DuckDB with Google BigQuery for direct querying and management of datasets 77
cidree/forestdata A package providing easy access to forestry and land use datasets. 13