big_data

Big Data tutorials

A collection of interactive tutorials and demonstrations on Big Data technologies such as Hadoop, Spark, and MapReduce.

Tutorials on Big Data essentials: Hadoop, MapReduce, Spark.

GitHub

68 stars
4 watching
25 forks
Language: Jupyter Notebook
last commit: 16 days ago
Linked from 1 awesome list

apache-sedonaapache-sparkbig-databigdatabigtopdockergutenberg-ebookshadoophadoop-clusterhadoop-hdfshadoop-mapreducejupyter-notebookmapreducemapreduce-bashmrjobpysparksparkspark-sqltestdfsio

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
yannael/bigdataanalytics_infoh515 A collection of Jupyter notebooks teaching Big Data Analytics with Spark and machine learning concepts 59
britefury/deep-learning-tutorial-pydata A tutorial project providing guidance on building and training deep learning models using PyData 85
nborwankar/learndatascience A collection of data science learning materials in the form of IPython Notebooks covering various techniques such as regression, classification, and clustering. 2,962
zenkay/bigdata-ecosystem A curated collection of big data related projects and resources 577
pangeo-data/pangeo-tutorial Interactive computing environment and tutorial materials for big data analysis in the geosciences using Python 91
andyshep/coredataplaygrounds A set of playgrounds for exploring the Core Data framework in bite-sized increments. 152
getredash/redash Enables users to connect to various data sources, visualize and share their data, making it easy to explore insights and drive business decisions. 26,445
dfreniche/modern-core-data-playground An introduction to Core Data using a Swift Playground project that tests and demonstrates its usage 35
googleapis/python-bigtable A Python client for interacting with Google's NoSQL Big Data database service. 68
jhorey/ferry A tool for setting up and managing big data applications on various platforms using Docker 252
catboost/tutorials A collection of tutorials and guides on using the CatBoost machine learning library for various tasks 1,033
strat0sphere/spark-euca Provides scripts to deploy multiple big data tools in a managed environment using Eucalyptus and Amazon AWS 1
intel-bigdata/hibench A set of benchmarking tools to evaluate big data frameworks' performance and resource utilization 1,458
infochimps-labs/big_data_for_chimps A comprehensive guide to Big Data Analytics using Hadoop, written in Ruby 169
jadianes/data-science-your-way An introduction to data science concepts and applications in R and Python using hands-on tutorials 596