big_data
Big Data tutorials
A collection of interactive tutorials and demonstrations on Big Data technologies such as Hadoop, Spark, and MapReduce.
Tutorials on Big Data essentials: Hadoop, MapReduce, Spark.
68 stars
4 watching
25 forks
Language: Jupyter Notebook
last commit: 16 days ago
Linked from 1 awesome list
apache-sedonaapache-sparkbig-databigdatabigtopdockergutenberg-ebookshadoophadoop-clusterhadoop-hdfshadoop-mapreducejupyter-notebookmapreducemapreduce-bashmrjobpysparksparkspark-sqltestdfsio
Related projects:
Repository | Description | Stars |
---|---|---|
yannael/bigdataanalytics_infoh515 | A collection of Jupyter notebooks teaching Big Data Analytics with Spark and machine learning concepts | 59 |
britefury/deep-learning-tutorial-pydata | A tutorial project providing guidance on building and training deep learning models using PyData | 85 |
nborwankar/learndatascience | A collection of data science learning materials in the form of IPython Notebooks covering various techniques such as regression, classification, and clustering. | 2,962 |
zenkay/bigdata-ecosystem | A curated collection of big data related projects and resources | 577 |
pangeo-data/pangeo-tutorial | Interactive computing environment and tutorial materials for big data analysis in the geosciences using Python | 91 |
andyshep/coredataplaygrounds | A set of playgrounds for exploring the Core Data framework in bite-sized increments. | 152 |
getredash/redash | Enables users to connect to various data sources, visualize and share their data, making it easy to explore insights and drive business decisions. | 26,445 |
dfreniche/modern-core-data-playground | An introduction to Core Data using a Swift Playground project that tests and demonstrates its usage | 35 |
googleapis/python-bigtable | A Python client for interacting with Google's NoSQL Big Data database service. | 68 |
jhorey/ferry | A tool for setting up and managing big data applications on various platforms using Docker | 252 |
catboost/tutorials | A collection of tutorials and guides on using the CatBoost machine learning library for various tasks | 1,033 |
strat0sphere/spark-euca | Provides scripts to deploy multiple big data tools in a managed environment using Eucalyptus and Amazon AWS | 1 |
intel-bigdata/hibench | A set of benchmarking tools to evaluate big data frameworks' performance and resource utilization | 1,458 |
infochimps-labs/big_data_for_chimps | A comprehensive guide to Big Data Analytics using Hadoop, written in Ruby | 169 |
jadianes/data-science-your-way | An introduction to data science concepts and applications in R and Python using hands-on tutorials | 596 |