awesome-learn-datascience

Data Science primer

A curated collection of resources to help beginners learn data science with Python and statistics basics.

chart_with_upwards_trend Curated list of resources to help you get started with Data Science

GitHub

656 stars
30 watching
78 forks
last commit: 6 months ago
Linked from 3 awesome lists

awesomeawesome-listdata-sciencelistsmachine-learning

Data Science Tutorials & Resources for Beginners / What is Data Science?

'What is Data Science?' on Quora
Explanation of important vocabulary Differentiation of Big Data, Machine Learning, Data Science
Data Science for Business (Book) An introduction to Data Science and its use as a business asset
Data Science Process: A Beginner’s Comprehensive Guide Technical Skills for the Data Science: This emphasizes the practical skills needed throughout the data science process

Data Science Tutorials & Resources for Beginners / Common Algorithms and Procedures

Supervised vs unsupervised learning The two most common types of Machine Learning algorithms
9 important Data Science algorithms and their implementation
Cross validation Evaluate the performance of your algorithm/model
Feature engineering Modifying the data to better model predictions
Scientific introduction to 10 important Data Science algorithms
Model ensemble: Explanation Combine multiple models into one for better performance

Data Science Tutorials & Resources for Beginners / Data Science using Python / General

O'Reilly Data Science from Scratch (Book) Data processing, implementation, and visualization with example code
Coursera Applied Data Science Online Course using Python that covers most of the relevant toolkits

Data Science Tutorials & Resources for Beginners / Data Science using Python / Learning Python

YouTube tutorial series by sentdex
Interactive Python tutorial website

Data Science Tutorials & Resources for Beginners / Data Science using Python / numpy

Numpy tutorial on DataCamp

Data Science Tutorials & Resources for Beginners / Data Science using Python / pandas

Introduction to pandas
DataCamp pandas foundations Paid course, but 30 free days upon account creation (enough to complete course)
Pandas cheatsheet 43,807 4 days ago Quick overview over the most important functions

Data Science Tutorials & Resources for Beginners / Data Science using Python / scikit-learn

Introduction and first model application
Rough guide for choosing estimators
Scikit-learn complete user guide
Model ensemble: Implementation in Python

Data Science Tutorials & Resources for Beginners / Data Science using Python / Jupyter Notebook

Downloading and running first Jupyter notebook
Example notebook for data exploration
Seaborn data visualization tutorial Plot library that works great with Jupyter

Data Science Tutorials & Resources for Beginners / Data Science using Python / Various other helpful tools and resources

Template folder structure for organizing Data Science projects 8,349 3 months ago
Anaconda Python distribution Contains most of the important Python packages for Data Science
Spacy Open source toolkit for working with text-based data
LightGBM gradient boosting framework 16,694 6 days ago Successfully used in many Kaggle challenges
Amazon AWS Rent cloud servers for more timeconsuming calculations (r4.xlarge server is a good place to start)

Data Science Tutorials & Resources for Beginners / Data Science Challenges for Beginners

Walkthrough: House prices challenge Walkthrough through a simple challenge on house prices
Blood Donation Challenge Predict if a donor will donate again
Titanic Challenge Predict survival on the Titanic
Water Pump Challenge Predict the operating condition of water pumps in Africa

Data Science Tutorials & Resources for Beginners / More advanced resources and lists

Awesome Data Science 25,157 14 days ago
Data Science Python 5,274 8 months ago
Machine Learning Tutorials 15,585 5 months ago

Backlinks from these awesome lists:

More related projects: