awesome-learn-datascience
Data Science primer
A curated collection of resources to help beginners learn data science with Python and statistics basics.
Curated list of resources to help you get started with Data Science
656 stars
30 watching
78 forks
last commit: 6 months ago
Linked from 3 awesome lists
awesomeawesome-listdata-sciencelistsmachine-learning
Data Science Tutorials & Resources for Beginners / What is Data Science? | |||
'What is Data Science?' on Quora | |||
Explanation of important vocabulary | Differentiation of Big Data, Machine Learning, Data Science | ||
Data Science for Business (Book) | An introduction to Data Science and its use as a business asset | ||
Data Science Process: A Beginner’s Comprehensive Guide | Technical Skills for the Data Science: This emphasizes the practical skills needed throughout the data science process | ||
Data Science Tutorials & Resources for Beginners / Common Algorithms and Procedures | |||
Supervised vs unsupervised learning | The two most common types of Machine Learning algorithms | ||
9 important Data Science algorithms and their implementation | |||
Cross validation | Evaluate the performance of your algorithm/model | ||
Feature engineering | Modifying the data to better model predictions | ||
Scientific introduction to 10 important Data Science algorithms | |||
Model ensemble: Explanation | Combine multiple models into one for better performance | ||
Data Science Tutorials & Resources for Beginners / Data Science using Python / General | |||
O'Reilly Data Science from Scratch (Book) | Data processing, implementation, and visualization with example code | ||
Coursera Applied Data Science | Online Course using Python that covers most of the relevant toolkits | ||
Data Science Tutorials & Resources for Beginners / Data Science using Python / Learning Python | |||
YouTube tutorial series by sentdex | |||
Interactive Python tutorial website | |||
Data Science Tutorials & Resources for Beginners / Data Science using Python / numpy | |||
Numpy tutorial on DataCamp | |||
Data Science Tutorials & Resources for Beginners / Data Science using Python / pandas | |||
Introduction to pandas | |||
DataCamp pandas foundations | Paid course, but 30 free days upon account creation (enough to complete course) | ||
Pandas cheatsheet | 43,807 | 4 days ago | Quick overview over the most important functions |
Data Science Tutorials & Resources for Beginners / Data Science using Python / scikit-learn | |||
Introduction and first model application | |||
Rough guide for choosing estimators | |||
Scikit-learn complete user guide | |||
Model ensemble: Implementation in Python | |||
Data Science Tutorials & Resources for Beginners / Data Science using Python / Jupyter Notebook | |||
Downloading and running first Jupyter notebook | |||
Example notebook for data exploration | |||
Seaborn data visualization tutorial | Plot library that works great with Jupyter | ||
Data Science Tutorials & Resources for Beginners / Data Science using Python / Various other helpful tools and resources | |||
Template folder structure for organizing Data Science projects | 8,349 | 3 months ago | |
Anaconda Python distribution | Contains most of the important Python packages for Data Science | ||
Spacy | Open source toolkit for working with text-based data | ||
LightGBM gradient boosting framework | 16,694 | 6 days ago | Successfully used in many Kaggle challenges |
Amazon AWS | Rent cloud servers for more timeconsuming calculations (r4.xlarge server is a good place to start) | ||
Data Science Tutorials & Resources for Beginners / Data Science Challenges for Beginners | |||
Walkthrough: House prices challenge | Walkthrough through a simple challenge on house prices | ||
Blood Donation Challenge | Predict if a donor will donate again | ||
Titanic Challenge | Predict survival on the Titanic | ||
Water Pump Challenge | Predict the operating condition of water pumps in Africa | ||
Data Science Tutorials & Resources for Beginners / More advanced resources and lists | |||
Awesome Data Science | 25,157 | 14 days ago | |
Data Science Python | 5,274 | 8 months ago | |
Machine Learning Tutorials | 15,585 | 5 months ago |