Free Resources
A collection of free resources designed to aid your journey in data engineering, analytics, and AI.
Data Engineering Training
- IBM Data Engineering Basics for Everyone on edX - An introductory course to data engineering principles offered by IBM on edX.
- Google Cloud Data Engineering Path - A learning path designed to master data engineering on the Google Cloud Platform.
- Meta Database Engineer Professional Certificate on Coursera - A professional certification program by Meta on Coursera, focusing on database engineering.
- Big Data Specialization by UC San Diego on Coursera - A series of courses by UC San Diego on Coursera, covering big data analysis techniques and tools.
- Data Engineering Zoomcamp on GitHub - A free, self-paced data engineering bootcamp hosted on GitHub, covering a range of tools and practices.
Data Engineering Books
- Fundamentals of Data Engineering - A guide by Redpanda to the basics of data engineering, covering essential concepts and practices.
- Database Technology Overview - An overview of database technology fundamentals.
- Erwin in Database Design - Insights into using Erwin for database design.
- Modern Data Engineering Playbook - ThoughtWorks presents strategies for modern data engineering, focusing on scalable, efficient solutions.
- The Data Engineering Cookbook - A comprehensive guide for data engineering practices.
- Data Engineering Design Patterns (DEDP) - An exploration of convergent evolution in data engineering, emphasizing design patterns for data systems.
SQL Tutorials and Courses
- W3 Schools SQL Tutorial - A comprehensive tutorial for SQL beginners.
- SQLBolt - Practice with real-world datasets and challenging SQL problems to deepen your skills.
Interactive SQL Learning Games
- SQL Murder Mystery - Solve a captivating murder case using your SQL skills in this immersive game-like environment. Suitable for beginners.
- SQL Island - Navigate through an adventure on SQL Island to learn SQL commands. Remember to change the language to English via the hamburger icon.
- SchemaVerse - A space-based strategy game where you use SQL commands to control your fleet and conquer the universe.
- Lost at SQL - The SQL Learning Game - Enhance your SQL skills through this engaging learning game.
SQL Practice and Challenges
- 8 Week SQL Challenge - Intermediate/Advanced SQL challenges through interactive projects.
- HackerRank SQL Challenges - Test your mettle against others in coding challenges that push your SQL boundaries. Suitable for Intermediate-Advanced learners.
- Codewars SQL Kata - Hone your understanding with kata-style SQL exercises, practicing diverse concepts.
- SQLZoo - Practice with a wide range of exercises at different difficulty levels, mastering SQL fundamentals.
Spreadsheets for Data Analysis
Excel Basics and Advanced Techniques
- Essential Spreadsheets Book 1 - University of York - Introduction to Excel.
- Essential Spreadsheets Book 2 - University of York - Advanced Excel techniques.
- Excel Video Training - Microsoft - Official tutorials from Microsoft.
Google Sheets for Analysis
- Google Sheets Training & Help - Official Google Sheets guide.
- Data Analysis in Google Sheets - Measure School - Analyzing data using Google Sheets.
- How to Analyze Data in Google Sheets - Data analysis techniques in Google Sheets.
Comprehensive Data Analysis with Excel
- Data Analysis Excel - Simplilearn - Simplilearn’s guide on Excel data analysis.
- Excel Data Analysis - TutorialsPoint - Detailed tutorials on Excel data analysis.
Learn Python
Python is a versatile and powerful programming language that’s great for beginners and professionals alike. Here are some resources to get started or enhance your Python skills:
- Python Official Getting Started Guide - Official Python documentation and guide on getting started with Python.
- Learnpython.org - A free interactive Python tutorial for people who want to learn Python, starting from the basics.
- OpenClassrooms - Learn Programming with Python - A course designed to introduce you to programming using Python.
- Principles of Computation with Python - Carnegie Mellon University - An open & free course by CMU focusing on computational principles using Python.
- Codecademy Python Course - Interactive Python programming courses for all levels.
- MIT OpenCourseWare - Introduction to Computer Science and Programming in Python - A course by MIT that introduces the fundamental ideas of computing using Python.
- Harvard University - Python Programming - Various Python programming courses offered by Harvard University.
- freeCodeCamp - Learn Python - A collection of free Python courses for beginners to help learn Python programming.
- Google Cloud - Python on Google Cloud - Learn how to use Python with Google Cloud services for building and deploying applications.
Learn R
R is a programming language and environment commonly used for statistical computing and graphics. The following resources provide comprehensive guides, tutorials, and courses for beginners to advanced users interested in learning R:
- Codecademy - Learn R - An interactive platform offering a course designed to get you started with R programming.
- W3Schools R Tutorial - Provides a quick and easy understanding of R, covering basics to advanced topics.
- RStudio Education - Learn R - Beginner resources compiled by RStudio, aiming to make learning R easier and more effective.
- Programiz - Learn R Programming - Offers R tutorials for beginners to learn R programming online.
- Codecademy - R Catalog - Discover more R courses offered by Codecademy to deepen your understanding and skills.
- Swirl - Learn R, in R - A platform that offers interactive R programming lessons directly within the R console.
- Hands-On Programming with R - Teaches how to perform data analysis with R through practical examples, covering basics to more advanced topics.
Web Scraping with Python
Web scraping is a method used to extract data from websites. Python offers several libraries and tools for web scraping. Here are some essential resources to get started or enhance your web scraping skills:
- Beautiful Soup Documentation - Official documentation for Beautiful Soup, a Python library designed for quick turnaround projects like screen-scraping.
- Python Requests Library - Official documentation for Requests, a simple HTTP library for Python, used to send HTTP requests easily.
- Scrape Quotes - A practice website designed for scraping quotes from famous authors.
- Scrape This Site - A website that offers lessons and challenges for web scraping practices.
- freeCodeCamp Web Scraping Python Tutorial - A comprehensive guide on how to scrape data from a website using Python.
- GeeksforGeeks Python Web Scraping Tutorial - Offers a tutorial on web scraping using Python, covering basics to advanced topics.
- Beautiful Soup Web Scraper Python - Real Python - A tutorial that explains how to use Beautiful Soup for web scraping effectively.
- Python Web Scraping - A Practical Introduction - Real Python - Provides a practical introduction to web scraping using Python, including setting up your environment and parsing HTML.
- Fullstack Python - Provides free tutorials and guides on how to do both front end and back end in Python.
Database and Data Processing Technologies
Enhance your knowledge in various database technologies and data processing frameworks through these tutorials and courses.
- MySQL Tutorial - Comprehensive tutorials for beginners to advanced users of MySQL.
- MongoDB Python Developer Learning Path - A structured learning path for Python developers focusing on MongoDB.
- PySpark Tutorial - Tutorials to learn PySpark for big data processing.
- Apache NiFi Tutorial - Learn about Apache NiFi for data routing and transformation.
- Debezium Tutorial - Introduction and advanced concepts for using Debezium for data capture.
Streaming and Integration
Courses that focus on streaming data platforms and event streaming architectures.
- Confluent Kafka Fundamentals - Free courses on Apache Kafka fundamentals provided by Confluent.
Data Warehousing
Courses and tutorials to deepen understanding of data warehousing concepts and technologies.
- Data Warehousing Tutorial - Basics to advanced concepts in data warehousing.
- Introduction to Snowflake Cloud Data Warehouse - A beginner’s course on using Snowflake for data warehousing and analytics.
- DBT Fundamentals - Learn how to transform data in your warehouse using dbt (data build tool).
DevOps and CI/CD
Learn about continuous integration and continuous deployment, crucial for modern software development practices.
- CI/CD Pipeline Tutorial - An overview of CI/CD concepts and how to implement them.
Foundational Data Science Courses
- The Open Source Data Science Masters - A comprehensive curriculum for self-study in data science.
- Data Science Essentials- Microsoft - Foundation course on edX covering essential data science concepts.
- Harvard CS109 Data Science - In-depth course covering data science methodologies and Python.
- Data Science Fundamentals – IBM - Covers fundamentals of data science through hands-on practice on the Cognitive Class platform.
Introduction to Data Science
- Introduction to Data Science by Jeff Hammerbacher at UC, Berkeley - Lectures and materials on the introduction to data science.
- Introduction to Data Science - Offers a broad introduction to data science through Coursera.
- Introduction to Data Science - Another Coursera course providing foundational knowledge in data science.
- Data Science for Beginners by Microsoft - Beginner-friendly data science materials hosted on GitHub.
Specialized Topics in Data Science
- Learning from Data – California Institute of Technology & UCBerkeley - Focus on machine learning and data analysis.
- Process Mining - Data Science in Action - Specialized course on process mining techniques.
- Pattern Discovery in Data Mining - Course on discovering patterns in datasets.
- Introduction to Data Mining - Lecture notes and materials on data mining concepts.
- Mining Massive Datasets - Advanced course on dealing with massive datasets.
Statistical Analysis and Data Wrangling
- Statistical Thinking and Data Analysis - Courses focused on statistical thinking and data analysis techniques.
- Open intro to Statistics - Provides a solid introduction to statistics.
- Regression Analysis - Learn approaches for analyzing multivariate data sets, emphasizing analysis of variance, linear regression, and logistic regression.
- Introduction to Data Wrangling at the School of Data - Focuses on data cleaning and preparation techniques.
- Elements of Statistical Learning by Hastie, Tibshirani, and Friedman
- Pattern Recognition and Machine Learning by Bishop
- Mathematics for Machine Learning by Deisenroth, Faisal, Soon Ong
- Machine Learning - a Probabilistic Perspective by Murphy
- Probabilistic Machine Learning - An Introduction by Murphy
Programming for Data Science
- Python for Data Science - Course by Cognitive Class on using Python for data science tasks.
- Data Science - R Basics - Harvard University - Introduction to using R for data science.
Online Learning Platforms and Resources
- Cognitive Class - Interactive platform offering courses on data science and AI.
- Jovian - For sharing and collaborating on Jupyter notebooks.
- DataPen - Interactive tutorials on data science and machine learning.
- Data Analytics Educational Resources - Collection of learning materials for data analytics.
- Executive Levels Data Software Training - Offers training on various data software tools.
- SQLZOO - Interactive SQL tutorials for hands-on learning.
YouTube Learning Playlists
Here are a few playlists most of which are contributions by Jemar and Sandy Cabanes.
Statistics and Data Analysis
- CrashCourse Statistics - Quick statistics overview.
- StatQuest - Statistics Fundamentals - Fundamental statistics concepts.
- Statistics Course for Data Science - Statistics tutorials tailored for data science.
- Tile Stats - Detailed statistical analysis and education.
- Tina Huang - Data science tutorials, including R and Python programming.
- Data Analyst Bootcamp by Alex - Comprehensive bootcamp for becoming a data analyst.
Data Visualization and Business Intelligence
- Tableau Dashboard - Tutorials on creating dashboards using Tableau.
- Power BI - Training sessions on using Power BI for data analysis and visualization.
- Mo Chen - Business intelligence and analytics techniques.
Professional Development and Storytelling
- Presentation and Storytelling - Improve your presentation skills and learn the art of storytelling.
- Create a Portfolio Website with AlexTheAnalyst - A comprehensive tutorial on creating and hosting a portfolio using GitHub Pages.
Google Analytics
- Google Analytics for Beginners - Learn how to track your portfolio or website visitors.
- Advanced Google Analytics - Dive deeper into Google Analytics for complex tracking and analysis.
Data Engineering
- YouTube Playlist on Data Engineering Projects - A curated list of video tutorials and project ideas for aspiring data engineers.
- Data Professor - Insights and tutorials on data engineering practices.
- TechTFQ - Technology-focused tutorials including data engineering topics.
Programming and Software Tutorials
- Kevin Stratvert - Guides and tips on using software tools effectively.
- Tech with Tim - Programming tutorials, especially in Python and web development.
- Socratica - Educational videos on programming and computer science.
- Harvard CS50’s Introduction to Programming with Python - Full university course on programming with Python.
Miscellaneous
- MS Fabric - Explore Microsoft’s design framework.
Additional Learning Resources
- Learn Git Branching - Learn about Git
- GitHub - Platform for code sharing and collaboration.
- PLURALSIGHT - Offers video courses on a wide range of tech topics, including data science.
- hackerRank - Platform for practicing coding and data science skills.
- World Quant - Provides resources for quantitative analysis.
- Real World Data Science Use Cases - A showcase for data science in action
- SeattleDataGuy’s Newsletter - Learn About End-To-End Data Flows (Data Engineering, MLOps, and Data Science)
- Git Expert in 4 Hours - A concise course to become proficient in Git.
- Azure Storage Blog - Updates and articles on Azure storage solutions.
- Databricks Free Training - Free training resources for learning Databricks Unified Analytics Platform.
Training and Internships
- The Sparks Foundation - A remote one-month internship in Data Science and Business Analytics.
- For The Women Foundation - FTW is a nonprofit organization providing free data science and technology training for women
Online Learning Platforms
- Dataquest - An interactive learning platform focusing on data science and analytics skills.
- Datacamp - Offers hands-on courses on data science, Python, R, and SQL among others.
- Codecademy - Provides interactive programming courses across many different technology areas.
- SoloLearn - A mobile-first platform offering courses on a wide array of programming languages.
- W3Schools - A comprehensive resource for learning web development technologies and languages.
- Khan Academy - Free online courses in various subjects, including computer programming.
- Coursera - Online courses, specializations, and degrees from universities and educational institutions.
- edX - Access to online university-level courses in a wide range of disciplines.
- freeCodeCamp - An open-source community providing free coding bootcamp and certifications.
- Udacity - Specializes in technology courses that offer Nanodegrees and certifications.
- The Odin Project - A free coding curriculum that provides a complete path to web development.
- Mozilla Developer Network (MDN) - Documentation and learning resources for web developers by Mozilla.
- Kaggle - A platform for predictive modelling and analytics competitions and datasets.
Cloud Resources
- AWS Training and Certification - Training and certification resources for Amazon Web Services.
- Google Cloud Training - Training resources for Google Cloud Platform.
- Microsoft Learn - Learning resources for Microsoft technologies, including Azure.
- Cloud Free Tier Comparison - Articles comparing free tier offers from AWS, Azure, GCP, and Oracle Cloud.
Open Courseware
- Data Analysis with R - Comprehensive R course for data analysis.
- Data Engineering Zoomcamp - Free, self-paced online course on data engineering.
- Data Science in a Box w/ R - Curriculum for teaching and learning data science using R.
- The Open Source Data Science Masters - Curriculum for a self-taught education in data science.
- Fast.ai Courses - Practical deep learning for coders, taught by fast.ai.
- Dive into Deep Learning - An interactive deep learning book with code, math, and discussions.
- GIS Programming Roadmap on GitHub - A roadmap for learning GIS programming.
- Intel Machine Learning Course - This course provides an overview of machine learning fundamentals on modern Intel architecture.
- Intel Deep Learning Course - This course provides an introduction to deep learning on modern Intel architecture.
Open Books
Data Science and R Programming
- R for Data Science - Learn data science with R.
- Data Science - R Basics - Foundation of data science in R.
- R Graphics Cookbook - Data visualization with ggplot2 in R.
- Geocomputation with R - Spatial data analysis with R.
- Exploratory Data Analysis with R - Techniques for EDA using R.
- The Art of Data Science - The process
of data analysis. - R for Data Science (Second Edition) - Updated guide for data science with R. - R Packages - Creating R packages. - R datasciencebook - R datasciencebook.
Python Programming
- Python for Everybody - Introduction to programming using Python.
- Modern Polars - Using Polars library in Python for data manipulation.
- Python datasciencebook - Python datasciencebook.
Visualization and Web Scraping
- Data Visualization - A Practical Introduction - Introduction to data visualization.
- Web Scraping with R - Techniques for web scraping using R.
- Interactive Data Visualization - Learning interactive data visualization.
Advanced Topics and Specialized Areas
- Telling Stories with Data - Narrative techniques in data communication.
- Spatial Data Science - Spatial data analysis.
- Hands-On Programming with R - Practical programming with R.
- Python Geospatial Analysis - Geospatial analysis with Python.
- Time Series Analysis with R - Time series analysis techniques.
- Forecasting: Principles and Practice with R - This textbook is intended to provide a comprehensive introduction to forecasting methods and to present enough information about each method for readers to be able to use them sensibly.
- Cookbook for R Polars - Using Polars package in R for data frames.
- NFL Analytics with R - Analyzing NFL data with R.
- Data Management in R - Strategies for data management with R.
- mlr3book - Machine learning in R with mlr3.
- Python for Geocomputation - Geocomputational analysis with Python.
- Raps with R - Music and analysis with R.
Miscellaneous
- Open Books by Open UMN - Free educational textbooks, including data science and statistics.
- Pandas for Everyone - Comprehensive guide to using Pandas for data analysis.
- Saylor Academy - Free and open online courses for people everywhere.
- Goalkicker - Programming Notes for Professionals - Free programming books on various topics.
- Python Programming Tutorials - Comprehensive resource for learning Python.
- Scratch - Imagine, Program, Share - Create stories, games, and animations.
- Waggle Dance - An interactive Python tutorial.
- Invent with Python - Books for learning Python with a focus on making things.
- Open Source Society University - Computer Science - Path to a free self-taught education in Computer Science.
- Learn Python the Hard Way - Free eBook Download - A book for learning Python programming.
- Data Analysis with Python - Spring 2020 - University of Helsinki’s course on Python for data analysis.
- Exercism - Code practice and mentorship for everyone.
- Git and GitHub Tutorial for Beginners - Introduction to Git and GitHub.
- RStudio - Books - Resources for learning R programming.
- Awesome Courses - This list is an attempt to bring to light those awesome CS courses which make their high-quality material i.e. assignments, lectures, notes, readings & examinations available online for free.
- Books - Learn Anything is collecting all material with focus on Computer Science but you’ll find other topics ranging from neuro science to philosophy.
- Learn to Program - This list aims to be a curated set of high quality educational resources with focus on foundations of web developement.
- awesome-dataviz - A curated list of awesome data visualization libraries and resources.
Training and Internships
- The Sparks Foundation - A remote one-month internship in Data Science and Business Analytics.
- For The Women Foundation - FTW is a nonprofit organization providing free data science and technology training for women
=======
Data Engineering Training
- IBM Data Engineering Basics for Everyone on edX - An introductory course to data engineering principles offered by IBM on edX.
- Google Cloud Data Engineering Path - A learning path designed to master data engineering on the Google Cloud Platform.
- Meta Database Engineer Professional Certificate on Coursera - A professional certification program by Meta on Coursera, focusing on database engineering.
- Big Data Specialization by UC San Diego on Coursera - A series of courses by UC San Diego on Coursera, covering big data analysis techniques and tools.
- Data Engineering Zoomcamp on GitHub - A free, self-paced data engineering bootcamp hosted on GitHub, covering a range of tools and practices.
Data Engineering Books
- Fundamentals of Data Engineering - A guide by Redpanda to the basics of data engineering, covering essential concepts and practices.
- Database Technology Overview - An overview of database technology fundamentals.
- Erwin in Database Design - Insights into using Erwin for database design.
- Modern Data Engineering Playbook - ThoughtWorks presents strategies for modern data engineering, focusing on scalable, efficient solutions.
- The Data Engineering Cookbook - A comprehensive guide for data engineering practices.
- Data Engineering Design Patterns (DEDP) - An exploration of convergent evolution in data engineering, emphasizing design patterns for data systems.
SQL Tutorials and Courses
- W3 Schools SQL Tutorial - A comprehensive tutorial for SQL beginners.
- SQLBolt - Practice with real-world datasets and challenging SQL problems to deepen your skills.
Interactive SQL Learning Games
- SQL Murder Mystery - Solve a captivating murder case using your SQL skills in this immersive game-like environment. Suitable for beginners.
- SQL Island - Navigate through an adventure on SQL Island to learn SQL commands. Remember to change the language to English via the hamburger icon.
- SchemaVerse - A space-based strategy game where you use SQL commands to control your fleet and conquer the universe.
- Lost at SQL - The SQL Learning Game - Enhance your SQL skills through this engaging learning game.
SQL Practice and Challenges
- 8 Week SQL Challenge - Intermediate/Advanced SQL challenges through interactive projects.
- HackerRank SQL Challenges - Test your mettle against others in coding challenges that push your SQL boundaries. Suitable for Intermediate-Advanced learners.
- Codewars SQL Kata - Hone your understanding with kata-style SQL exercises, practicing diverse concepts.
- SQLZoo - Practice with a wide range of exercises at different difficulty levels, mastering SQL fundamentals.
Spreadsheets for Data Analysis
Excel Basics and Advanced Techniques
- Essential Spreadsheets Book 1 - University of York - Introduction to Excel.
- Essential Spreadsheets Book 2 - University of York - Advanced Excel techniques.
- Excel Video Training - Microsoft - Official tutorials from Microsoft.
Google Sheets for Analysis
- Google Sheets Training & Help - Official Google Sheets guide.
- Data Analysis in Google Sheets - Measure School - Analyzing data using Google Sheets.
- How to Analyze Data in Google Sheets - Data analysis techniques in Google Sheets.
Comprehensive Data Analysis with Excel
- Data Analysis Excel - Simplilearn - Simplilearn’s guide on Excel data analysis.
- Excel Data Analysis - TutorialsPoint - Detailed tutorials on Excel data analysis.
Learn Python
Python is a versatile and powerful programming language that’s great for beginners and professionals alike. Here are some resources to get started or enhance your Python skills:
- Python Official Getting Started Guide - Official Python documentation and guide on getting started with Python.
- Learnpython.org - A free interactive Python tutorial for people who want to learn Python, starting from the basics.
- OpenClassrooms - Learn Programming with Python - A course designed to introduce you to programming using Python.
- Principles of Computation with Python - Carnegie Mellon University - An open & free course by CMU focusing on computational principles using Python.
- Codecademy Python Course - Interactive Python programming courses for all levels.
- MIT OpenCourseWare - Introduction to Computer Science and Programming in Python - A course by MIT that introduces the fundamental ideas of computing using Python.
- Harvard University - Python Programming - Various Python programming courses offered by Harvard University.
- freeCodeCamp - Learn Python - A collection of free Python courses for beginners to help learn Python programming.
- Google Cloud - Python on Google Cloud - Learn how to use Python with Google Cloud services for building and deploying applications.
Learn R
R is a programming language and environment commonly used for statistical computing and graphics. The following resources provide comprehensive guides, tutorials, and courses for beginners to advanced users interested in learning R:
- Codecademy - Learn R - An interactive platform offering a course designed to get you started with R programming.
- W3Schools R Tutorial - Provides a quick and easy understanding of R, covering basics to advanced topics.
- RStudio Education - Learn R - Beginner resources compiled by RStudio, aiming to make learning R easier and more effective.
- Programiz - Learn R Programming - Offers R tutorials for beginners to learn R programming online.
- Codecademy - R Catalog - Discover more R courses offered by Codecademy to deepen your understanding and skills.
- Swirl - Learn R, in R - A platform that offers interactive R programming lessons directly within the R console.
- Hands-On Programming with R - Teaches how to perform data analysis with R through practical examples, covering basics to more advanced topics.
Web Scraping with Python
Web scraping is a method used to extract data from websites. Python offers several libraries and tools for web scraping. Here are some essential resources to get started or enhance your web scraping skills:
- Beautiful Soup Documentation - Official documentation for Beautiful Soup, a Python library designed for quick turnaround projects like screen-scraping.
- Python Requests Library - Official documentation for Requests, a simple HTTP library for Python, used to send HTTP requests easily.
- Scrape Quotes - A practice website designed for scraping quotes from famous authors.
- Scrape This Site - A website that offers lessons and challenges for web scraping practices.
- freeCodeCamp Web Scraping Python Tutorial - A comprehensive guide on how to scrape data from a website using Python.
- GeeksforGeeks Python Web Scraping Tutorial - Offers a tutorial on web scraping using Python, covering basics to advanced topics.
- Beautiful Soup Web Scraper Python - Real Python - A tutorial that explains how to use Beautiful Soup for web scraping effectively.
- Python Web Scraping - A Practical Introduction - Real Python - Provides a practical introduction to web scraping using Python, including setting up your environment and parsing HTML.
- Fullstack Python - Provides free tutorials and guides on how to do both front end and back end in Python.
Database and Data Processing Technologies
Enhance your knowledge in various database technologies and data processing frameworks through these tutorials and courses.
- MySQL Tutorial - Comprehensive tutorials for beginners to advanced users of MySQL.
- MongoDB Python Developer Learning Path - A structured learning path for Python developers focusing on MongoDB.
- PySpark Tutorial - Tutorials to learn PySpark for big data processing.
- Apache NiFi Tutorial - Learn about Apache NiFi for data routing and transformation.
- Debezium Tutorial - Introduction and advanced concepts for using Debezium for data capture.
Streaming and Integration
Courses that focus on streaming data platforms and event streaming architectures.
- Confluent Kafka Fundamentals - Free courses on Apache Kafka fundamentals provided by Confluent.
Data Warehousing
Courses and tutorials to deepen understanding of data warehousing concepts and technologies.
- Data Warehousing Tutorial - Basics to advanced concepts in data warehousing.
- Introduction to Snowflake Cloud Data Warehouse - A beginner’s course on using Snowflake for data warehousing and analytics.
- DBT Fundamentals - Learn how to transform data in your warehouse using dbt (data build tool).
DevOps and CI/CD
Learn about continuous integration and continuous deployment, crucial for modern software development practices.
- CI/CD Pipeline Tutorial - An overview of CI/CD concepts and how to implement them.
Foundational Data Science Courses
- The Open Source Data Science Masters - A comprehensive curriculum for self-study in data science.
- Data Science Essentials- Microsoft - Foundation course on edX covering essential data science concepts.
- Harvard CS109 Data Science - In-depth course covering data science methodologies and Python.
- Data Science Fundamentals – IBM - Covers fundamentals of data science through hands-on practice on the Cognitive Class platform.
Introduction to Data Science
- Introduction to Data Science by Jeff Hammerbacher at UC, Berkeley - Lectures and materials on the introduction to data science.
- Introduction to Data Science - Offers a broad introduction to data science through Coursera.
- Introduction to Data Science - Another Coursera course providing foundational knowledge in data science.
- Data Science for Beginners by Microsoft - Beginner-friendly data science materials hosted on GitHub.
Specialized Topics in Data Science
- Learning from Data – California Institute of Technology & UCBerkeley - Focus on machine learning and data analysis.
- Process Mining - Data Science in Action - Specialized course on process mining techniques.
- Pattern Discovery in Data Mining - Course on discovering patterns in datasets.
- Introduction to Data Mining - Lecture notes and materials on data mining concepts.
- Mining Massive Datasets - Advanced course on dealing with massive datasets.
Statistical Analysis and Data Wrangling
- Statistical Thinking and Data Analysis - Courses focused on statistical thinking and data analysis techniques.
- Open intro to Statistics - Provides a solid introduction to statistics.
- Regression Analysis - Learn approaches for analyzing multivariate data sets, emphasizing analysis of variance, linear regression, and logistic regression.
- Introduction to Data Wrangling at the School of Data - Focuses on data cleaning and preparation techniques.
- Elements of Statistical Learning by Hastie, Tibshirani, and Friedman
- Pattern Recognition and Machine Learning by Bishop
- Mathematics for Machine Learning by Deisenroth, Faisal, Soon Ong
- Machine Learning - a Probabilistic Perspective by Murphy
- Probabilistic Machine Learning - An Introduction by Murphy
Programming for Data Science
- Python for Data Science - Course by Cognitive Class on using Python for data science tasks.
- Data Science - R Basics - Harvard University - Introduction to using R for data science.
Online Learning Platforms and Resources
- Cognitive Class - Interactive platform offering courses on data science and AI.
- Jovian - For sharing and collaborating on Jupyter notebooks.
- DataPen - Interactive tutorials on data science and machine learning.
- Data Analytics Educational Resources - Collection of learning materials for data analytics.
- Executive Levels Data Software Training - Offers training on various data software tools.
- SQLZOO - Interactive SQL tutorials for hands-on learning.
YouTube Learning Playlists
Here are a few playlists most of which are contributions by Jemar and Sandy Cabanes.
Statistics and Data Analysis
- CrashCourse Statistics - Quick statistics overview.
- StatQuest - Statistics Fundamentals - Fundamental statistics concepts.
- Statistics Course for Data Science - Statistics tutorials tailored for data science.
- Tile Stats - Detailed statistical analysis and education.
- Tina Huang - Data science tutorials, including R and Python programming.
- Data Analyst Bootcamp by Alex - Comprehensive bootcamp for becoming a data analyst.
Data Visualization and Business Intelligence
- Tableau Dashboard - Tutorials on creating dashboards using Tableau.
- Power BI - Training sessions on using Power BI for data analysis and visualization.
- Mo Chen - Business intelligence and analytics techniques.
Professional Development and Storytelling
- Presentation and Storytelling - Improve your presentation skills and learn the art of storytelling.
- Create a Portfolio Website with AlexTheAnalyst - A comprehensive tutorial on creating and hosting a portfolio using GitHub Pages.
Google Analytics
- Google Analytics for Beginners - Learn how to track your portfolio or website visitors.
- Advanced Google Analytics - Dive deeper into Google Analytics for complex tracking and analysis.
Data Engineering
- YouTube Playlist on Data Engineering Projects - A curated list of video tutorials and project ideas for aspiring data engineers.
- Data Professor - Insights and tutorials on data engineering practices.
- TechTFQ - Technology-focused tutorials including data engineering topics.
Programming and Software Tutorials
- Kevin Stratvert - Guides and tips on using software tools effectively.
- Tech with Tim - Programming tutorials, especially in Python and web development.
- Socratica - Educational videos on programming and computer science.
- Harvard CS50’s Introduction to Programming with Python - Full university course on programming with Python.
Miscellaneous
- MS Fabric - Explore Microsoft’s design framework.
Additional Learning Resources
- Learn Git Branching - Learn about Git
- GitHub - Platform for code sharing and collaboration.
- PLURALSIGHT - Offers video courses on a wide range of tech topics, including data science.
- hackerRank - Platform for practicing coding and data science skills.
- World Quant - Provides resources for quantitative analysis.
- Real World Data Science Use Cases - A showcase for data science in action
- SeattleDataGuy’s Newsletter - Learn About End-To-End Data Flows (Data Engineering, MLOps, and Data Science)
- Git Expert in 4 Hours - A concise course to become proficient in Git.
- Azure Storage Blog - Updates and articles on Azure storage solutions.
- Databricks Free Training - Free training resources for learning Databricks Unified Analytics Platform.
Training and Internships
- The Sparks Foundation - A remote one-month internship in Data Science and Business Analytics.
- For The Women Foundation - FTW is a nonprofit organization providing free data science and technology training for women
Online Learning Platforms
- Dataquest - An interactive learning platform focusing on data science and analytics skills.
- Datacamp - Offers hands-on courses on data science, Python, R, and SQL among others.
- Codecademy - Provides interactive programming courses across many different technology areas.
- SoloLearn - A mobile-first platform offering courses on a wide array of programming languages.
- W3Schools - A comprehensive resource for learning web development technologies and languages.
- Khan Academy - Free online courses in various subjects, including computer programming.
- Coursera - Online courses, specializations, and degrees from universities and educational institutions.
- edX - Access to online university-level courses in a wide range of disciplines.
- freeCodeCamp - An open-source community providing free coding bootcamp and certifications.
- Udacity - Specializes in technology courses that offer Nanodegrees and certifications.
- The Odin Project - A free coding curriculum that provides a complete path to web development.
- Mozilla Developer Network (MDN) - Documentation and learning resources for web developers by Mozilla.
- Kaggle - A platform for predictive modelling and analytics competitions and datasets.
Cloud Resources
- AWS Training and Certification - Training and certification resources for Amazon Web Services.
- Google Cloud Training - Training resources for Google Cloud Platform.
- Microsoft Learn - Learning resources for Microsoft technologies, including Azure.
- Cloud Free Tier Comparison - Articles comparing free tier offers from AWS, Azure, GCP, and Oracle Cloud.
Open Courseware
- Data Analysis with R - Comprehensive R course for data analysis.
- Data Engineering Zoomcamp - Free, self-paced online course on data engineering.
- Data Science in a Box w/ R - Curriculum for teaching and learning data science using R.
- The Open Source Data Science Masters - Curriculum for a self-taught education in data science.
- Fast.ai Courses - Practical deep learning for coders, taught by fast.ai.
- Dive into Deep Learning - An interactive deep learning book with code, math, and discussions.
- GIS Programming Roadmap on GitHub - A roadmap for learning GIS programming.
- Intel Machine Learning Course - This course provides an overview of machine learning fundamentals on modern Intel architecture.
- Intel Deep Learning Course - This course provides an introduction to deep learning on modern Intel architecture.
Open Books
Data Science and R Programming
- R for Data Science - Learn data science with R.
- Data Science - R Basics - Foundation of data science in R.
- R Graphics Cookbook - Data visualization with ggplot2 in R.
- Geocomputation with R - Spatial data analysis with R.
- Exploratory Data Analysis with R - Techniques for EDA using R.
- The Art of Data Science - The process
of data analysis. - R for Data Science (Second Edition) - Updated guide for data science with R. - R Packages - Creating R packages. - R datasciencebook - R datasciencebook.
Python Programming
- Python for Everybody - Introduction to programming using Python.
- Modern Polars - Using Polars library in Python for data manipulation.
- Python datasciencebook - Python datasciencebook.
Visualization and Web Scraping
- Data Visualization - A Practical Introduction - Introduction to data visualization.
- Web Scraping with R - Techniques for web scraping using R.
- Interactive Data Visualization - Learning interactive data visualization.
Advanced Topics and Specialized Areas
- Telling Stories with Data - Narrative techniques in data communication.
- Spatial Data Science - Spatial data analysis.
- Hands-On Programming with R - Practical programming with R.
- Python Geospatial Analysis - Geospatial analysis with Python.
- Time Series Analysis with R - Time series analysis techniques.
- Forecasting: Principles and Practice with R - This textbook is intended to provide a comprehensive introduction to forecasting methods and to present enough information about each method for readers to be able to use them sensibly.
- Cookbook for R Polars - Using Polars package in R for data frames.
- NFL Analytics with R - Analyzing NFL data with R.
- Data Management in R - Strategies for data management with R.
- mlr3book - Machine learning in R with mlr3.
- Python for Geocomputation - Geocomputational analysis with Python.
- Raps with R - Music and analysis with R.
Miscellaneous
- Open Books by Open UMN - Free educational textbooks, including data science and statistics.
- Pandas for Everyone - Comprehensive guide to using Pandas for data analysis.
- Saylor Academy - Free and open online courses for people everywhere.
- Goalkicker - Programming Notes for Professionals - Free programming books on various topics.
- Python Programming Tutorials - Comprehensive resource for learning Python.
- Scratch - Imagine, Program, Share - Create stories, games, and animations.
- Waggle Dance - An interactive Python tutorial.
- Invent with Python - Books for learning Python with a focus on making things.
- Open Source Society University - Computer Science - Path to a free self-taught education in Computer Science.
- Learn Python the Hard Way - Free eBook Download - A book for learning Python programming.
- Data Analysis with Python - Spring 2020 - University of Helsinki’s course on Python for data analysis.
- Exercism - Code practice and mentorship for everyone.
- Git and GitHub Tutorial for Beginners - Introduction to Git and GitHub.
- RStudio - Books - Resources for learning R programming.
- Awesome Courses - This list is an attempt to bring to light those awesome CS courses which make their high-quality material i.e. assignments, lectures, notes, readings & examinations available online for free.
- Books - Learn Anything is collecting all material with focus on Computer Science but you’ll find other topics ranging from neuro science to philosophy.
- Learn to Program - This list aims to be a curated set of high quality educational resources with focus on foundations of web developement.
- awesome-dataviz - A curated list of awesome data visualization libraries and resources.
Training and Internships
- The Sparks Foundation - A remote one-month internship in Data Science and Business Analytics.
- For The Women Foundation - FTW is a nonprofit organization providing free data science and technology training for women