A Roadmap to Becoming a Self-Taught Data Scientist

A Roadmap to Becoming a Self-Taught Data Scientist

Let’s say you have set a 6-month goal learn to data science and you are a complete newbie, here is the road map I have created to take you there from the scratch. It covers python programming, data science intuition and solving real world problems:

  1. Dataquest (2 months) [Hands-on Tutorial & Coding]- For hands-on python programming from the scratch. Good thing is that you’d get to work on real life projects and write the codes yourself. You’d start from the scratch with python and then learn to clean and manipulate your data. All these are done with real world data and it gives you the intuition to approach any data problem. Highly recommended.
  2. Automate The Boring Stuff [Book] (2 weeks) – This book is a good material that got me started with practical applications of Python. It teaches you how to use python to automate manual tasks like renaming files, moving documents and creating a csv file for data manipulation.
  3. Machine Learning AZ™: Hands-On Python & R In Data Science – Udemy (2 weeks) – [Video] The reviews are great and it got me started on solving a real machine learning problem. Its the best video tutorial I have taken on machine learning. At the end of this tutorial you will understand clearly all most of machine learning codes you come across that once looked like Greek. And Its the most upvoted machine learning course on Udemy.
  4. Andrew Ng’s Machine Learning Course – Coursera (2 months) – [Video] – This course is highly recommended and will deepen your knowledge and fill in the gaps that were not covered in A-Z machine learning. This course focus on the underlying concepts, the maths and the crux of machine learning. You will learning how to calculate the loss function by hand and what cost function means. Very intuitive and stimulating. Its the most upvoted machine learning course on Coursera.
  5. Kaggle (1 month) – This is the most important part as you will begin to work on real problems. I strongly suggest you replicate already solved problems and kernels before you work on new ones. You’d have a good enough knowledge of data science by replicating all the projects in these materials. Kaggle is free.

According to Wikipedia, a technology roadmap is a flexible planning technique to support strategic and long-range planning, by matching short-term and long-term goals with specific technology solutions. So this would be updated when I deem it fit.

Leave a Comment