A comprehensive Learning path to become a data scientist in 2018

1

January 2018
- Overview of Learning Path
- Introduction to Data Science
- Job of Data Scientist
- Exercise
- How to setup a machine?
- 1. Python Basics – Numbers and Maths
- 2. Variables and Inputs
- 3. Lists, sets and tuples
- Exercise
- Dictionary
- Exercise
- Conditional Statements
- Exercise
- Loops
- Excercise
- Reading and Writing
- Excercise
- DataHack Summit 2018
- Information- DHS 2018
- AI&ML Blackbelt Plus Program (Sponsored)
2

February 2018
- Overview
- Important applications of Statistics
- What is Descriptive Statistics?
- Optional
- Introduction to Design experiments
- Introduction to Design experiments
- Optional
- Exercise
- Visualizing Data
- Visualizing Data
- Central tendency
- Exercise
- Variability
- Exercise
- Normal distribution – Part 1
- Normal distribution – Part 2
- Exercise
- Z-Score
- Hypothesis Testing
- Exercise
- T-test
- Exercise
- One Way ANOVA
- Exercise
- Chi-square
- Chi-square - Exercise
- Part 1
- Part 2
- Exercise
- Data Exploration
- Exercise
- Git
- Blogs and Newsletters
3

March 2018
- What to expect - March 2018
- Overview
- Principal Of Counting
- Exercise
- Permutation
- Exercise
- Combination
- Exercise
- Conditional Probability – Part 1
- Conditional Probability – Part 2
- Exercise
- Binomial Distribution
- Random variable
- Expectation and variance
- Exercise
- Introduction to Machine Learning
- Linear Regression
- Exercise
- Logistic Regression- Part 1
- Logistic Regression – Part 2
- Exercise
- Decision Tree
- Exercise
- Naives Bayes
- Clustering algorithms
- Exercise
- KNN
- Exercise
4

April 2018
- Ensemble Learning Basics
- Different Ensemble Learning methods with code
- Bagging (Bootstrap Aggregation)
- Random Forest - Simplified
- Random Forest - Detailed with implementation
- Exercise
- Boosting - Simplified
- Boosting - Detailed with implementation
- Exercise
5

May 2018
- Introduction to validation
- Hold out cross validation
- Leave one out cross validation
- k-fold cross validation
- Implementation in Python
- Implementation in R
- Summary
- Exercise
- Different methods for finding best hyperparameters of an algorithm
- Hyperparameter tuning for Random Forest
- Hyperparameter tuning for GBM
- Hyperparameter tuning for XGBoost
- Hyperparameter tuning for LightGBM
- Black friday
- Loan Prediction
- Big mart sales
6

June 2018
- Image data
- Text data
- Audio data
- Projects
7

July 2018
- Factorisation machines
- Field-Aware Factorization Machines
- Implementation using XLearn
- Introduction to Vowpal Wabbit
- Projects
8

August 2018
- What is Neural Networks?
- Theory and Implementation
- Exercise
- Introduction to CNN
- Theory
- Implementation
- Exercise
- Project
- Theory
- Implementation
- Theory
- Implementation
- Project 1
- Project 2
9

September 2018
- Image Classification
- Project
- Object detection/Localisation
- Research papers
10

October 2018
- Audio classification - Theory and Implementation
- Project
- Speech recognition - Theory and implementation
- Project
- Speaker Identification - Theory and implementation
- Project
- DataHack Summit 2018
11

November 2018
- Text Classification
- Competition
- Text Summarization
- Author Identification
- Competition
- Machine Translation
12

December 2018
- Profile Building
- Introduction to Github
- Building your Resume
- Participating in Competitions
- Project and Certifications
- Jobs and Internships

A comprehensive Learning path to become a data scientist in 2018

Wanting to become a data scientist this year, but confused where to start and what to follow? This comprehensive learning path from Analytics Vidhya should provide you with all the answers you need.

About the course

Why take this course?

Pre-requisites

Course curriculum

January 2018

February 2018

March 2018

April 2018

May 2018

June 2018

July 2018

August 2018

September 2018

October 2018

November 2018

December 2018

Instructor

Analytics Vidhya

FAQ