## SCOPE OF DATA SCIENCE

Data Science is one of the fastest evolving fields & a Data Scientist’s job is one of the fastest growing and highest paid in tech.

As there is a cut-throat competition in the market, top organizations are turning their minds to data analytics to identify new market opportunities to design their services and products. Surveys show that 75% of top organizations consider data analytics an essential component of business performance. This is where data scientists come in. Data scientists know how to use their skills in math, statistics, programming, and other related subjects to organize large data sets. Then, they apply their knowledge to uncover solutions hidden in the data to take on business challenges and goals. They are thus able to contribute to their organization’s business goals. So, learning data science through effective training can give you a bright future.

“Data Scientist” has been ranked the number one job on Glassdoor and the average salary of a data scientist is over $120,000 in the United States according to Indeed! It is a rewarding career that allows you to solve some of the world’s most interesting problems!

As per Payscale.com, a Data Scientist (IT) with Big Data Analytics skills earns an average salary of Rs 706,750 per year in India.

## COURSE OBJECTIVE

Xilytica is delighted to offer classroom program for candidates to help them build a career in the blooming field of Data Science.The program focuses on building foundation skills in business analytics and data science with in depth training of statistical platform & tools.

We will develop candidates to think analytically and solve business problems using data.

Our innovative approach to analytics training by combining deep knowledge and collaborative learning environment will help candidates to develop real skills in analytics.

Candidates taking this course can expect to gain the knowledge of analytical techniques required by organizations to take strategic decisions, and by solving case studies they will know how analytical techniques can be used to get real insights from real data. Candidates will be able to forecast sales, Identify customer segments, identify drivers of sales/profit, perform regression, analyze customer comments and do much more after the course.

In-depth course coverage, hands-on experience of entire data analytics project cycle, and case studies on real world analytics problems, cutting across different domains are the high points that make Skill Venue’s Data Science in R certification course a leap towards a successful analytics career.

## WHO SHOULD TAKE THIS COURSE

- IT professionals looking for career or technology change to Data Science & Analytics.
- BPO industry or non technical professionals who are looking for career in Data Science & Analytics.
- Engineering graduates who want to build a career in Data Science & Analytics.
- Non- technical & MBA graduates who are interested in building a career in Data Science.
- Anyone who likes manipulating data & loves getting insights out of data.
- Prerequisite – There are not prerequisite for this course except hard work & dedication.

### Course Curriculum

Introduction

Introduction to the Course

Overview Of Course Curriculum

How To Ace Data Science- Pointers

What Is Data Science

Introduction to Programing Language

What is Programming Language

Ecosystem Of Any Programming Language

Windows Installation Set-Up

How to Install R & R Studio on Windows Operating System

Linux Installation Set-Up

How to Install R & R Studio on Linux Operating System

Mac OS Installation Set-Up

How to Install R & R Studio on Mac OS Operating System

Introduction to Basic R

Introduction to R Basics

Arithmatic Operations in R

Variables

R Basic Data Types

Vector Basics

Vector Operations

Vector Indexing & Selecting

Getting Help in R & R Studio

Comparison Operators

Quiz 01

R Matrices

Introduction to R matrices

Creating a Matrix

Arithmetic Operations In Matrix

Matrix Operations

Matrix Indexing & Selecting

Factor & Categorical Matrices

Quiz 02

R Data Frames

Introduction to R Data Frames

Data Frame Basics

Data Frame Indexing & Selecting

Data Frame Operations

Quiz 03

List in R

Introduction To Lists in R

Creating Lists With Different Objects

Selecting Elements of a List

Quiz 03

Data Input & Output with R

Introduction to Data Input & Output with R

Reading Data from CSV Files

Writing Data to CSV Files

Reading Data from Excel Files

Writing Data to Excel Files

SQL with R

Web Scraping with R

Quiz 04

R Programming basics

Introduction to Programming Basics

Logical Operators

If, Else, Else If Statements

While Loop

For Loop

Functions

Quiz 05

R programming Advance

Introduction to Advance R Programming

Built In R functions

Apply Fucntion

Math Functions with R

Regular Expressions

Dates & Times Stamps

Exceptions And Debugging In R

Quiz 06

Data Exploratory & Data Wrangling

Data Manipulation with R Overview

Guide to Using Dplyr

Pipe Operator

Guide to Using Tidyr Package

Making Data Fit For Analysis Using Spread, Gather, Separate & Unite Functions

Working With Data & Time

Manipulating Strings

Understanding Messy Data

Tackling Missing Data

Dealing With Outliers

Case Study On EDA – Exploratory Data Analysis

Quiz 07

Data Visualization with R

Overview Of Ggplot2 – Grammar Of Graphics

Different Layers Of Visualisation Ggplot2

7 Layers Of Ggplot2 – Data, Aesthetics, Geometries, Facets, Statistics, Coordinates, Themes

Histograms

Scatterplots

Barplots – Simple, Stacked, Dodge

Boxplots

Line Charts

Pie Charts, Coxcomb Plot

2 Variable Plotting

Visualisation For Exploratory Analysis

Visualisation Best Practices

Case Study On Visualisation

Quiz 08

Statistics 1

Introduction To Statistics

Understanding Types Of Data

Measure Of Centers – Mean, Mode & Median

Measure Of Spread

Probability

Continuous Probability Distribution

Normal Distribution – Z Distribution

F Distribution

Student's T Distribution

Chi Square Distribution

Discrete Probability Distribution

Binary Distribution

Poisson Distribution

Statistics 2

Point Estimation

Confidence & Significance Levels

Hypothesis Testing

Types Of Hypothesis

Parametric Test

One Sample, Two Sample T Test

One Sample Z Test

One Proportion, Two Proportion Test

One Way Anova

Chi-square Test

Non – Parametric Test

One Sample Sign Test

Mann – Whitney Test

Kruskal – Wallis Test

Mood's Median Test

Wilcoxon Signed Rank Test

Friedman's Test

Hypothesis Testing For Population Means

Hypothesis Testing For Population Variance

Hypothesis Testing For Population Proportions

Quiz 09

Machine Learning Toolbox

Introduction To Machine Learning With R

Types Of Machine Learning

Supervised Learning

Unsupervised Learning

Reinforcement Learning

Regression Models

Classification Models

Quiz 10

Data Pre-processing

Dealing With Missing Data

Dealing With Categorical Data

Feature Scaling Data

Splitting Data Into Training & Validation & Test Sets

K – Fold Cross Validation

Quiz 11

Linear Regression

Introduction To Regression Models

Correlation Between Two Variables

Visualising Correlation Using Corrgram And Corrplot

Simple Linear Regression

Multiple Linear Regression

Non Linear Regression

Training & Evaluating Regression Models Performance

Linear Regression Assumptions – Homoscedasticity, Multicollinearity. Etc.

Interpretation Of Regression Plots – Residual Vs Fitted Values, Normal Q-q Plot, Scale Location Plot, Residuals Vs Leverage Plot

Case Study Using Linear Regression, Multiple Linear Regression, Non Linear Regression

Project – Building Prediction Model

Quiz 12

Classification Models

Introduction To Classification Model

Training And Evaluating Classification Models

Confusion Matrix – Accuracy, Precision, Etc.

Interpreting Roc Curve

Fine Tuning Models Using Hyper Parameters

Quiz 13

Logistic Regression

Introduction To Logistic Regression

Building Logistic Regression Model

Visualising Logistic Regression

Interpreting Logistic Regression

Making Problistic Predictions

Logistic Regression Case Study

Project On Logistic Regression

Quiz 14

K Nearest Neighbour

Introduction To K Nearest Neighbour

Building K Nearest Neighbour Model

Visualising K Nearest Neighbour

Interpreting K Nearest Neighbour

Making Predictions

K Nearest Neighbour Case Study

Project On K Nearest Neighbour

Quiz 15

Decision Tree

Types Of Tree Based Models

Introduction To Decision Tree

Information Gain, Entropy Gain, Gini Index

Building Decision Tree Model

Visualising Decision Tree

Interpreting Decision Tree

Making Predictions

Decision Tree Case Study

Project On Decision Tree

Quiz 16

Bagged Trees

Introduction To Bagged Trees

Bootstrap Sampling

Building Bagged Tree Model

Visualising Bagged Tree

Interpreting Bagged Tree

Making Predictions

Fine Tuning Bagged Tree Using Hyper Parameters

Bagged Tree Case Study

Project On Bagged Tree

Quiz 17

Random Forests

Introduction To Random Forest

Building Random Forest Model

Visualising Random Forest

Interpreting Random Forest

Making Predictions

Fine Tuning Random Forest Using Hyper Parameters

Variable Selection And Variable Importance Plot

Random Forest Case Study

Project On Random Forest

Quiz 18

Boost Trees

Introduction To Boost Trees

Building Gradient Boost Models

Visualising Gradient Boost

Interpreting Gradient Boost

Making Predictions

Fine Tuning Gradient Boost Using Hyper Parameters

Variable Selection And Variable Importance Plot

Xgboost, Lightgbm – Popular On Kaggle

Gradient Boost Case Study

Project On Gradient Boost

Quiz 19

Support Vector Machines

Introduction To Support Vector Machines

Building Support Vector Model

Understanding Kernel And Gamma In

Visualising Support Vector Machine | 00:00:00 | ||

Interpreting Support Vector Machine | 00:00:00 | ||

Making Predictions | 00:00:00 | ||

Fine Tuning Support Vector Machine | 00:00:00 | ||

Support Vector Machine Case Study | 00:00:00 | ||

Project On Support Vector Machine | 00:00:00 | ||

Quiz 20 | 00:00:00 | ||

K - Means Clustering | |||

Understanding K – Means Clustering | 00:00:00 | ||

Selecting Right Number Of Clusters | 00:00:00 | ||

Elbow Plot | 00:00:00 | ||

K Means Clustering Case Study | 00:00:00 | ||

Project On K Means Clustering | 00:00:00 | ||

Quiz 21 | 00:00:00 | ||

Hierarchical Clustering | |||

Understanding Hierarchical Clustering | 00:00:00 | ||

Selecting Right Number Of Clusters | 00:00:00 | ||

Interpreting Dendrogram | 00:00:00 | ||

Hierarchical Clustering Case Study | 00:00:00 | ||

Project On Hierarchical Clustering | 00:00:00 | ||

Quiz 22 | 00:00:00 | ||

Dimensionality Reduction - PCA | |||

Understanding Dimensional Reduction | 00:00:00 | ||

Applications Of Dimensional Reduction | 00:00:00 | ||

PCA – Principal Component Analysis Intuition | 00:00:00 | ||

PCA Calculations In R | 00:00:00 | ||

PCA – Benefits | 00:00:00 | ||

PCA Case Study | 00:00:00 | ||

Project On PCA | 00:00:00 | ||

Quiz 23 | 00:00:00 | ||

NLP - Text Mining - Bag Of Words | |||

Understanding Text Mining | 00:00:00 | ||

Cleaning And Preprocessing Text Data | 00:00:00 | ||

Understanding Terminology | 00:00:00 | ||

Tdm And Dtm Formats | 00:00:00 | ||

Plotting Better | 00:00:00 | ||

Extracting Amazon Reviews Data | 00:00:00 | ||

NLP – Case Study | 00:00:00 | ||

NLP – Project | 00:00:00 | ||

Quiz 24 | 00:00:00 | ||

NLP - Text Mining - Sentiment Analysis | |||

Understanding Sentiment Analysis | 00:00:00 | ||

Cleaning And Preprocessing Text Data | 00:00:00 | ||

Understanding Terminology | 00:00:00 | ||

Visualising Sentiments | 00:00:00 | ||

Connecting With Twitter | 00:00:00 | ||

Extracting Twitter Data | 00:00:00 | ||

Sentiment Analysis – Case Study | 00:00:00 | ||

Sentiment Analysis – Project | 00:00:00 | ||

Quiz 25 | 00:00:00 | ||

Association Rule | |||

Understanding Association Rule | 00:00:00 | ||

Understanding Apriori Intuition | 00:00:00 | ||

Case Study Using Apriori | 00:00:00 | ||

Understanding Eclat | 00:00:00 | ||

Case Study Using Eclat | 00:00:00 | ||

Market Basket Analysis | 00:00:00 | ||

Quiz 26 | 00:00:00 | ||

Recommendation Engine | |||

Introduction To Recommendation Systems Engine | 00:00:00 | ||

How Recommendation Engine Works | 00:00:00 | ||

Types Of Recommendation Systems | 00:00:00 | ||

Building A Recommendation System Engine | 00:00:00 | ||

Project On Recommendation System | 00:00:00 | ||

Quiz 27 | 00:00:00 | ||

Time Series Analysis | |||

Introduction To Time Series Analysis | 00:00:00 | ||

Manipulating Time Series Data | 00:00:00 | ||

Forecasting Using Arima | 00:00:00 | ||

Exponential Smoothing Method | 00:00:00 | ||

Visualising Time Series Data | 00:00:00 | ||

Time Series – Case Study | 00:00:00 | ||

Time Series – Project | 00:00:00 | ||

Quiz 28 | 00:00:00 | ||

Data Wrangling With Sql | |||

Introduction To Sql Queries | 00:00:00 | ||

Connecting R With Sql Server | 00:00:00 | ||

Fetching & Querying Data From Sql Server Using R | 00:00:00 | ||

Quiz 29 | 00:00:00 | ||

Deep Learning & Neural Nets | |||

Introduction To Neural Nets | 00:00:00 | ||

Neural Nets With R | 00:00:00 | ||

Project On Neural Nets | 00:00:00 | ||

Quiz 30 | 00:00:00 |

