• No products in the cart.

203.4.1 Model Selection and Cross Validation

Starting off with introduction to Model Validation

After learning about the Tree Building Model Selection we will now study about The Model Selection and Cross Validation

In previous series-es we learned the most important and basic algorithms used in analytics industry.

In this series of posts, we will learn how to choose the best model out of many models we have created for specific problem. We will also learn how to improve our per-existing models.

Model Validation

What is model validation.

  • Checking how good is our model.
  • It is very important to report the ay of the model along with the final model.
  • The model validation in regression is done through R square and Adj R-Square.
  • Logistic Regression, Decision tree and other classification techniques have very similar validation measures.
  • Till now we have seen confusion matrix and accuracy. There are many more validation and model accuracy metrics for classification models.

Classification-Validation measures

  • Confusion matrix, Specificity, Sensitivity
  • ROC, AUC
  • KS, Gini
  • Concordance and discordance
  • Chi-Square, Hosmer and Lemeshow Goodness-of-Fit Test
  • Lift curve

All of them are measuring the model accuracy only. Some metrics work really well for certain class of problems. Confusion matrix, ROC and AUC will be sufficient for most of the business problems

Sensitivity and Specificity

Sensitivity and Specificity are derived from confusion matrix

  • Accuracy=(TP+TN)/(TP+FP+FN+TN)
  • Misclassification Rate=(FP+FN)/(TP+FP+FN+TN)
  • Sensitivity : Percentage of positives that are successfully classified as positive
  • Specificity : Percentage of negatives that are successfully classified as negatives

The next post is calculating Sensitivity and Specificity in R.

DV Analytics

DV Data & Analytics is a leading data science,  Cyber Security training and consulting firm, led by industry experts. We are aiming to train and prepare resources to acquire the most in-demand data science job opportunities in India and abroad.

Bangalore Center

DV Data & Analytics Bangalore Private Limited
#52, 2nd Floor:
Malleshpalya Maruthinagar Bengaluru.
Bangalore 560075
India
(+91) 9019 030 033 (+91) 8095 881 188
Email: info@dvanalyticsmds.com

Bhubneshwar Center

DV Data & Analytics Private Limited Bhubaneswar
Plot No A/7 :
Adjacent to Maharaja Cine Complex, Bhoinagar, Acharya Vihar
Bhubaneswar 751022
(+91) 8095 881 188 (+91) 8249 430 414
Email: info@dvanalyticsmds.com

top
© 2020. All Rights Reserved.