Lecture 7 - презентация, доклад, проект скачать

Нажмите для полного просмотра!

Содержание ▲

Вы можете ознакомиться и скачать презентацию на тему Lecture 7. Доклад-сообщение содержит 58 слайдов. Презентации для любого класса можно скачать бесплатно. Если материал и наш сайт презентаций Mypresentation Вам понравились – поделитесь им с друзьями с помощью социальных кнопок и добавьте в закладки в своем браузере.

Слайды и текст этой презентации

Слайд 1

Описание слайда:

Intro to Machine Learning Lecture 7 Adil Khan a.khan@innopolis.ru

Слайд 2

Описание слайда:

Recap Decision Trees (in class) for classification Using categorical predictors Using classification error as our metric Decision Trees (in lab) For regression Using continuous predictors Using entropy, gini, and information gain

Слайд 3

Описание слайда:

Impurity Measures: Covered in Lab last Week

Слайд 4

Описание слайда:

Practice Yourself

Слайд 5

Описание слайда:

Today’s Objectives Overfitting in Decision Trees (Tree Pruning) Ensemble Learning ( combine the power of multiple models in a single model while overcoming their weaknesses) Bagging (overcoming variance) Boosting (overcoming bias)

Слайд 6

Описание слайда:

Overfitting in Decision Trees

Слайд 7

Описание слайда:

Decision Boundaries at Different Depths

Слайд 8

Описание слайда:

Generally Speaking

Слайд 9

Описание слайда:

Decision Tree Over fitting on Real Data

Слайд 10

Описание слайда:

Simple is Better When two trees have the same classification error on validation set, choose the one that is simpler

Слайд 11

Описание слайда:

Modified Tree Learning Problem

Слайд 12

Описание слайда:

Finding Simple Trees Early Stopping: Stop learning before the tree becomes too complex Pruning: Simplify tree after learning algorithm terminates

Слайд 13

Описание слайда:

Criteria 1 for Early Stopping Limit the depth: stop splitting after max_depth is reached

Слайд 14

Описание слайда:

Criteria 2 for Early Stopping Use a threshold for decrease in error with a split Stop if the error does not decrease more than Mostly works, but may cause problems in some cases

Слайд 15

Описание слайда:

Criteria 3 for Early Stopping

Слайд 16

Описание слайда:

Early Stopping: Summary

Слайд 17

Описание слайда:

Pruning

Слайд 18

Описание слайда:

Which Tree is Simpler?

Слайд 19

Описание слайда:

Which Tree is Simpler

Слайд 20

Описание слайда:

Thus, Our Measure of Complexity

Слайд 21

Описание слайда:

New Optimization Goal Total Cost = Measure of Fit + Measure of Complexity Measure of Fit = Classification Error (large means bad fit to the data) Measure of complexity = Number of Leaves (large means likely to overfit)

Слайд 22

Описание слайда:

Tree Pruning Algorithm Let T be the final tree Start at the bottom of T and traverse up, apply prune_split at each decision node M

Слайд 23

Описание слайда:

prune_split Prune_split (, ) Compute total cost Let be the tree after pruning at Compute If < , prune to

Слайд 24

Описание слайда:

Ensemble Learning

Слайд 25

Описание слайда:

Bias and Variance A complex model could exhibit high variance A simple model could exhibit high bias

Слайд 26

Описание слайда:

Ensemble Classifier in General

Слайд 27

Описание слайда:

Ensemble Classifier in General

Слайд 28

Описание слайда:

Ensemble Classifier in General

Слайд 29

Описание слайда:

Important A necessary and sufficient condition for an ensemble of classifiers to be more accurate than any of its individual members is if the members are accurate and diverse (Hansen & Salamon, 1990)

Слайд 30

Описание слайда:

Bagging: Reducing Variance using An Ensemble of Classifiers from Bootstrap Samples

Слайд 31

Описание слайда:

Aside: Bootstrapping

Слайд 32

Описание слайда:

Bagging

Слайд 33

Описание слайда:

Why Bagging Works? Averaging reduces variance Let be i.i.d random variables

Слайд 34

Описание слайда:

Bagging Summary Bagging was first proposed by Leo Breiman in a technical report in 1994 He also showed that bagging can improve the accuracy of unstable models and decrease the degree of overfitting. I highly recommend you read about his research in L. Breiman. Bagging Predictors. Machine Learning, 24(2):123–140, 1996,

Слайд 35

Описание слайда:

Random Forests – Example of Bagging Draw a random bootstrap sample Grow a decision tree from the bootstrap sample. At each node: Randomly select d features without replacement ( ). Split the node using the feature that provides the best split according to the objective function, for instance, by maximizing the information gain. Repeat the steps 1 to 2 k times. Aggregate the prediction by each tree to assign the class label by majority voting

Слайд 36

Описание слайда:

Making a Prediction

Слайд 37

Описание слайда:

Boosting: Converting Weak Learners to Strong Learners through Ensemble Learning

Слайд 38

Описание слайда:

Boosting and Bagging Works in a similar way as bagging. Except: Models are built sequentially: each model is built using information from previously built models. Boosting does not involve bootstrap sampling; instead each tree is fit on a modified version of the original data set

Слайд 39

Описание слайда:

Boosting: (1) Train A Classifier

Слайд 40

Описание слайда:

Boosting: (2) Train Next Classifier by Focusing More on the Hard Points

Слайд 41

Описание слайда:

What does it mean to focus more?

Слайд 42

Описание слайда:

Example (Unweighted): Learning a Simple Decision Stump

Слайд 43

Описание слайда:

Example (Weighted): Learning a Decision Stump on Weighted Data

Слайд 44

Описание слайда:

Boosting

Слайд 45

Описание слайда:

AdaBoost (Example of Boosting) Start with the same weights for all points: For each Learn with data weights Compute coefficient Recompute weights Final model predicts as:

Слайд 46

Описание слайда:

Слайд 47

$Weighted Classification Error Total weight of the mistakes: Total weight of all points: Weighted error measures fraction of weight of mistakes: Best possible values is 0.0$

Описание слайда:

Weighted Classification Error Total weight of the mistakes: Total weight of all points: Weighted error measures fraction of weight of mistakes: Best possible values is 0.0

Слайд 48

Описание слайда:

AdaBoost: Computing Classifier’s Weights

Слайд 49

Описание слайда:

AdaBoost Start with the same weights for all points: For each Learn with data weights Compute coefficient Recompute weights Final model predicts by:

Слайд 50

Описание слайда:

Слайд 51

Описание слайда:

AdaBoost: Recomputing A Sample’s Weight

Слайд 52

Описание слайда:

AdaBoost: Recomputing A Sample’s Weight

Слайд 53

Описание слайда:

AdaBoost

Слайд 54

Описание слайда:

AdaBoost: Normalizing Sample Weights

Слайд 55

Описание слайда:

AdaBoost

Слайд 56

Описание слайда:

Self Study What is the effect of of: Increasing the number of classifiers in bagging vs. Increasing the number of classifiers in boosting

Слайд 57

Описание слайда:

Boosting Summary

Слайд 58

Описание слайда:

Summary Decision Tree Pruning Ensemble Learning Bagging Boosting

Теги Lecture 7

Похожие презентации

🗊Презентация Lecture 7

Слайды и текст этой презентации

Слайд 1

Слайд 2

Слайд 3

Слайд 4

Слайд 5

Слайд 6

Слайд 7

Слайд 8

Слайд 9

Слайд 10

Слайд 11

Слайд 12

Слайд 13

Слайд 14

Слайд 15

Слайд 16

Слайд 17

Слайд 18

Слайд 19

Слайд 20

Слайд 21

Слайд 22

Слайд 23

Слайд 24

Слайд 25

Слайд 26

Слайд 27

Слайд 28

Слайд 29

Слайд 30

Слайд 31

Слайд 32

Слайд 33

Слайд 34

Слайд 35

Слайд 36

Слайд 37

Слайд 38

Слайд 39

Слайд 40

Слайд 41

Слайд 42

Слайд 43

Слайд 44

Слайд 45

Слайд 46

Слайд 47

Слайд 48

Слайд 49

Слайд 50

Слайд 51

Слайд 52

Слайд 53

Слайд 54

Слайд 55

Слайд 56

Слайд 57

Слайд 58