Get Machine Learning Certification from the World’s top Universities. Earn Masters, Executive PGP, or Advanced Certificate Programs to fast-track your career.
Decision tree classification
The decision trees can be broadly classified into two categories, namely, Classification trees and Regression trees.
1. Classification trees
Classification trees are those types of decision trees that are based on answering “Yes” or “No” questions and using this information to come to a decision. So, a tree, which determines whether a person is fit or unfit by asking a bunch of related questions and using the answers to come to a viable solution, is a type of classification tree.
These types of trees are usually constructed by employing a process which is called binary recursive partitioning. The method of binary recursive partitioning involves splitting the data into separate modules or partitions, and then these partitions are further split into every branch of the decision tree classifier. One commonly used technique in classification trees is the Gini Index in decision trees, which helps measure the purity of data and determines the best splits.
2. Regression Trees
Now, a regression-type of decision tree is different from the classification-type of decision tree in one aspect. The data that has been fed into the two trees are very different. The classification trees handle the data, which is discrete, while the regression decision trees handle the continuous data type. A good example of regression trees would be the house price or how long a patient will typically stay in the hospital.
Learn more: Linear Regression in Machine Learning
How are the decision trees created?
Decision trees are created by taking the set of data that the model has to be trained on (decision trees are a part of supervised machine learning). This training dataset is to be continuously spliced into smaller data subsets. This process is complemented by the creation of an association tree that incrementally gets created side by side in the process of breaking down the data. After the machine has finished learning, the creation of a decision tree based on the training dataset that has been provided concludes, and this tree is then returned to the user.
The central idea behind using a decision tree is to separate the data into two primary regions: the region with the dense population (cluster) and the area, which are empty (or sparse) regions.
Decision Tree classification works on an elementary principle of division. It conquers where any new example that has been fed into the tree, after going through a series of tests, would be organized and given a class label. The algorithm of divide and conquer is discussed in detail below:
Divide and conquer
It is apparent that the decision tree classifier is based on and built by making use of a heuristic known as recursive partitioning, also known as the divide and conquer algorithm. It breaks down the data into smaller sets and continues to do so. Until it has determined that the data within each subset is homogenous, or if the user has defined another stopping criterion, that would put a stop to this algorithm.
How does the decision tree classifier work?
- The divide and conquer algorithm is used to create a decision tree classifier. By making the use of the algorithm, we always begin at the root of the tree, and we also split the dataset to reduce the uncertainty in the final decision.
- It happens to be an iterative process. So, we repeat this process at every node. This process is repeated until the time we don’t have the nodes of the purity we desire.
- Generally, to avoid overfitting, we set a limit on the purity to be achieved. This means the final result might not be 100% pure.
Basics of the divide and conquer algorithm:
- First comes choosing or selecting a test for the root node. Then begins the process of creating branches. The branches are designed with keeping in mind each possible outcome of the trial that has been defined.
- Next comes the splitting of the instances of data into smaller subsets. Each branch would have its own splice, which is connected to the node.
- This process then has to be repeated for each branch by using just the instances that come to the branch in question.
- This recursive process should be stopped if all the instances belong to the same class.
Advantages of using decision tree classification
- It does not require a tremendous amount of money to construct.
- It is a swift process of classification of records that are new or unknown.
- It can be very easily interpreted, especially if the tree is small in size.
- The accuracy of prediction using the decision tree classifier is comparable to other methods of prediction or classification.
- It also has the capability to exclude the features that are unimportant. This process of eliminating irrelevant features is done automatically.
Read: How to create a perfect decision tree?
Disadvantages of using the decision tree classifier
- Overfitting the dataset is very easy in this case.
- The boundary of the decision has a restriction. It can only be parallel to the axes, which contain the attributes.
- Models based on decision trees often have biased splits that have a massive number of levels.
- Any small changes made to the dataset can have a significant impact on the logic that governs the decision.
- Larger trees are challenging to understand because sometimes they might feel very counterintuitive.
Also read: Decision Trees in Machine Learning