Splitting Attribute measure in Decision Tree Learning (ML)
来源:互联网 发布:二叉树的先序遍历算法 编辑:程序博客网 时间:2024/05/29 16:17
Ref: chapter 8 in <<Data Mining. Concepts and Techs> 3rd Ed, by Han, etc.
Information Gain
strong: easy implement
weak: it prefers to select attribute with a large number of values, therefore, the selected splitting attribute might cause a large number of partitions, leading to bad purity. For example, in traffic classification, if using packet size attribute as splitting attribute, the partitions (internal nodes) can include such as 20, 50, 100, 1200, 1500, etc.
Information Gain+Gain Ratio
strong: improve the weakness of information Gain by using the gain ratio parameter to select splitting attribute with a relatively smaller size. It is a trade-off betweenrespecting to classes andrespecting to outcome partitions.
weak: if the split information approaches 0, the ratio is unstable. So, to avoid this, the information gain selected must be large.
Gini Index
measures the impurity of training data set or a partition. The subset that gives the minimum Gini index for that attribute is selected as its splitting subset.
weak: difficult when the number of classes is large
- Splitting Attribute measure in Decision Tree Learning (ML)
- [python for ML] Decision tree
- Machine Learning--Decision Tree
- Decision tree learning
- ML-Gradient Boost Decision Tree(+ Treelink)
- 【ML--09】决策树算法Decision Tree
- 决策树(Decision Tree)-机器学习ML
- decision tree Learning 决策树学习笔记
- OpenCV(4)ML库->Decision Tree决策树
- 【ML学习笔记】9:认识Decision Tree决策树
- 【ML】【python】Machine Learning in Action
- DataMing Papers:<The alternating decision tree learning algorithm>
- decision tree
- decision tree
- decision tree
- Decision Tree
- Decision Tree
- Decision Tree
- 浅谈大型web系统架构
- struts2+spring
- paip.c语言gtk开发环境CodeBlocks /QT建立最佳实践
- WIKIOI 1569 最佳绿草
- uva 10000 Longest Paths (SPFA)
- Splitting Attribute measure in Decision Tree Learning (ML)
- 微软云技术Windows Azure专题(一):如何利用Service Bus向Windows商店应用推送消息
- 优秀网站收集
- IOS XML解析
- MongoDB README
- Eclipse中如何关联Javadoc
- Matlab矩阵生成方式
- 博客已移至博客园
- 【leetcode】Balanced Binary Tree