DMX - SQL SERVER 数据挖掘决策树
来源:互联网 发布:qq业务宣传图ps源码 编辑:程序博客网 时间:2024/05/20 06:26
在SQL SERVER中, 决策树速度快,应用广泛,可以用于分类,回归,关联分析。
BOL上有详细教程,这里不赘述。
下面是一例预测查询:
select TM.fullname,vba!format(PredictProbability([Bike Buyer]),'Percent') as [Probability]from[TM Decision Tree]natural prediction joinopenquery([AdventureWorksDW2012],'select FirstName + '' '' + LastName as FullName, DateDiff(yy,BirthDate,GetDate()) as Age,Education, Gender, HouseOwnerFlag as [House Owner Flag],MaritalStatus as [Marital Status], NumberChildrenAtHomeas [Number Children At Home], Occupation, TotalChildren as [TotalChildren],NumberCarsOwned as [Number Cars Owned], YearlyIncome as [Yearly Income]from ProspectiveBuyer') as TMwhere Predict([Bike Buyer]) = 1order by PredictProbability([Bike Buyer]) desc
当模型建好,需要考虑准确,进行交叉验证。
CALL SystemGetCrossValidationResults([Targeted Mailing],[TM Decision Tree],[TM Naive Bayes],[TM Neural Net],2,0,'Bike Buyer',1,0.5)
然后,准确比较。
CALL SystemGetAccuracyResults ([Targeted Mailing],[TM Decision Tree],[TM Naive Bayes],[TM Neural Net],3,'Bike Buyer',1,0.5)
ModelName AttributeName AttributeState PartitionIndex PartitionSize Test Measure Value
TM Decision Tree Bike Buyer 1 0 18484 Classification True Positive 6828
TM Decision Tree Bike Buyer 1 0 18484 Classification False Positive 2355
TM Decision Tree Bike Buyer 1 0 18484 Classification True Negative 6997
TM Decision Tree Bike Buyer 1 0 18484 Classification False Negative 2304
TM Decision Tree Bike Buyer 1 0 18484 Likelihood Log Score -0.515976044561631
TM Decision Tree Bike Buyer 1 0 18484 Likelihood Lift 0.177100303313995
TM Decision Tree Bike Buyer 1 0 18484 Likelihood Root Mean Square Error 0.281766535304062
TM Naive Bayes Bike Buyer 1 0 18484 Classification True Positive 5591
TM Naive Bayes Bike Buyer 1 0 18484 Classification False Positive 3106
TM Naive Bayes Bike Buyer 1 0 18484 Classification True Negative 6246
TM Naive Bayes Bike Buyer 1 0 18484 Classification False Negative 3541
TM Naive Bayes Bike Buyer 1 0 18484 Likelihood Log Score -0.673703697378885
TM Naive Bayes Bike Buyer 1 0 18484 Likelihood Lift 0.019372650496705
TM Naive Bayes Bike Buyer 1 0 18484 Likelihood Root Mean Square Error 0.295231719425458
TM Neural Net Bike Buyer 1 0 18484 Classification True Positive 6165
TM Neural Net Bike Buyer 1 0 18484 Classification False Positive 2739
TM Neural Net Bike Buyer 1 0 18484 Classification True Negative 6613
TM Neural Net Bike Buyer 1 0 18484 Classification False Negative 2967
TM Neural Net Bike Buyer 1 0 18484 Likelihood Log Score -0.601339200639234
TM Neural Net Bike Buyer 1 0 18484 Likelihood Lift 0.091737147236361
TM Neural Net Bike Buyer 1 0 18484 Likelihood Root Mean Square Error 0.350182211614771
简单解释,
False positiveCorrect outcome
True negative
True positiveType II error
False negative
如果需要,可以计算敏感性和明确性。
LIFT正好,LOG SCORE近0好,因此,上面三个模型比较,优劣顺序,决策树-》神经元网络-》朴素贝叶斯。
- DMX - SQL SERVER 数据挖掘决策树
- DMX-SQL SERVER 数据挖掘简介一
- DMX- SQL SERVER 数据挖掘简介二
- DMX - SQL SERVER 数据挖掘聚类
- DMX-SQL SERVER 数据挖掘简介一
- SQL Server 2005数据挖掘
- [分享]微软BI专题-数据挖掘扩展插件语言:DMX
- SQL Server 2005数据挖掘开发者指南
- SQL Server 2005 数据挖掘(1)
- SQL Server 2005数据挖掘步骤
- SQL Server 2005数据挖掘开发者指南
- SQL Server 2008数据挖掘查询任务
- SQL Server 2005数据挖掘算法
- SQL Server 2008 数据挖掘算法浅析
- SQL Server 2008 数据挖掘算法浅析
- SQL Server 2008 R2数据挖掘即学即用
- 数据挖掘SSAS(Sql server analysis service)
- 数据挖掘常用技术 决策树
- sharepoint2010 导入AD数据怪现象
- 分享40个使用方便的免费智能手机UI套件
- 面试回答纪要
- Fab CEO:我在创办4家公司中学到的90件事
- webSQL 经常使用的几个必要函数
- DMX - SQL SERVER 数据挖掘决策树
- 超棒的响应式jQuery网格布局插件 - grid-a licious
- 快速保存网页中所有图片的方法
- CentOS下安装Oracle10g图文教程|Linux安装Oracle10g
- 漂泊的细胞
- Win 2003 基本设置
- 安装Win7时无法删除动态分区的解决方法
- Hibernate Annotation延迟加载的默认策略
- SQL Server中日期/时间值到字符类型的数据转换