【Kaggle笔记】House Prices: Advanced Regression Techniques
来源:互联网 发布:美国仓库淘宝网 编辑:程序博客网 时间:2024/05/29 08:34
比赛题目
- 房价预测
代码
# -*- coding: utf-8 -*-"""Created on Mon May 22 11:32:29 2017"""import pandas as pd train = pd.read_csv('train.csv')test = pd.read_csv('test.csv')# 挑选特征值selected_features = ['Foundation', 'Heating', 'Electrical', 'SaleType', 'SaleCondition', 'GarageArea','YearRemodAdd','YearBuilt','1stFlrSF', '2ndFlrSF', 'TotalBsmtSF', 'BsmtUnfSF', 'CentralAir']X_train = train[selected_features]X_test = test[selected_features]y_train = train['SalePrice']# 补充特征缺失值X_train['Electrical'].fillna('SBrkr', inplace=True)X_train['SaleType'].fillna('WD', inplace=True)X_train['GarageArea'].fillna(X_train['GarageArea'].mean(), inplace=True)X_train['TotalBsmtSF'].fillna(X_train['TotalBsmtSF'].mean(), inplace=True)X_train['BsmtUnfSF'].fillna(X_train['BsmtUnfSF'].mean(), inplace=True)X_test['Electrical'].fillna('SBrkr', inplace=True)X_test['SaleType'].fillna('WD', inplace=True)X_test['GarageArea'].fillna(X_test['GarageArea'].mean(), inplace=True)X_test['TotalBsmtSF'].fillna(X_test['TotalBsmtSF'].mean(), inplace=True)X_test['BsmtUnfSF'].fillna(X_test['BsmtUnfSF'].mean(), inplace=True)print X_train.info()print X_test.info()# 采用DictVectorizer进行特征向量化from sklearn.feature_extraction import DictVectorizerdict_vec = DictVectorizer(sparse=False)X_train = dict_vec.fit_transform(X_train.to_dict(orient='record'))X_test = dict_vec.transform(X_test.to_dict(orient='record'))# 使用随机森林回归模型进行 回归预测#from sklearn.ensemble import RandomForestRegressorfrom sklearn.ensemble import GradientBoostingRegressor#rfr = RandomForestRegressor()rfr = GradientBoostingRegressor()rfr.fit(X_train, y_train)rfr_y_predict = rfr.predict(X_test)# 输出结果rfr_submission = pd.DataFrame({'Id': test['Id'], 'SalePrice': rfr_y_predict})rfr_submission.to_csv('rfr_submission.csv', index=False)
阅读全文
1 0
- 【Kaggle笔记】House Prices: Advanced Regression Techniques
- kaggle Code :House Prices: Advanced Regression Techniques 回归
- kaggle中的可视化(一):House Prices
- Getting Started with Kaggle: House Prices Competition
- kaggle -- House Prices实例:分数+思路+代码:sklearn + xgboost
- Machine Learning Foundations: A Case Study Approach-Regression-Assignment: Predicting House Prices
- Core Animation Advanced Techniques 学习笔记(1)
- iOS-Core-Animation-Advanced-Techniques笔记
- IOS Core Animation Advanced Techniques的学习笔记(一)
- IOS Core Animation Advanced Techniques的学习笔记(二)
- IOS Core Animation Advanced Techniques的学习笔记(三)
- IOS Core Animation Advanced Techniques的学习笔记(四)
- IOS Core Animation Advanced Techniques的学习笔记(五)
- IOS Core Animation Advanced Techniques的学习笔记(三)
- IOS Core Animation Advanced Techniques的学习笔记(二)
- IOS Core Animation Advanced Techniques的学习笔记(一)
- IOS Core Animation Advanced Techniques的学习笔记(二)
- IOS Core Animation Advanced Techniques的学习笔记(三)
- Android Studio C语言通过反射调用java方法
- jsp页面中时间格式化显示,<fmt:formatDate> 标签使用
- tar命令详解--tgz文件解压详细过程
- java数组冒牌排序
- ISO-OSI 7层模型和TCP/TP 4层模型
- 【Kaggle笔记】House Prices: Advanced Regression Techniques
- 沉浸式管理:让你的APP更优雅
- Java反射的作用与用法
- 程序员如何从“职业倦怠”中恢复过来
- 线程、多线程基本原理与两种实现方法
- 原始套接字编程(三)
- 566. Reshape the Matrix
- 路由控制
- Android破解与防破解