Google Wide&&Deep Model
来源:互联网 发布:it技能 help desk 编辑:程序博客网 时间:2024/04/26 17:59
谷歌的GooglePlay的推荐模型Wide and Deep Learning 模型,目前已经提供了tensorflow的开源版本。
本文使用keras完成:
#coding: utf-8
'''
用keras写的google Wide&&Deep model
'''
import pandas as pd
from keras.models import Sequential
from keras.layers import Dense, Merge
from sklearn.preprocessing import MinMaxScaler
#所有的数据列
COLUMNS = [
"age", "workclass", "fnlwgt", "education", "education_num", "marital_status",
"occupation", "relationship", "race", "gender", "capital_gain", "capital_loss",
"hours_per_week", "native_country", "income_bracket"
]
#标签列
LABEL_COLUMN = "label"
#类别型特征变量
CATEGORICAL_COLUMNS = [
"workclass", "education", "marital_status", "occupation", "relationship",
"race", "gender", "native_country"
]
#连续值特征变量
CONTINUOUS_COLUMNS = [
"age", "education_num", "capital_gain", "capital_loss", "hours_per_week"
]
#加载文件
def load(filename):
with open(filename, 'r') as f:
skiprows = 1 if 'test' in filename else 0
df = pd.read_csv(
f, names=COLUMNS, skipinitialspace=True, skiprows=skiprows, engine='python'
)
#缺省值处理
df = df.dropna(how='any', axis=0)
return df
#预处理
def preprocess(df):
df[LABEL_COLUMN] = df['income_bracket'].apply(lambda x: ">50K" in x).astype(int)
df.pop("income_bracket")
y = df[LABEL_COLUMN].values
df.pop(LABEL_COLUMN)
df = pd.get_dummies(df, columns=[x for x in CATEGORICAL_COLUMNS])
# TODO: 对特征进行选择,使得网络更高效
# TODO: 特征工程,比如加入交叉与组合特征
# from sklearn.preprocessing import PolynomialFeatures
# X = PolynomialFeatures(degree=2, interaction_only=True, include_bias=False).fit_transform(X)
df = pd.DataFrame(MinMaxScaler().fit_transform(df), columns=df.columns)
X = df.values
return X, y
def main():
df_train = load('E:\\adult.data')
df_test = load('E:\\adult.test')
df = pd.concat([df_train, df_test])
train_len = len(df_train)
X, y = preprocess(df)
X_train = X[:train_len]
y_train = y[:train_len]
X_test = X[train_len:]
y_test = y[train_len:]
#Wide部分
wide = Sequential()
wide.add(Dense(1, input_dim=X_train.shape[1]))
#Deep部分
deep = Sequential()
# TODO: 添加embedding层
deep.add(Dense(input_dim=X_train.shape[1], output_dim=100, activation='relu'))
#deep.add(Dense(100, activation='relu'))
deep.add(Dense(input_dim=100, output_dim=32, activation='relu'))
#deep.add(Dense(50, activation='relu'))
deep.add(Dense(input_dim=32, output_dim=8))
deep.add(Dense(1, activation='sigmoid'))
#Wide和Deep拼接
model = Sequential()
model.add(Merge([wide, deep], mode='concat', concat_axis=1))
model.add(Dense(1, activation='sigmoid'))
#编译模型
model.compile(
optimizer='rmsprop',
loss='binary_crossentropy',
metrics=['accuracy']
)
#模型训练
model.fit([X_train, X_train], y_train, nb_epoch=10, batch_size=32)
#loss与准确率评估
loss, accuracy = model.evaluate([X_test, X_test], y_test)
print('\n', 'test accuracy:', accuracy)
if __name__ == '__main__':
main()
- Google Wide&&Deep Model
- The Wide and Deep Learning Model(译文+Tensorlfow源码解析)
- wide & deep论文-----2016.6.24
- 第十三课 wide&deep模型
- 《Wide & Deep Learning for Recommender Systems 》笔记
- Deep&Wide Learning论文阅读笔记
- tensorflow线性模型以及Wide deep learning
- tensorflow线性模型以及Wide deep learning
- 学习笔记:TensorFlow Wide & Deep Learning Tutorial
- 论文阅读——Wide & Deep Learning
- 《Wide & Deep Learning for Recommender Systems》
- The Wide and Deep Learning Model(译文+Tensorlfow源码解析) 原创 2017年11月03日 22:14:47 标签: 深度学习 / 谷歌 / tensorf
- deep learning pretrained model
- training deep learning model
- 《Wide and Deep Learning for Recommender Systems》学习笔记
- TensorFlow学习笔记9----TensorFlow Wide & Deep Learning Tutorial
- 论文笔记:Wide & Deep Learning for Recommender Systems
- TensorFlow Wide And Deep 模型详解与应用
- 4.4、数组与指针
- 如何在C++中调用C函数
- 暑假训练day1 : 2016 USP-ICMC
- ##公司合作项目--基于质差指标筛选结果的 spark-sql 查询系统Demo
- 第二天
- Google Wide&&Deep Model
- 自定义控件
- 3522. 【NOIP2013模拟11.7B组】迷宫花园(maze)
- 我的学习之路_第二十七章_jQuery
- python学习笔记(1)
- Android实现按钮点击传按钮的值到另一个按钮
- 自己写的HTML嵌入Android项目,运用webview控件与Fragment与activity之间的传值
- 线程本地存储ThreadLocal
- linux修改镜像的地址