deeplearning4j之GloVe实现实现
来源:互联网 发布:erp系统在哪下载 php 编辑:程序博客网 时间:2024/06/01 15:22
glove类似于word2vec,听说效果还比word2vec更加强大,可以用于做自然语言处理,正好学习deeplearning4j的时候看到了,顺便写在这,
文章用到的数据跟上一篇word2vec一样,看看效果吧,训练时间比word2vec要长太多,代码如下:
package com.meituan.deeplearning4j;import org.datavec.api.util.ClassPathResource;import org.deeplearning4j.models.glove.Glove;import org.deeplearning4j.text.sentenceiterator.BasicLineIterator;import org.deeplearning4j.text.sentenceiterator.SentenceIterator;import org.deeplearning4j.text.tokenization.tokenizer.preprocessor.CommonPreprocessor;import org.deeplearning4j.text.tokenization.tokenizerfactory.DefaultTokenizerFactory;import org.deeplearning4j.text.tokenization.tokenizerfactory.TokenizerFactory;import org.slf4j.Logger;import org.slf4j.LoggerFactory;import java.io.File;import java.io.FileNotFoundException;import java.util.Arrays;import java.util.Collection;public class GloVeRaw {public static void main(String[] args) throws FileNotFoundException {String filePath = "/Users/shuubiasahi/Desktop/bayies/deeplearning/part-00000";SentenceIterator iter = new BasicLineIterator(new File(filePath));TokenizerFactory t = new DefaultTokenizerFactory();t.setTokenPreProcessor(new CommonPreprocessor());Glove glove = new Glove.Builder().iterate(iter).tokenizerFactory(t).alpha(0.75).learningRate(0.1).epochs(25).xMax(100).batchSize(1000).shuffle(true).symmetric(true).build();glove.fit();System.out.println("和微信最接近的10个词汇:" + glove.wordsNearest("微信", 10));System.out.println(Arrays.toString(glove.getWordVector("微信")));System.out.println("微信和qq的相似度为:" + glove.similarity("微信", "腾讯聊天账号"));System.out.println("和美女最接近的10个词汇:" + glove.wordsNearest("腾讯聊天账号", 10));System.exit(0);}}
0 0
- deeplearning4j之GloVe实现实现
- deeplearning4j之卷积神经网络实现
- doc2vec用于分类 deeplearning4j实现
- Deeplearning4j 实战(2):Deeplearning4j 手写体数字识别Spark实现
- doc2vec用于训练文本向量-deeplearning4j实现
- 卷积文本分类(gpu)实现--deeplearning4j
- glove背后的计算原理(进一步理解glove实现细节)
- Deeplearning4j 实战(2):Deeplearning4j 手写体数字识别Spark实现【转】
- 深度学习Deeplearning4j 入门实战(2):Deeplearning4j 手写体数字识别Spark实现
- lstm用于文本分类(gpu实现)--deeplearning4j为例子
- Deeplearning4j源码研习(1): BP算法原理及源码实现
- deeplearning4j实现多感知器的手写数字识别
- deeplearning4j之word2vec学习
- deeplearning4j
- Deeplearning4j
- Deeplearning4j 实战(4):Deep AutoEncoder进行Mnist压缩的Spark实现
- Deeplearning4j 实战(5):基于多层感知机的Mnist压缩以及在Spark实现
- Deeplearning4j 实战(6):基于LSTM的文本情感识别及其Spark实现
- FreeMaker 截取指定长度字符串
- 打断点遇到的异常
- cookie和session
- springMVC模型驱动set,get再加工理解
- linux shell脚本 截取字符串时执行错误:bad substitution
- deeplearning4j之GloVe实现实现
- 基于SIFT特征的全景图像拼接
- 异常处理综合理解
- git-创建空白分支
- linux下安装zookeeper服务
- 有关springMvc跳转的特点
- equal hashcode compareTo使用时机
- struts2-Action-1-示例
- 关于mybatis传空值的处理