mahout 创建向量问题There are too many documents that do not have a term vector
来源:互联网 发布:知无涯者的经典台词 编辑:程序博客网 时间:2024/06/06 03:12
bin/mahout lucene.vector --dir /home/hadoop/index --output /user/hadoop/out/part-out.vec --field title --idField id --dictOut /user/hadoop/out/dict.out
--maxPercentErrorDocs 0.1
Exception in thread "main" java.lang.IllegalStateException: There are too many documents that do not have a term vector for ***
at org.apache.mahout.utils.vectors.lucene.LuceneIterator.computeNext(LuceneIterator.java:118)
at org.apache.mahout.utils.vectors.lucene.LuceneIterator.computeNext(LuceneIterator.java:41)
at com.google.common.collect.AbstractIterator.tryToComputeNext(AbstractIterator.java:141)
at com.google.common.collect.AbstractIterator.hasNext(AbstractIterator.java:136)
at org.apache.mahout.utils.vectors.io.SequenceFileVectorWriter.write(SequenceFileVectorWriter.java:44)
at org.apache.mahout.utils.vectors.lucene.Driver.dumpVectors(Driver.java:109)
at org.apache.mahout.utils.vectors.lucene.Driver.main(Driver.java:250)
原因1
***是不存在的file ,修改为正确的field
原因2
***是termVectors为false的field
解决,生成index时要设置field的termVectors 为true
原因3,错误文档数目超过了预定的百分比,
可以增加参数--maxPercentErrorDocs 0.1
表示允许10%的错误文档
- mahout 创建向量问题There are too many documents that do not have a term vector
- dhango错误Your models have changes that are not yet reflected in a migration,
- There are many different
- Luckily, there are a number of options that are available that are worth
- burberry outlet store If you have a little children there namely not mistrust that you must purchase
- Check that you do not already have another mysqld process
- canada goose sale online There are many people over there that are desperate lack apt quest for low
- mac FileZilla FTP 报错421 There are too many connections from your internet address
- hdu3191How Many Paths Are There
- How Many Paths Are There
- You do not have a license for this Vuser type问题
- 解决LoadRunner运行负载测试You do not have a license for this Vuser type问题
- You do not have a license for this Vuser type
- You do not have a license for this Vuser type
- do not have a clipped shader version for SoftClip
- How to create MFC applications that do not have a menu bar in Visual C++(MFC单文档和多文档程序中去掉菜单栏)(转)
- This webpage has a redirect loop.Error 310 There were too many redirects.
- 写窗体:c++ WINDOW:There are so many definitions of a window
- <param name="wmode" value="transparent">
- C++位运算实例
- Java中二进制16进制与字节数组之间的转换
- 两矩阵相乘
- USB转UART桥接芯片CP2101及其应用
- mahout 创建向量问题There are too many documents that do not have a term vector
- sitemesh 建立复合视图
- 60个有用的规则
- 多任务处理:控制默认行为
- VC++ Debug 调试时看不到CString或其他类型的值,提示“错误的指针”
- 如何让MFC中listctrl自动填充整个对话框窗口
- getHibernateTemplate().save();不能保存数据到数据库解决办法
- mount: wrong fs type, bad option, bad superblock
- Acmer语录