SIGIR 2016 Improving Language Estimation with the Paragraph Vector Model for Ad-hoc Retrieval
来源:互联网 发布:java进阶路线 编辑:程序博客网 时间:2024/06/05 17:15
论文出处:SIGIR'16
英文摘要:Incorporating topic level estimation into language models has been shown to be beneficial for information retrieval(IR) models such as cluster-based retrieval and LDA-based document representation. Neural embedding models, such as paragraph vector (PV) models, on the other hand have shown their eeffectiveness and efficiency in learning semantic representations of documents and words in multiple Natural Language Processing (NLP) tasks. However, their effectiveness in information retrieval is mostly unknown. In this paper, we study how to effectively use the PV model to improve ad-hoc retrieval. We propose three major improvements over the original PV model to adapt it for the IR scenario: (1) we use a document frequency-based rather than the corpus frequency-based negative sampling strategy so that the importance of frequent words will not be sup-pressed excessively; (2) we introduce regularization over the document representation to prevent the model overtting short documents along with the learning iterations; and (3) we employ a joint learning objective which considers both the document-word and word-context associations to produce better word probability estimation. By incorporating this enhanced PV model into the language modeling frame-work, we show that it can significantly outperform the state-of-the-art topic enhanced language models
下载链接:https://ciir-publications.cs.umass.edu/pub/web/getpdf.php?id=1227
- SIGIR 2016 Improving Language Estimation with the Paragraph Vector Model for Ad-hoc Retrieval
- ICTIR 2016 Analysis of the Paragraph Vector Model for Information Retrieval
- 关于ad hoc retrieval的解释
- Automation for the people: Improving code with Eclipse plugins
- A Latent Semantic Model with Convolutional-Pooling Structure for Information Retrieval笔记
- Security for Wireless Ad Hoc Networks
- Wireless Ad Hoc Distribution for iPhone Apps
- HDU 2803 The MAX [Ad Hoc]
- HDU The Seven Percent Solution [Ad Hoc]
- Ad-hoc
- Ad hoc
- Ad hoc
- Ad hoc
- Algorithms and Protocols for Wireless, Mobile Ad Hoc Networks
- Save for Enterprise or ad-hoc deployment not present
- The Five-Paragraph Essay
- The Classic Vector Space Model
- Ad Hoc网络技术浅析
- 逆序的三位数
- spring security中如何弹出登录模态框(form login与ajax login并存)
- Python 学习随笔 2016.10.30
- BCD解密
- 表格输出
- SIGIR 2016 Improving Language Estimation with the Paragraph Vector Model for Ad-hoc Retrieval
- ICTIR 2016 Analysis of the Paragraph Vector Model for Information Retrieval
- mysql事务处理用法与实例详解
- CIKM 2016 aNMM: Ranking Short Answer Texts with Attention-Based Neural Matching Model
- 二分查找和递归@java
- 理解神经网络中的反向传播法
- MySQL自定义函数用法详解-复合结构自定义变量/流程控制
- Paint House II
- error while loading shared libraries的解決方法