Use document modeling to enhance PMF_1: CTR Model.
来源:互联网 发布:淘宝app能修改差评吗 编辑:程序博客网 时间:2024/06/06 01:23
Drawbacks of PMF
- Matrix factorization only uses information from other users, it cannot generalize to completely unrated items.(They cannot be used for recommending new products which have yet to receive rating information from any user)
- The prediction accuracy often drops significantly when the ratings are very sparse.
- The learnt latent space is not easy to interpret.(CTR Model can do this)
Use LDA to Enhance PMF
LDA
Documents are represented as random mixtures over latent topics, where each topic is characterized by a distribution over words. LDA assumes the following generative process for a corpus consisting of documents each of length :
1. Choose , where and {\displaystyle \mathrm {Dir} (\alpha )} is the Dirichlet distribution for parameter
2. Choose, where
3. For each of the word positions , where , and
(Note that the Multinomial distribution here refers to the Multinomial with only one trial. It is formally equivalent to the categorical distribution.)
4. Plate notation are as follows:
Categorical Distribution
K-dimensional categorical distribution is the most general distribution over a K-way event; any other discrete distribution over a size-K sample space is a special case. The parameters specifying the probabilities of each possible outcome are constrained only by the fact that each must be in the range 0 to 1, and all must sum to 1.
is the Iverson bracket
Multinomial Distribution
When n is 1 and k is 2 the multinomial distribution is the Bernoulli distribution.
When k is 2 and number of trials are more than 1 it is the Binomial distribution.
When n is 1 it is the categorical distribution.
Combine LDA into PMF: CTR
For each item j,
(a) Draw topic proportions
θj ∼ Dirichlet(α).
(b) Draw item latent offsetj∼N(0;λ−1IK) and set the item latent vector asvj =ϵj +θj .
(c) For each wordwjn ,
i. Draw topic assignment zjn ∼ Mult(θ).
ii. Draw wordwjn ∼ Mult(βzjn ).For each user-item pair
(i;j) , draw the rating
rij∼N(uTivj;c−1ij)
The key property in CTR lies in how the item latent vector
This is why we call the model collaborative topic regression
- Plate Notation are as follows:
,
Next: Use SDAE to enhance PMF //TODO
the latent representation learned by CTR may not be very effective when the auxiliary information is very sparse.
references
- C. Wang and D. M. Blei. Collaborative topic modeling for recommending scientific articles. In KDD, pages 448-456, 2011.
- Use document modeling to enhance PMF_1: CTR Model.
- use mongoc add document to mongodb
- Model One-to-Many Relationships with Document References
- use-case modeling 培训记录
- How to use document library in Sharepoint 2010
- Recommended Doctype Declarations to use in your Web document
- Extending the ASP.Net Security model to use rights
- use mongoose to update documents with model and Schema
- How to use tensorflow pre-trained model in Android
- Using Header Files To Enhance Portability
- Modeling Project-Directed Graph Document[z]
- CTR
- DOM (Document Object Model)
- Document Object Model
- Document Object Model Events
- DOM -----Document Object Model
- DOM(Document Object Model)
- The Use Case Model
- 在甘心之前 能否再拼一把?
- 最长上升子序列
- 为什么多线程读写 shared_ptr 要加锁?
- 实现一个四则加法器运算
- 深泽小舟,失落的十年
- Use document modeling to enhance PMF_1: CTR Model.
- 100天土鸡饲养计划(86)
- 微信小程序之注册配置
- 深入理解Java虚拟机之走近Java
- Demo02数学运算__自增自减__取余运算
- xml的生成和解析
- Redis--The PHP Extension Installed
- ffmpeg framework
- Exception:DatePickerDialog和TimePickerDialog会执行两次的异常