Ａｃtive learning ｌiterature Survey

来源：互联网发布：lua 定义有序数组编辑：程序博客网时间：2024/05/27 03:27

Three main active learning scenarios

Membership query synthesis:

The learner may request labels for any unlabeled instance in the input space.

Uncertainty sampling:

An active learner queries the instances about which it is least certain how to label

Entropy

单个分类器：

熵大小

多个分类器 Query - by - committee：

通过投票决定instance ｔｈey most disagree

minimizing the version space

1. be able to construct a committee of models that represent different regions of the version space

2.have some measure of disagreement among committee members

vote entropy/average Kullback- Lwiblwe(KL) divergence

３．３ expected model change

增加那些知道ｌａｂｅｌ后会对模型带来最大的改变的ｉｎｓｔａｎｃｅ

对于神经网络来说，选择使得梯度变化最大的ｉｎｓｔａｎｃｅ
该方法取得了较好的结果，但是在特征空间和标签集合较大的情况下，计算量较大

３．４最小化方差，对于模型不一定可以得到闭式形式

３．５estimated error reduction

估计某些instance加入后的错误期望

most prohibitively expensive query selection framework

1.要求计算加入每个可能的ｑｕｅｒｙ后误差期望，

２．对于不同的ｑｕｅｒｙ有不同的组合，需要不断进行迭代

３．６ density -weighted methods

uncertainty sampling　和ＱＢＣ starategies 都是选择位于边界上的数据，本方法选择具有代表性的数据，实现整体上优化

informative instances should not only be those which are uncertain, but also those which are representative of the input distribution

4.1 关于active learning的caveats

1. active training dataset 和模型相关，不能完全真实反应数据的潜在分布

２．

active-learning with costs

获取不同数据的难度不同，如果目标是减少训练的ｏｖｅｒａｌｌ　ｃｏｓｔ, 一味地减少训练样本数是不够的

半监督学习：选择ｍｏｓｔ confident ｉｎｓｔａｎｃｅｓ加入训练集

ａｃｔｉve learning :uncertainty sampling

multi-view learning and co-training:不同模型由标记数据训练，然后对未标记数据进行分类，把自身最为确定的样本给其他模型进行训练，自己选择最不确定的进行重新训练

半监督学习着重于ｌｅａｒｎｅｒ已经知道的，而ａｃｔｉｖｅ　ｌｅａｒｎｉｎｇ着重于ｌｅａｒｎｅｒ不知道的方面。将二者结合起来。

Reinforcement learning

增强学习

和ａｃｔｉｖｅｌｅａｒｎｉｎｇ的关系是，为了表现好，ｌｅａｒｎｅｒ需要ｐｒｏａｃｔｉｖｅ。

reinforcement learning 往往会采取一些措施，对于过去来说是最好的策略，但是不是最有策略。为了提高，需要尝试ｒｉｓｋ的步骤。这常常被称为

exploration-exploitation tradeoff

Equivalence query learning:

ｌｅａｒｎｅｒ对于instance 给出一个label的假设，标注者给出假设是否正确的结论。如果不正确，需要给出一个ｃｏｕｎｔｅｒ-example(反例)，即给出不同于真正标签的ｉｎｓｔａｎｃｅ

Active class selection

传统的active learning 认为获取数据很简单，但是标注需要cost。在相反的情况下，知道ｃｌａｓｓ　ｌａｂｅｌ，需要查询instance

Active feature acquisition and classification

using incomplete symptom information as the feature set

active feature acquisition seeks to alleviate these problems by allowing the learner to request more complete feature information

select the most informative features to obtain

Model parroting and compression

0 0