Deep Learning Face Attributes in the Wild
来源:互联网 发布:思迅天店收银软件 编辑:程序博客网 时间:2024/04/28 04:23
文章要解决的问题
Predicting face attributes from web images
方法的主要想法
It cascades two CNNs (LNet and ANet) forface localization and attribute prediction respectively.
贡献(吹牛逼)
(1) It shows how LNet and ANet can be improved by different pre-trainingstrategies.
(2) It reveals that although filters of LNet are fine-tuned by attributelabels, their response maps over the entire image have strong indication offace’s location.
(3)It also demonstrates that the high-level hidden neurons of ANetautomatically discover semantic concepts after pretraining, and such concepts aresignificantly enriched after fine-tuning.
pre-train andfine-tuned
LNet and ANet are first pretrained differently and then jointly trainedwith attribute labels.
LNet is pre-trained by classifying massive general object categories.Thus, its pre-trained features have good generalization capability on handlingvarious background clutters. LNet is then fine-tuned by predicting attributes.
ANet is pre-trained by classifying massive face identities, to obtaindiscriminative face representation. Then it is fine-tuned by the attributeprediction task.
Pre-train的原因和人脸定位的理论:
A filter (or a group of filters) functions as a detector of an attribute. Whena subset of neurons are activated, they indicate the existence of face images,which have a particular attribute configuration. The neurons at differentlayers can form many activation patterns, implying that the whole set of face imagescan be divided into many subsets based on attribute configurations, and eachactivation pattern corresponds to one subset (e.g. ‘pointy nose’, ‘rosy cheek’, and‘smiling’). Therefore, it is not surprising that filters learned by attribute predictionlead to effective representations for face localization. By simply averagingand thresholding response maps, good face localization is achieved.
With this strategy, each face attribute is well explained by a sparselinear combination of these sematic concepts. By analyzing the coefficients ofsuch combinations, attributes show clear grouping patterns, which could be wellinterpreted semantically.
Structure Of Framework
再加上最后的SVM分类器,一共四个过程
1 LNeto定位头部和肩部
2 LNets定位脸(更准确的定位)
3 ANet表达和预测人脸特性
这个只有一个FC,所以只有一个Attribute(这妈蛋的就是接着好玩,坑爹)。训练网络时有用,预测时没用。
4 SVM人脸特征分类
这个分类出来的是Attributes的线性组合(因为有多组FC)
Framework内部说明(此处运用了参数局部共享和全局共享的混合策略)
本人不懂局部共享,以下是文献,请参考之
参数局部分享文章:
开始慢慢一点点往后面看详细介绍
3.1.Coarse-to-fine Face Localization
- Deep Learning Face Attributes in the Wild
- Deep Learning Face Attributes in the Wild
- Deep Learning Face Attributes in the Wild
- Discriminative Deep Metric Learning for Face Verification in the Wild(文献泛读)
- Learning Face Hallucination in the Wild--阅读笔记
- Pose-Aware Face Recognition in the Wild
- 人脸识别 - Pose-Aware Face Recognition in the Wild
- Pose-Aware Face Recognition in the Wild--填坑1
- 人脸对齐--Face Alignment In-the-Wild: A Survey
- High-Fidelity Pose and Expression Normalization for Face Recognition in the Wild
- 17.2.22 Pose-Aware Face Recognition in the Wild 小感
- 大角度人脸转正--Towards Large-Pose Face Frontalization in the Wild
- deep learning for face detection
- deep learning for face detection
- deep learning for face detection
- 【深度学习论文笔记】Recover Canonical-View Faces in the Wild with Deep Neural Network
- 【论文笔记】Recover Canonical-View Faces in the Wild with Deep Neural Network
- Breakout detection in the wild
- 我的游戏框架基础构建篇(A* 寻路算法实现 )
- Ng机器学习系列补充:4、关联分析算法FP_Growth
- android 启动过程
- 优化SQL查询:如何写出高性能SQL语句
- 参数修饰符ref,out ,params的区别
- Deep Learning Face Attributes in the Wild
- mediapalyer各种属性和方法
- 解决UnicodeDecodeError: 'ascii' codec can't decode byte 0xe9 in position 0: ordinal not in range(128)
- SqlServer查找指定数据库数据库下的所有表名
- Linux——Centos底部面板图标不见的解决办法
- android动画总结
- hdu4143 A Simple Problem(数论)
- iOS设计模式——单例模式
- Sqlite的一些操作 获取新加自增id 创建触发器