CS231n Assignment3--Q1
来源:互联网 发布:java 非form 上传文件 编辑:程序博客网 时间:2024/04/29 02:41
Q1: Image Captioning with Vanilla RNNs
作业代码已上传至我github: https://github.com/jingshuangliu22/cs231n,欢迎参考、讨论、指正。
LSTM_Captioning.ipynb
Microsoft COCO
idx_to_word <type 'list'> 1004train_captions <type 'numpy.ndarray'> (400135, 17) int32val_captions <type 'numpy.ndarray'> (195954, 17) int32train_image_idxs <type 'numpy.ndarray'> (400135,) int32val_features <type 'numpy.ndarray'> (40504, 512) float32val_image_idxs <type 'numpy.ndarray'> (195954,) int32train_features <type 'numpy.ndarray'> (82783, 512) float32train_urls <type 'numpy.ndarray'> (82783,) |S63val_urls <type 'numpy.ndarray'> (40504,) |S63word_to_idx <type 'dict'> 1004
Look at the data
Vanilla RNN: step forward
next_h error: 6.29242142647e-09
Vanilla RNN: step backward
dx error: 6.88735954327e-11
dprev_h error: 5.28932394133e-10
dWx error: 1.12554920911e-10
dWh error: 4.84496557569e-10
db error: 2.72330774095e-11
Vanilla RNN: forward
h error: 7.72846618019e-08
Vanilla RNN: backward
dx error: 2.70104774724e-08
dh0 error: 1.7454525052e-09
dWx error: 3.40035760677e-10
dWh error: 2.01095678956e-09
db error: 3.23168709094e-10
Word embedding: forward
out error: 1.00000000947e-08
Word embedding: backward
Word embedding: backward
Temporal Affine layer
dx error: 4.98623200795e-11
dw error: 7.54622091734e-11
db error: 5.76987410469e-12
Temporal Softmax loss
2.30256439876
23.025705242
2.32606402665
dx error: 2.45647211476e-08
RNN for image captioning
loss: 9.83235591003
expected loss: 9.83235591003
difference: 2.61124455392e-12
W_embed relative error: 2.006287e-09
W_proj relative error: 2.435961e-09
W_vocab relative error: 2.411310e-09
Wh relative error: 2.055948e-08
Wx relative error: 3.195020e-07
b relative error: 1.777874e-09
b_proj relative error: 1.159276e-09
b_vocab relative error: 1.960674e-10
Overfit small data
(Iteration 1 / 100) loss: 82.463010
(Iteration 11 / 100) loss: 27.939999
(Iteration 21 / 100) loss: 8.880015
(Iteration 31 / 100) loss: 1.921411
(Iteration 41 / 100) loss: 0.639671
(Iteration 51 / 100) loss: 0.340682
(Iteration 61 / 100) loss: 0.287836
(Iteration 71 / 100) loss: 0.180632
(Iteration 81 / 100) loss: 0.187963
(Iteration 91 / 100) loss: 0.179619
Test-time sampling
- CS231n Assignment3--Q1
- cs231n assignment3
- CS231n Assignment3--Q2
- CS231n (winter 2016) : Assignment3
- [CS231n@Stanford] Assignment1-Q1
- CS231n Assignment1--Q1
- CS231n Assignment2--Q1
- assignment3
- [CS231n@Stanford] Assignment1-Q1 (python) KNN实现
- 利用pytorch实现GAN(生成对抗网络)-MNIST图像-cs231n-assignment3
- [CS231n@Stanford] Assignment2-Q1 (python) Fully-connected Neural Network实现
- cs231n:assignment2——Q1: Fully-connected Neural Network
- Q1
- Assignment3 总结
- cs231n——assignment1: Q1: k-Nearest Neighbor classifier(手动复制版)
- cs231n:assignment1——Q1: k-Nearest Neighbor classifier(自动生成版)
- cs231n
- cs231n
- 【面试题6】重建二叉树
- jsCharts插件的使用和去图标(提供jscharts无图标下载)
- APP消息推送方案调研
- 关于socket阻塞与非阻塞情况下的recv、send、read、write返回值
- JavaScript Stack from Scratch
- CS231n Assignment3--Q1
- 数据库的基本操作
- Activity活动的使用和生命周期
- 强化学习直观理解
- stm32笔记--1硬件
- CMake笔记
- PowerDesigner设计表时显示注释列Comment
- 神经网络简介
- Unity3D 大型游戏 最后一站 源码 部分重点NetworkManager(一)(7)