Convolutional Sequence to Sequence Learning
来源:互联网 发布:php 7 加密 编辑:程序博客网 时间:2024/05/23 00:01
Convolutional Sequence to Sequence Learning
(Submitted on 8 May 2017)
The prevalent approach to sequence to sequence learning maps an input sequence to a variable length output sequence via recurrent neural networks. We introduce an architecture based entirely on convolutional neural networks. Compared to recurrent models, computations over all elements can be fully parallelized during training and optimization is easier since the number of non-linearities is fixed and independent of the input length. Our use of gated linear units eases gradient propagation and we equip each decoder layer with a separate attention module. We outperform the accuracy of the deep LSTM setup of Wu et al. (2016) on both WMT'14 English-German and WMT'14 English-French translation at an order of magnitude faster speed, both on GPU and CPU.
Submission history
From: Michael Auli [view email][v1] Mon, 8 May 2017 23:25:30 GMT (1489kb,D)
0 0
- Convolutional Sequence to Sequence Learning
- Convolutional Sequence to Sequence Learning笔记
- 【论文阅读】Convolutional Sequence to Sequence Learning (未完待续)
- <模型汇总-7>基于CNN的Seq2Seq模型-Convolutional Sequence to Sequence Learning
- Sequence to Sequence Learning with Neural Networks
- 【Deep Learning】genCNN: A Convolutional Architecture for Word Sequence Prediction
- (翻译)Sequence to Sequence Learning with Neural Networks
- 【论文笔记】Sequence to sequence Learning with Neural Networks
- [ACL2016] Incorporating Copying Mechanism in Sequence-to-Sequence Learning
- [2014]Sequence to Sequence Learning with Neural Networks
- Sequence to Sequence Learning with Neural Networks论文笔记
- Deep learning From Image to Sequence
- Deep learning From Image to Sequence
- sequence to sequence
- Sequence to Sequence 模型
- Sequence to Sequence model
- 深度学习Deep learning From Image to Sequence
- "Sequence to Sequence Learning for Optical Character Recognition"——Devendra Kunar Sahu
- c++作业6
- 深入理解linux下write()和read()函数
- JavaScript 中为 JSON 字符串创建对象
- 线索化二叉树的构造及遍历
- 守护进程(精灵进程)&调用fork一次和两次的区别
- Convolutional Sequence to Sequence Learning
- Bitlocker 参数错误导致打不开移动硬盘的解决方法
- django用户注册
- maven发布时在不同的环境使用不同的配置文件
- CString,string,char*之间的转换(转)
- 欢迎使用CSDN-markdown编辑器
- Java 多线程
- java从入门到弃坑第十三天0A0
- 统计学习方法概论