word2vec

来源:互联网 发布:畅捷通软件好用吗 编辑:程序博客网 时间:2024/06/05 01:09

word2vec1 (Mikolov et al., 2013a) toolkit can  pre-train the
character embeddings on the Chinese corpus. The obtained embeddings are used to initialize the character lookup table instead of random initialization. Inspired by (Pei et al., 2014), we also can utilize bigram character embeddings which is simply
initialized as the average of embeddings of two
consecutive characters.
0 0