Two frameworks PyTorch and DyNet are released recently. They both feature dynamic graph construction, which is more flexible than current frameworks like Tensorflow and Theano, with other features of their own. (though Tensorflow has some functions for dynamic NN architecture construction)
Update on Feb 16:
Tensorflow accounced a new feature called Tensorflow Fold, which supports dynamic computation graphs. More importantly, different from the above two, it also supports dynamic batching, which makes it more powerful. I would expect other frameworks like MXNet/Pytorch/others will support the function as well very soon.
More to be continued on these topics soon.
This is a post that reviews the recent advancements of machine translation, especially neural machine translation (NMT).
Prerequisites: this post assumes some prior knowledge about machine learning, artificial neural networks, CNN, RNN (LSTM, GRU) encoder-decoder architecture, seq-to-seq models, etc. Continue reading
This is a post that summarizes the fundamental approaches to representing a word in NLP. I found that it’ll make me better understand a concept if I write it down and make some reviewing on it.
Basically, this post tries to answer the question: how to represent a word in a sentence for various applications of NLP?
Here is a link to a literature review I made on the topic of rare word issue and its approaches in neural machine translation. It briefly introduces neural machine translation, and then focuses on the specific present solutions to rare word issue in NMT. Feel free to make comments on it.
What is Single Source Shortest Path? Find the shortest paths from a source s to all other vertices v in graph G Continue reading
The post is based on <<Introduction to Algorithms>>, Third Edition