Linguistic Regularities in Continuous Space Word Representations

21 Nov 2018 in Studies on Deep Learning, Natural Language Processing

WHY?

Vector space word representations capture syntactic and semantic regularities in language well.

Scalable Distributed DNN Training Using Commodity GPU Cloud Computing

26 Oct 2018 in Studies on Computer Science, Distributed Processing

WHY?

Synchronization is an important issue is distributed SGD. Too few synchoronization among nodes causes unstable training while too frequent synchoronization causes high communication cost.

Neural Word Embedding as Implicit Matrix Factorization

25 Oct 2018 in Studies on Deep Learning, Natural Language Processing

WHY?

Skip-Gram Negative Sampling(SGNS) showed amazing performance compared to traditional word embedding methods. However, it was not clear where SGNS converge to.

Dependency-Based Word Embeddings

24 Oct 2018 in Studies on Deep Learning, Natural Language Processing

WHY?

Traditional continuous word embeddings based on linear contexts. In other words, word embeddings considered only surrounding words as context.

Large Scale Distributed Deep Networks

23 Oct 2018 in Studies on Computer Science, Distributed Processing

WHY?

Models with huge number of parameters or huge amount of data do not fit in GPU memory of a machine.

Improving Distributional Similarity with Lessons Learned from Wrod Embeddings

01 Oct 2018 in Studies on Deep Learning, Natural Language Processing

WHY?

Word embedding using neural network(Skipgram) seems to outperform traditional count-based distributional model. However, this paper points out that current superiority of word2vec is not because of the algorithm itself, but because of system design choices and hyperparameter optimizations.

Hadamard Product for Low-rank Bilinear Pooling

19 Sep 2018 in Studies on Deep Learning, Deep Learning

WHY?

Bilinear model can caputure rich relation of two vectors. However, the computational complexity of bilinear model is huge due to its high dimensionality. To make bilinear model more applicable, this paper suggests low-rank bilinear pooling using Hadamard product.

Stacked Attention Networks for Image Question Answering

19 Sep 2018 in Studies on Deep Learning, Deep Learning

WHY?

Visual question answering task is answering natural language questions based on images. To solve questions that require multi-step reasoning, stacked attention networks(SANs) stacks several layers of attention on parts of images based on query.

WHY?

WHY?

WHY?

WHY?

WHY?

WHY?

WHY?

WHY?

Pagination