Yong-Yeol Ahn, An implicit statistical bias of word2vec model and why it may be a good thing

https://kaist.zoom.us/j/98984764325

Abstract: Neural language models have revolutionized how we model text data as well as a broad range of machine learning methods, even beyond methods for natural language processing. One of the first, simplest, and most widely used methods is the skip-gram negative sampling model, or simply word2vec, which allows us to obtain vector representations of