Topics
It aims to make the dot product of word vectors (plus biases) approximate the logarithm of the words’ co-occurrence count.
- Working with very large numbers (like raw co-occurrence counts) can lead to numerical instability in computations. The log transformation helps mitigate this issue
- Treats the target and context words symmetrically. This is reflected in the objective function where and play similar roles
- is the weighting function in GloVe and plays an important role in stabilizing training