Topics
GloVe (Global Vectors for Word Representation) is a method for creating word embeddings that can be interpreted through the lens of word co-occurrence probabilities.
Key points:
- GloVe uses the ratio of co-occurrence probabilities to capture word relationships
- The co-occurrence ratio helps distinguish between related and unrelated words
- A function is designed to fit this ratio using word vectors
The co-occurrence probability ratio is defined as:
Where:
- is the center word
- and are context words
In terms of co-occurrence counts:
Where:
- is the number of times word appears in the context of word
- is the total number of times any word appears in the context of word
Therefore, the ratio can be expressed as:
The goal is to design a function that approximates this ratio:
Where and are word vectors.
This interpretation helps us to derive the GloVe objective function.