ZettelsProblemSetsResumeContact
Altamash Khan

Altamash Khan

Did the war forge the spear that remained? No. All it did was identify the spear that wouldn't break

Altamash Khan

Altamash Khan

Did the war forge the spear that remained? No. All it did was identify the spear that wouldn't break

ZettelsProblemSetsResumeContact
redbuffs

no non-linearity is used in word2vec

Oct 22, 20241 min read

Topics

  • word2vec

Mainly because the network is shallow with just embedding or linear layers, so no need to use non-linearity. Also the loss function adds some non-linearity to the logits (e.g., sigmoid for skip-gram with negative sampling, softmax for cross-entropy).

Related


Backlinks

  • vanilla skip-gram spelled out

Created with Quartz v4.4.0 © 2025

  • Source