Loss Function: how good a model is. The lower the better, it means that is very accurate in predicting

Minimize = loss function = Maximum accuracy!

We can achieve this loss function by:

  • Softmax: soft probability smaller than x, max amplifies probabilities of target . Maps arbitrary values to probability distribution
  • more optimized solutions
  • negative sampling