GumbelMax
GumbelMax refers to the Gumbel-max trick, a technique used in probability theory and machine learning to sample from a discrete distribution given by a set of log-probabilities or logits. The method works by adding independent Gumbel noise to each log-probability and selecting the index of the largest resulting value.
Formally, given logits l1, l2, ..., lK, one samples g1, g2, ..., gK independently from a standard Gumbel
Implementation notes: a standard way to generate a Gumbel sample is gi = -log(-log(Ui)) where Ui are
Relation to related methods: the Gumbel-Max trick provides exact sampling, while the Gumbel-Softmax (or Concrete distribution)
Applications include discrete sampling in probabilistic models, variational inference, reinforcement learning, and other areas where sampling