Tagged “research”

The Gumbel-max trick for the Bernoulli distribution July 2025
Zero truncated count distributions and their negative log likelihoods. April 2025
Hacking "vanilla" FlashAttention for variable-length inputs August 2024
Visualizing equivariances in transformer neural networks May 2024