Skip to main content

Tagged “research”

  1. Zero truncated count distributions and their negative log likelihoods.
  2. Hacking "vanilla" FlashAttention for variable-length inputs
  3. Visualizing equivariances in transformer neural networks

See all tags.