pollen

❯

❯

❯

FlashAttention 2 — Making Transformers 800% Faster WO Approximation - With Tri Dao of Together AI

Sep 16, 20231 min read

AuthorLatent Space: The AI Engineer Podcast — CodeGen, Agents, Computer Vision, Data Science, AI UX and al…

TypePodcast

FlashAttention 2 — Making Transformers 800% Faster W/O Approximation - With Tri Dao of Together AI
Highlights