| Episode | Status |
|---|---|
Tri Dao, Chief Scientist at Together AI and Princeton professor who created Flash Attention and Mamba, discusses how inference optimization has driven costs down 100x since ChatGPT's launch through me...
This episode has not been transcribed yet.