The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)
Mamba, Mamba-2 and Post-Transformer Architectures for Generative AI with Albert Gu - #693
Albert Gu joins the TWIML AI Podcast to discuss his research on post-transformer architectures, including Mamba and Mamba-2 state-space models. The conversation covers the efficiency of attention mechanisms, transformer limitations, and the future of foundation models.