The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence) - Quantizing Transformers by Helping Attention Heads Do Nothing with Markus Nagel

Efficiency of Multitask and Multidomain Learning

The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)

Today we’re joined by Markus Nagel, research scienti... more

Dec 26 2023 46m

Chapter 1 15 mins

Transformer Efficiency With Qualcomm AI Research

Chapter 2 12 mins

Model Efficiency Through Quantization and Pruning

Chapter 3 10 mins

Equivariance, Transformers, and LLMs

Chapter 4 8 mins

On-Device AI and Full Stack Optimization

Clip

Transcript

Read Transcript

Chapters

About This Episode

Play Full

Get the future of podcasts.