The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence) - Localizing and Editing Knowledge in LLMs with Peter Hase

Model Performance Based on Task Specification

The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)

Today we're joined by Peter Hase, a fifth-year PhD s... more

Apr 8 2024 49m

Chapter 1 11 mins

Interpretability, Model Editing, and Scalable Oversight

Chapter 2 13 mins

Model Editing and Knowledge Updates

Chapter 3 7 mins

Ensuring Model Safety and Oversight

Chapter 4 5 mins

Exploring Easy to Hard Generalization

Chapter 5 8 mins

Task Specification in Language Models

Chapter 6 1 min

Internal Models and Consistency in AI

Clip

Transcript

Read Transcript

Chapters

About This Episode

Play Full

Get the future of podcasts.