The Inside View

En podcast av Michaël Trazzi

54 Avsnitt

Owain Evans - AI Situational Awareness, Out-of-Context Reasoning
Publicerades: 2024-08-23
[Crosspost] Adam Gleave on Vulnerabilities in GPT-4 APIs (+ extra Nathan Labenz interview)
Publicerades: 2024-05-17
Ethan Perez on Selecting Alignment Research Projects (ft. Mikita Balesni & Henry Sleight)
Publicerades: 2024-04-09
Emil Wallner on Sora, Generative AI Startups and AI optimism
Publicerades: 2024-02-20
Evan Hubinger on Sleeper Agents, Deception and Responsible Scaling Policies
Publicerades: 2024-02-12
[Jan 2023] Jeffrey Ladish on AI Augmented Cyberwarfare and compute monitoring
Publicerades: 2024-01-27
Holly Elmore on pausing AI
Publicerades: 2024-01-22
Podcast Retrospective and Next Steps
Publicerades: 2024-01-09
Kellin Pelrine on beating the strongest go AI
Publicerades: 2023-10-04
Paul Christiano's views on "doom" (ft. Robert Miles)
Publicerades: 2023-09-29
Neel Nanda on mechanistic interpretability, superposition and grokking
Publicerades: 2023-09-21
Joscha Bach on how to stop worrying and love AI
Publicerades: 2023-09-08
Erik Jones on Automatically Auditing Large Language Models
Publicerades: 2023-08-11
Dylan Patel on the GPU Shortage, Nvidia and the Deep Learning Supply Chain
Publicerades: 2023-08-09
Tony Wang on Beating Superhuman Go AIs with Advesarial Policies
Publicerades: 2023-08-04
David Bau on Editing Facts in GPT, AI Safety and Interpretability
Publicerades: 2023-08-01
Alexander Pan on the MACHIAVELLI benchmark
Publicerades: 2023-07-26
Vincent Weisser on Funding AI Alignment Research
Publicerades: 2023-07-24
[JUNE 2022] Aran Komatsuzaki on Scaling, GPT-J and Alignment
Publicerades: 2023-07-19
Nina Rimsky on AI Deception and Mesa-optimisation
Publicerades: 2023-07-18

1 / 3

The goal of this podcast is to create a place where people discuss their inside views about existential risk from AI.

Visit the podcast's native language site

54 Avsnitt

Owain Evans - AI Situational Awareness, Out-of-Context Reasoning

[Crosspost] Adam Gleave on Vulnerabilities in GPT-4 APIs (+ extra Nathan Labenz interview)

Ethan Perez on Selecting Alignment Research Projects (ft. Mikita Balesni & Henry Sleight)

Emil Wallner on Sora, Generative AI Startups and AI optimism

Evan Hubinger on Sleeper Agents, Deception and Responsible Scaling Policies

[Jan 2023] Jeffrey Ladish on AI Augmented Cyberwarfare and compute monitoring

Holly Elmore on pausing AI

Podcast Retrospective and Next Steps

Kellin Pelrine on beating the strongest go AI

Paul Christiano's views on "doom" (ft. Robert Miles)

Neel Nanda on mechanistic interpretability, superposition and grokking

Joscha Bach on how to stop worrying and love AI

Erik Jones on Automatically Auditing Large Language Models

Dylan Patel on the GPU Shortage, Nvidia and the Deep Learning Supply Chain

Tony Wang on Beating Superhuman Go AIs with Advesarial Policies

David Bau on Editing Facts in GPT, AI Safety and Interpretability

Alexander Pan on the MACHIAVELLI benchmark

Vincent Weisser on Funding AI Alignment Research

[JUNE 2022] Aran Komatsuzaki on Scaling, GPT-J and Alignment

Nina Rimsky on AI Deception and Mesa-optimisation