Best AI papers explained

En podcast av Enoch H. Kang

550 Avsnitt

Partner Modelling Emerges in Recurrent Agents (But Only When It Matters)
Publicerades: 2025-05-29
LLM Populations Form Social Conventions and Collective Bias
Publicerades: 2025-05-29
LLM Generated Persona is a Promise with a Catch
Publicerades: 2025-05-29
Large Language Models for Digital Twin Simulation
Publicerades: 2025-05-29
From RL Distillation to Autonomous LLM Agents
Publicerades: 2025-05-29
Prompting, Auto-Prompting, and Human-AI Communication
Publicerades: 2025-05-29
Textual Gradients for LLM Optimization
Publicerades: 2025-05-29
Large Language Models as Markov Chains
Publicerades: 2025-05-28
Metastable Dynamics of Chain-of-Thought Reasoning: Provable Benefits of Search, RL and Distillation
Publicerades: 2025-05-28
Selective induction heads: how transformers select causal structures in context
Publicerades: 2025-05-28
The Evolution of Statistical Induction Heads: In-Context Learning Markov Chains
Publicerades: 2025-05-28
How Transformers Learn Causal Structure with Gradient Descent
Publicerades: 2025-05-28
Planning anything with rigor: general-purpose zero-shot planning with llm-based formalized programming
Publicerades: 2025-05-28
Automated Design of Agentic Systems
Publicerades: 2025-05-28
What’s the Magic Word? A Control Theory of LLM Prompting
Publicerades: 2025-05-28
BoNBoN Alignment for Large Language Models and the Sweetness of Best-of-n Sampling
Publicerades: 2025-05-27
RL with KL penalties is better viewed as Bayesian inference
Publicerades: 2025-05-27
Asymptotics of Language Model Alignment
Publicerades: 2025-05-27
Qwen 2.5, RL, and Random Rewards
Publicerades: 2025-05-27
Theoretical guarantees on the best-of-n alignment policy
Publicerades: 2025-05-27

14 / 28

Cut through the noise. We curate and break down the most important AI papers so you don’t have to.

Visit the podcast's native language site

550 Avsnitt

Partner Modelling Emerges in Recurrent Agents (But Only When It Matters)

LLM Populations Form Social Conventions and Collective Bias

LLM Generated Persona is a Promise with a Catch

Large Language Models for Digital Twin Simulation

From RL Distillation to Autonomous LLM Agents

Prompting, Auto-Prompting, and Human-AI Communication

Textual Gradients for LLM Optimization

Large Language Models as Markov Chains

Metastable Dynamics of Chain-of-Thought Reasoning: Provable Benefits of Search, RL and Distillation

Selective induction heads: how transformers select causal structures in context

The Evolution of Statistical Induction Heads: In-Context Learning Markov Chains

How Transformers Learn Causal Structure with Gradient Descent

Planning anything with rigor: general-purpose zero-shot planning with llm-based formalized programming

Automated Design of Agentic Systems

What’s the Magic Word? A Control Theory of LLM Prompting

BoNBoN Alignment for Large Language Models and the Sweetness of Best-of-n Sampling

RL with KL penalties is better viewed as Bayesian inference

Asymptotics of Language Model Alignment

Qwen 2.5, RL, and Random Rewards

Theoretical guarantees on the best-of-n alignment policy