Best AI papers explained

Ein Podcast von Enoch H. Kang

Podimo 90!!! Tage kostenlos! testen

Ein Universum voller exklusiver Podcasts und Hörbücher. Klicken Sie hier um loszulegen!

515 Folgen

Beyond Markovian: Reflective Exploration via Bayes-Adaptive RL for LLM Reasoning
Vom: 29.5.2025
Planning without Search: Refining Frontier LLMs with Offline Goal-Conditioned RL
Vom: 29.5.2025
Value-Guided Search for Efficient Chain-of-Thought Reasoning
Vom: 29.5.2025
Shallow Preference Signals: Large Language model aligns even better without truncated data?
Vom: 29.5.2025
Gaming Tool Preferences in Agentic LLMs
Vom: 29.5.2025
Partner Modelling Emerges in Recurrent Agents (But Only When It Matters)
Vom: 29.5.2025
LLM Populations Form Social Conventions and Collective Bias
Vom: 29.5.2025
LLM Generated Persona is a Promise with a Catch
Vom: 29.5.2025
Large Language Models for Digital Twin Simulation
Vom: 29.5.2025
From RL Distillation to Autonomous LLM Agents
Vom: 29.5.2025
Prompting, Auto-Prompting, and Human-AI Communication
Vom: 29.5.2025
Textual Gradients for LLM Optimization
Vom: 29.5.2025
Large Language Models as Markov Chains
Vom: 28.5.2025
Metastable Dynamics of Chain-of-Thought Reasoning: Provable Benefits of Search, RL and Distillation
Vom: 28.5.2025
Selective induction heads: how transformers select causal structures in context
Vom: 28.5.2025
The Evolution of Statistical Induction Heads: In-Context Learning Markov Chains
Vom: 28.5.2025
How Transformers Learn Causal Structure with Gradient Descent
Vom: 28.5.2025
Planning anything with rigor: general-purpose zero-shot planning with llm-based formalized programming
Vom: 28.5.2025
Automated Design of Agentic Systems
Vom: 28.5.2025
What’s the Magic Word? A Control Theory of LLM Prompting
Vom: 28.5.2025

12 / 26

Cut through the noise. We curate and break down the most important AI papers so you don’t have to.

Visit the podcast's native language site

515 Folgen

Beyond Markovian: Reflective Exploration via Bayes-Adaptive RL for LLM Reasoning

Planning without Search: Refining Frontier LLMs with Offline Goal-Conditioned RL

Value-Guided Search for Efficient Chain-of-Thought Reasoning

Shallow Preference Signals: Large Language model aligns even better without truncated data?

Gaming Tool Preferences in Agentic LLMs

Partner Modelling Emerges in Recurrent Agents (But Only When It Matters)

LLM Populations Form Social Conventions and Collective Bias

LLM Generated Persona is a Promise with a Catch

Large Language Models for Digital Twin Simulation

From RL Distillation to Autonomous LLM Agents

Prompting, Auto-Prompting, and Human-AI Communication

Textual Gradients for LLM Optimization

Large Language Models as Markov Chains

Metastable Dynamics of Chain-of-Thought Reasoning: Provable Benefits of Search, RL and Distillation

Selective induction heads: how transformers select causal structures in context

The Evolution of Statistical Induction Heads: In-Context Learning Markov Chains

How Transformers Learn Causal Structure with Gradient Descent

Planning anything with rigor: general-purpose zero-shot planning with llm-based formalized programming

Automated Design of Agentic Systems

What’s the Magic Word? A Control Theory of LLM Prompting