Best AI papers explained

Ein Podcast von Enoch H. Kang

Podimo 90!!! Tage kostenlos! testen

Ein Universum voller exklusiver Podcasts und Hörbücher. Klicken Sie hier um loszulegen!

508 Folgen

Compute as Teacher: Turning Inference Compute Into Reference-Free Supervision
Vom: 27.9.2025
Learning without training: The implicit dynamics of in-context learning
Vom: 24.9.2025
Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model
Vom: 24.9.2025
Open Problems in Mechanistic Interpretability
Vom: 21.9.2025
Maestro: Joint Graph & Config Optimization for Reliable AI Agents
Vom: 21.9.2025
Thought Anchors: Which LLM Reasoning Steps Matter?
Vom: 21.9.2025
Sample Complexity and Representation Ability of Test-time Scaling Paradigms
Vom: 9.9.2025
RL's Razor: Why Online RL Forgets Less
Vom: 7.9.2025
Why Language Models Hallucinate
Vom: 6.9.2025
ALFA: Aligning LLMs to Ask Good Questions A Case Study in Clinical Reasoning
Vom: 6.9.2025
Sample Efficient Preference Alignment in LLMs via Active Exploration
Vom: 6.9.2025
Adventures in Demand Analysis Using AI
Vom: 4.9.2025
Memento: Fine-tuning LLM Agents without Fine-tuning LLMs
Vom: 1.9.2025
On the Theoretical Limitations of Embedding-Based Retrieval
Vom: 31.8.2025
Performance Prediction for Large Systems via Text-to-Text Regression
Vom: 30.8.2025
Demystifying the Visual Quality Paradox in Multimodal Large Language Models
Vom: 30.8.2025
Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL
Vom: 30.8.2025
Compute-Optimal Scaling for Value-Based Deep RL
Vom: 25.8.2025
LLM-based Conversational Recommendation Agents with Collaborative Verbalized Experience
Vom: 23.8.2025
Signal and Noise: Evaluating Language Model Benchmarks
Vom: 23.8.2025

4 / 26

Cut through the noise. We curate and break down the most important AI papers so you don’t have to.

Visit the podcast's native language site

508 Folgen

Compute as Teacher: Turning Inference Compute Into Reference-Free Supervision

Learning without training: The implicit dynamics of in-context learning

Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model

Open Problems in Mechanistic Interpretability

Maestro: Joint Graph & Config Optimization for Reliable AI Agents

Thought Anchors: Which LLM Reasoning Steps Matter?

Sample Complexity and Representation Ability of Test-time Scaling Paradigms

RL's Razor: Why Online RL Forgets Less

Why Language Models Hallucinate

ALFA: Aligning LLMs to Ask Good Questions A Case Study in Clinical Reasoning

Sample Efficient Preference Alignment in LLMs via Active Exploration

Adventures in Demand Analysis Using AI

Memento: Fine-tuning LLM Agents without Fine-tuning LLMs

On the Theoretical Limitations of Embedding-Based Retrieval

Performance Prediction for Large Systems via Text-to-Text Regression

Demystifying the Visual Quality Paradox in Multimodal Large Language Models

Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL

Compute-Optimal Scaling for Value-Based Deep RL

LLM-based Conversational Recommendation Agents with Collaborative Verbalized Experience

Signal and Noise: Evaluating Language Model Benchmarks