Talking Papers Podcast

About the Talking Papers Podcast

The Talking Papers Podcast is where research meets conversation: deep dives into research papers in computer vision, 3D, machine learning, and AI, with the authors who wrote them. By researchers, for researchers.

Each episode is structured like the paper itself: a TL;DR / abstract to set the stage, then related work, approach, results, conclusions, and future work. We close with a bonus segment called "What did Reviewer 2 say?", where the author shares the candid peer-review story behind the publication.

Guests are PhD students, postdocs, and faculty from leading labs across academia and industry. Episodes are aimed at fellow researchers and graduate students who want the candid version of the work, not a polished press release. I started the show because I wanted the conversations I wished I'd had earlier in my own PhD, and to put a human voice behind the papers that often feel anonymous on arXiv.

Podcast

Choosing a PhD Advisor: Questions to Ask + Red Flags

2025-02-17

A frank conversation with Derek Liu (returning guest) on what to actually ask a prospective PhD advisor, the red flags worth taking seriously, and the lessons most students learn the hard way.

Listen →

Podcast

3D Paintbrush: Local Stylization of 3D Shapes with Cascaded Score Distillation

2024-07-12

Talking Papers with Dale Decatur on 3D Paintbrush — local texture editing of 3D meshes from text via Cascaded Score Distillation. Outputs drop straight into standard graphics pipelines. CVPR 2024.

Listen →

Podcast

3DInAction Explained: Recognizing Human Actions Directly from 3D Point Cloud Sequences (CVPR 2024)

2024-06-03

Talking Papers with myself (host as guest!) on 3DInAction — recognizing human actions directly from 3D point cloud sequences via 't-patches' that move coherently in time. CVPR 2024.

Listen →

Podcast

Cameras as Rays: Pose Estimation via Ray Diffusion

2024-03-14

Talking Papers with Jason Zhang (CMU) on Cameras as Rays — reformulating sparse-view camera pose estimation as a per-ray prediction problem with a diffusion-style generative head. ICLR 2024.

Listen →

Podcast

Instant3D: Fast Text-to-3D with Sparse-View Generation and Large Reconstruction Model

2024-02-16

Talking Papers with Jiahao Li (TTIC) on Instant3D — feed-forward text-to-3D that generates assets in ~20 seconds, two orders of magnitude faster than score-distillation. ICLR 2024.

Listen →

Podcast

Variational Barycentric Coordinates

2023-12-14

Talking Papers with Ana Dodik on Variational Barycentric Coordinates — replacing mesh- and formula-based generalized barycentric coordinates with a neural field, unlocking new objective-function freedom. SIGGRAPH Asia 2023.

Listen →

Podcast

Reverse Engineering Self-Supervised Learning

2023-11-22

Talking Papers with Ravid Shwartz-Ziv on Reverse Engineering SSL — an empirical look at what self-supervised learning actually learns. Surprise: semantic clustering comes from the regularization term. NeurIPS 2023.

Listen →

Podcast

Constructive Solid Geometry on Neural Signed Distance Fields

2023-11-09

Talking Papers with Zoë Marschner on Constructive Solid Geometry for Neural SDFs — fixing the 'Pseudo-SDF' problem when applying booleans and CSG operations to learned signed distance fields. SIGGRAPH Asia 2023.

Listen →

Podcast

HMD-NeMo: Online 3D Avatar Motion Generation From Sparse Observations

2023-11-01

Talking Papers with Sadegh Aliakbarian on HMD-NeMo — online full-body avatar motion generation from sparse head-and-hands HMD signals. Handles hand occlusion that prior work assumed away. ICCV 2023.

Listen →

Podcast

CC3D: Layout-Conditioned Generation of Compositional 3D Scenes

2023-09-28

Talking Papers with Jeong Joon Park on CC3D — a 3D GAN that synthesizes compositional scenes with multiple objects, conditioned on a 2D semantic layout. Beyond the single-object 3D-GAN limitation. ICCV 2023.

Listen →

Podcast

NeRF-Det: Learning Geometry-Aware Volumetric Representation for Multi-View 3D Object Detection

2023-09-07

Talking Papers with Chenfeng Xu on NeRF-Det — multi-view 3D object detection that reasons in a NeRF-style volumetric backbone instead of projecting back to 2D. ICCV 2023.

Listen →

Podcast

🎠 MagicPony: Learning Articulated 3D Animals in the Wild

2023-08-10

Talking Papers with Tomas Jakab on MagicPony — reconstructing articulated 3D animals (shape, texture, lighting, pose) from a single in-the-wild RGB image, no 3D supervision. CVPR 2023.

Listen →

Podcast

Word-As-Image for Semantic Typography

2023-07-20

Talking Papers with Shir Iluz on Word-As-Image — automatic semantic typography that morphs letter outlines (as Bézier curves) to convey a word's meaning while staying readable. SIGGRAPH 2023 Honorable Mention.

Listen →

Podcast

Panoptic Lifting for 3D Scene Understanding with Neural Fields

2023-07-10

Talking Papers with Yawar Siddiqui on Panoptic Lifting — 3D-consistent panoptic segmentation from in-the-wild 2D images by lifting masks into a unified neural-field volume. CVPR 2023 highlight.

Listen →

Podcast

MobileBrick: Building LEGO for 3D Reconstruction on Mobile Devices

2023-06-14

Talking Papers with Kejie Li on MobileBrick — LEGO objects with known per-brick geometry as physical ground truth for evaluating mobile-device 3D reconstruction. 153 captured models. CVPR 2023.

Listen →

Podcast

Aligning Step-by-Step Instructional Diagrams to Video Demonstrations

2023-05-17

Talking Papers with Jiahao Zhang on the IKEA Assembly in the Wild dataset — aligning step-by-step assembly diagrams to YouTube videos, a less-explored multimodal alignment problem. CVPR 2023.

Listen →

Podcast

CLIPasso: Semantically-Aware Object Sketching

2023-03-29

Talking Papers with Yael Vinker on CLIPasso — converting photos into minimal semantic sketches by optimizing Bézier curves against CLIP embeddings. SIGGRAPH 2022 Best Paper.

Listen →

Podcast

INR2Vec: Deep Learning on Implicit Neural Representations of Shapes

2023-03-29

Talking Papers with Luca De Luigi on INR2Vec — treating an Implicit Neural Representation itself as an input signal that downstream networks can consume, no discrete conversion needed. ICLR 2023.

Listen →

Podcast

Random Walks for Adversarial Meshes

2022-12-14

Talking Papers with Amir Belder on Random Walks for Adversarial Meshes — generating adversarial examples against triangle-mesh classifiers via random-walk surrogates. First general mesh-domain attack. SIGGRAPH 2022.

Listen →

Podcast

Stochastic Poisson Surface Reconstruction

2022-12-13

Talking Papers with Silvia Sellán on Stochastic Poisson Surface Reconstruction — a Gaussian-Process-based statistical reformulation of PSR that supports posterior uncertainty queries. SIGGRAPH Asia 2022.

Listen →

Podcast

Beyond Periodicity: Towards a Unifying Framework for Activations in Coordinate-MLPs

2022-12-06

Talking Papers with Sameera Ramasinghe on Beyond Periodicity — a unifying framework for understanding which activation functions actually make coordinate-MLPs work, beyond the SIREN sinusoid story. ECCV 2022 oral.

Listen →

Podcast

KeypointNeRF: Generalizing Image-based Volumetric Avatars using Relative Spatial Encoding of Keypoints

2022-10-19

Talking Papers with Marko Mihajlovic on KeypointNeRF — generalizable volumetric human avatars from 2–3 RGB images, conditioned on relative spatial encodings of detected keypoints. ECCV 2022.

Listen →

Podcast

BACON: Band-Limited Coordinate Networks for Multiscale Scene Representation

2022-08-09

Talking Papers with David Lindell on BACON — band-limited coordinate networks with analytically-known Fourier spectra, finally making multiscale neural-field behavior predictable. CVPR 2022.

Listen →

Podcast

Learning Smooth Neural Functions via Lipschitz Regularization

2022-07-29

Talking Papers with Hsueh-Ti Derek Liu on Lipschitz regularization for neural fields — penalizing an upper bound on the Lipschitz constant to get smooth latent spaces for shape editing. SIGGRAPH 2022.

Listen →

Podcast

DiGS Explained: Divergence-Guided Implicit Surface Reconstruction from Unoriented Point Clouds (CVPR 2022)

2022-07-18

Talking Papers with Chamin Hewa Koneputugodage on DiGS — divergence-guided implicit surface reconstruction from unoriented point clouds. No normal-vector supervision required. CVPR 2022.

Listen →

Podcast

ICON: Implicit Clothed humans Obtained from Normals

2022-07-18

Talking Papers with Yuliang Xiu on ICON — animatable clothed-human avatars from in-the-wild images, robust across unconstrained poses via SMPL-X-conditioned local features. CVPR 2022.

Listen →

Podcast

Neural RGB-D Surface Reconstruction

2022-07-18

Talking Papers with Dejan Azinović on Neural RGB-D Surface Reconstruction — combining NeRF-style implicit fields with depth supervision in a truncated SDF for more accurate, complete reconstructions. CVPR 2022.

Listen →

Podcast

Panoptic 3D Scene Reconstruction From a Single RGB Image

2022-07-18

Talking Papers with Manuel Dahnert on Panoptic 3D Scene Reconstruction — unifying geometric reconstruction with semantic and instance segmentation from a single RGB image. NeurIPS 2021.

Listen →

Podcast

SampleNet: Differentiable Point Cloud Sampling

2022-07-18

Talking Papers with Itai Lang on SampleNet — a differentiable, task-aware point-cloud sampling layer that retains downstream performance at 3% of the original points. CVPR 2020.

Listen →

Podcast

Shape As Points: A Differentiable Poisson Solver

2022-07-18

Talking Papers with Songyou Peng on Shape As Points — a differentiable Poisson Surface Reconstruction layer that bridges explicit point/mesh representations with implicit fields. NeurIPS 2021.

Listen →

Podcast

VLN BERT: A Recurrent Vision-and-Language BERT for Navigation

2022-02-24

Talking Papers with Yicong Hong on VLN BERT — a time-aware recurrent BERT that finally makes vision-and-language navigation work over partially observable trajectories. CVPR 2021 SOTA.

Listen →

Podcast

Neural Parts: Learning Expressive 3D Shape Abstractions with Invertible Neural Networks

2022-02-11

Talking Papers with Despoina Paschalidou on Neural Parts — 3D shape abstractions that stay both geometrically accurate and semantically consistent, thanks to Invertible Neural Networks per part. CVPR 2021.

Listen →

Podcast

Deep Declarative Networks

2022-02-05

Talking Papers with Dylan Campbell on Deep Declarative Networks — neural-network layers defined implicitly as the solution to an optimization problem, with gradients via the implicit function theorem. TPAMI.

Listen →

Podcast

DORi: Discovering Object Relationships for Moment Localization of a Natural Language Query in a Video

2022-02-05

Talking Papers with Cristian Rodriguez-Opazo on DORi — temporal moment localization in long videos from natural-language queries, using a language-conditioned graph over discovered object relationships. WACV 2021.

Listen →

Podcast

Talking Papers Podcast – The Beginning

2022-01-15

Announcing the Talking Papers Podcast — a podcast by researchers, for researchers, helping you finally get through that 'to-read' paper pile. The mission, the format, and what's coming.

Listen →

Where to Listen and Subscribe

Podcast apps (Apple Podcasts, Spotify, Overcast, and more): talking.papers.podcast.itzikbs.com
YouTube channel: subscribe for the video episodes
Mailing list: get notified when new episodes drop
Twitter / X: @talking_papers

Frequently Asked Questions

What is the Talking Papers Podcast?

It is an interview podcast where the authors of cutting-edge research papers in computer vision, 3D, machine learning, and AI discuss their own work. Each episode follows the structure of a research paper and closes with a "What did Reviewer 2 say?" segment about the peer-review story behind it. Hosted by Itzik Ben-Shabat.

How is each episode structured?

Each episode follows the structure of a research paper: a TL;DR / abstract to set the stage, then related work, approach, results, conclusions, and future work. Every episode also closes with a bonus "What did Reviewer 2 say?" segment, where the author shares the candid peer-review story behind the publication.

Who hosts the Talking Papers Podcast?

I host it. My name is Itzik Ben-Shabat. I am a researcher working in computer vision and 3D learning; you can read more on the About page or my homepage.

What topics does the podcast cover?

Computer vision, 3D vision and reconstruction, neural rendering and radiance fields, point clouds, generative models, self-supervised learning, machine learning, and adjacent areas of AI. Most guests work at the intersection of geometry, deep learning, and perception.

Who are the guests?

PhD students, postdocs, and faculty from research labs in academia and industry. Guests are typically the lead authors of the paper being discussed, so the conversation is grounded in their own experience writing and publishing the work.

Where can I listen to the Talking Papers Podcast?

On any podcast app via the show page, which lists Apple Podcasts, Spotify, Overcast, and other platforms. The video versions of episodes are on the YouTube channel.

About the Talking Papers Podcast

Podcast Episodes

Choosing a PhD Advisor: Questions to Ask + Red Flags

3D Paintbrush: Local Stylization of 3D Shapes with Cascaded Score Distillation

3DInAction Explained: Recognizing Human Actions Directly from 3D Point Cloud Sequences (CVPR 2024)

Cameras as Rays: Pose Estimation via Ray Diffusion

Instant3D: Fast Text-to-3D with Sparse-View Generation and Large Reconstruction Model

Variational Barycentric Coordinates

Reverse Engineering Self-Supervised Learning

Constructive Solid Geometry on Neural Signed Distance Fields

HMD-NeMo: Online 3D Avatar Motion Generation From Sparse Observations

CC3D: Layout-Conditioned Generation of Compositional 3D Scenes

NeRF-Det: Learning Geometry-Aware Volumetric Representation for Multi-View 3D Object Detection

🎠 MagicPony: Learning Articulated 3D Animals in the Wild

Word-As-Image for Semantic Typography

Panoptic Lifting for 3D Scene Understanding with Neural Fields

MobileBrick: Building LEGO for 3D Reconstruction on Mobile Devices

Aligning Step-by-Step Instructional Diagrams to Video Demonstrations

CLIPasso: Semantically-Aware Object Sketching

INR2Vec: Deep Learning on Implicit Neural Representations of Shapes

Random Walks for Adversarial Meshes

Stochastic Poisson Surface Reconstruction

Beyond Periodicity: Towards a Unifying Framework for Activations in Coordinate-MLPs

KeypointNeRF: Generalizing Image-based Volumetric Avatars using Relative Spatial Encoding of Keypoints

BACON: Band-Limited Coordinate Networks for Multiscale Scene Representation

Learning Smooth Neural Functions via Lipschitz Regularization

DiGS Explained: Divergence-Guided Implicit Surface Reconstruction from Unoriented Point Clouds (CVPR 2022)

ICON: Implicit Clothed humans Obtained from Normals

Neural RGB-D Surface Reconstruction

Panoptic 3D Scene Reconstruction From a Single RGB Image

SampleNet: Differentiable Point Cloud Sampling

Shape As Points: A Differentiable Poisson Solver

VLN BERT: A Recurrent Vision-and-Language BERT for Navigation

Neural Parts: Learning Expressive 3D Shape Abstractions with Invertible Neural Networks

Deep Declarative Networks

DORi: Discovering Object Relationships for Moment Localization of a Natural Language Query in a Video

Talking Papers Podcast – The Beginning

Where to Listen and Subscribe

Frequently Asked Questions

What is the Talking Papers Podcast?

How is each episode structured?

Who hosts the Talking Papers Podcast?

What topics does the podcast cover?

Who are the guests?

Where can I listen to the Talking Papers Podcast?