Listen to a podcast, please open Podcast Republic app. Available on Google Play Store and Apple App Store.
Episode | Date |
---|---|
MegaSaM: Accurate, Fast, and Robust Structure and Motion from Casual Dynamic Videos
|
Dec 07, 2024 |
SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models
|
Nov 12, 2024 |
D3RoMa: Disparity Diffusion-based Depth Sensing for Material-Agnostic Robotic Manipulation
|
Nov 11, 2024 |
Scaling Proprioceptive-Visual Learning with Heterogeneous Pre-trained Transformers
|
Nov 09, 2024 |
HOVER: Versatile Neural Whole-Body Controller for Humanoid Robots
|
Nov 04, 2024 |
Precise and Dexterous Robotic Manipulation via Human-in-the-Loop Reinforcement Learning
|
Nov 03, 2024 |
Local Policies Enable Zero-shot Long Horizon Manipulation
|
Nov 02, 2024 |
MENTOR: Mixture-of-Experts Network with Task-Oriented Perturbation for Visual Reinforcement Learning
|
Oct 30, 2024 |
SkillMimicGen: Automated Demonstration Generation for Efficient Skill Learning and Deployment
|
Oct 29, 2024 |
LVSM: A Large View Synthesis Model with Minimal 3D Inductive Bias
|
Oct 28, 2024 |
Dynamic 3D Gaussian Tracking for Graph-Based Neural Dynamics Modeling
|
Oct 27, 2024 |
SPIRE: Synergistic Planning, Imitation, and Reinforcement Learning for Long-Horizon Manipulation
|
Oct 25, 2024 |
VILA-U: a Unified Foundation Model Integrating Visual Understanding and Generation
|
Oct 24, 2024 |
CooHOI: Learning Cooperative Human-Object Interaction with Manipulated Object Dynamics
|
Oct 23, 2024 |
SynFlowNet: Design of Diverse and Novel Molecules with Synthesis Constraints
|
Oct 22, 2024 |
L3DG: Latent 3D Gaussian Diffusion
|
Oct 21, 2024 |
The Ingredients for Robotic Diffusion Transformers
|
Oct 20, 2024 |
Estimating Body and Hand Motion in an Ego-sensed World
|
Oct 19, 2024 |
Janus: Decoupling Visual Encoding for Unified Multimodal Understanding and Generation
|
Oct 19, 2024 |
One Step Diffusion via Shortcut Models
|
Oct 19, 2024 |
Mix Data or Merge Models? Optimizing for Diverse Multi-Task Learning
|
Oct 18, 2024 |
CoTracker3: Simpler and Better Point Tracking by Pseudo-Labelling Real Videos
|
Oct 18, 2024 |
nGPT: Normalized Transformer with Representation Learning on the Hypersphere
|
Oct 18, 2024 |
RDT-1B: a Diffusion Foundation Model for Bimanual Manipulation
|
Oct 18, 2024 |
Simplifying, Stabilizing and Scaling Continuous-Time Consistency Models
|
Oct 18, 2024 |