Listen to a podcast, please open Podcast Republic app. Available on Google Play Store and Apple App Store.
| Episode | Date |
|---|---|
|
GAIA-2 Controllable Multi-View Generative World Model for Autonomous Driving
|
May 06, 2025 |
|
Distillation Scaling Laws
|
Feb 19, 2025 |
|
Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention
|
Feb 19, 2025 |
|
Streaming DiLoCo: Efficient Distributed Training of Large Language Models
|
Feb 06, 2025 |
|
Efficiently Scaling Transformer Inference
|
Feb 06, 2025 |
|
Tülu 3: Pushing Frontiers in Open Language Model Post-Training
|
Feb 06, 2025 |
|
Bytedance: UI-TARS: End-to-End Model for Automated GUI Interaction
|
Jan 22, 2025 |
|
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
|
Jan 20, 2025 |
|
DeepSeek-V3: Advancements in Open-Source Large Language Models
|
Jan 19, 2025 |
|
Titans: Learning to Memorize at Test Time
|
Jan 18, 2025 |
|
Transformer2: Self-Adaptive Large Language Models
|
Jan 18, 2025 |
|
Learning to Learn Optimization Algorithms with LSTM Networks
|
Jan 18, 2025 |
|
Trust Region Policy Optimization
|
Jan 18, 2025 |
|
Efficient Deep Learning Parallelization using SOAP Search Space and FlexFlow Framework
|
Aug 31, 2024 |
|
Deep Retrieval: Learning Efficient Structures for Large-Scale Recommendation Systems
|
Aug 31, 2024 |
|
Scaling User Modeling for Personalized Advertising at Meta
|
Aug 31, 2024 |
|
LiNR: Revolutionizing Large-Scale Retrieval for Recommendation Systems
|
Aug 31, 2024 |
|
Comprehensive Guide to Real-Time Bidding (RTB): Challenges and Opportunities
|
Aug 31, 2024 |
|
Efficient Inference for Large Language Models with LLM.int8()
|
Aug 14, 2024 |
|
Enhancing Language Models with a Massive Datastore
|
Aug 14, 2024 |
|
In-Context Policy Iteration: Enhancing Reinforcement Learning with Large Language Models
|
Aug 14, 2024 |
|
Optimizing Quantization of Large Language Models for Efficiency and Accuracy
|
Aug 12, 2024 |
|
AutoPruner: End-to-End Trainable Filter Pruning for Efficient Deep Neural Networks
|
Aug 11, 2024 |
|
SparseGPT: One-shot Pruning of Large Language Models
|
Aug 11, 2024 |
|
Efficient Compression of Large Language Models using LLM-Pruner
|
Aug 11, 2024 |
|
ScreenAgent: A Vision Language Model-driven Computer Control Agent
|
Aug 10, 2024 |
|
Supervised Pretraining for In-Context Reinforcement Learning with Transformers
|
Aug 10, 2024 |
|
Decision-Pretrained Transformer: Bridging Supervised Learning and Reinforcement Learning
|
Aug 10, 2024 |
|
How Transformers Learn In-Context Beyond Simple Functions
|
Aug 10, 2024 |
|
In-Context Learning Capabilities of Transformers
|
Aug 10, 2024 |
|
Spider2-V: Automated Multimodal Agents for Data Science Workflows
|
Aug 10, 2024 |
|
Generalization Patterns of Transformers in In-Weights Learning and In-Context Learning
|
Aug 10, 2024 |
|
Unmasking the Lottery Ticket Hypothesis
|
Aug 09, 2024 |
|
Rethinking Scale for In-Context Learning in Large Language Models
|
Aug 09, 2024 |
|
Ferret-UI: Multimodal Large Language Model for Mobile User Interface Understanding
|
Aug 08, 2024 |
|
Grounded SAM: A Novel Approach to Open-Set Segmentation
|
Aug 08, 2024 |
|
SAM 2: Segment Anything in Images and Videos
|
Aug 06, 2024 |
|
RL^2: Fast Reinforcement Learning via Slow Reinforcement Learning
|
Aug 05, 2024 |
|
Evolutionary Optimization of Model Merging Recipes
|
Aug 05, 2024 |
|
Exploring Weight Agnostic Neural Networks
|
Aug 05, 2024 |
|
Speculative Execution for Efficient Inference in Large Language Models on Consumer Devices
|
Aug 05, 2024 |
|
In-context Learning and Induction Heads
|
Aug 02, 2024 |
|
On the Measure of Intelligence
|
Aug 02, 2024 |
|
Geometric Properties of Data Representations in Deep Neural Networks
|
Aug 02, 2024 |
|
The Case for Learned Index Structures
|
Aug 02, 2024 |
|
NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis
|
Aug 02, 2024 |
|
Constitutional AI: Harmlessness from AI Feedback
|
Aug 02, 2024 |
|
Proximal Policy Optimization Algorithms
|
Aug 02, 2024 |
|
Graph Isomorphism Networks: A Theoretical Framework and Architecture
|
Aug 02, 2024 |
|
Rethinking the Value of Network Pruning
|
Aug 02, 2024 |
|
The Lottery Ticket Hypothesis: Finding Sparse, Trainable Neural Networks
|
Aug 02, 2024 |
|
Adding Conditional Control to Text-to-Image Diffusion Models
|
Aug 02, 2024 |
|
Denoising Diffusion Probabilistic Models
|
Aug 02, 2024 |
|
Practical Research Problems in AI Safety
|
Aug 02, 2024 |
|
Segment Anything: A Paradigm Shift in Image Segmentation
|
Aug 02, 2024 |
|
Learning Transferable Visual Models From Natural Language Supervision
|
Aug 02, 2024 |
|
Language Models are Few-Shot Learners
|
Aug 02, 2024 |
|
Training Deep Reinforcement Learning Systems with Human Preferences
|
Aug 02, 2024 |
|
Playing Atari with Deep Reinforcement Learning
|
Aug 02, 2024 |
|
Single Path One-Shot (SPOS): Efficient Neural Architecture Search with Simplified Supernet
|
Aug 01, 2024 |
|
Long-CLIP: Extending Text Length for Improved Vision-Language Modeling
|
Aug 01, 2024 |
|
𝑓VDB: A Deep-Learning Framework for Sparse, Large-Scale, and High-Performance Spatial Intelligence
|
Aug 01, 2024 |
|
Unraveling the Connection between In-Context Learning and Gradient Descent in Transformers
|
Jul 24, 2024 |
|
Gradient Low-Rank Projection (GaLore): Revolutionizing Memory-Efficient LLM Training
|
Jul 24, 2024 |
|
Retrieval-Enhanced Transformers (RETRO): A Semi-Parametric Approach to Enhance Performance of Large Language Models
|
Jul 20, 2024 |
|
Foundation Models in Decision Making: Roles, Challenges, and Opportunities
|
Jul 20, 2024 |
|
FlashAttention: Fast and Memory-Efficient Exact Attention with IO-Awareness
|
Jul 19, 2024 |
|
PyTorch FSDP: Experiences on Scaling Fully Sharded Data Parallel
|
Jul 19, 2024 |
|
Hyper Networks: A Novel Approach to Learning Weights in Deep Neural Networks
|
Jul 18, 2024 |
|
DARTS: Differentiable Architecture Search
|
Jul 18, 2024 |
|
TiTok: A Transformer-based 1D Tokenization Approach for Image Generation
|
Jul 18, 2024 |
|
NerfBaselines: A Framework for Standardized Evaluation of Novel View Synthesis Methods in Computer Vision
|
Jul 18, 2024 |
|
Survey on reinforcement learning in reccomender systems
|
Jul 18, 2024 |
|
Models tell you what to discard
|
Jul 18, 2024 |
|
Training Large Language Models for Compiler Optimization
|
Jul 18, 2024 |
|
Metadata-based Color Harmonization for Multi-camera Surround View Systems
|
Jul 18, 2024 |
|
Extrapolated View Synthesis for Urban Scene Reconstruction
|
Jul 18, 2024 |
|
Planning-Oriented Autonomous Driving
|
Jul 18, 2024 |
|
SafePathNet: Learning a Distribution of Trajectories for Safe and Comfortable Autonomous Driving
|
Jul 18, 2024 |
|
Unsupervised Occupancy Fields for Perception and Forecasting
|
Jul 18, 2024 |
|
UniPAD: A Universal Pre-training Paradigm for Autonomous Driving
|
Jul 18, 2024 |
|
RT-DETR: Real-Time Object Detection with Transformer
|
Jul 18, 2024 |
|
Robustness Evaluation of HD Map Constructors under Sensor Corruptions for Autonomous Driving
|
Jul 18, 2024 |
|
DriveVLM: Vision-Language Models for Autonomous Driving in Urban Environments
|
Jul 18, 2024 |
|
ZeRO Memory Optimizations: Toward Training Trillion Parameter Models
|
Jul 08, 2024 |
|
No-Transaction Band Network A Neural Network Architecture for Efficient Deep Hedging
|
Jul 08, 2024 |
|
NeuralProphet Explainable Forecasting at Scale
|
Jul 08, 2024 |
|
AutoEmb Automated Embedding Dimensionality Searchg in Streaming Recommendations
|
Jul 08, 2024 |
|
A Better Match for Drivers and Riders Reinforcement Learning at Lyft
|
Jul 08, 2024 |
|
The limits to learning a diffusion model
|
Jul 08, 2024 |
|
Zero Bubble Pipeline Parallelism
|
Jul 08, 2024 |
|
TransAct Transformer-based Realtime User Action Model for Recommendation at Pinterest
|
Jul 08, 2024 |