Publications

NeuralSSD: A Neural Solver for Signed Distance Surface Reconstruction
ArXiv 2025
Lyra: Generative 3D Scene Reconstruction via Video Diffusion Model Self-Distillation
ArXiv 2025
TerraCraft: City-scale Generative Procedural Modeling with Natural Language
Graphical Models 2025
ViPE: Video Pose Engine for 3D Geometric Perception
NVIDIA Technical Report
InfiniCube: Unbounded and Controllable Dynamic 3D Driving Scene Generation with World-Guided Video Models
ICCV 2025
VideoPanda: Video Panoramic Diffusion with Multi-view Attention
ArXiv 2025
GEN3C: 3D-Informed World-Consistent Video Generation with Precise Camera Control
Computer Vision and Pattern Recognition (CVPR) 2025 - Highlight
Cosmos World Foundation Model Platform for Physical AI
NVIDIA Technical Report
STORM: Spatio-Temporal Reconstruction Model for Large-Scale Outdoor Scenes
ArXiv 2025
Feed-Forward Bullet-Time Reconstruction of Dynamic Scenes from Monocular Videos
NeurIPS 2025
SCube: Instant Large-Scale Scene Reconstruction using VoxSplats
NeurIPS 2024
OmniRe: Omni Urban Scene Reconstruction
International Conference on Learning Representations (ICLR) 2025 - Spotlight
fVDB: A Deep-Learning Framework for Sparse, Large-Scale, and High-Performance Spatial Intelligence
SIGGRAPH 2024
XCube: Large-Scale 3D Generative Modeling using Sparse Voxel Hierarchies
Computer Vision and Pattern Recognition (CVPR) 2024 - Highlight
Approximately Piecewise E(3) Equivariant Point Networks
International Conference on Learning Representations (ICLR) 2024
DiffFacto: Controllable Part-Based 3D Point Cloud Generation with Cross Diffusion
International Conference on Computer Vision (ICCV) 2023
Neural Kernel Surface Reconstruction
Computer Vision and Pattern Recognition (CVPR) 2023 - Highlight
A Neural Galerkin Solver for Accurate Surface Reconstruction
SIGGRAPH Asia 2022
Multiway Non-rigid Point Cloud Registration via Learned Functional Map Synchronization
T-PAMI 2022
Real-Time Globally Consistent 3D Reconstruction with Semantic Priors
IEEE Transactions on Visualization and Computer Graphics 2021
CIRCLE: Convolutional Implicit Reconstruction and Completion for Large-scale Indoor Scene
ArXiv 2021
Subdivision-Based Mesh Convolution Networks
ACM Transactions on Graphics 2021
MultiBodySync: Multi-Body Segmentation and Motion Estimation via 3D Scan Synchronization
Computer Vision and Pattern Recognition (CVPR) 2021 - Oral
DI-Fusion: Online Implicit 3D Reconstruction with Deep Priors
Computer Vision and Pattern Recognition (CVPR) 2021
WallNet: Reconstructing General Room Layouts from RGB Images
Graphical Models 2020
ClusterVO: Clustering Moving Instances and Estimating Visual Odometry for Self and Surroundings
Computer Vision and Pattern Recognition (CVPR) 2020
Shallow2Deep: Indoor Scene Modeling by Single Image Understanding
Pattern Recognition 2020
ClusterSLAM: A SLAM Backend for Simultaneous Rigid Body Clustering and Motion Estimation
International Conference on Computer Vision (ICCV) 2019
Interactive Modeling of Lofted Shapes from a Single Image
Computational Visual Media (CVM) 2019
DeepSpline: Data-Driven Reconstruction of Parametric Curves and Surfaces
ArXiv 2019
DeepPrimitive: Image decomposition by layered primitive detection
Computational Visual Media (CVM) 2018
Controllable Dendritic Crystal Simulation Using Orientation Field
Eurographics (EG) 2018
MUSA: Wi-Fi AP-assisted video prefetching via Tensor Learning
International Symposium on Quality of Service (IWQoS) 2017