About

Publications

MOSIV: Multi-Object System Identification from Videos
"identify multi-object geometry and physics from video"
Chunjiang Liu, Xiaoyuan Wang, Qingran Lin, Albert Xiao, Haoyu Chen, Shizheng Wen, Hao Zhang, Lu Qi, Ming-Hsuan Yang, Laszlo A. Jeni, Min Xu, Yizhou Zhao
International Conference on Learning Representations (ICLR), 2026

Paper Data


HoliGS: Holistic Gaussian Splatting for Embodied View Synthesis
"deformable Gaussian Splatting for free-viewpoint rendering of long videos"
Xiaoyuan Wang, Yizhou Zhao, Botao Ye, Xiaojun Shan, Weijie Lyu, Lu Qi, Kelvin CK Chan, Yinxiao Li, Ming-Hsuan Yang
Advances in Neural Information Processing Systems (NeurIPS), 2025

Paper


MASIV: Toward Material-Agnostic System Identification from Videos
"identify geometry and physics from video without material assumptions"
Yizhou Zhao, Haoyu Chen, Chunjiang Liu, Zhenyang Li, Charles Herrmann, Junhwa Hur, Yinxiao Li, Ming-Hsuan Yang, Bhiksha Raj, Min Xu
International Conference on Computer Vision (ICCV), 2025

Paper Code Data


Total-Editing: Head Avatar with Editable Appearance, Motion, and Lighting
"edit appearance, motion, and lighting of head avatars with one-shot demostration"
Yizhou Zhao, Chunjiang Liu, Haoyu Chen, Bhiksha Raj, Min Xu, Tadas Baltrusaitis, Mitch Rundle, HsiangTao Wu, Kamran Ghasedi
International Conference on Computer Vision Workshops (ICCVW), 2025

Paper


Metric from Human: Zero-shot Monocular Metric Depth Estimation via Test-time Adaptation
"paint humans as a metric ruler for zero-shot monocular metric depth estimation"
Yizhou Zhao, Hengwei Bian, Kaihua Chen, Pengliang Ji, Liao Qu, Shao-yu Lin, Weichen Yu, Haoran Li, Hao Chen, Jun Shen, Bhiksha Raj, Min Xu
Advances in Neural Information Processing Systems (NeurIPS), 2024

Paper Code


CryoSAM: Training-Free CryoET Tomogram Segmentation with Foundation Models
"foundation models segment cryo-ET tomograms with no training"
Yizhou Zhao, Hengwei Bian, Michael Mu, Mostofa Rafid Uddin, Zhenyang Li, Xiang Li, Tianyang Wang, Min Xu
International Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2024

Paper Code


SynCHMR: Synergistic Global-space Camera and Human Reconstruction from Videos
"jointly reconstruct camera, human, and scene in global space from video"
Yizhou Zhao, Tuanfeng Yang Wang, Bhiksha Raj, Min Xu, Jimei Yang, Chun-Hao Paul Huang
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2024

Project Page Paper


Alignment-guided Temporal Attention for Video Action Recognition
"frame alignment guides temporal attention for video action recognition"
Yizhou Zhao, Zhenyang Li, Xun Guo, Yan Lu
Advances in Neural Information Processing Systems (NeurIPS), 2022

Paper


Semantic-aligned Fusion Transformer for One-shot Object Detection
"align query and support features across scale and space for one-shot detection"
Yizhou Zhao, Xun Guo, Yan Lu
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2022

Paper


Awards

Mentees