I am a Research Scientist at ByteDance working on GenAI. Previously, I earned my Ph.D. in Computer Science from Johns Hopkins University, advised by Bloomberg Distinguished Professor Alan L. Yuille. I have a board research experience in vairent computer vision / artificaial intellegence area, including but not limit to Video Generation 1 2 3, 3D vision 4 5 6, Robust Vision 7 8, Differible Rendering 9, and Medical Image Diagnosis 10 11.
My main focus is on developing high-quality, controllable, and temporally consistent video generation models. My work covers localized control of video synthesis, ID/IP-conditioned video generation, and long-form video generation, as well as large-scale video foundation model training. I have been deeply involved in building the complete training system for video generation — from raw video collection and processing pipelines, to data loading, model design (DiT-based architectures), and large-scale distributed training. Our research has been successfully deployed in TikTok effect products such as AI Mermaid (best AI effects on TikTok since 2023).
arXiv Preprint
arXiv Preprint
NIPS
arXiv Preprint
AAAI
WACV
ICLR
NIPS
ICLR
CVPR
CVPR
ICLR
ICLR
WACV
ICCV
WACV
ECCV
ECCV
IJCV
MICCAI
ECCV
arXiv Preprint
arXiv Preprint
arXiv Preprint
arXiv Preprint
Powered by Jekyll and Minimal Light theme.