I am a Research Scientist at ByteDance working on GenAI. Previously, I earned my Ph.D. in Computer Science from Johns Hopkins University, advised by Bloomberg Distinguished Professor Alan L. Yuille. I have a board research experience in vairent computer vision / artificaial intellegence area, including but not limit to Video Generation 1 2 3, 3D vision 4 5 6, Robust Vision 7 8, Differible Rendering 9, and Medical Image Diagnosis 10 11.
My main focus is on developing high-quality, controllable, and temporally consistent video generation models. My work covers localized control of video synthesis, ID/IP-conditioned video generation, and long-form video generation, as well as large-scale video foundation model training. I have been deeply involved in building the complete training system for video generation — from raw video collection and processing pipelines, to data loading, model design (DiT-based architectures), and large-scale distributed training. Our research has been successfully deployed in TikTok effect products such as AI Mermaid (best AI effects on TikTok since 2023).
Powered by Jekyll and Minimal Light theme.