Wenhao Chai
Ph.D. Student in Computer Science, Princeton University
Wenhao Chai is a first-year Ph.D. student in Computer Science at Princeton University and student researcher at Google DeepMind. He received his master's degree from University of Washington and bachelor's degree from Zhejiang University. His research spans a wide range of topics in computer vision and machine learning, with a focus on long-context multimodal modeling and reasoning. He has interned at Pika Labs working with Professor Christopher D. Manning, and Microsoft Research Asia. He leads MovieChat, one of the first large multimodal models and benchmarks for hour-long video understanding. He co-leads LiveCodeBench Pro. His work has been featured by MIT Technology Review. He has organized workshops and competitions at CVPR 2024 and CVPR 2025.
Check Out
News and Highlights
- Chat. To junior master/undergraduate students: if you would like to chat about life, career plan, or research ideas related to AI/ML. I will dedicate at least 30 mins every week for such meetings. I encourage students from underrepresented groups to reach out. Also check this.
-
Join Discord.
We are hosting Discord server among professors and students for
daily sharing and research discussion.
- Calendar. View my live-updated availability and upcoming events.
- 05/2026: I join Google DeepMind as a student researcher.
- 02/2026: Two papers accepted by CVPR 2026.
- 01/2026: Five papers accepted by ICLR 2026.
- 12/2025: One paper accepted by IEEE TIP.
- 12/2025: I give an oral presentation at NeurIPS 2025 about Benchmarking Reasoning-Informed Visual Editing. Slides.
- 10/2025: Video-MMLU received the Outstanding Paper Award at ICCV 2025 Workshop @ Knowledge-Intensive Multimodal Reasoning with Travel Grant.
- 09/2025: One paper accepted by NeurIPS 2025, two papers accepted by NeurIPS 2025 Datasets and Benchmarks Track with one Oral.
- 09/2025: Invited talk at Abaka AI and 2077AI titled Better and Longer Video Understanding. Slides.
- 09/2025: Join Princeton University as a CS Ph.D. student. 2025 Fall application Record.
- 08/2025: One paper accepted by IEEE TPAMI.
- 08/2025: LiveCodeBench Pro presented in Open AGI Symposium at University of California, Berkeley. Slides.
- 07/2025: Interviewed by DeepTech and MIT Technology Review China. Post.
- 06/2025: One paper accepted by ICCV 2025.
- 06/2025: Featured in MIT Technology Review as one of the lead authors of LiveCodeBench Pro.
- 06/2025: One paper accepted by IROS 2025.
- 05/2025: One paper accepted by ACL 2025.
- 04/2025: We host CVPR 2025 Video Understanding Challenge @ LOVEU sponsored by Lambda.
- 03/2025: Graduated from the University of Washington with a Master's thesis on Large Multimodal Models for Video Captioning, nominated for the Distinguished Thesis Award by the ECE Department.
- 02/2025: Three papers accepted by CVPR 2025.
- 01/2025: Two papers accepted by ICLR 2025.
- 07/2024: Two papers accepted by ECCV 2024.
- 06/2024: I work with Pika Labs as intern to develop next-generation video understanding and generation models.
- 04/2024: We host CVPR 2024 Long-form Video Understanding Challenge @ LOVEU.
- 04/2024: Invited talk at AgentX seminar about our STEVE series works.
- 02/2024: Two papers accepted by CVPR 2024 with one highlight (2.81%).
- 02/2024: Invited talk at AAAI 2024 workshop @ IMAGEOMICS.
- 12/2023: One paper accepted by AAAI 2024.
- 07/2023: Two papers accepted by ICCV 2023.
- 02/2023: I become a research intern at Microsoft Research Asia (MSRA), advised by principal researcher Xun Guo.