§ Homepage — Princeton, NJ · 2026

Wenhao Chai

Wenhao Chai is a first-year Ph.D. student in Computer Science at Princeton University and student researcher at Google DeepMind. He received his master's degree from University of Washington and bachelor's degree from Zhejiang University.

His research spans a wide range of topics in computer vision and machine learning, with a focus on long-context multimodal modeling and reasoning. He has interned at Pika Labs working with Professor Christopher D. Manning, and Microsoft Research Asia.

He leads MovieChat, one of the first large multimodal models and benchmarks for hour-long video understanding with memory mechanism. He co-leads LiveCodeBench Pro, which has been listed as evaluation benchmarks by frontier models like Google Gemini and Meta Muse Spark. He has organized workshops and competitions at CVPR 2024, CVPR 2025, and CVPR 2026. His work has been featured by MIT Technology Review.

§ News & Highlights

News & Highlights

View all
Office Hours

Chat

To junior master/undergraduate students: if you would like to chat about life, career plan, or research ideas related to AI/ML. I will dedicate at least 30 mins every week for such meetings. I encourage students from underrepresented groups to reach out. Also check my calendar.

Toolbox

Claude Code Plugins

I maintain a personal marketplace of Claude Code plugins for daily workflows, such as TODO management, calendar / email triage, and conversation wrap-up. Install via /plugin marketplace add wenhaochai/claude-plugins.

§ Feature

AuroraCap. Efficient, performant video detailed captioning, and a new benchmark.

Introduction to AuroraCap and the VDC benchmark. ICLR 2025, in collaboration with Pika Labs, Stanford, MIT, Harvard, and NYU.

Watch on YouTube
2:14 · AuroraCap / ICLR '25