§ Code · Data · Templates

Downloads & Resources

Open-source codebases, datasets and benchmarks we've released, survey write-ups, and LaTeX templates I've built for my own papers, posters, and slides.

Featured Codebases

SAMURAI
Contributor·Tracking

SAMURAI

Zero-shot visual object tracking with motion-aware memory, built on SAM.

LMMs-Eval
Contributor·Eval

LMMs-Eval

Evaluation suite accelerating the development of large multimodal models.

MovieChat
Lead·CVPR 2024

MovieChat

Large multimodal models for long-form video understanding with memory mechanism.

Featured Datasets & Benchmarks

FrontierCS
2026·Benchmark

FrontierCS

An open-ended benchmark for challenging computer science problems with objective, fine-grained evaluation.

LiveCodeBench Pro
NeurIPS 2025·Benchmark

LiveCodeBench Pro

Models like o3-high, o4-mini, and Gemini 2.5 Pro score 0% on hard competitive programming problems.

Video-MMLU
ICCVW 2025·Benchmark

Video-MMLU

A massive benchmark designed to evaluate LMMs in understanding Multi-Discipline Lectures.

TEMPURA
2025·Dataset

TEMPURA

1M reasoning samples about causal event relationships with fine-grained, timestamped descriptions of untrimmed videos.

Science T2I
CVPR 2025·Dataset

Science T2I

Over 20k image pairs for training a language-guided reward model for text-to-image alignment with scientific knowledge.

VDC & AuroraCap Trainset
ICLR 2025·Dataset

VDC & AuroraCap Trainset

First benchmark for detailed video captioning — 1k+ videos with significantly longer captions plus training recipes.

RT-Pose
ECCV 2024·Dataset

RT-Pose

Human pose estimation dataset with calibrated radar ADC data, 4D radar tensors, stereo RGB images, and LiDAR.

MovieChat-1K
CVPR 2024·Dataset

MovieChat-1K

Manually labeled long-video QA and caption dataset — 1,000 videos, each longer than ten thousand frames.