Blog
Featured
Blog

Introduction to Claude Code and Codex
Sep 13, 2025
A practical introduction focusing on Claude Code and Codex, with clear explanations of memory, permissions, MCP, custom commands, and best practices for dual-tool collaboration.
Slides
ODE Perspective on Neural Networks
Jul 26, 2025
In this blog, we cover the neural ODE perspective in terms of optimization and architecture design.
PDF
View Transformer Layers from Online Optimization Perspective
First posted: May 20, 2025
Last updated: Jul 17, 2025
In this blog, we cover mesa-optimization, test-time-training (TTT), and broad view of fast weight programming in transformer models.
PDF
What is the Intrinsic Dimension of Your Data?
Jan 15, 2025
In this blog, we introduce the concept of intrinsic dimension and provide a method to estimate it. It is amazing that ImageNet has only 50 of the intrinsic dimension.
Slides