Blog

Featured

Blog

Claude Code blog post thumbnail
Introduction to Claude Code and Codex

Sep 13, 2025

A practical introduction focusing on Claude Code and Codex, with clear explanations of memory, permissions, MCP, custom commands, and best practices for dual-tool collaboration.

Slides
Neural ODE blog post thumbnail
ODE Perspective on Neural Networks

Jul 26, 2025

In this blog, we cover the neural ODE perspective in terms of optimization and architecture design.

PDF
MESA blog post thumbnail
View Transformer Layers from Online Optimization Perspective

First posted: May 20, 2025
Last updated: Jul 17, 2025

In this blog, we cover mesa-optimization, test-time-training (TTT), and broad view of fast weight programming in transformer models.

PDF
Data Dimension blog post thumbnail
What is the Intrinsic Dimension of Your Data?

Jan 15, 2025

In this blog, we introduce the concept of intrinsic dimension and provide a method to estimate it. It is amazing that ImageNet has only 50 of the intrinsic dimension.

Slides
Bridge blog post thumbnail
Bridging the Parallel Decoding of LLMs with the Diffusion Process

Oct 30, 2024

In this blog, we introduce Jacobi Decoding, a parallel decoding algorithm for LLMs and its connection to the diffusion process in terms of high-level concepts.

English
Chinese