Blogs | Slides

Featured

Blogs and Slides

Introducing muP

Mar 13, 2025

In this blog, we introduce muP (Maximal Update Parametrization), which aims at studying the transfer patterns of hyperparameters across model scales.

Blogs
What is the Intrinsic Dimension of Your Data?

Jan 15, 2025

In this blog, we introduce the concept of intrinsic dimension and provide a method to estimate it. It is amazing that ImageNet has only 50 of the intrinsic dimension.

Slides
Bridging the Parallel Decoding of LLMs with the Diffusion Process

Oct 30, 2024

In this blog, we introduce Jacobi Decoding, a parallel decoding algorithm for LLMs and its connection to the diffusion process in terms of high-level concepts.

English
Chinese