
FrontierCS: Evolving Challenges for Evolving Intelligence
An open-ended benchmark for evolving computer science challenges with objective and fine-grained evaluation of agent capabilities.
Notes on topics I'm chewing on, not necessarily topics I'm working on.

An open-ended benchmark for evolving computer science challenges with objective and fine-grained evaluation of agent capabilities.

A practical introduction focusing on Claude Code and Codex, with clear explanations of memory, permissions, MCP, custom commands, and best practices for dual-tool collaboration.

We cover the neural ODE perspective in terms of optimization and architecture design.

We cover mesa-optimization, test-time-training (TTT), and a broad view of fast weight programming in transformer models.

We introduce the concept of intrinsic dimension and provide a method to estimate it. It is amazing that ImageNet has only 50 of the intrinsic dimension.

We introduce Jacobi Decoding, a parallel decoding algorithm for LLMs, and its connection to the diffusion process in terms of high-level concepts.