Publications
Differentiation and Specialization of Attention Heads via the Refined Local Learning Coefficient
Wang et al.
Loss landscape geometry reveals stagewise development of transformers
Wang et al.
Estimating the Local Learning Coefficient at Scale
Furman and Lau
The Developmental Landscape of In-Context Learning
Hoogland et al.
Quantifying Degeneracy in Singular Models via the Learning Coefficient
Lau et al.