Notes

On papers, ideas, and things I'm thinking about.

On the Subtleties of Software Optimization Evals

3 min

Small metric choices in software optimization benchmarks can create brittle aggregates and misleading capability conclusions. A look at how harmonic mean aggregation and correctness penalties interact in optimization evals.

benchmarks evals software optimization

Building Blocks to AI Timelines

6 min

Notes and intuitions on economic growth models, functional forms, and how they connect to forecasting AI progress. Part of a series on economics basics for AI forecasting.

economics AI progress forecasting growth models functional forms