Paper Feed: December 2024

Highlighting research I find interesting and think may deserve more attention (as of 12/06/24) in either academia, government, or the AI safety community.

For the latest edition, see here.

Science of DL

Scaling Laws and Compute

Misc. Safety/Elicitation

Security/Control

Evals