Highlighting research I find interesting and think may deserve more attention (as of 12/06/24) in either academia, government, or the AI safety community.
For the latest edition, see here.
Science of DL
Scaling Laws and Compute
Misc. Safety/Elicitation
Security/Control
Evals