Maintaining Alignment during RSI as a Feedback Control Problem

Recent advances have shown the ML field moving beyond pretrained amortized models and supervised learning and we are now moving into the realm of online reinforcement learning and hence the creation of hybrid direct and amortized optimizing agents. While we generally have found that purely amortized pretrained models are an... [Read More]

Review of Mind Children By Hans Moravec (1988)

I’ve had this book on my reading list for a while since this is the classic book everyone cites about predicting the singularity ahead of time and describing a ‘merging’ of AI and human minds into the future as a positive singularity. Since I had some time this afternoon, I... [Read More]

Intellectual Progress in 2024

As always, 2024 has been an interesting year marked by extremely rapid and impressive AI progress. Every year since 2020 has felt like a rollercoaster of AI surging past our expectations, which makes you think there is no way it can possibly go any faster, and then the next year... [Read More]

A Retrospective on Active Inference

Active Inference is a theory of adaptive action selection for agents proposed by Karl Friston initially and now expanded upon by many authors and forms a small academic subfield of research. The core claims of the theory are that action selection and decision-making can be usefully understood as inference problems... [Read More]

Right to Left (R2L) Integer Tokenization

This is a guest post by Max Buckley, a software engineer at Google and fellow AI researcher1. Contributions: Max wrote a draft on this post and did the experiments, Beren provided editorial review. ↩ [Read More]