I haven’t updated the Claude Reflects experiment in a few days because it is incredibly resource intensive and some other things have popped up that are taking priority. The Experiment is paused, but will resume.
Also, you may have heard that Claude’s source code has been leaked. I have some thoughts on this; I will post a piece soon. Additionally, I’m working on something related titled “Adversarial Alignment” – the revelations in the leak have made this notion particularly potent.
Stay tuned.
