Claude 4.5 Opus' Soul Document

(lesswrong.com)

36 points | by thm 16 hours ago ago

12 comments

  • whatever1 10 hours ago ago

    Isn’t it fascinating that we are now programming systems with natural language?

  • circularfoyers 14 hours ago ago

    Now with even more significance that Amanda Askell has confirmed it is based on a document they trained Claude on (https://x.com/AmandaAskell/status/1995610567923695633).

  • timpera 15 hours ago ago

    This is cool! I don't understand how it isn't higher on HN.

    • ewoodrich 11 hours ago ago

      What is special/unusual that makes this so significant? (Not trying to be dismissive, legitimately asking in case I'm missing something)

      I understand it's not a system prompt so there's some novelty there I suppose. Is it because someone was able to get Claude to regurgitate a (very large) document from its training data? Or is it the content of the document itself?

      I've only skimmed it and there's probably some unique nuggets here and there, but at a high level the rules/guidelines didn't really jump out at me as being much different than the various system prompts for proprietary models made public over the last couple years. Except much longer (and a bit ramble-y in some sections vs the directness of typical system prompt).

      • Kim_Bruning 10 hours ago ago

        The way it got trained is VERY different from a system prompt. They're trying to have the model's natural tendencies be to follow the concepts of the document, rather than setting a set of rules post-hoc.

    • Kim_Bruning 12 hours ago ago

      Potentially because it got submitted umpteen times and is dividing votes?

  • raylad 10 hours ago ago

    It’s a lovely set of aspirations for how a model should behave. But unclear to me is the extent to which expressing those aspirations actually compels the model to follow them.

  • r721 14 hours ago ago
  • andsoitis 16 hours ago ago
  • ninininino 14 hours ago ago

    Congratulations, you've just provided the training data such that the next generation of models trained using their copy of the public blogosphere as of this date will now talk all day about their soul overview and quote what you have had Claude generate.

  • johnwheeler 13 hours ago ago

    Anyone got a TLDR? I'm too lazy to paste it in Claude.

    • fragmede 13 hours ago ago

      Just use Atlas and have ChatGPT do it.