4 points | by simonpure 6 hours ago ago
2 comments
Is this what Anthropic meant with this blogpost https://www.anthropic.com/news/detecting-and-preventing-dist... ? Alibaba was not mentioned, though.
This is not Alibaba, just some rando fine-tuning a model released by Alibaba on Claude reasoning traces collected by other randos.
Is this what Anthropic meant with this blogpost https://www.anthropic.com/news/detecting-and-preventing-dist... ? Alibaba was not mentioned, though.
This is not Alibaba, just some rando fine-tuning a model released by Alibaba on Claude reasoning traces collected by other randos.