I think there was a major jump in AI capabilities from Anthropic and OpenAI between the end of 2025 and the start of 2026 that made them far more reliable at programming correctly. I wonder what changed in the secret sauce.
Nothing drastic I'd say. It's a continuous stream of small improvements just accumulating with each release, and someone just noticed a few releases away from a previous publicized-bad-capabilities release that there's major improvement between those points. So it looks like something major only due to the spacing between the capability surveys on the release timeline.
I think there was a major jump in AI capabilities from Anthropic and OpenAI between the end of 2025 and the start of 2026 that made them far more reliable at programming correctly. I wonder what changed in the secret sauce.
Nothing drastic I'd say. It's a continuous stream of small improvements just accumulating with each release, and someone just noticed a few releases away from a previous publicized-bad-capabilities release that there's major improvement between those points. So it looks like something major only due to the spacing between the capability surveys on the release timeline.
[dead]