Ask HN: Do people want to AB test LLMs?

2 points | by k11kirky 7 hours ago ago

6 comments

noemit 7 hours ago ago
To answer the headline - No.
I find that working and adjusting my prompts and context is way higher value than A/B testing LLMs.
After all, I will never expect 100% accuracy.
I feel like they are reaching commodity status and the result quality is so similar, that it just doesn't really matter what you use.
[-]