Save Claude Code Tokens with Smart Routing

(github.com)

11 points | by FrancescoMassa 14 hours ago ago

4 comments

nithiink 5 hours ago ago
How do you handle prompt caching? A lot of cost savings for a single model chat come from cache hits on the conversation context, and switching models invalidates that cache — the new model has to reprocess everything at full input price.
patch_dev 4 hours ago ago
What does this solve that well used subagents doesn't solve already?
[-]
14 hours ago ago
[deleted]