Two Leaps to 1000 Tokens/s on a 1T-Parameter Model

(tilert.ai)

7 points | by __natty__ 20 hours ago ago

No comments yet.