HN
New
Show
Ask
Jobs
Built with Solid
INT-FlashAttention: Enabling Flash Attention for INT8 Quantization
(arxiv.org)
6 points | by
PaulHoule
2 days ago ago
No comments yet.
No comments yet.