HN
New
Show
Ask
Jobs
Built with Solid
TurboPrefill: 2.7× faster than llama.cpp Pipeline Parallel on Llama-3-70B
(github.com)
3 points | by
trykhlieb
16 hours ago ago
1 comments
16 hours ago ago
[deleted]
1 comments