HN
New
Show
Ask
Jobs
Built with Qwik
Re-quantizing a local LLM 14x faster by skipping the tensors that didn't change
(andreaborio.substack.com)
6 points | by
andreaborio
5 hours ago ago
1 comments
andreaborio
5 hours ago
[dead]
[dead]