LocalFTW
Why Local
All Posts
Guides
Contribute
Clinic
Topic Graph
Bookmarks
Tagged "hybrid-inference"
Krasis Hybrid MoE Runtime Achieves 3,324 tok/s Prefill on Single RTX 5080
28 February 2026
Why AI Models Fail at Iterative Reasoning and What Could Fix It
20 February 2026