LocalFTW

Why Local
All Posts
Guides
Contribute
Clinic
Topic Graph
Bookmarks

Tagged "agent-benchmarking"

GLM 5.1 Dominates Agentic Benchmarks, Outperforming Most Models at 1/3 Opus Cost 11 April 2026
AI Agent Reliability Tracker 8 March 2026

© Mike Doyle. Published with Eleventy.

Privacy · Takedown