GLM-5 Becomes Top Open-Weights Model on Extended NYT Connections Benchmark

23 February 2026 1 min read

GLM-5 has emerged as the top-performing open-weights model on the Extended NYT Connections benchmark with a score of 81.8, edging out Kimi K2.5 Thinking's 78.3. This benchmark is particularly relevant for evaluating local model capabilities on complex reasoning tasks.

For local LLM practitioners, this result validates the viability of open-source alternatives that can run on consumer hardware. The performance gap closure between open and proprietary models continues to narrow, making self-hosted deployments increasingly attractive for reasoning-intensive workloads. This benchmark provides a clear reference point for practitioners evaluating which models to deploy locally.

Read the full article on r/LocalLLaMA.

Source: r/LocalLLaMA · Relevance: 8/10