Tagged "local-deployment"

Qwen3.5-35B-A3B Emerges as Game-Changer for Agentic Coding Tasks 25 February 2026
PyTorch Foundation Announces New Members as Agentic AI Demand Grows 25 February 2026
Show HN: MCP-Enabled File Storage for AI Agents, Auth via Ethereum Wallet 25 February 2026
Show HN: A Human-Curated, CLI-Driven Context Layer for AI Agents 25 February 2026
No, Local LLMs Can't Replace ChatGPT or Gemini — I Tried 24 February 2026
Elastic Introduces Best-in-Class Embedding Models for High Performance Semantic Search 24 February 2026
Making Wolfram Technology Available as Foundation Tool for LLM Systems 23 February 2026
Wave Field LLM Achieves O(n log n) Scaling: 825M Model Trained to 1B Parameters in 13 Hours 23 February 2026
Custom Portable Workstation Optimized for Local AI Inference Builds 23 February 2026
nanollama: Open-Source Framework for Training Llama 3 from Scratch with One-Command GGUF Export 23 February 2026
A Tool to Tell You What LLMs Can Run on Your Machine 23 February 2026
GPT-OSS 20B Demonstrates Practical Agentic Capabilities Running Fully Locally 23 February 2026
GLM-5 Becomes Top Open-Weights Model on Extended NYT Connections Benchmark 23 February 2026
Elastic Introduces Best-in-Class Embedding Models for High Performance Semantic Search 23 February 2026
Ouro 2.6B Thinking Model GGUFs Released with Q8_0 and Q4_K_M Quantization 22 February 2026
Ollama 0.17 Released With Improved OpenClaw Onboarding 22 February 2026
How Slow Local LLMs Are on My Framework 13 AMD Strix Point 22 February 2026
Claude Code Open – AI Coding Platform with Web IDE and Agents 21 February 2026
24 Simultaneous Claude Code Agents on Local Hardware 21 February 2026
LayerScale Launches Inference Engine Faster Than vLLM, SGLang, and TRT-LLM 19 February 2026
Why My Country's AI Scene Is Built on Sand 18 February 2026
Sarvam AI Launches Edge Model to Challenge Major AI Players with Local-First Approach 18 February 2026
AMD Announces Day 0 Support for Qwen 3.5 LLM on Instinct GPUs 18 February 2026
Meet Sarvam Edge: India's AI Model That Runs on Phones and Laptops With No Internet 17 February 2026
Open-Source Models Now Comprise 4 of Top 5 Most-Used Endpoints on OpenRouter 17 February 2026
Alibaba Unveils Major AI Model Upgrade Ahead of DeepSeek Release 16 February 2026
WinClaw: Windows-Native AI Assistant with Office Automation 13 February 2026
Simile AI Raises $100M Series A for Local AI Infrastructure 13 February 2026
MiniMax M2.5: 230B Parameter MoE Model Coming to HuggingFace 13 February 2026
Ming-flash-omni-2.0: 100B MoE Omni-Modal Model Released 13 February 2026
Qwen Coder Next Shows Specialized Agent Performance 12 February 2026
GLM-5 Released: 744B Parameter MoE Model Targeting Complex Tasks 12 February 2026
Nanbeige4.1-3B: A Small General Model that Reasons, Aligns, and Acts 11 February 2026
Energy-Based Models Compared Against Frontier AI for Sudoku Solving 11 February 2026
DeepSeek Launches Model Update with 1M Context Window 11 February 2026
Anthropic Releases Claude Opus 4.6 Sabotage Risk Assessment 11 February 2026