Tagged "edge-device"

New Era of On-Device AI Driven by High-Speed UFS 5.0 Storage 25 February 2026
PyTorch Foundation Announces New Members as Agentic AI Demand Grows 25 February 2026
Mirai Announces $10M to Advance On-Device AI Performance for Consumer Devices 25 February 2026
Show HN: 100% LLM Accuracy–No Fine-Tuning, JSON Only 25 February 2026
How AI is Redefining Price and Performance in Modern Laptops 25 February 2026
What Breaks When AI Agent Frameworks Are Forced Into <1MB RAM and Sub-ms Startup 25 February 2026
Show HN: A Ground Up TLS 1.3 Client Written in C 24 February 2026
Mirai Tech Raises $10 Million for On-Device AI Innovation 24 February 2026
No, Local LLMs Can't Replace ChatGPT or Gemini — I Tried 24 February 2026
Kioxia Sampling UFS 5.0 Embedded Flash Memory for Next-Generation Mobile Applications 24 February 2026
Enhanced Interface Speed Enables High-Performance On-Device AI Features in Smartphones 24 February 2026
Elastic Introduces Best-in-Class Embedding Models for High Performance Semantic Search 24 February 2026
Enterprise Infrastructure Guide: Running Local LLMs for 70-150 Developers 24 February 2026
Which Web Frameworks Are Most Token-Efficient for AI Agents? 23 February 2026
Breaking the Speed Limit: Strategies for 17k Tokens/Sec Local Inference 23 February 2026
South Korea to Launch $687 Million Project to Develop On-Device AI Semiconductors 23 February 2026
Qwen3 Demonstrates Advanced Voice Cloning via Embeddings 23 February 2026
Custom Portable Workstation Optimized for Local AI Inference Builds 23 February 2026
Nvidia Could Launch Its First Laptops With Its Own Processors 23 February 2026
Future of Mobile AI: What On-Device Intelligence Means for App Developers 23 February 2026
Future of Mobile AI: What On-Device Intelligence Means for App Developers 23 February 2026
The Complete Stack for Local Autonomous Agents: From GGML to Orchestration 23 February 2026
Breaking the Speed Limit: Strategies for 17k Tokens/Sec Local Inference 23 February 2026
Yet Another Fix Coming for Older AMD GPUs on Linux – Thanks to Valve Developer 23 February 2026
AI-Powered Reverse-Engineering of Rosetta 2 for Linux 23 February 2026
AI Is Stress Testing Processor Architectures and RISC-V Fits the Moment 22 February 2026
How Slow Local LLMs Are on My Framework 13 AMD Strix Point 22 February 2026
Google Open-Sources NPU IP, Synaptics Implements It for Hardware Acceleration 22 February 2026
GGML Joins Hugging Face: What This Means for Local Model Optimization 22 February 2026
DietPi Released a New Version v10.1 22 February 2026
AI PCs Explained: 7 Critical Truths About NPUs and Privacy 22 February 2026
Taalas Etches AI Models onto Transistors to Rocket Boost Inference 21 February 2026
Strix Halo Performance Benchmarks: Minimax M2.5, Step 3.5 Flash, Qwen3 Coder 21 February 2026
Qwen3 Coder Next Remains Effective at Aggressive Quantization Levels 21 February 2026
[Release] Ouro-2.6B-Thinking: ByteDance's Recurrent Model Now Runnable Locally 21 February 2026
GGML.AI Acquired by Hugging Face 21 February 2026
Apple Researchers Develop On-Device AI Agent That Interacts With Apps for You 21 February 2026
Sarvam Brings AI to Feature Phones, Cars, and Smart Glasses 19 February 2026
Running Local LLMs and VLMs on Arduino UNO Q with yzma 19 February 2026
Mihup and Qualcomm Collaborate to Advance Secure On-Device Voice AI for BFSI 19 February 2026
Complete Offline AI System: Voice Control and Smart Home via Local LLM and Radio Without Internet 19 February 2026
LayerScale Launches Inference Engine Faster Than vLLM, SGLang, and TRT-LLM 19 February 2026
Kitten TTS V0.8 Released: State-of-the-Art Super-Tiny Text-to-Speech Model Under 25MB 19 February 2026
Show HN: Shiro.computer Static Page, Unix/NPM Shimmed to Host Claude Code 18 February 2026
Sarvam AI Launches Edge Model to Challenge Major AI Players with Local-First Approach 18 February 2026
Qualcomm Ventures Positions India as Blueprint for Affordable On-Device AI Infrastructure 18 February 2026
OpenClaw Refactored in Go, Runs on $10 Hardware 18 February 2026
Same INT8 Model Shows 93% to 71% Accuracy Variance Across Snapdragon Chipsets 18 February 2026
Cloudflare Releases Agents SDK v0.5.0 with Rust-Powered Infire Engine for Edge Inference 18 February 2026
Can We Leverage AI/LLMs for Self-Learning? 18 February 2026
Meet Sarvam Edge: India's AI Model That Runs on Phones and Laptops With No Internet 17 February 2026
Cohere Releases Tiny Aya: Efficient 3.3B Multilingual Model for 70+ Languages 17 February 2026
Chinese AI Chipmaker Axera Semiconductor Plans $379 Million Hong Kong IPO for Edge Inference Hardware 17 February 2026
ASUS Zenbook 14 Launches in India with AI-Capable Hardware, Starting at Rs 1,15,990 17 February 2026
Asus ExpertBook B3 G2 Laptop Features Ryzen AI 9 HX 470 CPU in 1.41kg Ultraportable Form Factor 17 February 2026
GPU-Accelerated DataFrame Library for Local Inference Workloads 16 February 2026
Alibaba Unveils Major AI Model Upgrade Ahead of DeepSeek Release 16 February 2026
First Vibecoded AI Operating System for Local Deployment 13 February 2026
Scaling llama.cpp On Neoverse N2: Solving Cross-NUMA Performance Issues 13 February 2026
Scaling llama.cpp On Neoverse N2: Solving Cross-NUMA Performance Issues 12 February 2026
Samsung's REAM: Alternative Model Compression Technique 12 February 2026
Running Mistral-7B on Intel NPU Achieves 12.6 Tokens/Second 12 February 2026
Memio Launches AI-Powered Knowledge Hub for Android with Local Processing 12 February 2026
NAS System Achieves 18 tok/s with 80B LLM Using Only Integrated Graphics 11 February 2026
Nanbeige4.1-3B: A Small General Model that Reasons, Aligns, and Acts 11 February 2026
Energy-Based Models Compared Against Frontier AI for Sudoku Solving 11 February 2026
Arm SME2 Technology Expands CPU Capabilities for On-Device AI 11 February 2026