Independent Science + Technology

Category: on-device-ai

Unlock Peak Mobile Performance: A Deep Dive into PowerInfer-2’s Neuron-Aware Runtime

Post date August 26, 2025
Post author By Writings, Papers and Blogs on Text Models
Post categories In Edge AI, heterogeneous-computing, llm-inference-optimization, mobile-computing, neuron-cluster, on-device-ai, power-infer-2, system-for-ml

The Conductor in Your Pocket: How PowerInfer-2 Orchestrates Smartphone Hardware for LLM Inference

Post date August 26, 2025
Post author By Writings, Papers and Blogs on Text Models
Post categories In Edge AI, heterogeneous-computing, llm-inference, mobile-computing, neuron-cluster, on-device-ai, power-infer-2, system-for-ml

Why Your Phone’s AI is Slow: A Story of Sparse Neurons and Finicky Flash Storage

Post date August 26, 2025
Post author By Writings, Papers and Blogs on Text Models
Post categories In edge-computing, llm-inference, mobile-system, on-device-ai, performance-analysis, sparse-activation, system-for-ml, ufs4

Mobile AI with ONNX Runtime: How to Build Real-Time Noise Suppression That Works

Post date August 3, 2025
Post author By Sergey Drymchenko
Post categories In android-ai-sdk, dtln-noise-reduction, lightweight-ai-deployment, mobile-ai, mobile-ai-performance, on-device-ai, onnx-runtime, onnx-runtime-android

On-Device AI Models and Core ML Tools: Insights From WWDC 2024

Post date June 25, 2024
Post author By Maksim Niagolov
Post categories In apple-wwdc, core-ml, new-apple-updates, on-device-ai, on-device-language-models, palettization-explained, what-are-core-ml-tools, what-is-quantization

Nothing left to load.