Independent Science + Technology

Category: on-device-llm

Comparing Efficiency Strategies for LLM Deployment and Summarizing PowerInfer‑2’s Impact

Post date November 3, 2025
Post author By Writings, Papers and Blogs on Text Models
Post categories In edge-computing, mobile-ai, model-optimization, neural-efficiency, on-device-llm, power-infer, quantization, speculative-decoding

Performance Evaluation of PowerInfer‑2: Offloading, Prefill, and In‑Memory Efficiency

Post date November 3, 2025
Post author By Writings, Papers and Blogs on Text Models
Post categories In ai-infrastructure, benchmarking, Edge AI, mobile-inference, model-optimization, on-device-llm, power-infer, sparse-computing

How PowerInfer‑2 Turns Your Smartphone Into an AI Workstation

Post date November 3, 2025
Post author By Writings, Papers and Blogs on Text Models
Post categories In ai-infrastructure, edge-computing, machine-learning-optimization, model-serving, neural-computation, on-device-llm, power-infer, system-engineering

Nothing left to load.