PowerInfer-2 Achieves 29x Speedup, Running 47-Billion Parameter LLMs on Smartphones Post date August 26, 2025 Post author By Writings, Papers and Blogs on Text Models Post categories In Edge AI, efficient-ai, heterogeneous-computing, mobile-ai, on-device-language-models, power-infer-2, system-for-ml
Android Function Examples That You Should Know Post date April 9, 2025 Post author By Language Models (dot tech) Post categories In ai-agents-for-edge-devices, efficient-edge-computing, function-calling-models, lm-latency-models, low-latency-ai-inference, on-device-language-models, privacy-focused-ai-models, small-scale-ai-models
The Future of Octopus v2: What Does it Entail? Post date April 9, 2025 Post author By Language Models (dot tech) Post categories In ai-agents-for-edge-devices, efficient-edge-computing, function-calling-models, lm-latency-models, low-latency-ai-inference, on-device-language-models, privacy-focused-ai-models, small-scale-ai-models
Detailing the Primary Methodology Implemented in Our Models: Octopus v2 Post date April 3, 2025 Post author By Language Models (dot tech) Post categories In ai-agents-for-edge-devices, efficient-edge-computing, function-calling-models, lm-latency-models, low-latency-ai-inference, on-device-language-models, privacy-focused-ai-models, small-scale-ai-models
Efficient On-Device LLMs: Function Calling and Fine-Tuning Strategies Post date April 3, 2025 Post author By Language Models (dot tech) Post categories In ai-agents-for-edge-devices, efficient-edge-computing, function-calling-models, lm-latency-models, low-latency-ai-inference, on-device-language-models, privacy-focused-ai-models, small-scale-ai-models
Octopus v2: An On-Device Language Model for Super Agent Post date April 1, 2025 Post author By Language Models (dot tech) Post categories In ai-agents-for-edge-devices, efficient-edge-computing, function-calling-models, llms, lm-latency-models, on-device-language-models, privacy-focused-ai-models, small-scale-ai-models
On-Device AI Models and Core ML Tools: Insights From WWDC 2024 Post date June 25, 2024 Post author By Maksim Niagolov Post categories In apple-wwdc, core-ml, new-apple-updates, on-device-ai, on-device-language-models, palettization-explained, what-are-core-ml-tools, what-is-quantization