Articles in this category
All 14 articles, sorted alphabetically
Advertisement
ARTICLE · 01
Distillation for Production Inference
Teacher-student in 2026: when distillation is worth it.
Read article →ARTICLE · 02
Fine-Tuning Phi-4
QLoRA recipe for narrow tasks.
Read article →ARTICLE · 03
On-Device AI: Running SLMs on Smartphones and the Death of Cloud Dependency
Read article →ARTICLE · 04
On-Device LLM Inference
What runs on phones, laptops, and edge boxes in 2026.
Read article →ARTICLE · 05
Phi, Qwen, Gemma: Small Models Compared
The 1B-9B parameter sweet spot in 2026.
Read article →ARTICLE · 06
SLM Distillation Data Recipe
Teacher outputs filtering and CoT capture.
Read article →ARTICLE · 07
Evaluating Small Models: Common Pitfalls
Why your small model looks worse than it is.
Read article →ARTICLE · 08
Small Models for Tool Calling
When a 3B model beats GPT-4 for the agent's tool layer.
Read article →ARTICLE · 09
SLMs on Mobile in 2026
iOS Foundation Models and Android Gemini Nano.
Read article →ARTICLE · 10
SLMs in IoT: Giving 'Dumb' Appliances a Voice with Local 1B Parameter Models
Read article →ARTICLE · 11
Small Models for Classification
Cheap inference for high-volume routing.
Read article →ARTICLE · 12
Structured Output with Small Models
JSON schemas at 3B parameters.
Read article →ARTICLE · 13
The Economics of SLMs: Why Startups Are Saving Millions by Switching to Smaller Footprints
Read article →ARTICLE · 14