Why Small Language Models Are the Future of AI-led Products

Andrew Cheung
2 min readMay 28, 2024

--

The AI industry is witnessing a significant shift as leading companies like Microsoft, Meta, and Google focus on developing small language models (SLMs). These models are designed with fewer parameters, making them more cost-effective and easier to build and train compared to their larger counterparts.

Traditionally, AI performance has been linked to the number of parameters, with higher parameter counts leading to more complex and nuanced task handling. For example, Microsoft’s GPT-4o and Google’s Gemini 1.5 Pro each boast over a trillion parameters, while Meta’s Llama model is being developed with 400 billion parameters. However, these large models come with substantial costs and operational complexities, creating barriers for many enterprises in adopting generative AI solutions.

Given the high costs of large language models, how can businesses effectively leverage AI technology while managing expenses and ensuring regulatory compliance?

Small language models (SLMs) offer a transformative solution. With just a few billion parameters, they are more affordable, energy-efficient, and customizable. They require significantly less computational power, reducing training and operational costs. Additionally, SLMs can be tailored to specific business needs and process tasks locally on devices, avoiding cloud-based data transfer and ensuring data remains secure and compliant with regulatory standards.

Understanding and implementing SLMs is crucial. These models not only provide cost and efficiency benefits but also offer a pathway to innovation within legal constraints.

--

--

Andrew Cheung
Andrew Cheung

Written by Andrew Cheung

AI Consultant & Product Manager | Specialising in AI Strategy & transformation | Enhancing Business and User Journeys