The role of small language models in enterprise AI

April 14, 2025

1

According to analyst Gartner, small language models (SLMs) offer a potentially cost-effective alternative for generative artificial intelligence (GenAI) development and deployment because they are easier to fine-tune, more efficient to serve and more straightforward to control.

In its Explore small language models for specific AI scenarios report, published in August 2024, Gartner explores how the definitions of “small” and “large” in AI language models have changed and evolved.

Gartner notes that there are estimates that GPT-4 (OpenAI – March 2023), Gemini 1.5 (Google – February 2024), Llama 3.1 405B (Meta – July 2024) and Claude 3 Opus (Anthropic – March 2024) have around half a trillion to two trillion parameters. On the opposite end of the spectrum, models such as Mistral 7B (Mistral.AI – September 2023), Phi-3-mini 3.8B and Phi-3-small 7B (Microsoft – April 2024), Llama 3.1 8B (Meta – July 2024) and Gemma 2 9B (Google – June 2024) are estimated to have 10 billion parameters or fewer.

Looking at one example of the computational resources used by a small language model compared with those used by a large language model, Gartner reports that Llama 3 8B (eight billion parameters) requires 27.8GB of graphics processing unit (GPU) memory, whereas Llama 3 70B (70 billion parameters) requires 160GB.

The more GPU memory needed, the greater the cost. For instance, at current GPU prices, a server capable of running the complete 670 billion parameter DeepSeek-R1 model in-memory will cost over $100,000.

The role of small language models in enterprise AI

Hertz warns UK customers of Cleo-linked data breach

The Investigatory Powers Tribunal explained

How will Trump’s tariffs hit tech?

LEAVE A REPLY Cancel reply

Most Popular

Flooding displaces thousands amid ongoing unrest in eastern DR Congo

Birmingham City Council ‘on track’ to clear waste backlog

The stars who turned their back on Hollywood (and some who returned)

China’s Xi urges Vietnam to oppose ‘bullying’ as Trump mulls more tariffs

Recent Comments

EDITOR PICKS

Flooding displaces thousands amid ongoing unrest in eastern DR Congo

Birmingham City Council ‘on track’ to clear waste backlog

The stars who turned their back on Hollywood (and some who returned)

POPULAR POSTS

Flooding displaces thousands amid ongoing unrest in eastern DR Congo

Birmingham City Council ‘on track’ to clear waste backlog

The stars who turned their back on Hollywood (and some who returned)

POPULAR CATEGORY

ABOUT US

FOLLOW US

The role of small language models in enterprise AI

Knowledge distillation

Augmenting SLMs

Reducing errors and hallucinations

LEAVE A REPLY Cancel reply

Most Popular

Recent Comments

EDITOR PICKS

POPULAR POSTS

POPULAR CATEGORY

ABOUT US

FOLLOW US