Glossary · Term

Compact language model

Also known as: Small language model, sLM, sLLM, lightweight model

Compact language model is a language model whose size has been significantly reduced compared to large models. It is cheap and fast, so it is in the spotlight for specific tasks.

A small language model (SLM) is a language model that maintains the structure of a large language model while greatly reducing the number of parameters, allowing it to run with fewer resources. Instead of a luxury hotel kitchen that does all the cooking, it can be likened to a specialty restaurant that makes one menu item quickly and inexpensively.

It attracted attention because it was recognized that using a large, high-end model for all tasks was a waste of money. For tasks with a defined scope, such as document classification, summarization, and customer service, it is cheaper and faster to use small models by fine-tuning them, and they are also the mainstay of on-device AI that is installed directly on devices such as smartphones.

Small models do not necessarily have lower quality, but there is a gap with large models in terms of extensive knowledge and complex reasoning. Therefore, a combination strategy where SLM takes on easy tasks and a large model takes on difficult tasks is commonly used.

✅ Why it matters

Low operating costs and fast response speed
When fine-tuned for specific tasks, it performs as well as larger models
Can be mounted on devices, making it suitable for on-device AI and security needs

⚠️ Limits and debates

When it comes to broad knowledge and complex inference, it lags behind large models
There is a management burden of selecting and tuning models for each task
Smaller models may have higher hallucinations and error frequencies