Knowledge distillation

Knowledge distillation transfers knowledge from a large model to a smaller one without loss of validity

Image: U.S. Navy Photo by Mass Communication Specialist 3rd Class Jeff Johnstone, Public domain, via Wikimedia Commons

Knowledge distillation

Knowledge distillation transfers knowledge from a large model to a smaller one without loss of validity

Knowledge distillation is a technique used in machine learning to transfer knowledge from a large model to a smaller one. This process allows the smaller model to retain the knowledge capacity of the larger model without the computational expense.

Example

A large neural network trained on a dataset can be distilled into a smaller network that performs nearly as well on the same tasks, making it more efficient for deployment on devices with limited resources.

Knowledge distillation enables the use of smaller, more efficient models while maintaining high performance, which is crucial for deploying machine learning applications on devices with limited computational power.

Related concepts

One email a day: 5 concepts + the 5 stories that matter →

Swipe through 100 ML concepts daily

Open TickerNews