Batch normalization

Batch normalization formula: Y = (X - μ) / σ * γ + β

Image: Typhoon_Mawar_2005_computer_simulation.gif: Atmoz derivative work: Atmoz (talk), CC BY-SA 3.0, via Wikimedia Commons

Batch normalization

Batch normalization formula: Y = (X - μ) / σ * γ + β

Batch normalization adjusts the inputs to each layer in a neural network. It re-centers them around zero and re-scaales them to a standard size, making training faster and more stable.

Example

If X = [1, 2, 3], μ = 2, and σ = 1, then Y = [(1-2)/1, (2-2)/1, (3-2)/1] * γ + β = [-1, 0, 1] * γ + β = [-1*γ + β, 0*γ + β, 1*γ + β].

Understanding the formula for batch normalization is crucial for implementing this technique correctly in neural networks.

Related concepts

One email a day: 5 concepts + the 5 stories that matter →

Swipe through 100 ML concepts daily

Open TickerNews