Before we learn Backprop, let's remember sensitivity. If a machine multiplies by 3, nudging the input by 1 moves the output by 3.
Sensitivity = 3 (Constant)
When machines are chained, their sensitivities multiply.
Why is the gradient of b² equal to 2*b? The slope gets steeper as the number gets bigger.
In a = x * y, why is the derivative of x equal to y? Use the "Hourly Wage" analogy.
1 more hour (+1 to x) → Pay increases by $10 (y).
$1 raise (+1 to y) → Pay increases by $5 (x).